<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:podcast="https://podcastindex.org/namespace/1.0"
    xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/"
    xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"
    xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
    xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:spotify="http://www.spotify.com/ns/rss">
    <channel>
        <title>The Cloud Pod | Weekly AI &amp; Cloud News on AWS, Azure &amp; GCP</title>
        <generator>Castos</generator>
        <atom:link href="https://feeds.castos.com/kqk1" rel="self" type="application/rss+xml" />
        <link>https://www.thecloudpod.net/series/tcp-show/</link>
        <description>The Cloud Pod delivers weekly cloud computing and AI news for engineers, architects, and technology leaders. Join Justin Brodley, Jonathan Baker, Ryan Lucas, and Matt Kohn as they break down the latest from AWS, Azure, and Google Cloud — covering new services, platform updates, FinOps strategies, and the AI innovations reshaping the industry. Stay ahead of the cloud landscape with one of the longest-running cloud computing podcasts available.</description>
        <lastBuildDate>Tue, 05 May 2026 18:20:13 +0000</lastBuildDate>
        <language>en-Us</language>
        <copyright>© 2026 The Cloud Pod</copyright>
        
        <spotify:limit recentCount="500" />
        
        <spotify:countryOfOrigin>
            US  
        </spotify:countryOfOrigin>
                    <image>
                <url>https://episodes.castos.com/5e2d2c4b117f29-10227663/images/podcast/covers/c1a-k5d5-v6w3g0qkc2d0-hkelyt.png</url>
                <title>The Cloud Pod | Weekly AI &amp; Cloud News on AWS, Azure &amp; GCP</title>
                <link>https://www.thecloudpod.net/series/tcp-show/</link>
            </image>
                <itunes:subtitle>The Cloud Pod delivers weekly cloud computing and AI news for engineers, architects, and technology leaders. Join Justin Brodley, Jonathan Baker, Ryan Lucas, and Matt Kohn as they break down the latest from AWS, Azure, and Google Cloud — covering new services, platform updates, FinOps strategies, and the AI innovations reshaping the industry. Stay ahead of the cloud landscape with one of the longest-running cloud computing podcasts available.</itunes:subtitle>
        <itunes:author>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</itunes:author>
        <itunes:type>episodic</itunes:type>
        <itunes:summary>The Cloud Pod delivers weekly cloud computing and AI news for engineers, architects, and technology leaders. Join Justin Brodley, Jonathan Baker, Ryan Lucas, and Matt Kohn as they break down the latest from AWS, Azure, and Google Cloud — covering new services, platform updates, FinOps strategies, and the AI innovations reshaping the industry. Stay ahead of the cloud landscape with one of the longest-running cloud computing podcasts available.</itunes:summary>
        <itunes:owner>
            <itunes:name>TCP.FM</itunes:name>
            <itunes:email>justin@thecloudpod.net</itunes:email>
        </itunes:owner>
        <itunes:explicit>false</itunes:explicit>
                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/podcast/covers/c1a-k5d5-v6w3g0qkc2d0-hkelyt.png"></itunes:image>
        
                                    <itunes:category text="Technology" />
                                                <itunes:category text="Business">
                                            <itunes:category text="Management" />
                                    </itunes:category>
                    
                    <itunes:new-feed-url>https://feeds.castos.com/kqk1</itunes:new-feed-url>
                
        
        <podcast:locked>yes</podcast:locked>
                                    <item>
                <title>
                    <![CDATA[352: Google Next: Rebrandapalooza]]>
                </title>
                <pubDate>Tue, 05 May 2026 18:20:13 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2454749</guid>
                                    <link>https://tcpfm.castos.com/episodes/352-google-next-rebrandapalooza</link>
                                <description>
                                            <![CDATA[<p>Welcome to episode 352 of The Cloud Pod, where the weather is always cloudy! Justin, Matt, and Ryan are safely back from Vegas (Ryan and Justin, anyway), and they have all the news and announcements from Google Next. Plus, we have Ryan’s take on Phish, news from Cloudflare, and a shoe company making a pivot. There’s a lot to cover, so let’s get started! 
</p>
<h3>Titles we almost went with this week</h3>
<ul>
<li>Redact Yourself Before You Wreck Yourself OpenAI **Anthropic</li>
<li>Fork Yeah Cloudflare Artifacts Is Here</li>
<li>Git Happens at Scale on Cloudflare</li>
<li>Bucket List Item Checked Lambda Mounts S3 File Systems</li>
<li>Terraform Your Agents Before They Terraform You</li>
<li>Cloud Run Gets GPUs and Finally Hits the Gym</li>
<li>Spanner Goes Rogue, Leaves the Cloud Behind</li>
<li>Knowledge Catalog Knows What Your Agents Did Last Query</li>
<li>One Control Plane to Rule a Million Chips</li>
<li>No More Incognito Windows for Your AWS Identity Crisis</li>
<li>Your Agent Can Now Write Files Without Burning Everything Down</li>
<li>Spend Caps Finally Tell Runaway AI Jobs to Chill</li>
<li>RIP Vertex, long live the agent</li>
<li>Agents all the way down</li>
<li>Google Next: This is the dawning of the Age of Agentic</li>
<li>Allbirds Proves AI Hype Needs No Infrastructure</li>
</ul>
<p> </p>
<p> A big thanks to this week’s sponsors:</p>
<p>There are a lot of cloud cost management tools out there, but only Archera provides insured commitments. It sounds fancy, but it’s really simple. Archera gives you the cost savings of a 1 or 3-year AWS Savings Plan with a commitment as short as 30 days. If you do not use all the cloud resources you have committed to, Archera will literally cover the difference. Other cost management tools may say they offer “insured commitments”, but remember to ask: Will you actually give me my rebate? Because Archera will. </p>
<p>Check out <a href="http://www.thecloudpod.net/archera">thecloudpod.net/archera</a> to schedule a demo today. </p>
<p>We also wanted to tell you about something coming to the US for the first time — WeAreDevelopers World Congress! </p>
<p>They’ve been doing this in Europe for years, 15,000-plus attendees in Berlin, it’s one of the biggest developer events over there. Coté from Software Defined Talk is actually speaking at their Berlin event this summer, so we’ve got some firsthand context here. In September, they’re launching the North America edition. San José, September 23 to 25. 500-plus speakers, 18 tracks — cloud, infrastructure, DevOps, security, AI, data engineering, all of it. Speakers from Datadog, Honeycomb, Sentry, Google, LinkedIn, and Stack Overflow. Olivier Pomel, Christine Yen, Milin Desai, Kelsey Hightower – plus workshops and masterclasses, not just talks. These are people who know how to do a developer conference at scale. wearedevelopers.us, code DEVPOD26 for 15% off. Group rates on top of that for 4 or more.</p>
<h2>General News </h2>
<p>06:12 <a href="https://www.cnbc.com/2026/04/20/amazon-invest-up-to-25-billion-in-anthropic-part-of-ai-infrastructure.html">Amazon invest up to $25 billion in Anthropic part of AI infrastructure</a></p>
<ul>
<li style="font-weight:400;"><a href="http://amazon.com">Amazon</a> has committed up to $25 billion in additional investment in <a href="https://www.anthropic.com/">Anthropic</a>, bringing its total potential investment to $33 billion. The latest $5 billion tranche is based on Anthropic’s $380 billion valuation, with up to $20 billion more tied to commercial milestones.</li>
<li style="font-weight:400;">In exchange, Anthropic has committed to spending over $100 billion on AWS over the next decade, with a specific focus on Trainium custom AI chips, and plans to bring nearly 1 gigawatt of <a href="https://aws.amazon.com/ai/machine-learning/trainium/">Trainium2 and Trainium3</a> capacity online by end of the year.</li>
<li style="font-weight:400;">Anthropic cited real infrastructure strain from growing enterprise...</li></ul>
<h3>Chapters</h3>
<ul><li>(00:00:00) - WerDevelopers World Congress Coming to San Jose</li><li>(00:02:03) - The Eagles at the Sphere in Vegas</li><li>(00:05:37) - The Secret Life of the Sphere</li><li>(00:07:23) - Amazon, Google Invest $40 Million in OpenAI</li><li>(00:11:57) - SpaceX Buys AI Coding Startup Cursor</li><li>(00:14:36) - OpenAI's Agent SDK Unveils</li><li>(00:20:23) - Don't Blame AI for Layoffs</li><li>(00:20:45) - Claude Opus 4.7</li><li>(00:28:37) - Archera and Claude Design: Cloud Design Preview</li><li>(00:31:55) - Cloudflare's First Agents Week</li><li>(00:33:10) - Cloudflare Launches Artifact File System and Private Beta</li><li>(00:36:58) - OpenAI's GPT 5.5 for Cloud & Enterprise Work</li><li>(00:38:44) - OpenAI's ChatGPT: Workflow Agents for Enterprise</li><li>(00:43:54) - AWS Interconnect now generally available for Google Cloud, Azure and</li><li>(00:45:18) - Amazon Quicksight Launches Desktop With New Features</li><li>(00:48:35) - Amazon CloudWatch: Auditing Telemetry Configuration across multiple regions</li><li>(00:52:35) - Anthropic for AWS: S3 Files and More</li><li>(00:57:58) - Amazon Bedrock Agent Core: New Features, New CLI, and</li><li>(00:59:40) - Google's Dev Signal: Text to Speech AI</li><li>(01:01:58) - 2018 Cloud Conference</li><li>(01:03:19) - Orion Comes in Strong With Gemini 3.1 Pro</li><li>(01:05:04) - AI Conference 2017: Who Won?</li><li>(01:08:20) - How to Rank the AI Announcements</li><li>(01:09:04) - Gemini Enterprise Agent Platform Build 1.4</li><li>(01:10:09) - Torch TPU: From Inference to Scale</li><li>(01:11:55) - Wiz AI Expands to AWS, Cloud and Salesforce</li><li>(01:18:22) - GKE Cloud: Tier 3, BigQuery AI and More</li><li>(01:21:46) - Gemini Enterprise Announcements 2017</li><li>(01:22:58) - Google Cloud Agent Skill Repo</li><li>(01:24:57) - Wizard & Agentic Conference 2017: A Bigger Conference than</li><li>(01:29:37) - Microsoft Azure: Intelligent Tiering, and More</li><li>(01:32:23) - Entre 2.0 Announcement and Governance</li><li>(01:33:12) - Azure SRE Agent now supports KQL and Application Insights</li><li>(01:34:19) - Azure 2.8</li><li>(01:34:32) - Azure Key Vault: Migrating to the Modern HSM</li><li>(01:35:36) - Cloud News: Google's AI Marathon</li><li>(01:36:59) - Allbirds to Rebrand as AI Clothes</li><li>(01:41:45) - What Happened to AI?</li><li>(01:45:16) - Facebook Is Trying to Become an AI Company</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 352 of The Cloud Pod, where the weather is always cloudy! Justin, Matt, and Ryan are safely back from Vegas (Ryan and Justin, anyway), and they have all the news and announcements from Google Next. Plus, we have Ryan’s take on Phish, news from Cloudflare, and a shoe company making a pivot. There’s a lot to cover, so let’s get started! 

Titles we almost went with this week

Redact Yourself Before You Wreck Yourself OpenAI **Anthropic
Fork Yeah Cloudflare Artifacts Is Here
Git Happens at Scale on Cloudflare
Bucket List Item Checked Lambda Mounts S3 File Systems
Terraform Your Agents Before They Terraform You
Cloud Run Gets GPUs and Finally Hits the Gym
Spanner Goes Rogue, Leaves the Cloud Behind
Knowledge Catalog Knows What Your Agents Did Last Query
One Control Plane to Rule a Million Chips
No More Incognito Windows for Your AWS Identity Crisis
Your Agent Can Now Write Files Without Burning Everything Down
Spend Caps Finally Tell Runaway AI Jobs to Chill
RIP Vertex, long live the agent
Agents all the way down
Google Next: This is the dawning of the Age of Agentic
Allbirds Proves AI Hype Needs No Infrastructure

 
 A big thanks to this week’s sponsors:
There are a lot of cloud cost management tools out there, but only Archera provides insured commitments. It sounds fancy, but it’s really simple. Archera gives you the cost savings of a 1 or 3-year AWS Savings Plan with a commitment as short as 30 days. If you do not use all the cloud resources you have committed to, Archera will literally cover the difference. Other cost management tools may say they offer “insured commitments”, but remember to ask: Will you actually give me my rebate? Because Archera will. 
Check out thecloudpod.net/archera to schedule a demo today. 
We also wanted to tell you about something coming to the US for the first time — WeAreDevelopers World Congress! 
They’ve been doing this in Europe for years, 15,000-plus attendees in Berlin, it’s one of the biggest developer events over there. Coté from Software Defined Talk is actually speaking at their Berlin event this summer, so we’ve got some firsthand context here. In September, they’re launching the North America edition. San José, September 23 to 25. 500-plus speakers, 18 tracks — cloud, infrastructure, DevOps, security, AI, data engineering, all of it. Speakers from Datadog, Honeycomb, Sentry, Google, LinkedIn, and Stack Overflow. Olivier Pomel, Christine Yen, Milin Desai, Kelsey Hightower – plus workshops and masterclasses, not just talks. These are people who know how to do a developer conference at scale. wearedevelopers.us, code DEVPOD26 for 15% off. Group rates on top of that for 4 or more.
General News 
06:12 Amazon invest up to $25 billion in Anthropic part of AI infrastructure

Amazon has committed up to $25 billion in additional investment in Anthropic, bringing its total potential investment to $33 billion. The latest $5 billion tranche is based on Anthropic’s $380 billion valuation, with up to $20 billion more tied to commercial milestones.
In exchange, Anthropic has committed to spending over $100 billion on AWS over the next decade, with a specific focus on Trainium custom AI chips, and plans to bring nearly 1 gigawatt of Trainium2 and Trainium3 capacity online by end of the year.
Anthropic cited real infrastructure strain from growing enterprise...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[352: Google Next: Rebrandapalooza]]>
                </itunes:title>
                                    <itunes:episode>352</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p>Welcome to episode 352 of The Cloud Pod, where the weather is always cloudy! Justin, Matt, and Ryan are safely back from Vegas (Ryan and Justin, anyway), and they have all the news and announcements from Google Next. Plus, we have Ryan’s take on Phish, news from Cloudflare, and a shoe company making a pivot. There’s a lot to cover, so let’s get started! 
</p>
<h3>Titles we almost went with this week</h3>
<ul>
<li>Redact Yourself Before You Wreck Yourself OpenAI **Anthropic</li>
<li>Fork Yeah Cloudflare Artifacts Is Here</li>
<li>Git Happens at Scale on Cloudflare</li>
<li>Bucket List Item Checked Lambda Mounts S3 File Systems</li>
<li>Terraform Your Agents Before They Terraform You</li>
<li>Cloud Run Gets GPUs and Finally Hits the Gym</li>
<li>Spanner Goes Rogue, Leaves the Cloud Behind</li>
<li>Knowledge Catalog Knows What Your Agents Did Last Query</li>
<li>One Control Plane to Rule a Million Chips</li>
<li>No More Incognito Windows for Your AWS Identity Crisis</li>
<li>Your Agent Can Now Write Files Without Burning Everything Down</li>
<li>Spend Caps Finally Tell Runaway AI Jobs to Chill</li>
<li>RIP Vertex, long live the agent</li>
<li>Agents all the way down</li>
<li>Google Next: This is the dawning of the Age of Agentic</li>
<li>Allbirds Proves AI Hype Needs No Infrastructure</li>
</ul>
<p> </p>
<p> A big thanks to this week’s sponsors:</p>
<p>There are a lot of cloud cost management tools out there, but only Archera provides insured commitments. It sounds fancy, but it’s really simple. Archera gives you the cost savings of a 1 or 3-year AWS Savings Plan with a commitment as short as 30 days. If you do not use all the cloud resources you have committed to, Archera will literally cover the difference. Other cost management tools may say they offer “insured commitments”, but remember to ask: Will you actually give me my rebate? Because Archera will. </p>
<p>Check out <a href="http://www.thecloudpod.net/archera">thecloudpod.net/archera</a> to schedule a demo today. </p>
<p>We also wanted to tell you about something coming to the US for the first time — WeAreDevelopers World Congress! </p>
<p>They’ve been doing this in Europe for years, 15,000-plus attendees in Berlin, it’s one of the biggest developer events over there. Coté from Software Defined Talk is actually speaking at their Berlin event this summer, so we’ve got some firsthand context here. In September, they’re launching the North America edition. San José, September 23 to 25. 500-plus speakers, 18 tracks — cloud, infrastructure, DevOps, security, AI, data engineering, all of it. Speakers from Datadog, Honeycomb, Sentry, Google, LinkedIn, and Stack Overflow. Olivier Pomel, Christine Yen, Milin Desai, Kelsey Hightower – plus workshops and masterclasses, not just talks. These are people who know how to do a developer conference at scale. wearedevelopers.us, code DEVPOD26 for 15% off. Group rates on top of that for 4 or more.</p>
<h2>General News </h2>
<p>06:12 <a href="https://www.cnbc.com/2026/04/20/amazon-invest-up-to-25-billion-in-anthropic-part-of-ai-infrastructure.html">Amazon invest up to $25 billion in Anthropic part of AI infrastructure</a></p>
<ul>
<li style="font-weight:400;"><a href="http://amazon.com">Amazon</a> has committed up to $25 billion in additional investment in <a href="https://www.anthropic.com/">Anthropic</a>, bringing its total potential investment to $33 billion. The latest $5 billion tranche is based on Anthropic’s $380 billion valuation, with up to $20 billion more tied to commercial milestones.</li>
<li style="font-weight:400;">In exchange, Anthropic has committed to spending over $100 billion on AWS over the next decade, with a specific focus on Trainium custom AI chips, and plans to bring nearly 1 gigawatt of <a href="https://aws.amazon.com/ai/machine-learning/trainium/">Trainium2 and Trainium3</a> capacity online by end of the year.</li>
<li style="font-weight:400;">Anthropic cited real infrastructure strain from growing enterprise and consumer demand for <a href="https://claude.ai/new">Claude</a>, noting reliability and performance impacts, which gives this deal a practical operational motivation beyond financial positioning.</li>
<li style="font-weight:400;">Amazon is now a substantial investor in both Anthropic and <a href="https://openai.com/">OpenAI</a>, having committed up to $50 billion to OpenAI in February, which raises notable questions for developers about how AWS positions competing AI platforms on its infrastructure.</li>
<li style="font-weight:400;">With Anthropic also holding compute agreements with <a href="https://azure.microsoft.com/en-us/get-started/azure-portal/">Microsoft Azure</a> and <a href="http://google.com">Google</a>, and now securing up to 5 gigawatts of total capacity, the company is distributing its infrastructure across multiple providers despite naming AWS its primary training partner.</li>
</ul>
<p>08:46  Justin – “The big question is going to be when one of these companies – OpenAI or Anthropic – finally goes public, and they start publishing these things; what people’s actual reaction is to their financials .” </p>
<p>10:48 <a href="https://www.businessinsider.com/spacex-cursor-coding-xai-deal-acquisition-2026-4">SpaceX Strikes $60 Billion Deal for Right to Buy Coding Startup Cursor</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.businessinsider.com/spacex-elon-musk">SpaceX</a> struck a deal with AI coding startup <a href="https://cursor.com/">Cursor</a>, valued at either a $60 billion acquisition or a $10 billion partnership fee, giving Cursor access to <a href="https://x.ai/">xAI</a>‘s <a href="https://x.ai/colossus">Colossus supercomputer</a>, which runs 200,000 <a href="https://www.nvidia.com/en-us/data-center/h100/">Nvidia H100</a>-equivalent GPUs for model training.</li>
<li style="font-weight:400;">Cursor had been compute-constrained despite reaching $1 billion in annual recurring revenue and a $29.3 billion valuation, so this deal directly addresses their infrastructure bottleneck for scaling model intelligence.</li>
<li style="font-weight:400;">The partnership positions SpaceX to compete in the AI coding tools space against <a href="https://www.anthropic.com/">Anthropic</a> and others, notable given xAI’s <a href="https://grok.com/">Grok</a> has publicly acknowledged falling behind competitors in coding capabilities.</li>
<li style="font-weight:400;">For developers and cloud users, this deal signals continued consolidation between compute providers and AI coding tools, which could influence pricing, model availability, and platform lock-in decisions for teams building on AI-assisted development workflows.</li>
<li style="font-weight:400;">SpaceX’s recent acquisition of xAI, combined with this Cursor deal, suggests a vertical integration strategy connecting rocket-company compute infrastructure directly to developer-facing AI products ahead of a potential IPO later this year.</li>
</ul>
<p>12:27  Justin – “The thing I don’t get is the $10 billion partnership versus the $60 billion acquisition. What’s the triggering events on those things? When is it a partnership, versus when is it now an acquisition? And does that mean that these people who are working at Cursor – if it’s a partnership, aren’t getting equity? That’s a bummer.” </p>
<h2>AI Is Going Great – Or How ML Makes Money </h2>
<p>13:36 <a href="https://openai.com/index/the-next-evolution-of-the-agents-sdk">The next evolution of the Agents SDK </a></p>
<ul>
<li style="font-weight:400;"><a href="https://openai.com/">OpenAI</a> has updated its <a href="https://developers.openai.com/api/docs/guides/agents">Agents SDK</a> to general availability, adding native sandbox execution, configurable memory, and filesystem tools modeled after Codex. </li>
<li style="font-weight:400;">Agents can now read and write files, run shell commands, install dependencies, and apply code patches within controlled environments without developers building that infrastructure themselves.</li>
<li style="font-weight:400;">The SDK introduces a Manifest abstraction that standardizes how agent workspaces are defined across sandbox providers, including <a href="https://blaxel.ai/">Blaxel</a>, <a href="https://www.cloudflare.com/">Cloudflare</a>, <a href="https://e2b.dev/">E2B</a>, <a href="https://modal.com/">Modal</a>, and <a href="https://vercel.com/">Vercel</a>, with storage integrations for <a href="https://aws.amazon.com/s3/">AWS S3</a>, <a href="https://cloud.google.com/storage4">Google Cloud Storage</a>, <a href="https://azure.microsoft.com/en-us/products/storage/blobs/">Azure Blob Storage</a>, and <a href="https://www.cloudflare.com/developer-platform/products/r2/">Cloudflare R2</a>. </li>
<li style="font-weight:400;">This gives developers a consistent path from local prototype to production deployment.</li>
<li style="font-weight:400;">Built-in snapshotting and rehydration mean a failed or expired sandbox container does not terminate a long-running agent run, as the SDK can restore state in a fresh container from the last checkpoint. This addresses a practical reliability gap for agents working on multi-step tasks.</li>
<li style="font-weight:400;">The SDK incorporates several emerging agentic standards, including <a href="https://modelcontextprotocol.io/">MCP</a> for tool use, <a href="http://agents.md">AGENTS.md</a> for custom instructions, and the skills spec for progressive capability disclosure. </li>
<li style="font-weight:400;">OpenAI positions this as reducing the maintenance burden on developers as these patterns evolve.</li>
<li style="font-weight:400;">The updated SDK is currently Python-only, with TypeScript support planned for a future release. </li>
<li style="font-weight:400;">Pricing follows standard API rates based on tokens and tool use, and features like code mode and subagents are still in development for both language runtimes.</li>
</ul>
<p>13:59  Ryan – “As long as it also logs and has permissions and some sort of boundaries, I don’t have to kill it. It’s just terrifying because we already have people that are just throwing questions into any chat tool, and just then running whatever command it spits out indiscriminately. And now that’s just going to happen at a faster rate.”</p>
<p>19:56 <a href="https://www.anthropic.com/news/claude-opus-4-7">Introducing Claude Opus 4.7</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.anthropic.com/news/claude-opus-4-7">Claude Opus 4.7</a> is now generally available across <a href="https://claude.ai/new">Claude</a> products, the API, <a href="https://aws.amazon.com/bedrock/">Amazon Bedrock</a>, <a href="https://cloud.google.com/vertex-ai">Google Cloud Vertex AI</a>, and <a href="https://azure.microsoft.com/en-us/products/ai-foundry/">Microsoft Foundry</a> at the same pricing as <a href="https://www.anthropic.com/news/claude-opus-4-6">Opus 4.6</a>: $5 per million input tokens and $25 per million output tokens. </li>
<li style="font-weight:400;">The model targets complex, long-running agentic coding workflows, with early testers reporting 13% higher resolution on a 93-task coding benchmark and 3x more production task resolution on Rakuten-SWE-Bench compared to Opus 4.6.</li>
<li style="font-weight:400;">Vision capabilities received a notable upgrade, with Opus 4.7 now supporting images up to 2,576 pixels on the long edge, more than three times the resolution of prior Claude models. </li>
<li style="font-weight:400;">This opens up use cases like computer-use agents reading dense screenshots and data extraction from complex technical diagrams, though higher-resolution images will consume more tokens.</li>
<li style="font-weight:400;">Anthropic is using Opus 4.7 as a testbed for cybersecurity safeguards before any broader release of its more capable <a href="https://red.anthropic.com/2026/mythos-preview/">Mythos Preview model</a>. </li>
<li style="font-weight:400;">The model includes automatic detection and blocking of prohibited cybersecurity uses, with a new <a href="https://islandinthenet.com/claude-opus-4-7-and-cyber-verification-programme/">Cyber Verification Program</a> available for legitimate security professionals doing penetration testing or vulnerability research.</li>
<li style="font-weight:400;">A new high effort level sits between the existing high and max settings, giving developers finer control over the reasoning-versus-latency tradeoff. </li>
<li style="font-weight:400;">Developers migrating from Opus 4.6 should note that the updated tokenizer can increase token counts by roughly 1.0 to 1.35 times, depending on content type, and a migration guide is available on the Claude platform.</li>
<li style="font-weight:400;">File system-based memory improvements allow Opus 4.7 to retain notes across multi-session agentic work, reducing the need to re-establish context at the start of each task. </li>
<li style="font-weight:400;">This is particularly relevant for enterprise teams running parallel agent workflows where continuity across long runs matters.</li>
</ul>
<p>21:50  Ryan – “I didn’t realize it’s the same price because every platform that I’m using this in, Opus 4.7 is so much more expensive than 4.6.”</p>
<p>28:14 <a href="https://www.anthropic.com/news/claude-design-anthropic-labs">Introducing Claude Design by Anthropic Labs </a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.anthropic.com/">Anthropic</a> launched <a href="https://claude.com/plugins/design">Claude Design</a> in research preview for <a href="https://claude.com/pricing/pro">Pro</a>, <a href="https://claude.com/pricing/max">Max</a>, <a href="https://claude.com/pricing/team">Team</a>, and <a href="https://claude.com/pricing/enterprise">Enterprise</a> subscribers, powered by <a href="https://www.anthropic.com/news/claude-opus-4-7">Claude Opus 4.7</a>. </li>
<li style="font-weight:400;">It enables users to create interactive prototypes, pitch decks, wireframes, and marketing assets through conversational prompts and inline editing controls.</li>
<li style="font-weight:400;">A notable workflow feature is the <a href="https://claude.com/product/claude-code">Claude Code</a> handoff, where finished designs are packaged into a bundle that developers can pass directly to Claude Code for implementation, creating a tighter loop between design and engineering.</li>
<li style="font-weight:400;">Claude Design builds a team-specific design system during onboarding by reading codebases and design files, then automatically applies brand colors, typography, and components to every subsequent project. </li>
<li style="font-weight:400;">Teams can maintain multiple design systems simultaneously.</li>
<li style="font-weight:400;">Early user data from Brilliant suggests complex pages that required 20-plus prompts in other tools needed only 2 prompts in Claude Design, indicating meaningful efficiency gains for interactive prototype creation.</li>
<li style="font-weight:400;">Export options include Canva, PDF, PPTX, and standalone HTML, with organization-scoped sharing and collaborative editing. </li>
<li style="font-weight:400;">For Enterprise customers, the feature is off by default and must be enabled by admins in Organization settings.</li>
</ul>
<p>30:56 <a href="https://blog.cloudflare.com/agents-week-in-review/">Building the agentic cloud: everything we launched during Agents Week </a><a href="https://blog.cloudflare.com/agents-week-in-review/">2026</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.cloudflare.com/">Cloudflare</a> held its first <a href="https://blog.cloudflare.com/welcome-to-agents-week/">Agents Week</a>, shipping a new set of primitives across compute, security, and tooling specifically designed for running AI agents at scale. </li>
<li style="font-weight:400;">The core premise is that traditional cloud infrastructure built around one app serving many users does not fit a model where individual users each run multiple concurrent agents.</li>
<li style="font-weight:400;">On the compute side, Cloudflare launched new environments supporting both full operating system containers for package installation and terminal commands, and lightweight isolates that start in milliseconds for high-scale deployments. </li>
<li style="font-weight:400;">They also shipped a Git-compatible workspace designed for agent-generated code moving from prototype to production.</li>
<li style="font-weight:400;">Security and identity were treated as built-in defaults rather than add-ons, with new tools for connecting agents to private networks and managing autonomous actions taken on behalf of users across an organization.</li>
<li style="font-weight:400;">The agent toolbox additions include inference, search, memory, voice, email, and a browser primitive, giving agents the ability to perceive, remember, and communicate without developers assembling separate third-party services.</li>
<li style="font-weight:400;">Cloudflare also addressed the web infrastructure side, releasing tools for existing websites to control bot access, package content for agent consumption, and measure their readiness for agent-driven traffic, acknowledging that most of the current web was built for human browsers rather than automated agents.</li>
</ul>
<p>31:40  Justin – “I look forward to Cloudflare taking down Cloudflare, and then writing an RCA with these great tools.” </p>
<p>32:14 <a href="https://blog.cloudflare.com/artifacts-git-for-agents-beta/">Artifacts: versioned storage that speaks Git</a></p>
<ul>
<li style="font-weight:400;">Cloudflare launched Artifacts in private beta, a versioned file system built on Git that lets developers and agents programmatically create, fork, and manage Git repositories at scale via a <a href="https://developers.cloudflare.com/artifacts/api/rest-api/">REST API</a> and native <a href="https://developers.cloudflare.com/workers/ci-cd/builds/">Workers API</a>, with public beta targeted for early May 2026.</li>
<li style="font-weight:400;">The system is built on Durable Objects with a <a href="https://blog.cloudflare.com/artifacts-git-for-agents-beta/#why-git-whats-a-versioned-file-system">Git server</a> written in <a href="https://ziglang.org/">Zig</a> and compiled to a roughly 100KB WebAssembly binary, enabling tens of millions of isolated repo instances per namespace while handling the full Git smart HTTP protocol with zero external dependencies.</li>
<li style="font-weight:400;">Cloudflare is also <a href="https://github.com/cloudflare/artifact-fs">open-sourcing ArtifactFS</a>, a filesystem driver that mounts large Git repos using a blobless clone and lazy file hydration, reducing startup times for multi-gigabyte repos from roughly 2 minutes down to 10-15 seconds, which at 10,000 sandbox jobs per month translates to approximately 2,778 compute hours saved.</li>
<li style="font-weight:400;">Beyond source control, Artifacts supports use cases like per-session agent state persistence, customer config versioning with rollback, and session forking, using Git semantics such as diff, revert, and clone as a general-purpose state management layer rather than just a code storage tool.</li>
<li style="font-weight:400;">Pricing is designed for agent-scale workloads, charging based on storage consumed and operations performed rather than repo count, with plans to bring Artifacts to the Workers Free plan with fair use limits as the <a href="https://forms.gle/DwBoPRa3CWQ8ajFp7">beta</a> progresses.</li>
</ul>
<p>32:54  Justin – “…another way it’s going to take down Cloudflare, so I look forward to that.” </p>
<p>34:53 <a href="https://www.snowflake.com/content/snowflake-site/global/en/blog/enterprise-ai-agent-platform">Cortex Agents: The Platform Powering Snowflake Intelligence and </a><a href="https://www.snowflake.com/content/snowflake-site/global/en/blog/enterprise-ai-agent-platform">Enterprise AI Agents</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.snowflake.com/en/">Snowflake</a> is launching <a href="https://www.snowflake.com/en/blog/enterprise-ai-agent-platform/">Cortex Agents</a> as a full enterprise agent platform with several capabilities now generally available, including multi-tenancy with row-level data isolation, agent versioning with commit-based rollback, resource budgets for per-agent and per-team spending controls, and <a href="https://docs.snowflake.com/en/en/user-guide/snowflake-cortex/cortex-agents-evaluations">Cortex Agent Evaluations</a> using their GPA (Goal-Plan-Action) framework.</li>
<li style="font-weight:400;">MCP connector support is coming soon to GA, allowing Cortex Agents to connect natively to external tools like <a href="https://login.salesforce.com/">Salesforce</a>, <a href="https://www.atlassian.com/software/jira">Jira</a>, <a href="https://github.com/">GitHub</a>, <a href="https://slack.com/signin#/signin">Slack</a>, and <a href="https://workspace.google.com/">Google Workspace</a> using the Model Context Protocol standard, with the same Snowflake role-based governance applied to those external connections.</li>
<li style="font-weight:400;">The <a href="https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-agents-code-execution-tool">Code Execution Tool</a> (public preview soon) gives agents a sandboxed Python environment with session-level isolation, letting agents generate and run code on demand during conversations without accessing data outside the current session scope.</li>
<li style="font-weight:400;">The GPA evaluation framework is a notable technical detail here: in benchmark testing against TRAIL/GAIA, it captured 95% of human-annotated errors compared to a 55% baseline, and localized errors to specific trace spans with 86% accuracy, giving teams a structured alternative to subjective human review.</li>
<li style="font-weight:400;">The cost governance model is more granular than typical platforms, supporting both agent-level and per-team shared budgets with configurable threshold actions, such as alerts at 80% spend and automatic access revocation at 100%, which addresses a practical concern for enterprises deploying agents across multiple business units.</li>
</ul>
<p>35:06  Justin – “If you need your agents close to your data, this is a great way to do it. I definitely would look into cost with this one, because Snowflake is not cheap.” </p>
<p>36:11 <a href="https://openai.com/index/introducing-gpt-5-5">Introducing GPT-5.5</a></p>
<ul>
<li style="font-weight:400;"><a href="https://openai.com/index/introducing-gpt-5-5/">GPT-5.5</a> is now generally available in <a href="https://chatgpt.com/">ChatGPT</a> and <a href="https://openai.com/codex/">Codex</a> for <a href="https://chatgpt.com/plans/plus/">Plus</a>, <a href="https://chatgpt.com/plans/pro/">Pro</a>, <a href="https://chatgpt.com/business/business-plan/?openaicom-did=ad04b478-4d65-49f6-a679-e4f8c7cd9645&amp;openaicom_referred=true">Business</a>, and <a href="https://chatgpt.com/business/enterprise/?openaicom-did=ad04b478-4d65-49f6-a679-e4f8c7cd9645&amp;openaicom_referred=true">Enterprise</a> users, with API access priced at $5 per 1M input tokens and $30 per 1M output tokens, and a Pro variant at $30 input and $180 output per 1M tokens.</li>
<li style="font-weight:400;">The model shows notable agentic coding improvements, scoring 82.7% on Terminal-Bench 2.0 and 58.6% on SWE-Bench Pro, while using fewer tokens than GPT-5.4 to complete the same tasks, which partially offsets the higher per-token cost.</li>
<li style="font-weight:400;">For cloud and enterprise workloads, GPT-5.5 was co-designed with and served on NVIDIA GB200 and GB300 NVL72 systems, with inference optimizations including dynamic load balancing heuristics that increased token generation speeds by over 20%.</li>
<li style="font-weight:400;">Knowledge work benchmarks are worth noting for enterprise buyers: 84.9% on GDPval across 44 occupations, 78.7% on OSWorld-Verified for autonomous computer use, and 98.0% on Tau2-bench Telecom for customer service workflows, suggesting practical applicability across business functions.</li>
<li style="font-weight:400;">OpenAI is classifying GPT-5.5 as High under its Preparedness Framework for both cybersecurity and biological capabilities, and is introducing a <a href="https://openai.com/index/trusted-access-for-cyber/">Trusted Access for Cyber program</a> through Codex that gives verified defenders expanded access with fewer restrictions, which has direct implications for security teams evaluating AI-assisted vulnerability management.</li>
</ul>
<p>37:31  Ryan – “That’s kind of cool. That’s the first I’m hearing of those kind of frameworks for their testing, and testing the safety AI aspects and having a rating, which I like.”   </p>
<p>37:58 <a href="https://openai.com/index/introducing-workspace-agents-in-chatgpt">Introducing workspace agents in ChatGPT</a></p>
<ul>
<li style="font-weight:400;"><a href="https://openai.com/">OpenAI</a> is launching workspace agents in <a href="https://chatgpt.com/">ChatGPT</a> as a research preview for Business, Enterprise, Edu, and Teachers plans, positioning them as an evolution of GPTs powered by <a href="https://openai.com/codex/">Codex</a> and designed for shared team workflows rather than individual use.</li>
<li style="font-weight:400;">These agents run persistently in the cloud, meaning they can continue working on long-running tasks without user interaction, and can be triggered on a schedule or deployed directly in Slack to handle incoming requests automatically.</li>
<li style="font-weight:400;">The practical use cases OpenAI highlights include a lead outreach agent that reduced 5-6 hours of weekly rep work to an automated background process, and an accounting agent that handles month-end close tasks, including journal entries and variance analysis in minutes.</li>
<li style="font-weight:400;">(mention privacy filters) On the enterprise controls side, admins get role-based access management, a Compliance API for auditing every agent configuration and run, built-in prompt injection safeguards, and the ability to suspend agents, which addresses a common concern about autonomous agents operating within sensitive business environments.</li>
<li style="font-weight:400;">Pricing is worth noting for teams evaluating adoption: workspace agents are free until May 6, 2026, after which credit-based pricing kicks in, giving organizations a window to test and build before committing to costs.</li>
</ul>
<p>38:42 <a href="https://openai.com/index/introducing-openai-privacy-filter">Introducing OpenAI Privacy Filter </a></p>
<ul>
<li style="font-weight:400;">OpenAI released <a href="https://openai.com/index/introducing-openai-privacy-filter/">Privacy Filter</a>, an open-weight 1.5B parameter model (with only 50M active parameters) for detecting and redacting PII in text, available now on <a href="https://huggingface.co/openai/privacy-filter">Hugging Face</a> and <a href="https://github.com/openai/privacy-filter">GitHub</a> under the Apache 2.0 license for free commercial use and fine-tuning.</li>
<li style="font-weight:400;">The model uses a bidirectional token-classification architecture with constrained Viterbi span decoding, processing up to 128,000 tokens in a single forward pass across eight PII categories, including private persons, addresses, account numbers, and secrets like API keys and passwords.</li>
<li style="font-weight:400;">A key practical advantage for cloud and on-premise deployments is that the model runs locally, meaning sensitive data never needs to leave the device for de-identification, which directly reduces exposure risk in logging, indexing, and training pipelines.</li>
<li style="font-weight:400;">Performance benchmarks show a 97.43% F1 score on the corrected <a href="https://huggingface.co/datasets/ai4privacy/pii-masking-300k">PII-Masking-300k benchmark</a>, and fine-tuning on small domain-specific datasets can lift accuracy from 54% to 96% F1, making it adaptable for legal, medical, and financial workflows.</li>
<li style="font-weight:400;">OpenAI explicitly notes this is not a compliance certification or anonymization guarantee, and recommends human review in high-stakes settings, which is an important caveat for developers considering it as a drop-in solution for regulated industries.</li>
</ul>
<p>38:52  Justin – “If you’re looking for a lightweight built-in option inside of Codex to find privacy PII, this little model sits on top of it and does great work.”</p>
<p>40:33 <a href="https://openai.com/index/introducing-chatgpt-images-2-0">Introducing ChatGPT Images 2.0 </a></p>
<ul>
<li style="font-weight:400;"><a href="https://openai.com/index/introducing-chatgpt-images-2-0/">ChatGPT Images 2.0</a> can now handle small text, UI elements, icons, and complex layouts at up to 2K resolution; no more getting something “close enough. It actually delivers what you asked for.</li>
<li style="font-weight:400;">Previous versions struggled outside of Latin-based text, but now it has solid support for Japanese, Korean, Chinese, Hindi, and Bengali, where the language is baked into the design itself.</li>
<li style="font-weight:400;">When paired with a reasoning model, it can search the web, plan the image structure, self-check its work, and even produce multiple distinct images from a single prompt.</li>
<li style="font-weight:400;">Images 2.0 supports everything from wide 3:1 banners to tall 1:3 mobile screens. Useful for social graphics, presentations, posters, and more, all without manual resizing.</li>
<li style="font-weight:400;">This replaces the back-and-forth between prompting, designing, and editing. You describe what you need, it researches, writes, and visualizes from start to finish.</li>
</ul>
<p>41:29  Matt – “I like that it can do multiple at the same time. That’s a nice feature.” </p>
<h2>Cloud Tools </h2>
<p>42:19 <a href="https://blog.cloudflare.com/registrar-api-beta/">Register domains wherever you build: Cloudflare Registrar API now in beta</a></p>
<ul>
<li style="font-weight:400;"><a href="https://blog.cloudflare.com/registrar-api-beta/">Cloudflare Registrar API</a> is now in beta, allowing developers to search, check availability, and register domains programmatically through three straightforward API endpoints, keeping the entire workflow inside editors, terminals, or agent-driven tools.</li>
<li style="font-weight:400;">The API integrates directly with Cloudflare’s MCP server, meaning <a href="https://www.cloudflare.com/learning/ai/what-is-agentic-ai/">agents</a> in environments like <a href="https://cursor.com/home">Cursor</a> or <a href="https://claude.com/product/claude-code">Claude Code</a> can already discover and call <a href="https://www.cloudflare.com/products/registrar/">Registrar</a> endpoints without any additional integration or custom tool definitions.</li>
<li style="font-weight:400;">Cloudflare maintains its at-cost pricing model through the API, charging exactly what the registry charges with no markup, and <a href="https://www.cloudflare.com/learning/dns/what-is-domain-privacy/">WHOIS privacy protection</a> is enabled by default at no extra charge.</li>
<li style="font-weight:400;">Registration typically completes synchronously within seconds, but the API also handles longer operations by returning a 202 Accepted with a polling URL, using the same response shape either way to simplify agent logic.</li>
<li style="font-weight:400;">The beta currently covers search, check, and registration for a curated set of TLDs, with Cloudflare actively working to expand the API to include transfers, renewals, contact updates, and eventually a broader registrar-as-a-service offering for multi-tenant platforms.</li>
</ul>
<h2>AWS</h2>
<p>43:29 <a href="https://aws.amazon.com/blogs/aws/aws-interconnect-is-now-generally-available-with-a-new-option-to-simplify-last-mile-connectivity/">AWS Interconnect is now generally available, with a new option to simplify </a><a href="https://aws.amazon.com/blogs/aws/aws-interconnect-is-now-generally-available-with-a-new-option-to-simplify-last-mile-connectivity/">last-mile connectivity</a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/interconnect/latest/userguide/what-is.html">AWS Interconnect</a> is now generally available in two flavors: <a href="https://docs.aws.amazon.com/interconnect/latest/userguide/what-is.html">multicloud</a> for private Layer 3 connections between AWS and other cloud providers (starting with Google Cloud, Azure coming later in 2026), and <a href="https://aws.amazon.com/interconnect/lastmile/">last-mile for connecting</a> on-premises locations to AWS through network providers like Lumen, AT&amp;T, and Megaport.</li>
<li style="font-weight:400;">The multicloud option uses <a href="https://1.ieee802.org/security/802-1ae/">IEEE 802.1AE MACsec encryption</a> by default on physical links, routes traffic entirely over private backbones without touching the public internet, and includes built-in redundancy across at least two physical facilities. </li>
<li style="font-weight:400;">Pricing is a flat hourly rate based on bandwidth tier and region pair, so check the pricing page before sizing your connection.</li>
<li style="font-weight:400;">Provisioning is handled through the <a href="https://aws.amazon.com/directconnect/">AWS Direct Connect</a> console in a few clicks, generating an activation key that completes the handshake on the partner cloud side. </li>
<li style="font-weight:400;">However, there are gotchas to watch for, including non-overlapping IP ranges, matching MTU settings between VPCs, and consistent IPv4/IPv6 configuration on both sides.</li>
<li style="font-weight:400;">Last-mile connectivity automatically provisions four redundant connections, configures BGP routing, enables MACsec and Jumbo Frames by default, and supports 1 Gbps to 100 Gbps with bandwidth adjustable from the console without reprovisioning. It includes a 99.99% availability SLA up to the Direct Connect port.</li>
<li style="font-weight:400;">Current multicloud availability covers five region pairs across US East, US West, and Europe, connecting to Google Cloud, with last-mile launching in US East N. Virginia only. </li>
<li style="font-weight:400;">The open specification published on GitHub under Apache 2.0 allows other cloud providers to implement the standard and become Interconnect partners.</li>
<li style="font-weight:400;">AWS Interconnect -multicloud pricing is available <a href="https://aws.amazon.com/interconnect/multicloud/pricing/">here</a>, and last-mile pricing can be found <a href="https://aws.amazon.com/interconnect/lastmile/pricing/">here</a>. </li>
</ul>
<p>44:29  Justin – “Good to see it in GA; hopefully it gets expanded out pretty quickly.” </p>
<p>44:43 <a href="https://aws.amazon.com/blogs/machine-learning/amazon-quick-for-marketing-from-scattered-data-to-strategic-action/">Amazon Quick for marketing: From scattered data to strategic action</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/machine-learning/category/artificial-intelligence/amazon-quick-suite/">Amazon Quick</a> is an AI-powered marketing intelligence tool built on AWS that connects to existing tools like HubSpot, Salesforce, Slack, and Adobe to create a unified knowledge graph from scattered marketing data. </li>
<li style="font-weight:400;">Pricing is available <a href="http://aws.amazon.com/quick/pricing">here</a>, with support for MCP and OpenAPI integrations for extending to other systems.</li>
<li style="font-weight:400;">The tool addresses three specific marketing pain points: campaign performance reporting, competitive intelligence, and content creation. Quick claims to reduce competitive analysis from days to 30 minutes and content production from three hours to under 20 minutes.</li>
<li style="font-weight:400;">Quick Flows allow teams to automate recurring tasks like weekly performance summaries and monthly competitive reports on a schedule, shifting work from manual queries to automated delivery. </li>
<li style="font-weight:400;">This is a notable distinction from standard AI chat assistants that require active prompting.</li>
<li style="font-weight:400;">On the security side, Quick runs within the customer’s AWS environment, queries and responses are not used to train external models, and role-based access controls are included. </li>
<li style="font-weight:400;">This positions it as an enterprise-focused offering rather than a consumer AI tool.</li>
<li style="font-weight:400;">The product references an MIT study showing AI cut document creation time by 40% and improved output quality by 18% among 444 professionals, which gives some external grounding to the productivity claims. </li>
<li style="font-weight:400;">Teams considering this should evaluate it against existing point solutions like dedicated BI tools or standalone AI writing assistants they may already have in place.</li>
</ul>
<p>48:12 <a href="https://aws.amazon.com/about-aws/whats-new/2026/04/amazon-cloudwatch-cross-region-enablement-rules/">Amazon CloudWatch now supports cross-region telemetry auditing and </a><a href="https://aws.amazon.com/about-aws/whats-new/2026/04/amazon-cloudwatch-cross-region-enablement-rules/">enablement rules</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/cloudwatch/">CloudWatch</a> now lets customers audit telemetry configuration and enable telemetry from services like <a href="https://aws.amazon.com/ec2/">EC2</a>, <a href="https://docs.aws.amazon.com/vpc/latest/userguide/what-is-amazon-vpc.html">VPC</a>, and <a href="https://docs.aws.amazon.com/cloudtrail/">CloudTrail</a> across multiple regions from a single control point, reducing the operational overhead of managing observability at scale.</li>
<li style="font-weight:400;">Enablement rules can be scoped to specific regions or all supported regions, and rules set to cover all regions automatically expand to include new regions as they become available, which is useful for organizations with growing AWS footprints.</li>
<li style="font-weight:400;">A practical use case is a central security team creating one organization-wide rule for <a href="https://docs.aws.amazon.com/vpc/latest/userguide/flow-logs.html">VPC Flow Logs</a> that consistently applies across every account and region, eliminating gaps in telemetry coverage that could create blind spots.</li>
<li style="font-weight:400;">The feature is available in all AWS commercial regions with standard CloudWatch pricing applying to telemetry ingestion, so costs will scale with the volume of logs and metrics collected rather than the feature itself carrying an additional charge.</li>
<li style="font-weight:400;">For teams managing multi-account AWS Organizations setups, this reduces the risk of misconfigured or missing telemetry in individual accounts, which has historically required custom automation or third-party tooling to enforce consistently.</li>
</ul>
<p>47:58  Ryan – “…this has always been a challenge, even before I was doing security and trying to do log governance across these things, trying to have different serving farms basically in multiple regions and having to log into different web pages to view the metrics on each one. They sort of fix that with the ability to reference metrics in a foreign site a little while ago, but you could only do it for metrics. And so this is definitely something I’m glad to see that you can use.”</p>
<p>50:15 <a href="https://aws.amazon.com/blogs/machine-learning/introducing-granular-cost-attribution-for-amazon-bedrock/">Introducing granular cost attribution for Amazon Bedrock</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/machine-learning/category/artificial-intelligence/amazon-machine-learning/amazon-bedrock/">Amazon Bedrock</a> now automatically attributes inference costs to the <a href="https://docs.aws.amazon.com/IAM/latest/UserGuide/intro-structure.html">IAM principal</a> making the call, with data flowing into <a href="https://docs.aws.amazon.com/cur/latest/userguide/table-dictionary-cur2.html">CUR 2.0</a> via a new line_item_iam_principal column. </li>
<li style="font-weight:400;">This works across all Bedrock models at no additional cost and requires no changes to existing workflows.</li>
<li style="font-weight:400;">The feature supports four distinct access patterns: direct IAM users or <a href="https://docs.aws.amazon.com/bedrock/latest/userguide/api-keys.html">API keys</a>, application roles on AWS compute, federated identity through providers like Okta or Azure AD, and LLM gateway architectures. </li>
<li style="font-weight:400;">Each scenario has different configuration requirements, with the gateway scenario being the most complex since it requires per-user AssumeRole session management to avoid all traffic appearing under a single identity.</li>
<li style="font-weight:400;">Cost allocation tags can be attached to IAM users or roles, or passed dynamically as session tags through identity providers, and once activated in <a href="https://console.aws.amazon.com/billing/">AWS Billing</a>, they appear in Cost Explorer under an iamPrincipal prefix. This enables chargeback reporting by team, project, cost center, or tenant without building custom tracking infrastructure.</li>
<li style="font-weight:400;">For organizations running LLM gateways like LiteLLM or custom proxies, the solution requires the gateway to call AssumeRole per user and cache those credentials for up to one hour, which keeps STS call volume manageable but introduces architectural changes. </li>
<li style="font-weight:400;">The default STS rate limit of 500 AssumeRole calls per second per account may require a limit increase for high-throughput deployments.</li>
<li style="font-weight:400;">Tags take 24 to 48 hours to appear in <a href="https://aws.amazon.com/aws-cost-management/aws-cost-explorer/">Cost Explorer</a> and CUR 2.0 after activation, and IAM principal data must be explicitly enabled in the CUR 2.0 data export configuration before any attribution data will appear.</li>
</ul>
<p>52:12 <a href="https://aws.amazon.com/about-aws/whats-new/2026/04/aws-lambda-amazon-s3/">AWS Lambda functions can now mount Amazon S3 buckets as file systems </a><a href="https://aws.amazon.com/about-aws/whats-new/2026/04/aws-lambda-amazon-s3/">with S3 Files</a></p>
<ul>
<li style="font-weight:400;">Lambda functions can now mount <a href="https://aws.amazon.com/s3/features/files/">S3</a> buckets as file systems using S3 Files, which is built on Amazon EFS, allowing standard file operations without the overhead of downloading objects or managing ephemeral storage limits.</li>
<li style="font-weight:400;">Multiple <a href="https://docs.aws.amazon.com/lambda/latest/dg/durable-functions.html">Lambda functions</a> can connect to the same S3 Files file system simultaneously, enabling shared workspaces without custom synchronization logic, which is particularly useful for multi-step AI and machine learning pipelines.</li>
<li style="font-weight:400;">The integration pairs well with Lambda durable functions, where an orchestrator can clone a repository to a shared workspace while parallel agent functions analyze it, with automatic checkpointing handling execution state.</li>
<li style="font-weight:400;">Configuration is supported through the Lambda console, <a href="https://aws.amazon.com/cli/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">AWS CLI</a>, SDKs, <a href="https://docs.aws.amazon.com/cloudformation/">CloudFormation</a>, and SAM, though the feature is limited to Lambda functions not configured with a capacity provider.</li>
<li style="font-weight:400;">Pricing adds no additional charge beyond standard Lambda and S3 rates, and the feature is available in all AWS regions where both Lambda and S3 Files are supported.</li>
</ul>
<p>52:19  Justin – “Thanks. Could have announced that last week.” </p>
<p>53:41 <a href="https://aws.amazon.com/blogs/machine-learning/from-developer-desks-to-the-whole-organization-running-claude-cowork-in-amazon-bedrock/">From developer desks to the whole organization: Running Claude Cowork </a><a href="https://aws.amazon.com/blogs/machine-learning/from-developer-desks-to-the-whole-organization-running-claude-cowork-in-amazon-bedrock/">in Amazon Bedrock</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.anthropic.com/product/claude-cowork">Claude Cowork</a> is a desktop application (macOS and Windows) that lets knowledge workers delegate research, document analysis, data processing, and report generation to <a href="https://claude.ai/new">Claude</a>, with all model inference routed through <a href="https://aws.amazon.com/bedrock/">Amazon Bedrock</a> in your AWS account rather than Anthropic’s infrastructure.</li>
<li style="font-weight:400;">Pricing is consumption-based through your existing AWS agreement with no per-seat licensing from Anthropic, which is a notable distinction from <a href="https://claude.com/pricing/enterprise">Claude Enterprise</a> and could make cost modeling more predictable for organizations with variable usage patterns.</li>
<li style="font-weight:400;">Enterprise security controls are central to the integration, including <a href="https://aws.amazon.com/iam/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">AWS IAM</a> or Bedrock API key authentication, VPC endpoint network isolation, CloudTrail audit logging, and OpenTelemetry export to <a href="https://aws.amazon.com/cloudwatch/">CloudWatch</a>, with Anthropic receiving only aggregate telemetry that can be disabled.</li>
<li style="font-weight:400;">Setup relies on device management tools like Jamf, Microsoft Intune, or Group Policy to push a managed configuration to <a href="https://code.claude.com/docs/en/desktop-quickstart">Claude Desktop</a>, specifying the model ID, Bedrock inference profile, and auth method, which means IT teams control rollout rather than individual users configuring their own credentials.</li>
<li style="font-weight:400;">Organizations already using <a href="https://claude.com/product/claude-code">Claude Code</a> in Amazon Bedrock can reuse the same infrastructure setup for <a href="https://www.anthropic.com/product/claude-cowork">Cowork</a>, and both in-region and cross-region inference profiles are supported to address data residency requirements across different geographies.</li>
</ul>
<p>56:51  Justin – “The problem is that instead of building a proper enterprise backend that would do all the things they want, they partnered with Work OS. And so while Work OS has a bunch of things, it doesn’t have all the things that you would want, and this is a problem also for OpenAI, as well, because they also partner the same way. And Snowflake partners with them. But some have done a better job than others in how they lay out some of these tools.”</p>
<p>57:54 <a href="https://aws.amazon.com/blogs/machine-learning/get-to-your-first-working-agent-in-minutes-announcing-new-features-in-amazon-bedrock-agentcore/">Get to your first working agent in minutes: Announcing new features in </a><a href="https://aws.amazon.com/blogs/machine-learning/get-to-your-first-working-agent-in-minutes-announcing-new-features-in-amazon-bedrock-agentcore/">Amazon Bedrock AgentCore</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/machine-learning/category/artificial-intelligence/amazon-machine-learning/amazon-bedrock/amazon-bedrock-agentcore/">Amazon Bedrock AgentCore</a> now includes a managed agent harness feature that lets developers define an agent’s model, tools, and instructions via API calls without writing orchestration code, reducing initial setup from days to minutes. </li>
<li style="font-weight:400;">It supports popular frameworks, including <a href="https://www.langchain.com/langgraph">LangGraph</a>, <a href="https://www.llamaindex.ai/">LlamaIndex</a>, <a href="https://crewai.com/">CrewAI</a>, and <a href="https://strandsagents.com/">Strands Agents</a>.</li>
<li style="font-weight:400;">The new <a href="https://github.com/aws/agentcore-cli">AgentCore CLI</a> (available on GitHub at github.com/aws/agentcore-cli) keeps the full agent lifecycle in one workflow, covering local prototyping, deployment, and operations from a single terminal with CDK support and Terraform coming soon.</li>
<li style="font-weight:400;">AgentCore now includes persistent session state via a durable filesystem, enabling agents to suspend mid-task and resume where they left off, which makes human-in-the-loop workflows practical without custom storage plumbing.</li>
<li style="font-weight:400;">Pre-built coding agent skills give tools like Claude Code and Kiro curated knowledge of AgentCore best practices rather than just raw API access, with plugins for Codex and Cursor coming by the end of April.</li>
<li style="font-weight:400;">The managed agent harness is in preview across <a href="https://docs.aws.amazon.com/bedrock-agentcore/latest/devguide/agentcore-regions.html">four regions</a> (Oregon, N. Virginia, Sydney, Frankfurt) with no additional charge for the CLI, harness, or skills beyond standard resource consumption. </li>
<li style="font-weight:400;">Full pricing details are <a href="https://aws.amazon.com/bedrock/agentcore/pricing/">here</a>.</li>
</ul>
<p>58:49  Ryan – “This is a great feature; this now makes it competitive with Vertex AI’s AgentBuilder, and so now it’s a useable option on Amazon. Awesome.” </p>
<h2>GCP</h2>
<p> Pre-Next Announcements</p>
<p>59:40 <a href="https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-3-1-flash-tts/">Gemini 3.1 Flash TTS: New text-to-speech AI model</a></p>
<ul>
<li style="font-weight:400;"><a href="https://ai.google.dev/gemini-api/docs/models/gemini-3.1-flash-tts-preview">Gemini 3.1 Flash TTS</a> is now available in <a href="https://aistudio.google.com/?model=gemini-3.1-flash-tts-preview">preview</a> across three surfaces: the Gemini API and <a href="http://aistudio.google.com/generate-speech">Google AI Studio</a> for developers, <a href="https://console.cloud.google.com/vertex-ai/studio/media/speech">Vertex AI for enterprises</a>, and <a href="https://docs.google.com/videos/create?usp=blog">Google Vids for Workspace</a> users, giving GCP customers multiple integration paths depending on their use case.</li>
<li style="font-weight:400;">The model scored an Elo of 1,211 on the <a href="https://artificialanalysis.ai/text-to-speech/models">Artificial Analysis TTS leaderboard</a> based on blind human preference testing, and was placed in the top <a href="https://artificialanalysis.ai/text-to-speech/models?quality=quality-vs-price">quadrant</a> for balancing speech quality with low cost, though specific per-character or per-request pricing was not disclosed in the announcement.</li>
<li style="font-weight:400;">A new audio <a href="https://ai.google.dev/gemini-api/docs/speech-generation#transcript-tags">tags</a> system lets developers embed natural language commands directly into text input to control vocal style, pace, tone, and accent at a granular level, including mid-sentence expression changes, which reduces the need for custom voice training pipelines.</li>
<li style="font-weight:400;">The model supports native multi-speaker dialogue and more than 70 languages with localized style and accent controls, making it a practical option for developers building global or multilingual audio applications.</li>
<li style="font-weight:400;">All generated audio is automatically watermarked using <a href="https://deepmind.google/models/synthid/">Google’s SynthID</a> technology, embedding an imperceptible signal that allows detection of AI-generated content, which is a relevant consideration for enterprises with compliance or content authenticity requirements.</li>
</ul>
<p>1:00:01 <a href="https://blog.google/innovation-and-ai/products/gemini-app/gemini-app-now-on-mac-os/">The Gemini App is now available on Mac OS</a></p>
<ul>
<li style="font-weight:400;">Google has released a native <a href="https://blog.google/innovation-and-ai/products/gemini-app/gemini-app-now-on-mac-os/">Gemini app</a> for macOS, available free to all <a href="https://gemini.google.com/">Gemini</a> users on macOS 15 and above, downloadable at gemini.google/mac. </li>
<li style="font-weight:400;">This is a desktop client rather than a GCP infrastructure announcement, so its relevance to enterprise GCP customers is indirect.</li>
<li style="font-weight:400;">The app includes a screen-sharing feature that lets users pass local files and on-screen content directly to Gemini for context-aware assistance, which could be useful for analysts or developers reviewing complex outputs without leaving their workflow.</li>
<li style="font-weight:400;">A keyboard shortcut (Option + Space) surfaces Gemini from any application, positioning it as a system-level assistant similar to Spotlight, aimed at reducing context-switching during tasks like spreadsheet work or document drafting.</li>
<li style="font-weight:400;">The app integrates with Google’s existing generative media tools, including image generation via <a href="https://deepmind.google/models/imagen/">Imagen</a> (<a href="https://gemini.google/overview/image-generation/">Nano Banana</a>) and video generation via <a href="https://aistudio.google.com/models/veo-3">Veo</a>, giving creative users access to those capabilities without opening a browser.</li>
<li style="font-weight:400;">Google has indicated this initial release is a foundation for a broader desktop assistant strategy, with additional features planned, so organizations evaluating AI assistant tooling for their teams should monitor how this evolves alongside Workspace and GCP integrations.</li>
</ul>
<p>1:01:33 <a href="https://cloud.google.com/blog/topics/developers-practitioners/create-expert-content-deploying-a-multi-agent-system-with-terraform-and-cloud-run/">Create Expert Content: Deploying a Multi-Agent System with Terraform </a><a href="https://cloud.google.com/blog/topics/developers-practitioners/create-expert-content-deploying-a-multi-agent-system-with-terraform-and-cloud-run/">and Cloud Run</a></p>
<ul>
<li style="font-weight:400;">Google’s <a href="https://cloud.google.com/blog/topics/developers-practitioners/create-expert-content-deploying-a-multi-agent-system-with-terraform-and-cloud-run/">Dev Signal</a> is a four-part tutorial series showing how to build and deploy a production multi-agent system using <a href="https://docs.cloud.google.com/gemini-enterprise-agent-platform/build/adk">Google ADK</a>, <a href="https://cloud.google.com/discover/what-is-model-context-protocol">MCP</a>, <a href="https://cloud.google.com/products/gemini-enterprise-agent-platform">Vertex AI memory bank</a>, and <a href="https://cloud.google.com/run">Cloud Run</a>, with the full code available at the GoogleCloudPlatform devrel-demos GitHub repository.</li>
<li style="font-weight:400;">The deployment architecture uses Terraform to provision least-privilege service accounts, Artifact Registry, and Secret Manager integrations, following the Agent Starter Pack patterns to avoid common security pitfalls like over-permissioned default compute accounts.</li>
<li style="font-weight:400;">Observability is handled through OpenTelemetry integration with a single otel_to_cloud=True flag in the FastAPI server, which exports agent traces to Cloud Console showing LLM invocations and MCP tool calls, though production traces are sampled, so targeted evaluation runs are needed for full request visibility.</li>
<li style="font-weight:400;">The system distinguishes between two types of monitoring: system traces for identifying latency and timeout issues at scale, and reasoning traces for targeted evaluation of specific agent decisions, which is a practical distinction teams often miss when moving prototypes to production.</li>
<li style="font-weight:400;">Pricing for this stack depends on Cloud Run usage, Vertex AI memory bank calls, and <a href="https://docs.cloud.google.com/secret-manager/docs/best-practices#coding-practices">Secret Manager</a> API requests, all billed separately at standard GCP rates, so teams should factor in the multi-service cost model when estimating production expenses.</li>
</ul>
<p>1:02:05 Google Next: the Conference</p>
<ul>
<li style="font-weight:400;">32K attendees, 3 keynotes, 25 spotlights, 700+ breakouts, 260 announcements (yeah, we counted.) </li>
</ul>
<p>Justin: </p>
<ul>
<li>Wiz + Google Cloud Security/Product Offering</li>
</ul>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;">Antigravity IDE + Gemini CLI (agent mode) enhancements
<ul>
<li style="font-weight:400;">Data Agent Kit with VS Code/ Claude Code and Gemini CLI (close but no cigar)</li>
</ul>
</li>
</ul>
</li>
<li>Ironwood TPU GA and/or dedicated Inference-based CHIP</li>
</ul>
<p>Ryan</p>
<ul>
<li>Gemini 3.1 Pro GA &amp; Teasing Gemini 3.5 or 4 or future model</li>
</ul>
<ul>
<li>Enhancements with agents and Agentic (THE ENTIRE CONFERENCE)</li>
<li style="font-weight:400;">VMware interruption based on Kubernetes? (Opposite of Tanzu)</li>
</ul>
<p>Matt</p>
<ul>
<li>Default Guardrails in AI in general. How Gemini will have guard rails via Vertex. </li>
</ul>
<ul>
<li>Agent Identity, Agent Gateway, and Model Armor</li>
</ul>
<ul>
<li>Agentic coding tooling and how developers are leveraging Agentic (SDLC)</li>
</ul>
<ul>
<li>Data Agent Kit &amp; Agentic Task Force</li>
<li style="font-weight:400;">3 Non AI Announcements (at the conference, but not on stage, so…)</li>
</ul>
<p>This is genuinely the best we’ve ever done. Time to go buy a lotto ticket and lose. </p>
<p>Runner Ups</p>
<ul>
<li>A2A protocol 1.0 released – Donated to CNCF</li>
</ul>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;">Turboquant Ships in Vertex AI</li>
<li style="font-weight:400;">Something waymo</li>
</ul>
</li>
</ul>
<ul>
<li>Biqquery AI Agents – Part of Data Agent Kit</li>
</ul>
<ul>
<li>Gemini 3.1 Flash GA</li>
</ul>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;">Axion Gen 2</li>
</ul>
</li>
</ul>
<ul>
<li>Nano bananas updates</li>
<li style="font-weight:400;">Sovereign Cloud AI</li>
<li style="font-weight:400;">Gemini Robotics API Preview</li>
<li style="font-weight:400;">Hugging Face</li>
<li style="font-weight:400;">AWS Activate type program </li>
<li style="font-weight:400;">AP2 Payment Protocol</li>
<li style="font-weight:400;">AI in Android</li>
<li style="font-weight:400;">Gemini + Boston Dynamics</li>
<li style="font-weight:400;">Glasswing Answer</li>
</ul>
<p>How many times is AI said on stage? </p>
<p>JUST THE FIRST KEYNOTE WAS 132 Times!! </p>
<p>2nd Keynote: 55 Times</p>
<p>Matt – 99</p>
<p>Ryan – 75</p>
<p>Justin – 115 Winner</p>
<p>That makes Justin the overall winner for this year’s NEXT predictions. </p>
<p>Here’s our Claude-based tier ranking of the 260 announcements:</p>
<p>1:09:21 TIER S — Headline</p>
<p>Agent platform (Vertex AI evolves)</p>
<ul>
<li style="font-weight:400;">The story of the keynote. Vertex AI is being repositioned as the Gemini Enterprise Agent Platform</li>
<li style="font-weight:400;">16 named sub-features in three buckets:
<ul>
<li style="font-weight:400;">Build: ADK (graph-based sub-agent networks), Agent Studio (no-code → ADK export), Agent Designer</li>
<li style="font-weight:400;">Run: Agent Runtime (sub-second cold starts), Agent Sandbox (now everyone), Memory Bank, Sessions, long-running agents</li>
<li style="font-weight:400;">Govern + Optimize: Agent Identity (cryptographic ID per agent), Agent Registry, Agent Gateway, Anomaly Detection, Security dashboard, Simulation, Evaluation, Optimizer</li>
</ul>
</li>
<li style="font-weight:400;">Strategic frame: the agent is now the unit of work, not the model call</li>
<li style="font-weight:400;">Hot takes to fight over:
<ul>
<li style="font-weight:400;">Is “delegating business outcomes” the new “infrastructure as code”?</li>
<li style="font-weight:400;">Does the 16-feature stack feel cohesive or like marketing bolt-ons?</li>
<li style="font-weight:400;">Does this simplify Vertex’s SKU sprawl or make it worse?</li>
</ul>
</li>
<li style="font-weight:400;">Matt’s guardrails prediction lands naturally inside the Govern bucket.</li>
</ul>
<p>1:10:27 Customer scale — agents actually in production</p>
<ul>
<li style="font-weight:400;">Strongest competitive flex of the keynote. Pick 4-5 to land on-mic:
<ul>
<li style="font-weight:400;">Mars — Gemini Enterprise as the primary AI operating system for the global workforce (the headline customer)</li>
<li style="font-weight:400;">Merck — agentic platform across R&amp;D, manufacturing, commercial; 75K employees</li>
<li style="font-weight:400;">GE Appliances — 800+ agents across manufacturing, logistics, supply chain</li>
<li style="font-weight:400;">Tata Steel — 300+ specialized agents in 9 months</li>
<li style="font-weight:400;">Deutsche Telekom MINDR — 95%+ reduction in event management times (best ROI quote)</li>
<li style="font-weight:400;">Citadel Securities — TPU research workloads 4x faster, 30% lower cost, days → minutes</li>
<li style="font-weight:400;">Highmark Health Sidekick — $27.9M in value in 2025 alone</li>
</ul>
</li>
<li style="font-weight:400;">Frame: Last year was “agents are coming,” this year was “here’s the receipts”</li>
<li style="font-weight:400;">Hot take: name the equivalent customer slate from AWS, Azure, or Snowflake. You can’t.</li>
<li style="font-weight:400;">Skip if tight on time: Home Depot Magic Apron, Macy’s “Ask Macy’s”, Papa John’s, Virgin Voyages Rovey, Capcom, Citi Sky, Vodafone, Unilever</li>
</ul>
<p>1:11:44 TPU 8t and 8i (the silicon split)</p>
<ul>
<li style="font-weight:400;">8t (training): ~3x compute vs Ironwood</li>
<li style="font-weight:400;">8i (inference): purpose-built, 80% better perf/$, optimized for MoE + agentic workloads</li>
<li style="font-weight:400;">TorchTPU: native PyTorch, full Eager Mode — kills the JAX-only friction</li>
<li style="font-weight:400;">Strategic: only hyperscaler shipping dedicated inference silicon this generation</li>
<li style="font-weight:400;">Practitioner angle: agent workloads (lots of small inference calls) tilt economically toward Google if 80% perf/$ holds in production</li>
<li style="font-weight:400;">Justin’s prediction wins twice — he specifically called the dedicated inference chip</li>
</ul>
<p>1:12:22 TIER A — Strong second tier </p>
<p>Wiz expands (multi-cloud agent visibility)</p>
<ul>
<li style="font-weight:400;">Lead with: acquisition formally closed</li>
<li style="font-weight:400;">Wiz AI-APP — code-to-cloud-to-runtime AI Application Protection Platform</li>
<li style="font-weight:400;">Killer move: Wiz now supports AWS Agentcore, Azure Copilot Studio, Salesforce Agentforce, Databricks
<ul>
<li style="font-weight:400;">Google is selling security to customers who’ll never run a workload on GCP</li>
<li style="font-weight:400;">Different posture than they’ve had historically</li>
</ul>
</li>
<li style="font-weight:400;">Other Wiz news worth a mention:
<ul>
<li style="font-weight:400;">Inline AI security hooks in IDEs</li>
<li style="font-weight:400;">Wiz Skills — validated attack-surface findings exposed to coding agents for auto-remediation</li>
<li style="font-weight:400;">AI-Bill of Materials — auto-inventory of every AI framework, model, IDE extension across your environment (shadow-AI killer)</li>
<li style="font-weight:400;">Lovable vibe-coding integration (security scanning inside Lovable)</li>
</ul>
</li>
<li style="font-weight:400;">Hot take: most strategically interesting acquisition payoff Google has shipped in years.</li>
</ul>
<p>1:13:46 Partner fund — $750M + Forward-Deployed Engineers</p>
<ul>
<li style="font-weight:400;">$750M innovation fund for partner agent development</li>
<li style="font-weight:400;">Agent Marketplace + Agent Gallery — 70+ partner-built agents at launch
<ul>
<li style="font-weight:400;">Accenture, Adobe, Atlassian, Deloitte, Lovable, Oracle, Palo Alto, Replit, S&amp;P Global, Salesforce, ServiceNow, Workday</li>
</ul>
</li>
<li style="font-weight:400;">Forward-Deployed Engineers with Accenture, Deloitte, McKinsey — Google making its own engineers available through partner GTM</li>
<li style="font-weight:400;">Hot take: this is a Palantir-style move. Google admitting agent adoption needs hand-holding and putting money + bodies behind it</li>
<li style="font-weight:400;">Open question: Does this reshape the SI economics, or is it just GTM theater?</li>
</ul>
<p>1:14:48 Antigravity + Data Agent Kit + Gemini 3.1 Pro</p>
<ul>
<li style="font-weight:400;">Gemini 3.1 Pro in preview across Vertex / Gemini Enterprise / Antigravity / Android Studio / Gemini CLI / AI Studio</li>
<li style="font-weight:400;">Data Agent Kit — portable suite of skills, MCP tools, plugins; turns VS Code and Gemini CLI into native data workspaces</li>
<li style="font-weight:400;">Full-stack vibe coding from AI Studio → Cloud Run is now GA (Firestore + auth out of the box)</li>
<li style="font-weight:400;">Hot take: this is the developer story. Cursor / Claude Code / Replit competitors take note.</li>
<li style="font-weight:400;">Justin and Ryan both have prediction wins here</li>
</ul>
<p>1:15:25 Agentic Data Cloud — Knowledge Catalog + Cross-cloud Lakehouse + Spanner Omni</p>
<ul>
<li style="font-weight:400;">Knowledge Catalog — universal context engine; maps business meaning across the data estate. Foundation for accurate agent execution.</li>
<li style="font-weight:400;">Cross-cloud Lakehouse (BigLake renamed) — Iceberg REST Catalog, federation with AWS Glue / Databricks / Snowflake / SAP, cross-cloud caching cuts egress</li>
<li style="font-weight:400;">Spanner Omni — Spanner runs multi-cloud, on-prem, even on a laptop
<ul>
<li style="font-weight:400;">This is the most underrated announcement of the keynote</li>
<li style="font-weight:400;">Fight over: is this the new Aurora-anywhere? Does it actually pull workloads off RDS / Cosmos?</li>
</ul>
</li>
<li style="font-weight:400;">Lakehouse federation for AlloyDB — live joins between transactional + analytical without ETL.</li>
</ul>
<p>1:17:17 TIER B — Solid block </p>
<p>Workspace AI — Workspace Intelligence + Studio</p>
<ul>
<li style="font-weight:400;">Workspace Intelligence — unified semantic understanding across Docs / Slides / Gmail / projects / org domain knowledge</li>
<li style="font-weight:400;">Workspace Studio — no-code agent builder; skills deployable across Workspace</li>
<li style="font-weight:400;">M365 → Workspace migration tool — competitive shot at Microsoft, easy to move emails/files/conversations</li>
<li style="font-weight:400;">Sovereign controls + client-side encryption — lock processing to US/EU; CSE means even Google can’t see</li>
<li style="font-weight:400;">Auto browse with Gemini in Chrome Enterprise (US)</li>
</ul>
<p>1:17:53 Cloud Run grew up</p>
<ul>
<li style="font-weight:400;">Full-stack vibe coding deploy from AI Studio (GA)</li>
<li style="font-weight:400;">NVIDIA RTX PRO 6000 Blackwell support — run 70B+ parameter models without managing GPU infra, scales to zero</li>
<li style="font-weight:400;">Billing caps (long-requested!) — set max monthly spend, resources de-activate when hit</li>
<li style="font-weight:400;">Cloud Run sandboxes for ephemeral isolated agent execution</li>
<li style="font-weight:400;">SSH into running containers (preview)</li>
<li style="font-weight:400;">Hot take: Cloud Run is positioning itself as the default agent runtime, period</li>
</ul>
<p>Gemini Enterprise for CX</p>
<ul>
<li style="font-weight:400;">Shopping agent + Food Ordering agent (Papa John’s first user)</li>
<li style="font-weight:400;">Omnichannel Gateway — agent context across web / mobile / voice</li>
<li style="font-weight:400;">Agent Assist — coaching mode for human agents in complex situations</li>
</ul>
<p>1:19:04 BigQuery AI </p>
<ul>
<li style="font-weight:400;">AI.PARSE_DOCUMENT — single SQL function for OCR + layout + chunking via Gemini’s layout parser</li>
<li style="font-weight:400;">TabularFM — zero-shot regression/classification, no feature engineering</li>
<li style="font-weight:400;">BigQuery Graph — entity/relationship modeling natively in the warehouse</li>
<li style="font-weight:400;">Reverse ETL — one-click sync from lakehouse to AlloyDB/Spanner for low-latency serving</li>
<li style="font-weight:400;">Connected Sheets with TimesFM — zero-shot forecasting in Google Sheets</li>
<li style="font-weight:400;">BigQuery hybrid search — semantic + full-text in one function</li>
<li style="font-weight:400;">35% YoY perf improvement, lower processing cost</li>
<li style="font-weight:400;">Hot take: biggest “Monday morning” change for data teams in the entire keynote
</li>
</ul>
<p>1:19:32 TIER C — Lightning round </p>
<p>Virgo Network</p>
<ul>
<li style="font-weight:400;">Custom interconnect: 134K TPUs in a single fabric, 1M+ across sites</li>
<li style="font-weight:400;">A5X with NVIDIA Vera Rubin NVL72 — up to 960K GPUs cross-site</li>
<li style="font-weight:400;">The “we can scale further than anyone else” mic drop</li>
</ul>
<p>1:20:05 Rapid storage</p>
<ul>
<li style="font-weight:400;">Rapid Bucket — 15 TB/s bandwidth, 20M req/s, sub-millisecond latency, single-zone</li>
<li style="font-weight:400;">Rapid Cache (formerly Anywhere Cache) — 2.5 TB/s aggregate read; 2.2x faster checkpoint restores</li>
<li style="font-weight:400;">Managed Lustre at 10 TB/s throughput; 2.6x faster checkpoints</li>
</ul>
<p>1:20:54 Axion expands</p>
<ul>
<li style="font-weight:400;">N4A GA — 2x price/perf vs x86; 30% better perf/$ for GKE Agent Sandbox vs other hyperscalers</li>
<li style="font-weight:400;">C4A.metal preview — first Axion bare metal (Android dev, automotive sim, custom hypervisors)</li>
<li style="font-weight:400;">Confidential Computing on G4 (Blackwell) + C4 (Granite Rapids) — confidential AI workloads</li>
</ul>
<p>1:21:54 Fraud Defense</p>
<ul>
<li style="font-weight:400;">reCAPTCHA evolves into a platform that distinguishes bots, humans, AND agents</li>
<li style="font-weight:400;">Agent-specific capabilities coming for the digital commerce journey (account → payment → checkout)</li>
<li style="font-weight:400;">Closest thing in the wrap-up to the AP2 protocol prediction nobody hit</li>
</ul>
<p>1:21:50 Post-quantum crypto</p>
<ul>
<li style="font-weight:400;">KMS Quantum Safe Key Imports (preview)</li>
<li style="font-weight:400;">PQC in Cross-Cloud Network</li>
<li style="font-weight:400;">Boring but important — Google front-running the regulatory ask</li>
</ul>
<p>1:22:00 GKE upgrades</p>
<ul>
<li style="font-weight:400;">4x faster node startup, 80% faster pod startup, 5x faster model loading</li>
<li style="font-weight:400;">GKE hypercluster — single control plane, millions of accelerators, multi-region (private GA)</li>
<li style="font-weight:400;">Predictive latency boost in GKE Inference Gateway — up to 70% lower time-to-first-token</li>
<li style="font-weight:400;">KV Cache tiering across RAM / Local SSD / Cloud Storage / Lustre</li>
<li style="font-weight:400;">RL Scheduler, RL Sandbox, RL Observability for reinforcement learning workloads
</li>
</ul>
<p>1:22:33 Three themes that emerged</p>
<ul>
<li style="font-weight:400;">Agent platform is the new operating system. Vertex’s rebrand to Gemini Enterprise Agent Platform isn’t cosmetic — Google restructured the portfolio so the unit of work is an agent, not a model call.</li>
<li style="font-weight:400;">Wiz is now Google’s multi-cloud trojan horse. Supporting AWS Agentcore + Azure Copilot Studio + Salesforce Agentforce means Google is happy to sell security to customers who’ll never run on GCP. New posture.</li>
<li style="font-weight:400;">Customer scale is the real flex. Mars, Merck (75K employees), GE Appliances (800 agents), Tata Steel (300 in 9 months), Deutsche Telekom (95% MTTR reduction). Other hyperscalers can match the silicon. They can’t yet match this deployment depth on stage.</li>
</ul>
<p>1:23:00 Conspicuously absent</p>
<ul>
<li style="font-weight:400;">A2A 1.0 / CNCF donation — third-party press reported it, not in the official wrap-up</li>
<li style="font-weight:400;">No Boston Dynamics or Waymo crossover</li>
<li style="font-weight:400;">No Gemini Robotics API preview</li>
<li style="font-weight:400;">No Hugging Face deal</li>
<li style="font-weight:400;">No AP2 Payment Protocol (Cloud Fraud Defense is the closest cousin)</li>
<li style="font-weight:400;">No Nano Banana update</li>
<li style="font-weight:400;">No Glasswing answer</li>
<li style="font-weight:400;">No Turboquant in Vertex</li>
</ul>
<p>1:23:24 Less important stuff</p>
<ul>
<li style="font-weight:400;">Bigtable in-memory; Memorystore for Valkey 9.0</li>
<li style="font-weight:400;">AlloyDB AI search at 10B vectors; new AlloyDB AI functions</li>
<li style="font-weight:400;">Firestore Enterprise edition (full-text + geospatial + JOINs)</li>
<li style="font-weight:400;">Firebase SQL Connect; Firebase Phone Number Verification</li>
<li style="font-weight:400;">NetApp Volumes Flex Unified + ONTAP-mode</li>
<li style="font-weight:400;">Filestore for GKE; Hyperdisk Exapools / ML / Balanced improvements</li>
<li style="font-weight:400;">Cloud WAN expansion to 25+ countries; NCC Gateway with Palo Alto + Symantec</li>
<li style="font-weight:400;">Cloud Armor managed rules (Thales Imperva); Cloud NGFW Advanced Malware Sandbox</li>
<li style="font-weight:400;">Private Service Connect: 40+ published services, endpoint-based security</li>
<li style="font-weight:400;">Looker Studio renamed to Data Studio; Looker Dashboard Agents; AI assistants</li>
<li style="font-weight:400;">CME Group ultra-low-latency partnership for financial exchanges</li>
<li style="font-weight:400;">Google for Startups AI Agents Challenge ($90K prize, $500 credits)</li>
</ul>
<p><a href="https://cloud.google.com/blog/topics/google-cloud-next/google-cloud-next-2026-wrap-up/">Google Cloud Next 2026 Wrap Up</a></p>
<ul>
<li style="font-weight:400;">Google Cloud Next 26 featured 260 announcements centered on what Google calls the “Agentic Era,” with the headline being the Gemini Enterprise Agent Platform, which replaces Vertex AI as the primary platform for building, scaling, and governing AI agents with new components like Agent Runtime (sub-second cold starts), Agent Memory Bank, Agent Identity with cryptographic IDs, and Agent Gateway for fleet management.</li>
<li style="font-weight:400;">On the infrastructure side, Google announced 8th-generation TPUs split into two variants: TPU 8t for training workloads delivering roughly 3x higher compute than the previous generation, and TPU 8i for inference and reinforcement learning with up to 80% better performance-per-dollar, alongside new Axion-based N4A VMs now generally available at up to 2x better price-performance than comparable x86 VMs.</li>
<li style="font-weight:400;">The Agentic Data Cloud introduces a Knowledge Catalog as a universal context engine, a Cross-Cloud Lakehouse (formerly BigLake) built on Iceberg REST Catalog spanning AWS and Azure, and Spanner Omni, which extends Spanner’s globally consistent database to run on-premises or on other clouds, addressing the challenge of agents needing consistent data access across fragmented environments.</li>
<li style="font-weight:400;">Security got notable attention with the completed Wiz acquisition now reflected in integrated tooling, Model Armor expanding to Agent Gateway and Firebase, a new Google Cloud Fraud Defense platform (evolved from reCAPTCHA) now generally available, and post-quantum cryptography support in Cloud KMS for quantum-safe key imports, all aimed at securing agentic workloads specifically.</li>
<li style="font-weight:400;">Storage announcements include the new Cloud Storage Rapid Bucket delivering over 15 TB/s bandwidth with sub-millisecond latency now generally available, Managed Lustre Dynamic tier priced at $0.06/GB-month, and Hyperdisk ML throughput increased to 2 TB/s aggregate, all targeting the checkpoint and model loading bottlenecks common in large-scale AI training.</li>
</ul>
<p><a href="https://cloud.google.com/blog/topics/google-cloud-next/next26-day-1-recap/">Next ‘26 day 1 recap</a></p>
<ul>
<li style="font-weight:400;">Google Cloud Next ’26 centered on moving AI into production at enterprise scale, with the Gemini Enterprise platform serving as the connective tissue across a unified stack spanning chips, models, data, agents, and security. The Gemini Enterprise Agent Platform is essentially a rebranded and expanded Vertex AI with new tools for building, scaling, governing, and optimizing agents.</li>
<li style="font-weight:400;">On the infrastructure side, Google announced two new TPU 8 variants with distinct purposes: TPU 8t for training scales to 9,600 TPUs with 2 petabytes of shared memory, while TPU 8i for inference delivers 80% better performance per dollar than the prior generation using a new Boardfly topology. The new Virgo Network and Google Cloud Managed Lustre at 10 terabytes per second throughput round out the infrastructure updates.</li>
<li style="font-weight:400;">The Agentic Data Cloud rebrands and expands Google’s data platform with notable additions, including a Knowledge Catalog for contextual grounding, a Lightning Engine for Apache Spark claiming 4.5x speed over open-source alternatives, and a Cross-Cloud Lakehouse based on Apache Iceberg that lets customers query data in AWS or Azure without copying it.</li>
<li style="font-weight:400;">Security got substantial attention with three new agents in Google Security Operations for threat hunting, detection engineering, and third-party context enrichment, all currently in preview. The Wiz acquisition is now complete, and new Wiz integrations include inline security scanning in IDEs, an AI Bill of Materials for inventorying AI frameworks and models, and a Lovable platform integration generally available in May.</li>
<li style="font-weight:400;">Google Workspace is being repositioned from a productivity suite into what Google calls a semantic intelligence layer, with new features like AI Inbox in Gmail, Drive Projects as an active collaborator, and an Ask Gemini interface in Google Chat that can take actions like scheduling meetings or creating documents directly from the chat window.</li>
</ul>
<p><a href="https://cloud.google.com/blog/topics/google-cloud-next/next26-day-2-recap/">Next ’26 day 2 recap</a></p>
<ul>
<li style="font-weight:400;">Google Cloud Next Day 2 centered on the Gemini Enterprise Agent Platform, positioned as the evolution of Vertex AI, offering tools to build, scale, govern, and optimize autonomous agents. The keynote used a multi-agent marathon route planner for Las Vegas as a practical demonstration of the platform’s capabilities.</li>
<li style="font-weight:400;">The Agent Development Kit, remote MCP servers, and Agent Runtime work together to give agents instructions, skills, and tools, while Agent Registry functions as a DNS-like directory for discovering and connecting deployed agents across a system.</li>
<li style="font-weight:400;">Agent Platform Sessions and Memory Bank address a common problem in agentic systems by allowing agents to retain learned knowledge across interactions without stuffing raw text into every request, which improves performance over time.</li>
<li style="font-weight:400;">Debugging and observability are handled through Agent Runtime trace view and Gemini Cloud Assist, which let developers use natural language to investigate logs and pinpoint issues, with fixes applied directly from an IDE connected via MCP and redeployed automatically.</li>
<li style="font-weight:400;">Security is addressed through Agent Identity, which gives each agent a unique, immutable credential, and Agent Gateway, which enforces IAM policies to restrict agent actions to approved sources. Wiz integration adds code and infrastructure scanning with remediation suggestions, and notably supports Anthropic Claude Code as an alternative tooling option alongside Google’s own tools.</li>
</ul>
<p><a href="https://cloud.google.com/blog/products/ai-machine-learning/partner-built-agents-available-in-gemini-enterprise/">Partner-built agents available in Gemini Enterprise </a></p>
<ul>
<li style="font-weight:400;">Google has added partner-built agents from its Agent Marketplace directly into the Agent Gallery inside the Gemini Enterprise app, with partners including Salesforce, ServiceNow, Workday, Oracle, Atlassian, and Palo Alto Networks, among others. Each agent must pass a four-step evaluation covering basic functionality, output accuracy, autonomous execution, and enterprise standards to earn the Google Cloud Ready – Gemini Enterprise designation.</li>
<li style="font-weight:400;">The governance model is worth noting for enterprise IT teams: employees can browse and request agents, but administrators retain approval control over deployments and can manage access at a granular level. Every agent also gets a cryptographically secure identity for audit trail purposes, and Agent Gateway plus Model Armor screen traffic to prevent data from being used for model training.</li>
<li style="font-weight:400;">Google announced a 750 million dollar partner fund for agentic development alongside this launch, and partners selling through the Marketplace are reportedly closing deals 112 percent larger, with purchasing cycles accelerating by up to 50 percent. This creates a clear commercial incentive for ISVs to build and list agents on the platform.</li>
<li style="font-weight:400;">The agent catalog covers a wide range of industries and functions, including supply chain optimization from Accenture, tariff management from Deloitte, financial analysis from S&amp;P Global, identity security from Saviynt, and healthcare intake workflows from Synthpop. This breadth suggests Google is positioning the Agent Gallery as a general-purpose enterprise AI distribution channel rather than a niche tool.</li>
<li style="font-weight:400;">Pricing for individual agents will vary by partner and likely requires existing subscriptions in some cases, such as the Alteryx AI Insights Agent requiring an Alteryx One subscription. Gemini Enterprise offers a 30-day free trial at console.cloud.google.com/freetrial for organizations wanting to evaluate the platform before committing.</li>
</ul>
<p><a href="https://cloud.google.com/blog/topics/developers-practitioners/level-up-your-agents-announcing-googles-official-skills-repository/">Level Up Your Agents: Announcing Google’s Official Skills Repository</a></p>
<ul>
<li style="font-weight:400;">Google announced an official Agent Skills repository at Cloud Next 2026, launching with 13 skills covering products like BigQuery, Cloud Run, GKE, Firebase, and Gemini API, plus Well-Architected Framework pillars and recipe-style guides for common tasks. The repository is available at github.com/google/skills and is free to use.</li>
<li style="font-weight:400;">Agent Skills address a practical problem called context bloat, where loading too much information into an AI agent’s context window increases token costs and degrades model performance. Skills are compact Markdown-based documents that agents load only when needed, rather than pulling in full documentation sets.</li>
<li style="font-weight:400;">The format is described as open, meaning it is not locked to Google’s own tooling. Skills work with Google’s Antigravity and Gemini CLI agents as well as third-party agents, and installation is handled via a single npx command.</li>
<li style="font-weight:400;">The announcement positions Skills as a complement to existing approaches like the Google developer documentation, the MCP server, giving practitioners a lighter-weight alternative when full real-time documentation grounding is unnecessary or too costly.</li>
<li style="font-weight:400;">For teams building AI agents on top of Google Cloud services, this provides a structured way to keep agents accurate on GCP-specific APIs and best practices without manual prompt engineering or expensive context loading. Google indicated that more skills will be added in the coming weeks.</li>
</ul>
<p><a href="https://cloud.google.com/blog/products/ai-machine-learning/introducing-gemini-enterprise-agent-platform/">Introducing Gemini Enterprise Agent Platform </a></p>
<ul>
<li style="font-weight:400;">Google launched the Gemini Enterprise Agent Platform, which consolidates Vertex AI capabilities with new agent-specific tooling for building, scaling, governing, and optimizing AI agents. All future Vertex AI services and roadmap updates will be delivered exclusively through this platform rather than as a standalone service.</li>
<li style="font-weight:400;">The platform introduces four governance-focused components: Agent Identity assigns each agent a unique cryptographic ID for auditable trails, Agent Registry maintains a central library of approved tools, Agent Gateway enforces security policies across environments, and Agent Anomaly Detection flags unusual reasoning using an LLM-as-a-judge framework.</li>
<li style="font-weight:400;">Agent Runtime now supports long-running agents that maintain state for multiple days, with sub-second cold starts and a Memory Bank for persistent context across sessions. This addresses a practical gap where most agent frameworks previously lost context between interactions.</li>
<li style="font-weight:400;">Developers can access over 200 models through Model Garden, including Gemini 3.1 Pro, Gemma 4, and third-party models like Anthropic Claude, with a low-code Agent Studio path and a code-first Agent Development Kit that processes over six trillion tokens monthly. Agent Garden provides pre-built templates for use cases like invoice processing, financial analysis, and code modernization.</li>
<li style="font-weight:400;">Real-world deployments mentioned include Comcast rebuilding its Xfinity Assistant, Color Health using agents to schedule cancer screenings, and PayPal using Agent Payment Protocol for secure agent-based commerce. Pricing details are not specified in the announcement and would need to be confirmed through the Google Cloud console at console.cloud.google.com/agent-platform/overview.</li>
</ul>
<p><a href="https://cloud.google.com/blog/products/application-development/gemini-cloud-assist-at-next26/">Gemini Cloud Assist at Next ‘26 </a></p>
<ul>
<li style="font-weight:400;">Gemini Cloud Assist is shifting from a reactive assistant to a proactive operations platform, using an agentic architecture to handle tasks like infrastructure troubleshooting, cost anomaly detection, and application design without waiting for user prompts.</li>
<li style="font-weight:400;">The redesigned Application Design Center lets teams describe infrastructure goals in plain language and get back visual architectures with deployable Terraform templates, integrated with Security Command Center to enforce organizational policies from the start.</li>
<li style="font-weight:400;">A 24/7 FinOps agent monitors for cost anomalies and correlates spending spikes with specific triggers like auto-scaling events or new resource creation, allowing teams to query cost data in natural language instead of manually aggregating reports.</li>
<li style="font-weight:400;">MCP server support extends Gemini Cloud Assist beyond the Google Cloud console into IDEs, CLIs, and third-party tools like ServiceNow and Slack, reducing context switching for development and operations teams.</li>
<li style="font-weight:400;">Petco reported a 60% reduction in Google Cloud-related questions to their cloud team after adopting Gemini Cloud Assist, suggesting meaningful productivity gains for platform teams supporting large developer organizations. Pricing details are not specified in the announcement, so teams should check the Gemini Cloud Assist admin console for current costs.</li>
</ul>
<p><a href="https://cloud.google.com/blog/products/databases/unify-analytical-and-operational-data-for-ai/">Unify analytical and operational data for AI </a></p>
<ul>
<li style="font-weight:400;">Google announced what it calls an “Agentic Data Cloud” at Google Cloud Next, focused on eliminating the separation between operational and analytical data systems. The goal is to let AI agents query both live transactional data and historical analytical data without complex data movement pipelines.</li>
<li style="font-weight:400;">Three specific capabilities are now available or in preview: Lakehouse federation for AlloyDB lets operational systems query BigQuery data directly, Reverse ETL for BigQuery pushes analytical results into AlloyDB, Bigtable, or Spanner with sub-millisecond read latency, and the Spanner Columnar Engine is now GA with analytical queries running up to 200 times faster than standard transactional queries.</li>
<li style="font-weight:400;">Datastream now supports real-time Change Data Capture into Apache Iceberg tables from AlloyDB, Cloud SQL, Spanner, and Oracle, streaming operational changes directly into the open Lakehouse format for immediate use in BigQuery ML and feature engineering workflows.</li>
<li style="font-weight:400;">Knowledge Catalog, formerly Dataplex, is being extended with integrations across AlloyDB, BigQuery, Bigtable, Cloud SQL, and Spanner to provide a unified metadata layer. The intent is to reduce inconsistent data definitions that can cause AI agents to produce inaccurate outputs.</li>
<li style="font-weight:400;">Native vector and full-text search are being embedded directly into AlloyDB, Bigtable, Cloud SQL, Firestore, and Spanner, and graph federation is being added across BigQuery and Spanner. This removes the need to move data into separate search or graph engines for hybrid retrieval and GraphRAG patterns. Pricing for these features is not specified in the announcement and would vary by service and usage.</li>
</ul>
<p><a href="https://cloud.google.com/blog/products/data-analytics/introducing-the-google-cloud-knowledge-catalog/">Introducing the Google Cloud Knowledge Catalog</a></p>
<ul>
<li style="font-weight:400;">Google is evolving its existing Dataplex service into the Knowledge Catalog, a context engine designed to feed AI agents accurate business semantics, data relationships, and verified SQL patterns to reduce hallucinations and improve query accuracy.</li>
<li style="font-weight:400;">The service aggregates metadata from a broad range of sources, including BigQuery, AlloyDB, Spanner, Cloud SQL, and third-party catalogs like Collibra and Atlan, plus enterprise platforms like SAP, Salesforce, and Workday through a preview feature called Enterprise Connectivity.</li>
<li style="font-weight:400;">A notable enrichment capability is Smart Storage, which automatically tags and embeds metadata for files as they land in Google Cloud Storage buckets, making unstructured data immediately discoverable by agents without manual curation steps.</li>
<li style="font-weight:400;">The search layer uses hybrid retrieval with access control awareness, meaning agents can only retrieve data assets they are explicitly authorized to see, which addresses a practical governance concern when deploying autonomous agents at enterprise scale.</li>
<li style="font-weight:400;">Bloomberg Media is cited as an early customer, using Knowledge Catalog to power an internal Data Access AI Agent that translates business questions against their data lake. Pricing details are not publicly listed, so teams evaluating this should check cloud.google.com/products/knowledge-catalog for current information.</li>
</ul>
<p><a href="https://cloud.google.com/blog/products/data-analytics/the-future-of-data-lakehouse-for-the-agentic-era/">The future of data lakehouse for the agentic era </a></p>
<ul>
<li style="font-weight:400;">Google Cloud announced a next-generation cross-cloud Lakehouse built around Apache Iceberg, offering fully managed Iceberg storage with read/write interoperability across BigQuery, Managed Apache Spark, and third-party engines like Databricks and Snowflake (Preview). The goal is to let teams process the same data across multiple engines without duplication, which Spotify is already doing across BigQuery and Dataflow.</li>
<li style="font-weight:400;">A new cross-cloud interconnect and caching capability (Preview) gives BigQuery and Managed Apache Spark high-performance access to data stored in AWS S3 Iceberg tables, with claimed price-performance comparable to AWS-native solutions. Catalog federation (Preview) extends this to AWS Glue, Databricks, SAP, and Snowflake, with Confluent Tableflow support coming later this year.</li>
<li style="font-weight:400;">The Lightning Engine for Apache Spark claims up to 2x price-performance over competing high-speed Spark alternatives using vectorized execution and optimized I/O, with no code changes required. This runs within Managed Service for Apache Spark, formerly known as Dataproc.</li>
<li style="font-weight:400;">Knowledge Catalog (formerly Dataplex) now provides always-on context for AI agents by continuously learning how enterprise data is used and mapping relationships within unstructured files. This feeds grounded context to agents built with tools like Agent Developer Kit and Model Context Protocol.</li>
<li style="font-weight:400;">Real-time change replication from Spanner, AlloyDB, and Cloud SQL into BigQuery is now GA, with Iceberg replication in Preview, enabling operational data to feed directly into lakehouse workloads. Pricing is not specified in the announcement and would vary based on storage, compute, and cross-cloud data transfer usage.</li>
</ul>
<p><a href="https://cloud.google.com/blog/products/data-analytics/whats-new-in-the-agentic-data-cloud/">What’s New in the Agentic Data Cloud </a></p>
<ul>
<li style="font-weight:400;">Google is rebranding and expanding Dataplex Universal Catalog into the Knowledge Catalog, which aggregates business context from third-party platforms like Salesforce, SAP, ServiceNow, and Workday, then uses hybrid search with access-control-aware retrieval so agents only act on data they are authorized to see.</li>
<li style="font-weight:400;">The new Google Cloud Data Agent Kit (Preview) drops into existing developer environments like VS Code, Gemini CLI, and Claude Code, automatically selecting frameworks like dbt, Spark, or Airflow and generating production-ready code, with three specialized agents for data engineering, data science, and database observability now available at various GA and Preview stages.</li>
<li style="font-weight:400;">Google is expanding MCP support across BigQuery, Spanner, AlloyDB, Cloud SQL, and Looker, using existing IAM policies and VPC Service Controls to govern agent interactions rather than requiring separate security configurations.</li>
<li style="font-weight:400;">The cross-cloud lakehouse now supports bi-directional federation with Databricks Unity Catalog, Snowflake Polaris, and AWS Glue Data Catalog using the open Iceberg REST Catalog standard, and Spanner Omni (Preview) extends the Spanner engine to run on-premises or across other clouds for the first time.</li>
<li style="font-weight:400;">On the performance side, Google is citing up to 2x price-performance improvement for Apache Spark via Lightning Engine, up to 34% cost reduction for BigQuery autoscaling workloads, sub-millisecond Bigtable reads via a new in-memory tier, and up to 10 terabytes per second throughput with Managed Lustre, though specific pricing details were not disclosed in the announcement.</li>
</ul>
<p><a href="https://cloud.google.com/blog/products/storage-data-transfer/next26-storage-announcements/">Next 26 storage announcements </a></p>
<ul>
<li style="font-weight:400;">Cloud Storage Rapid is now generally available in two forms: Rapid Bucket, which uses Google’s internal Colossus system to deliver over 15 TB/s bandwidth and sub-millisecond latency, and Rapid Cache, which provides 2.5 TB/s aggregate read throughput for existing buckets with no code changes. The headline numbers for AI training are checkpoint writes 3.2x faster and restores 5x faster compared to traditional object storage.</li>
<li style="font-weight:400;">Google Cloud Managed Lustre now delivers up to 10 TB/s throughput, a 10x increase from last year, and adds a new Dynamic tier priced at $0.06 per GB per month that serves data from persistent disk rather than object-based caching to avoid performance degradation under load.</li>
<li style="font-weight:400;">Smart Storage adds automated metadata annotation directly in Cloud Storage, so objects get labels, extracted entities, and compliance signals attached at write time without custom pipelines. A new Cloud Storage MCP server lets AI agents read, write, and analyze Cloud Storage data using the standard Model Context Protocol, which reduces the need for separate retrieval layers.</li>
<li style="font-weight:400;">Storage Intelligence, already used by 70% of Google Cloud’s largest customers managing over 50 billion objects each, gets zero-configuration dashboards that surface cost anomalies and integrate Security Command Center’s data governance signals with no setup required, plus enhanced batch operations supporting multi-bucket actions on billions of objects at once.</li>
<li style="font-weight:400;">The ecosystem additions include NetApp Volumes Flex Unified, supporting both block and file protocols on the same storage pool with ONTAP API compatibility, Filestore for GKE scaling down to 100 GiB shares, and Google Cloud Backup and DR gaining agentic AI capabilities to autonomously audit and remediate backup coverage gaps with new GA support for AlloyDB and Filestore.</li>
</ul>
<p><a href="https://cloud.google.com/blog/products/networking/introducing-virgo-megascale-data-center-fabric/">Introducing Virgo Network megascale data center fabric </a></p>
<ul>
<li style="font-weight:400;">Google introduced Virgo Network, a specialized scale-out data center fabric designed for AI workloads, built on a flat two-layer non-blocking topology that reduces network tiers and latency compared to traditional data center architectures. It underpins the AI Hypercomputer platform and connects up to 134,000 TPU chips with up to 47 petabits per second of non-blocking bi-sectional bandwidth in a single fabric.</li>
<li style="font-weight:400;">The architecture separates east-west accelerator traffic (handled by Virgo) from north-south storage and compute traffic (handled by the existing Jupiter network), allowing each layer to evolve independently without system-wide disruptions. This decoupling also means bandwidth dedicated to accelerator-to-accelerator communication is non-blocking and not competing with general data center traffic.</li>
<li style="font-weight:400;">Virgo delivers 4x the bandwidth per accelerator and 40% lower unloaded fabric latency compared to the previous generation, which matters specifically for latency-sensitive inference workloads and large synchronized training jobs where a single slow node can degrade the entire cluster.</li>
<li style="font-weight:400;">Reliability at this scale is addressed through independent switching planes for fault isolation, sub-millisecond telemetry for observability, and automated straggler and hang detection to minimize training job interruptions. Google frames this around maximizing “goodput,” meaning the useful work completed relative to total time, rather than just raw throughput.</li>
<li style="font-weight:400;">No pricing details were provided in the announcement, as Virgo Network is infrastructure-level and costs would surface through TPU and AI Hypercomputer product pricing rather than as a standalone purchasable service.</li>
</ul>
<p><a href="https://cloud.google.com/blog/products/databases/whats-new-for-google-cloud-databases-at-next26/">What’s new for Google Cloud databases at Next’26 </a></p>
<ul>
<li style="font-weight:400;">Google announced Spanner Omni, a downloadable edition of Spanner that runs outside of Google Cloud, including on-premises data centers, other clouds, and edge environments. This gives organizations using Spanner’s distributed database capabilities more deployment flexibility without being locked into a single cloud region or provider.</li>
<li style="font-weight:400;">AlloyDB received notable vector search improvements, scaling to 10 billion vectors using Google’s ScaNN index and delivering up to 6 times faster vector queries compared to standard PostgreSQL HNSW indexes. The addition of native BM25 support, coming soon, enables hybrid search combining vector retrieval with full-text search in a single database.</li>
<li style="font-weight:400;">Managed remote MCP servers are now generally available for AlloyDB, Bigtable, Cloud SQL, Firestore, and Spanner, with preview support for Memorystore, Datastream, and Oracle Database at Google Cloud. This removes the operational burden of self-hosting Model Context Protocol infrastructure for teams building AI agents that need secure, reliable access to enterprise data.</li>
<li style="font-weight:400;">The lakehouse integration announcements bridge the gap between transactional and analytical workloads, with AlloyDB now able to query live BigQuery and Iceberg tables directly from the PostgreSQL data plane without data movement. Datastream also now supports continuous replication from AlloyDB to Iceberg tables, which is useful for real-time ML feature engineering pipelines.</li>
<li style="font-weight:400;">Bigtable is adding a new in-memory tier with sub-millisecond read latency as part of a new Enterprise Plus edition, and Memorystore for Valkey 9.0 is now generally available with a managed migration path from self-managed Redis. Both updates reflect Google’s push to offer managed caching and low-latency storage options with enterprise security features like ACLs and token-based authentication.</li>
</ul>
<p><a href="https://cloud.google.com/blog/products/databases/introducing-spanner-omni/">Introducing Spanner Omni</a></p>
<ul>
<li style="font-weight:400;">Spanner Omni is a downloadable version of Google’s Spanner database now in preview, allowing deployment on-premises, across clouds, on Kubernetes clusters, or even a laptop, rather than being limited to Google Cloud infrastructure. The developer edition is available for free download today at the link in the show notes, with a commercial edition requiring direct contact with Google.</li>
<li style="font-weight:400;">On the technical side, Google had to replace two core Spanner dependencies to make this work. Colossus, Google’s proprietary distributed file system, was replaced with a software abstraction layer that writes to local file systems, and TrueTime’s atomic clock and GPS-based synchronization was replaced with a software-based alternative that still provides error-bounded time synchronization.</li>
<li style="font-weight:400;">Internal benchmarks show Spanner Omni can process millions of queries per second across petabytes of data in a single regional deployment, and it supports the full multimodal feature set, including SQL, graph, key-value, full-text search, vector search, and columnar analytics.</li>
<li style="font-weight:400;">Three primary use cases are emerging from early adopters: hybrid failover, where managed Spanner in Google Cloud serves as primary, and Spanner Omni handles disaster recovery on-premises, a write-once-run-anywhere approach for ISVs and SaaS providers, and on-premises modernization for organizations with regulatory or data sovereignty requirements that prevent full cloud adoption.</li>
<li style="font-weight:400;">Pricing for the commercial edition is not publicly listed yet, so organizations interested in production use will need to engage Google directly at cloud.google.com/consulting/spanner-omni to discuss terms.</li>
</ul>
<p><a href="https://cloud.google.com/blog/products/compute/tpu-8t-and-tpu-8i-technical-deep-dive/">TPU 8t and TPU 8i technical deep dive</a></p>
<ul>
<li style="font-weight:400;">Google’s eighth-generation TPUs split into two specialized chips: TPU 8t for large-scale pre-training and TPU 8i for inference and reasoning workloads. This specialization reflects a recognition that training and serving have distinct hardware bottlenecks that a single chip design cannot optimally address.</li>
<li style="font-weight:400;">TPU 8t introduces native FP4 support, SparseCore for embedding lookups, and a new Virgo Network fabric that can link over 134,000 chips with 47 petabits per second of non-blocking bandwidth. Combined with TPUDirect Storage and Managed Lustre 10T, Google claims 10x faster storage access compared to seventh-generation Ironwood TPUs.</li>
<li style="font-weight:400;">TPU 8i uses a new Boardfly network topology inspired by Dragonfly principles, reducing chip-to-chip communication from 16 hops to 7 hops in a 1,024-chip pod. This 56% reduction in network diameter directly benefits Mixture-of-Experts and reasoning models that require frequent all-to-all communication patterns.</li>
<li style="font-weight:400;">On performance-per-dollar, Google claims TPU 8t delivers 2.7x improvement over Ironwood for training, while TPU 8i delivers 80% improvement for low-latency inference on large MoE models. Both chips also deliver up to 2x better performance-per-watt, which matters for customers managing energy costs at scale.</li>
<li style="font-weight:400;">The software stack supports JAX, native PyTorch (currently in preview), Keras, and vLLM, with XLA handling hardware-specific translation transparently. Customers interested in access can submit an interest form at cloud.google.com/resources/tpu-interest, though pricing details have not been publicly disclosed.</li>
</ul>
<p><a href="https://cloud.google.com/blog/topics/cost-management/introducing-spend-caps-ai-cost-visibility-next26/">Introducing Spend Caps AI Cost Visibility Next ’26</a></p>
<ul>
<li style="font-weight:400;">Google Cloud announced Spend Caps in private preview, allowing FinOps and DevOps managers to set hard budget limits at the project level for services including AI Studio, Gemini Agent Platform, Cloud Run, Cloud Run Functions, and Maps. Unlike traditional budget alerts, Spend Caps automatically pause API traffic when a budget threshold is reached while leaving underlying resources intact, addressing the risk of runaway AI training jobs or unoptimized models draining budgets quickly.</li>
<li style="font-weight:400;">A new FinOps Explainability Agent, built on Gemini and accessible through Google Cloud Billing, autonomously analyzes AI cost drivers and answers natural language queries such as breaking down spend by API key or comparing input versus output token costs across specific Gemini models. This addresses the challenge of AI costs blending into general infrastructure spend, making ROI attribution more straightforward.</li>
<li style="font-weight:400;">Google reported that since launching Gemini Cloud Assist for FinOps, cost reporting adoption increased 75% and time spent on cost analysis decreased 18%, providing some baseline context for the value customers are seeing from AI-assisted billing tools.</li>
<li style="font-weight:400;">Two additional private previews were announced alongside Spend Caps: enhanced billing account hierarchies that aggregate spend across multiple billing accounts, including Other Eligible Services, and contract commitment reporting that shows burndown progress within Enterprise Agreements. Both features target larger organizations managing complex commercial arrangements with Google Cloud.</li>
<li style="font-weight:400;">Spend Caps are currently in private preview with a signup form available, and no specific pricing details were provided for the new FinOps tooling beyond its availability in the Google Cloud Billing console.</li>
</ul>
<p><a href="https://cloud.google.com/blog/products/identity-security/next26-redefining-security-for-the-ai-era-with-google-cloud-and-wiz/">Next ‘26: Redefining security for the AI era with Google Cloud and Wiz </a></p>
<ul>
<li style="font-weight:400;">Google Cloud announced three new security agents in Google Security Operations at Next 26: a Threat Hunting agent, a Detection Engineering agent, and a Third-Party Context agent, all in preview. The existing Triage and Investigation agent has already processed over 5 million alerts, reducing the typical 30-minute manual analysis to 60 seconds.</li>
<li style="font-weight:400;">Wiz, now fully part of Google Cloud, is expanding its AI-Application Protection Platform to cover new agent studios, including AWS Agentcore, Microsoft Azure Copilot Studio, and Salesforce Agentforce, plus Databricks. New capabilities include inline AI security hooks for IDEs, agent-based remediation via Wiz Skills, and an AI Bill of Materials to inventory shadow AI tools across an environment.</li>
<li style="font-weight:400;">Google Cloud is introducing Agent Identity and Agent Gateway as part of the Gemini Enterprise Agent Platform, giving AI agents unique identities with scoped permissions and enforcing policy on all agent-to-agent and agent-to-tool traffic. Model Armor now integrates with Agent Gateway, LangChain, and Firebase to provide runtime protection against prompt injection and data leakage without code changes.</li>
<li style="font-weight:400;">On the data security side, Confidential Computing support is coming to G4 VMs with NVIDIA RTX PRO 6000 Blackwell GPUs and C4 VMs with Intel TDX, both in preview. KMS is also adding quantum-safe key imports in preview, addressing organizations starting to plan for post-quantum cryptography requirements.</li>
<li style="font-weight:400;">ReCAPTCHA is being rebranded and expanded into Google Cloud Fraud Defense, now generally available, with agent-specific capabilities for distinguishing bots, humans, and AI agents coming in preview. Chrome Enterprise is adding shadow AI reporting and AI-aware extension threat detection to help organizations manage unsanctioned AI tool usage at the browser level.</li>
</ul>
<p><a href="https://cloud.google.com/blog/products/business-intelligence/looker-updates-for-agentic-bi-at-next26/">Looker updates for agentic BI at Next ‘26</a></p>
<ul>
<li style="font-weight:400;">Google announced Looker BI Agents at Cloud Next, introducing Dashboard Agents and Agentic Workflows that go beyond static answers to trigger downstream business actions, all grounded in the Looker semantic layer and existing enterprise governance frameworks.</li>
<li style="font-weight:400;">Several features moved to GA, including Embedded Conversational Analytics, Visualization Assistant, Self-service Explores with CSV and Excel blending, and CI/CD pipeline support, giving teams more production-ready options without waiting on preview limitations.</li>
<li style="font-weight:400;">The new MCP integration adds a managed MCP server native to Looker, and a VS Code extension introduces a LookML AI Agent that translates natural language descriptions into production-ready LookML code, reducing the technical barrier for model authoring.</li>
<li style="font-weight:400;">Knowledge Catalog integration in preview allows Looker to transform metadata into a semantic graph, which is positioned as a way to reduce AI hallucinations by giving agents the context needed to complete tasks autonomously.</li>
<li style="font-weight:400;">Pricing details were not disclosed in the announcement, so teams evaluating these features should check cloud.google.com/looker directly, particularly for the preview features, which may have different availability or cost structures once they reach GA.</li>
</ul>
<p><a href="https://cloud.google.com/blog/products/identity-security/next26-announcing-new-partner-supported-workflows-for-google-security-operations/">Next ‘26: Announcing new partner-supported workflows for Google </a><a href="https://cloud.google.com/blog/products/identity-security/next26-announcing-new-partner-supported-workflows-for-google-security-operations/">Security Operations </a></p>
<ul>
<li style="font-weight:400;">Google Security Operations is expanding its partner ecosystem with 13 new integrations announced at Next ’26, bringing the total vendor count to over 300. The new partners span data ingestion, automated response, and bi-directional API workflows, covering gaps in areas like SAP logs, VMware ESXi threats, and application-layer attacks.</li>
<li style="font-weight:400;">Three distinct integration patterns are supported: data feed integrations that pre-map telemetry to Google’s Unified Data Model schema, response integrations that automate alert triage and case management, and bi-directional API workflows that let partner platforms pull Chronicle detections without requiring analysts to switch consoles.</li>
<li style="font-weight:400;">Notable technical additions include Synqly Mesh offering bi-directional normalization between UDM and the Open Cybersecurity Schema Framework (OCSF), and Contrast Security streaming verified runtime attack telemetry to surface confirmed application exploits as cases correlated with WAF and EDR signals.</li>
<li style="font-weight:400;">AI-assisted triage shows up across multiple integrations, with Torq applying agentic AI to filter detections and autonomously execute response actions like endpoint isolation, and Prophet Security using natural language threat hunting with bidirectional sync back to Google Security Operations.</li>
<li style="font-weight:400;">Vendors interested in joining the ecosystem can download the Google Security Operations Build Partner Guide and request a development environment through the Google Cloud Security Tech Partners team. Pricing for individual integrations is not specified in the announcement and would vary by partner.</li>
</ul>
<p><a href="https://cloud.google.com/blog/products/ai-machine-learning/the-new-gemini-enterprise-one-platform-for-agent-development/">The new Gemini Enterprise: one platform for agent development </a></p>
<ul>
<li style="font-weight:400;">Google rebranded and expanded Vertex AI into Gemini Enterprise Agent Platform, consolidating model access, agent development, governance, and deployment tooling into a single system aimed at enterprise-scale agent management.</li>
<li style="font-weight:400;">The platform introduces Agent Identity, which assigns each agent a unique cryptographic ID for auditability, alongside Agent Gateway for securing agent-to-agent communications and Model Armor for protection against prompt injection and data leakage.</li>
<li style="font-weight:400;">A new Memory Bank and Memory Profiles feature gives agents persistent long-term context across sessions, allowing them to retain user preferences and historical interactions rather than starting fresh each time.</li>
<li style="font-weight:400;">The Gemini Enterprise app adds a no-code Agent Designer for non-technical users, a centralized Inbox for monitoring long-running agents, and a Projects workspace that preserves team context as a persistent company asset rather than individual chat history.</li>
<li style="font-weight:400;">The partner ecosystem integration brings agents from Adobe, Salesforce, ServiceNow, Workday, and others directly into the in-app Agent Gallery, with Google Cloud validation for security and interoperability before deployment. Pricing details were not disclosed in the announcement, so listeners should check cloud.google.com/ai for current pricing information.</li>
</ul>
<p><a href="https://cloud.google.com/blog/products/ai-machine-learning/whats-new-in-gemini-enterprise/">What’s new in Gemini Enterprise </a></p>
<ul>
<li style="font-weight:400;">Google is expanding Gemini Enterprise with long-running agents that can autonomously execute multi-step workflows for hours or days, handling tasks like financial reconciliation or sales prospecting without constant human supervision. This is managed through a new Inbox command center that categorizes agent activity into actionable groups.</li>
<li style="font-weight:400;">The Enhanced Agent Designer lets non-technical users build agents using natural language or a visual interface, with reusable Skills that codify specific workflows and human-in-the-loop checkpoints for review and approval at critical steps.</li>
<li style="font-weight:400;">Governance is built into the platform at no additional cost through three key controls: Agent Identity for unique digital IDs and least-privilege access, Agent Registry for IT-managed agent catalogs, and Agent Gateway for centralized network policies and protection against risks like prompt injection.</li>
<li style="font-weight:400;">Projects and Canvas introduce team-level collaboration by creating shared workspaces where humans and agents co-create together, with cross-platform support spanning Google Workspace, Microsoft 365, and OneDrive, plus the ability to export directly to Microsoft Office formats.</li>
<li style="font-weight:400;">The new Agent Marketplace integrates into the existing Agent Gallery, allowing organizations to browse and deploy third-party agents from partners like Accenture, Oracle, and ServiceNow, while BYO-MCP support lets admins connect custom or third-party business tools without writing code. New features will roll out over the coming months, and pricing details are available at cloud.google.com/gemini-enterprise.</li>
</ul>
<p><a href="https://cloud.google.com/blog/products/identity-security/introducing-google-cloud-fraud-defense-the-next-evolution-of-recaptcha/">Introducing Google Cloud Fraud Defense, the next evolution of </a><a href="https://cloud.google.com/blog/products/identity-security/introducing-google-cloud-fraud-defense-the-next-evolution-of-recaptcha/">reCAPTCHA</a></p>
<ul>
<li style="font-weight:400;">Google Cloud Fraud Defense is the rebranded and expanded version of reCAPTCHA, now positioned as a broader trust platform that handles not just bot detection but also AI agent verification and multi-stage fraud across entire user journeys. Existing reCAPTCHA customers are automatically migrated with no action required and no pricing changes.</li>
<li style="font-weight:400;">The platform introduces an agentic policy engine that lets businesses allow or block traffic based on risk scores, automation types, and agent identity, addressing the growing reality that AI agents are being used to complete end-to-end transactions on behalf of users.</li>
<li style="font-weight:400;">A notable new mitigation tool is a QR code-based challenge designed to require human presence when suspicious agent activity is detected, replacing traditional CAPTCHA puzzles with a method intended to make automated fraud economically impractical rather than just technically difficult.</li>
<li style="font-weight:400;">Google cites a 51% average reduction in account takeover for customers using the unified trust model, and the platform currently protects over 14 million domains globally, including 50% of Fortune 100 companies, giving it broad signal coverage that individual site data cannot replicate.</li>
<li style="font-weight:400;">The platform integrates with emerging standards like Web Bot Auth and SPIFFE for agent identity verification, which is worth watching for teams building or securing agentic workflows since standardized agent identity is still an evolving area across the industry.</li>
</ul>
<p><a href="https://cloud.google.com/blog/products/serverless/whats-new-for-cloud-run-at-next26/">What’s new for Cloud Run at Next ‘26 </a></p>
<ul>
<li style="font-weight:400;">Cloud Run is adding support for NVIDIA RTX PRO 6000 Blackwell GPUs, now generally available, allowing teams to serve models with 70 billion or more parameters without managing underlying infrastructure, including automatic scale-to-zero when idle to avoid unnecessary GPU costs.</li>
<li style="font-weight:400;">Google AI Studio now supports full-stack app deployment directly to Cloud Run with a single click, combining server-side code, Firestore, and user authentication in a generally available workflow aimed at lowering the barrier for new developers.</li>
<li style="font-weight:400;">A new Cloud Run MCP server is now generally available, giving developers and AI agents a standardized way to deploy and manage applications programmatically, which fits into the broader push toward agentic workflows.</li>
<li style="font-weight:400;">Cloud Run is introducing individual instances as a primitive resource, separate from services or jobs, allowing teams to run long-running background agents more directly, though this feature is currently in preview with select customers only.</li>
<li style="font-weight:400;">Billing caps are coming soon, letting teams set a monthly spend ceiling after which Cloud Run resources are deactivated, which addresses a common concern for teams running unpredictable or experimental workloads on pay-per-use infrastructure.</li>
</ul>
<p><a href="https://cloud.google.com/blog/products/containers-kubernetes/whats-new-in-gke-at-next26/">What’s new in GKE at Next 26</a></p>
<ul>
<li style="font-weight:400;">GKE Agent Sandbox launches as a new isolated execution environment for running untrusted AI agent code, using gVisor kernel isolation to support 300 sandboxes per second at sub-second latency, with up to 30% better price-performance on Axion processors compared to other cloud providers.</li>
<li style="font-weight:400;">GKE hypercluster enters private GA, enabling a single Kubernetes-conformant control plane to manage up to one million chips across 256,000 nodes spanning multiple Google Cloud regions, reducing the operational burden of managing hundreds of disconnected clusters for large AI training workloads.</li>
<li style="font-weight:400;">Inference performance improvements include ML-driven Predictive Latency Boost in GKE Inference Gateway, reducing time-to-first-token latency by up to 70%, plus automatic KV Cache storage tiering that delivered over 40% TTFT reduction when offloading to RAM and nearly 70% throughput improvement when offloading to Local SSD for long-context workloads.</li>
<li style="font-weight:400;">New reinforcement learning capabilities in preview include an RL Scheduler to address straggler effects, an RL Sandbox for millisecond-scale kernel-level isolation during reward evaluation, and out-of-the-box observability dashboards, targeting the GPU and TPU idle time that occurs between RL pipeline steps.</li>
<li style="font-weight:400;">Intent-based autoscaling adds native custom metrics support to the Horizontal Pod Autoscaler, reducing autoscaling reaction time from 25 seconds to 5 seconds while eliminating dependencies on external monitoring stacks that could cause autoscaling failures if they go down.</li>
</ul>
<p><a href="https://cloud.google.com/blog/products/compute/ai-infrastructure-at-next26/">AI infrastructure at Next ‘26</a></p>
<ul>
<li style="font-weight:400;">Google announced eighth-generation TPUs at Cloud Next, split into two specialized chips: TPU 8t for training (delivering nearly 3x higher compute performance than prior generation, with 121 exaflops in a single superpod) and TPU 8i for inference (offering 80% better performance per dollar with 5x lower on-chip latency). This is the first time Google has offered distinct TPU chips optimized for different workload types rather than a single general-purpose design.</li>
<li style="font-weight:400;">The Virgo Network is a new data center fabric with 4x the bandwidth of previous generations, capable of connecting 134,000 TPUs in a single data center or over one million TPUs across multiple sites into a unified training cluster. Google is also making it available for NVIDIA-based A5X instances, supporting up to 960,000 GPUs across multiple sites.</li>
<li style="font-weight:400;">Storage improvements include Google Cloud Managed Lustre now delivering 10 TB/s of bandwidth (10x improvement over last year) with 80 petabytes of capacity, plus a new Rapid Buckets feature on Cloud Storage offering sub-millisecond latency and 20 million operations per second to keep accelerator utilization at 95% or higher during training checkpoints.</li>
<li style="font-weight:400;">GKE received notable orchestration updates targeting agentic workloads, including node startup times 4x faster, pod startup reduced by up to 80%, and an updated Inference Gateway using ML-driven routing that cuts time-to-first-token latency by more than 70% without manual tuning.</li>
<li style="font-weight:400;">Native PyTorch support for TPUs (called TorchTPU) is now in preview, joining existing JAX and vLLM support, which reduces friction for teams who want to run existing PyTorch models on TPU hardware without significant code changes. Pricing for these new offerings has not yet been publicly detailed, with availability described as coming soon.</li>
</ul>
<h2>Azure</h2>
<p>1:30:36 <a href="https://azure.microsoft.com/en-us/blog/optimize-object-storage-costs-automatically-with-smart-tier-now-generally-available/">Optimize object storage costs automatically with smart tier—now </a><a href="https://azure.microsoft.com/en-us/blog/optimize-object-storage-costs-automatically-with-smart-tier-now-generally-available/">generally available</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/storage/blobs/access-tiers-smart?tabs=azure-portal">Azure Smart Tier</a> for <a href="https://azure.microsoft.com/en-us/products/storage/blobs/?ef_id=_k_8e8c8f1b4fbc13f92d2ee7ed2b7b3035_k_&amp;OCID=AIDcmm5edswduu_SEM__k_8e8c8f1b4fbc13f92d2ee7ed2b7b3035_k_&amp;msclkid=8e8c8f1b4fbc13f92d2ee7ed2b7b3035">Blob</a> and <a href="https://azure.microsoft.com/en-us/products/storage/data-lake-storage/">Data Lake Storage</a> is now generally available, automatically moving objects between hot, cool, and cold tiers based on actual access patterns. </li>
<li style="font-weight:400;">Data inactive for 30 days shifts to cool, then cold after another 60 days, and immediately returns to hot upon re-access with no retrieval or early deletion charges.</li>
<li style="font-weight:400;">The feature eliminates the need to manually configure and maintain lifecycle rules, which is particularly useful for organizations managing large analytics workloads, telemetry data, or data lakes with unpredictable access patterns. </li>
<li style="font-weight:400;">During preview, over 50% of smart-tier-managed capacity automatically shifted to cooler tiers.</li>
<li style="font-weight:400;">Pricing includes standard hot, cool, and cold capacity rates with no tier transition fees, but a per-object monthly monitoring fee applies to objects managed by the smart tier. </li>
<li style="font-weight:400;">Objects smaller than 128 KiB stay in hot tier permanently and do not incur the monitoring fee, so workloads with many small files should factor that into cost planning.</li>
<li style="font-weight:400;">Setup requires a storage account with zonal redundancy and is available via the Azure portal or API, either at account creation or by switching an existing account’s default tier to smart. Legacy account types like GPv1 and page or append blobs are not supported.</li>
<li style="font-weight:400;">Smart tier is available now in nearly all zonal public cloud regions, with broader regional coverage and updated Storage SDK support planned in upcoming releases. More details and pricing are at azure.microsoft.com/en-us/pricing/details/storage/blobs.</li>
</ul>
<p>1:31:21  Justin – “Thanks, you finally got what Amazon’s had for a while.” </p>
<p>1:38:37 <a href="https://techcommunity.microsoft.com/blog/microsoft-entra-blog/what%E2%80%99s-new-in-microsoft-entra-%E2%80%93-march-2026/4502150">What’s new in Microsoft Entra – March 2026 </a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/entra/identity/authentication/how-to-authentication-passkeys-fido2">Microsoft Entra ID is adding synced passkeys</a>, <a href="https://learn.microsoft.com/en-us/entra/identity/authentication/how-to-authentication-passkeys-fido2">passkey profiles</a>, and <a href="https://learn.microsoft.com/en-us/entra/identity/devices/sso-linux?tabs=password-auth%2Cdebian-install%2Cdebian-update%2Cdebian-uninstall%2Cdebian-sc-example">phish-resistant MFA</a> support for Linux SSO, giving organizations more options to move away from passwords while meeting compliance requirements for stronger authentication.</li>
<li style="font-weight:400;">Starting June 1, 2026, <a href="https://learn.microsoft.com/en-us/entra/identity/hybrid/connect/how-to-connect-sync-whatis">Entra Connect Sync</a> and <a href="https://learn.microsoft.com/en-us/entra/identity/hybrid/cloud-sync/what-is-cloud-sync">Cloud Sync</a> will block hard-match operations for users with assigned Entra roles, closing a potential attack path where on-premises AD attribute manipulation could be used to take over privileged cloud accounts. </li>
<li style="font-weight:400;">Admins should review their hybrid sync configurations before that date.</li>
<li style="font-weight:400;">The <a href="https://support.microsoft.com/en-us/authenticator/download-microsoft-authenticator">Microsoft Authenticator app</a> now includes jailbreak and root detection for Android, with a phased rollout moving from warning to blocking to wipe mode, meaning users on non-compliant devices will eventually lose access to Entra credentials entirely.</li>
<li style="font-weight:400;">Agent management is consolidating under Agent 365 as the single control plane, with the existing Entra admin center Agent registry and collections blades retiring May 1, 2026, and the current registry Graph API being deprecated and replaced, requiring re-registration of agents using the old API.</li>
<li style="font-weight:400;">Entra ID Governance added several notable features this quarter, including SCIM 2.0 API support, delegated workflow management in Lifecycle Workflows, and a new billing meter for guest users, which organizations relying on governance features for external identities should review for potential cost impact.</li>
<li style="font-weight:400;">Why June 1st? Turn this on today!</li>
</ul>
<p>1:34:17 <a href="https://techcommunity.microsoft.com/blog/appsonazureblog/new-in-azure-sre-agent-log-analytics-and-application-insights-connectors/4509649">New in Azure SRE Agent: Log Analytics and Application Insights </a><a href="https://techcommunity.microsoft.com/blog/appsonazureblog/new-in-azure-sre-agent-log-analytics-and-application-insights-connectors/4509649">Connectors</a></p>
<ul>
<li style="font-weight:400;"><a href="https://sre.azure.com/docs">Azure SRE Agent</a> now supports Log Analytics and Application Insights as native connectors, allowing the agent to run KQL queries directly against workspaces and App Insights resources during incident investigations, replacing the previous approach of shelling out to Azure CLI commands. (REALLY? Bombastic side eye.) </li>
<li style="font-weight:400;">Setup is simplified compared to the manual RBAC approach: selecting a resource from the dropdown automatically grants the agent’s managed identity Log Analytics Reader and Monitoring Reader on the target resource group, with a manual entry fallback if resource discovery fails.</li>
<li style="font-weight:400;">The feature is backed by the <a href="https://github.com/Azure/azure-mcp">Azure MCP Server</a> using the monitor namespace, giving the agent read-only tools like monitor_workspace_log_query and monitor_table_list, with no ability to modify alerts, retention settings, or workspace configuration.</li>
<li style="font-weight:400;">Practical use cases include AKS cluster investigations where the agent can automatically query ContainerLog, KubeEvents, and application traces across multiple connected workspaces to surface errors and failure patterns without manual intervention.</li>
<li style="font-weight:400;">The connectors are currently behind an early access flag under Settings &gt; Basics, though Azure SRE Agent itself is generally available. </li>
<li style="font-weight:400;">Pricing is not detailed in the announcement, so listeners should check sre.azure.com/docs for current cost information.</li>
</ul>
<p>1:35:14  Justin – “So they REALLY want you to burn tokens.” </p>
<p>1:35:41 <a href="https://techcommunity.microsoft.com/blog/microsoft-security-blog/azure-key-vault-hsm-platform-one-retirement-what-purview-byok-customers-need-to-/4510371">Azure Key Vault HSM Platform One Retirement: What Purview BYOK </a><a href="https://techcommunity.microsoft.com/blog/microsoft-security-blog/azure-key-vault-hsm-platform-one-retirement-what-purview-byok-customers-need-to-/4510371">Customers Need to Know </a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/purview/rights-management-byok#determine-if-your-key-is-on-hsmplatform-1">Azure Key Vault</a> is retiring its legacy HSM Platform One on September 15, 2028, and customers using Microsoft Purview Information Protection with Bring Your Own Key (BYOK) will need to migrate their tenant root keys to the modern FIPS 140-2 Level 3 certified HSM platform before that date or risk losing encryption and decryption capabilities.</li>
<li style="font-weight:400;">The migration is not straightforward because Azure Key Vault does not support exporting keys once imported, meaning customers must re-import their original on-premises key material into a new vault, which can be a lengthy process if that original key material is no longer readily accessible.</li>
<li style="font-weight:400;">Microsoft is recommending customers start planning now, despite the 2028 deadline, particularly because coordinating across security, compliance, and HSM teams to recover or regenerate lost key material can take considerable time.</li>
<li style="font-weight:400;">The practical steps involve confirming whether your tenant key sits on the legacy HSM platform, creating a new Key Vault on the modern platform, and updating your Purview configuration to reference the new vault, with Microsoft support available for customers who no longer have access to the original key material.</li>
<li style="font-weight:400;">This announcement is most relevant to enterprise customers in regulated industries who have adopted BYOK for compliance reasons, and they should review the updated guidance at the Microsoft Learn documentation for tenant root key management to understand prerequisites and supported migration paths.</li>
</ul>
<p>1:36:19  Matt – “The thing is, Microsoft does give you a decent amount of time to do stuff, but what’s always fun is if you buy a three-year reservation you’re stuck with it, and you have to deal with returning it right now, because otherwise you’d have negative time…”</p>
<h2>After Show</h2>
<p>1:38:11 <a href="https://www.bbc.com/news/articles/c98mrepzgj7o">Allbirds shares soar 580% after pivot from shoes to AI</a></p>
<ul>
<li style="font-weight:400;">Allbirds announced a $50 million deal to rebrand as NewBird AI, shifting its business model from footwear to GPU compute infrastructure and on-demand cloud services built for AI workloads.</li>
<li style="font-weight:400;">The company’s stated rationale is a supply gap in AI compute capacity, with plans to purchase GPUs and offer them as on-demand cloud resources to businesses that cannot access sufficient computing power through existing providers.</li>
<li style="font-weight:400;">Analysts are skeptical, with one branding consultant describing the move as using the company’s existing stock market shell for an unrelated business rather than a genuine operational pivot.</li>
<li style="font-weight:400;">The 580% share surge on a press release, despite no demonstrated product or AI-related revenue, has led retail analysts to categorize this as a meme stock situation driven by AI sentiment rather than fundamentals.</li>
<li style="font-weight:400;">For cloud podcast listeners, this story is a useful data point on how GPU scarcity narratives are influencing capital markets, and raises questions about the credibility of new entrants claiming to address AI compute shortages without established infrastructure or track records.</li>
</ul>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback, or ask questions at theCloudPod.net or tweet at us with the hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2454749/c1e-2okobqwd48s93gp0-qdpp4112s9xo-tsczi8.mp3" length="203477592"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 352 of The Cloud Pod, where the weather is always cloudy! Justin, Matt, and Ryan are safely back from Vegas (Ryan and Justin, anyway), and they have all the news and announcements from Google Next. Plus, we have Ryan’s take on Phish, news from Cloudflare, and a shoe company making a pivot. There’s a lot to cover, so let’s get started! 

Titles we almost went with this week

Redact Yourself Before You Wreck Yourself OpenAI **Anthropic
Fork Yeah Cloudflare Artifacts Is Here
Git Happens at Scale on Cloudflare
Bucket List Item Checked Lambda Mounts S3 File Systems
Terraform Your Agents Before They Terraform You
Cloud Run Gets GPUs and Finally Hits the Gym
Spanner Goes Rogue, Leaves the Cloud Behind
Knowledge Catalog Knows What Your Agents Did Last Query
One Control Plane to Rule a Million Chips
No More Incognito Windows for Your AWS Identity Crisis
Your Agent Can Now Write Files Without Burning Everything Down
Spend Caps Finally Tell Runaway AI Jobs to Chill
RIP Vertex, long live the agent
Agents all the way down
Google Next: This is the dawning of the Age of Agentic
Allbirds Proves AI Hype Needs No Infrastructure

 
 A big thanks to this week’s sponsors:
There are a lot of cloud cost management tools out there, but only Archera provides insured commitments. It sounds fancy, but it’s really simple. Archera gives you the cost savings of a 1 or 3-year AWS Savings Plan with a commitment as short as 30 days. If you do not use all the cloud resources you have committed to, Archera will literally cover the difference. Other cost management tools may say they offer “insured commitments”, but remember to ask: Will you actually give me my rebate? Because Archera will. 
Check out thecloudpod.net/archera to schedule a demo today. 
We also wanted to tell you about something coming to the US for the first time — WeAreDevelopers World Congress! 
They’ve been doing this in Europe for years, 15,000-plus attendees in Berlin, it’s one of the biggest developer events over there. Coté from Software Defined Talk is actually speaking at their Berlin event this summer, so we’ve got some firsthand context here. In September, they’re launching the North America edition. San José, September 23 to 25. 500-plus speakers, 18 tracks — cloud, infrastructure, DevOps, security, AI, data engineering, all of it. Speakers from Datadog, Honeycomb, Sentry, Google, LinkedIn, and Stack Overflow. Olivier Pomel, Christine Yen, Milin Desai, Kelsey Hightower – plus workshops and masterclasses, not just talks. These are people who know how to do a developer conference at scale. wearedevelopers.us, code DEVPOD26 for 15% off. Group rates on top of that for 4 or more.
General News 
06:12 Amazon invest up to $25 billion in Anthropic part of AI infrastructure

Amazon has committed up to $25 billion in additional investment in Anthropic, bringing its total potential investment to $33 billion. The latest $5 billion tranche is based on Anthropic’s $380 billion valuation, with up to $20 billion more tied to commercial milestones.
In exchange, Anthropic has committed to spending over $100 billion on AWS over the next decade, with a specific focus on Trainium custom AI chips, and plans to bring nearly 1 gigawatt of Trainium2 and Trainium3 capacity online by end of the year.
Anthropic cited real infrastructure strain from growing enterprise...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2454749/c1a-k5d5-ndrr41qxi834-ji5bsd.jpg"></itunes:image>
                                                                            <itunes:duration>01:45:30</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2454749/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[351: IAM the One Spending All Your AI Money]]>
                </title>
                <pubDate>Wed, 22 Apr 2026 00:52:46 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2428621</guid>
                                    <link>https://tcpfm.castos.com/episodes/351-iam-the-one-spending-all-your-ai-money</link>
                                <description>
                                            <![CDATA[<p>Welcome to episode 351 of The Cloud Pod, where the weather is always cloudy! Justin, Matt, and Ryan are in the studio today and ready to bring you the latest in cloud and AI news. And it’s that time of year again – we’re coming up quickly on Google Next, place your so we’ve got our yearly predictions for what’s coming from Vegas, as well as more news about Mythos, Amazon finally becoming a utility, and even an aftershow where we discuss the computing power of Artemis. It’s a great show, so let’s get started!  
</p>
<h3>Titles we almost went with this week</h3>
<ul>
<li> Three StorageClasses Walk Into an AI Workload</li>
<li> Deprecated Models Don’t Die, They Just Fail Your API Calls</li>
<li> SQL Walks Into a Graph Bar and Stays</li>
<li> Too Many Agents Spoil the Workflow</li>
<li> One Registry to Rule All Your Rogue AI Agents</li>
<li> Eight CPUs Walk Into Space, Only One Comes Back</li>
<li> Stop Retyping the Same Gemini Prompt Like a Caveman</li>
<li> Claude Code Routines Let AI Work While You Sleep</li>
<li> AWS Builds a Yellow Pages for Your AI Agents</li>
<li> GPT Finally Stops Refusing to Talk About Hacking</li>
<li> None of the hosts is ready for Next</li>
<li> We are once again trying to look into our next next next crystal ball and failing</li>
<li> Google is gonna announce AI, it’s just mandatory now</li>
<li> Las Vegas is calling, our Livers are crying</li>
</ul>
<p> A big thanks to this week’s sponsors:</p>
<p>There are a lot of cloud cost management tools out there, but only Archera provides insured commitments. It sounds fancy, but it’s really simple. Archera gives you the cost savings of a 1 or 3-year AWS Savings Plan with a commitment as short as 30 days. If you do not use all the cloud resources you have committed to, Archera will literally cover the difference. Other cost management tools may say they offer “insured commitments”, but remember to ask: Will you actually give me my rebate? Because Archera will. </p>
<p>Check out <a href="http://www.thecloudpod.net/archera">thecloudpod.net/archera</a> to schedule a demo today. </p>
<p>We also wanted to tell you about something coming to the US for the first time — WeAreDevelopers World Congress! </p>
<p>They’ve been doing this in Europe for years, 15,000-plus attendees in Berlin, it’s one of the biggest developer events over there. Coté from Software Defined Talk is actually speaking at their Berlin event this summer, so we’ve got some firsthand context here. In September, they’re launching the North America edition. San José, September 23 to 25. 500-plus speakers, 18 tracks — cloud, infrastructure, DevOps, security, AI, data engineering, all of it. Speakers from Datadog, Honeycomb, Sentry, Google, LinkedIn, and Stack Overflow. Olivier Pomel, Christine Yen, Milin Desai, Kelsey Hightower – plus workshops and masterclasses, not just talks. These are people who know how to do a developer conference at scale. wearedevelopers.us, code DEVPOD26 for 15% off. Group rates on top of that for 4 or more.</p>
<h2>Follow Up</h2>
<p>01:47 <a href="https://aisle.com/blog/ai-cybersecurity-after-mythos-the-jagged-frontier">AI Cybersecurity After Mythos: The Jagged Frontier </a></p>
<ul>
<li style="font-weight:400;">Since the original<a href="https://www.anthropic.com/glasswing"> Mythos/Project Glasswing announcement</a>, AISLE published follow-up testing showing that small, inexpensive open-weight models can replicate much of the vulnerability detection work <a href="https://www.anthropic.com/">Anthropic</a> attributed to Mythos, with all 8 tested models detecting the flagship FreeBSD NFS buffer overflow, including a 3.6B parameter model costing $0.11 per million tokens.</li>
<li style="font-weight:400;">A notable correction to the framing of the original announcement: cybersecurity AI capability does not scale smoothly with model size or cost. </li>
<li style="font-weight:400;">Model rankings reshuffle completely across different security tasks, meaning there is no single b...</li></ul>
<h3>Chapters</h3>
<ul><li>(00:00:00) - We Are Developers: Coming to North America</li><li>(00:01:58) - Vacation Hits the Beach While It Pours Down Rain</li><li>(00:03:02) - Will Cloudfla Find Vulnerabilities?</li><li>(00:08:37) - Google Next: First Predictation</li><li>(00:10:47) - Gemini 2.8</li><li>(00:11:30) - Gemini: Big Announcement for Dev and Enterprise</li><li>(00:13:38) - Third and Final Pick: Inference-based Chips</li><li>(00:14:37) - Three Things to Watch Out For From VMware</li><li>(00:17:25) - Top 3 AI Announcements of 2017</li><li>(00:19:09) - Gemini Robotics: Private Preview, AI Expansion</li><li>(00:21:55) - 2017 Conference Keynotes: How Many Times Will They Say AI?</li><li>(00:23:49) - Cloud Managed Agents</li><li>(00:28:57) - Meta AI Launches Muspark Model</li><li>(00:33:15) - Cloud Code: Installing Automated Workflows</li><li>(00:38:00) - OpenAI Launches GPT 5.4 Cyber, a Fine</li><li>(00:41:04) - Cloud Code: The New App Release</li><li>(00:45:13) - Amazon Bedrock Projects: Cost Analysis by IAM User</li><li>(00:48:36) - Amazon Bedrock Agent Core: Stateful MCP (</li><li>(00:53:39) - Amazon's Bedrock Agent Core</li><li>(00:57:25) - Amazon LEO to Power iPhone 14 and Apple Watch</li><li>(01:01:44) - GMC: 3-D Models in Cloud Storage</li><li>(01:05:40) - Google Cloud: Data Studio and Security</li><li>(01:11:13) - Google's 'Skills' in Chrome</li><li>(01:16:02) - Azure Agent Stack: More Confusing Than Google Cloud or AWS</li><li>(01:18:02) - Week in the Cloud: AI, Google Cloud, and Azure</li><li>(01:19:27) - NASA's Two Fault Tolerant Computer</li><li>(01:25:50) - AI in Healthcare: The Challenges</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 351 of The Cloud Pod, where the weather is always cloudy! Justin, Matt, and Ryan are in the studio today and ready to bring you the latest in cloud and AI news. And it’s that time of year again – we’re coming up quickly on Google Next, place your so we’ve got our yearly predictions for what’s coming from Vegas, as well as more news about Mythos, Amazon finally becoming a utility, and even an aftershow where we discuss the computing power of Artemis. It’s a great show, so let’s get started!  

Titles we almost went with this week

 Three StorageClasses Walk Into an AI Workload
 Deprecated Models Don’t Die, They Just Fail Your API Calls
 SQL Walks Into a Graph Bar and Stays
 Too Many Agents Spoil the Workflow
 One Registry to Rule All Your Rogue AI Agents
 Eight CPUs Walk Into Space, Only One Comes Back
 Stop Retyping the Same Gemini Prompt Like a Caveman
 Claude Code Routines Let AI Work While You Sleep
 AWS Builds a Yellow Pages for Your AI Agents
 GPT Finally Stops Refusing to Talk About Hacking
 None of the hosts is ready for Next
 We are once again trying to look into our next next next crystal ball and failing
 Google is gonna announce AI, it’s just mandatory now
 Las Vegas is calling, our Livers are crying

 A big thanks to this week’s sponsors:
There are a lot of cloud cost management tools out there, but only Archera provides insured commitments. It sounds fancy, but it’s really simple. Archera gives you the cost savings of a 1 or 3-year AWS Savings Plan with a commitment as short as 30 days. If you do not use all the cloud resources you have committed to, Archera will literally cover the difference. Other cost management tools may say they offer “insured commitments”, but remember to ask: Will you actually give me my rebate? Because Archera will. 
Check out thecloudpod.net/archera to schedule a demo today. 
We also wanted to tell you about something coming to the US for the first time — WeAreDevelopers World Congress! 
They’ve been doing this in Europe for years, 15,000-plus attendees in Berlin, it’s one of the biggest developer events over there. Coté from Software Defined Talk is actually speaking at their Berlin event this summer, so we’ve got some firsthand context here. In September, they’re launching the North America edition. San José, September 23 to 25. 500-plus speakers, 18 tracks — cloud, infrastructure, DevOps, security, AI, data engineering, all of it. Speakers from Datadog, Honeycomb, Sentry, Google, LinkedIn, and Stack Overflow. Olivier Pomel, Christine Yen, Milin Desai, Kelsey Hightower – plus workshops and masterclasses, not just talks. These are people who know how to do a developer conference at scale. wearedevelopers.us, code DEVPOD26 for 15% off. Group rates on top of that for 4 or more.
Follow Up
01:47 AI Cybersecurity After Mythos: The Jagged Frontier 

Since the original Mythos/Project Glasswing announcement, AISLE published follow-up testing showing that small, inexpensive open-weight models can replicate much of the vulnerability detection work Anthropic attributed to Mythos, with all 8 tested models detecting the flagship FreeBSD NFS buffer overflow, including a 3.6B parameter model costing $0.11 per million tokens.
A notable correction to the framing of the original announcement: cybersecurity AI capability does not scale smoothly with model size or cost. 
Model rankings reshuffle completely across different security tasks, meaning there is no single b...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[351: IAM the One Spending All Your AI Money]]>
                </itunes:title>
                                    <itunes:episode>351</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p>Welcome to episode 351 of The Cloud Pod, where the weather is always cloudy! Justin, Matt, and Ryan are in the studio today and ready to bring you the latest in cloud and AI news. And it’s that time of year again – we’re coming up quickly on Google Next, place your so we’ve got our yearly predictions for what’s coming from Vegas, as well as more news about Mythos, Amazon finally becoming a utility, and even an aftershow where we discuss the computing power of Artemis. It’s a great show, so let’s get started!  
</p>
<h3>Titles we almost went with this week</h3>
<ul>
<li> Three StorageClasses Walk Into an AI Workload</li>
<li> Deprecated Models Don’t Die, They Just Fail Your API Calls</li>
<li> SQL Walks Into a Graph Bar and Stays</li>
<li> Too Many Agents Spoil the Workflow</li>
<li> One Registry to Rule All Your Rogue AI Agents</li>
<li> Eight CPUs Walk Into Space, Only One Comes Back</li>
<li> Stop Retyping the Same Gemini Prompt Like a Caveman</li>
<li> Claude Code Routines Let AI Work While You Sleep</li>
<li> AWS Builds a Yellow Pages for Your AI Agents</li>
<li> GPT Finally Stops Refusing to Talk About Hacking</li>
<li> None of the hosts is ready for Next</li>
<li> We are once again trying to look into our next next next crystal ball and failing</li>
<li> Google is gonna announce AI, it’s just mandatory now</li>
<li> Las Vegas is calling, our Livers are crying</li>
</ul>
<p> A big thanks to this week’s sponsors:</p>
<p>There are a lot of cloud cost management tools out there, but only Archera provides insured commitments. It sounds fancy, but it’s really simple. Archera gives you the cost savings of a 1 or 3-year AWS Savings Plan with a commitment as short as 30 days. If you do not use all the cloud resources you have committed to, Archera will literally cover the difference. Other cost management tools may say they offer “insured commitments”, but remember to ask: Will you actually give me my rebate? Because Archera will. </p>
<p>Check out <a href="http://www.thecloudpod.net/archera">thecloudpod.net/archera</a> to schedule a demo today. </p>
<p>We also wanted to tell you about something coming to the US for the first time — WeAreDevelopers World Congress! </p>
<p>They’ve been doing this in Europe for years, 15,000-plus attendees in Berlin, it’s one of the biggest developer events over there. Coté from Software Defined Talk is actually speaking at their Berlin event this summer, so we’ve got some firsthand context here. In September, they’re launching the North America edition. San José, September 23 to 25. 500-plus speakers, 18 tracks — cloud, infrastructure, DevOps, security, AI, data engineering, all of it. Speakers from Datadog, Honeycomb, Sentry, Google, LinkedIn, and Stack Overflow. Olivier Pomel, Christine Yen, Milin Desai, Kelsey Hightower – plus workshops and masterclasses, not just talks. These are people who know how to do a developer conference at scale. wearedevelopers.us, code DEVPOD26 for 15% off. Group rates on top of that for 4 or more.</p>
<h2>Follow Up</h2>
<p>01:47 <a href="https://aisle.com/blog/ai-cybersecurity-after-mythos-the-jagged-frontier">AI Cybersecurity After Mythos: The Jagged Frontier </a></p>
<ul>
<li style="font-weight:400;">Since the original<a href="https://www.anthropic.com/glasswing"> Mythos/Project Glasswing announcement</a>, AISLE published follow-up testing showing that small, inexpensive open-weight models can replicate much of the vulnerability detection work <a href="https://www.anthropic.com/">Anthropic</a> attributed to Mythos, with all 8 tested models detecting the flagship FreeBSD NFS buffer overflow, including a 3.6B parameter model costing $0.11 per million tokens.</li>
<li style="font-weight:400;">A notable correction to the framing of the original announcement: cybersecurity AI capability does not scale smoothly with model size or cost. </li>
<li style="font-weight:400;">Model rankings reshuffle completely across different security tasks, meaning there is no single best model for cybersecurity work, which challenges the narrative that a restricted frontier model is required for this category.</li>
<li style="font-weight:400;">The current status of the broader AI security space is that AISLE reports 180-plus externally validated CVEs across 30-plus projects since mid-2025, predating Project Glasswing, and their system now runs on OpenSSL and curl pull requests in production, suggesting the category was already operational before the Anthropic announcement.</li>
<li style="font-weight:400;">A practical update for cloud practitioners is that specificity, meaning correctly identifying patched or safe code, remains a significant weak point across most models tested. Only one model was reliable in both directions, which reinforces that the orchestration layer and triage pipeline around the model matter more than the model itself for production security tooling.</li>
<li style="font-weight:400;">The broader ecosystem implication is that defensive AI security capabilities are accessible today with open or low-cost models, meaning organizations do not need to wait for access to restricted frontier models to begin building vulnerability discovery pipelines, though the scaffolding, security expertise, and maintainer trust-building remain the harder problems to solve.</li>
</ul>
<p>03:09  Justin – “If you’re in the security space and you want to have it poke holes at your app, it uses really complicated patterns to basically figure out different attack vectors and can actually link different vulnerabilities together.” </p>
<h2>General News </h2>
<p>06:11 <a href="https://techcrunch.com/2026/04/08/aws-boss-explains-why-investing-billions-in-both-anthropic-and-openai-is-an-ok-conflict/">AWS boss explains why investing billions in both Anthropic and OpenAI is </a><a href="https://techcrunch.com/2026/04/08/aws-boss-explains-why-investing-billions-in-both-anthropic-and-openai-is-an-ok-conflict/">an OK conflict</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/?nc2=h_home">Amazon</a> has invested $8 billion in Anthropic and $50 billion in <a href="https://openai.com/">OpenAI</a>, creating a situation where it holds significant financial stakes in two directly competing AI model companies.</li>
<li style="font-weight:400;">AWS CEO Matt Garman frames this as consistent with Amazon’s long-standing practice of partnering with companies it also competes against, citing Oracle selling its database services on AWS as an established precedent.</li>
<li style="font-weight:400;">The dual investment was partly driven by competitive necessity, as both Anthropic and OpenAI models were already available on <a href="https://azure.microsoft.com/en-us/get-started/azure-portal/">Microsoft Azure</a>, AWS’s primary rival in the cloud market.</li>
<li style="font-weight:400;">AI model-routing services are emerging as a key battleground, where cloud providers let customers automatically select different models for different tasks, which also creates a path for cloud providers to insert their own first-party models into customer workflows.</li>
<li style="font-weight:400;">Investor loyalty in AI is broadly eroding, with at least a dozen OpenAI backers also investing in Anthropic’s recent $30 billion round, including Microsoft, suggesting this multi-sided investment pattern is becoming standard across the industry.</li>
</ul>
<p>07:34 <a href="https://www.googlecloudevents.com/next-vegas">Google Next</a> Predictions</p>
<p>Justin</p>
<ol>
<li style="font-weight:400;">Wiz + Google Cloud Security/Product Offering</li>
<li style="font-weight:400;">Antigravity IDE + Gemini CLI (agent mode) enhancements</li>
<li style="font-weight:400;">Ironwood TPU GA and/or dedicated Inference-based CHIP</li>
</ol>
<p>Ryan</p>
<ol>
<li style="font-weight:400;">Gemini 3.1 Pro GA &amp; Teasing Gemini 3.5 or 4 or future model</li>
<li style="font-weight:400;">Enhancements with agents and Agentic</li>
<li style="font-weight:400;">VMware interruption based on Kubernetes? (Opposite of Tanzu)</li>
</ol>
<p>Matt</p>
<ol>
<li style="font-weight:400;">Default Guardrails in AI in general. How Gemini will have guard rails via Vertex. </li>
<li style="font-weight:400;">Agentic coding tooling and how developers are leveraging Agentic (SDLC)</li>
<li style="font-weight:400;">3 Non AI Announcements</li>
</ol>
<p>Runner Ups</p>
<ul>
<li style="font-weight:400;">A2A protocol 1.0 released</li>
<li style="font-weight:400;">Turboquant Ships in Vertex AI</li>
<li style="font-weight:400;">Something waymo</li>
<li style="font-weight:400;">Biqquery AI Agents</li>
<li style="font-weight:400;">Gemini 3.1 Flash GA</li>
<li style="font-weight:400;">Axion Gen 2</li>
<li style="font-weight:400;">Nano bananas updates</li>
<li style="font-weight:400;">Sovereign Cloud AI</li>
<li style="font-weight:400;">Gemini Robotics API Preview</li>
<li style="font-weight:400;">Hugging Face</li>
<li style="font-weight:400;">AWS Activate type program </li>
<li style="font-weight:400;">AP2 Payment Protocol</li>
<li style="font-weight:400;">AI in Android</li>
<li style="font-weight:400;">Gemini + Boston Dynamics</li>
<li style="font-weight:400;">Glasswing Answer</li>
</ul>
<p>How many times is AI said on stage? </p>
<ul>
<li>Matt- 99</li>
<li>Ryan- 75</li>
<li>Justin- 115</li>
</ul>
<h2>AI Is Going Great – Or How ML Makes Money </h2>
<p>24:35 <a href="https://claude.com/blog/claude-managed-agents">Claude Managed Agents: get to production 10x faster</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.anthropic.com/">Anthropic</a> launched <a href="https://claude.com/blog/claude-managed-agents">Claude Managed Agents</a> in public beta on April 8, 2026, a suite of composable APIs that handle production infrastructure like sandboxed code execution, state management, credential handling, and end-to-end tracing so developers can focus on defining tasks and guardrails rather than building backend systems.</li>
<li style="font-weight:400;">The platform includes long-running autonomous sessions, multi-agent coordination (in research preview), and trusted governance with scoped permissions and identity management, with internal testing showing up to 10 percentage points improvement in task success over standard prompting loops on structured file generation tasks.</li>
<li style="font-weight:400;"><a href="https://claude.com/pricing">Pricing</a> is consumption-based at standard Claude Platform token rates plus $0.08 per session-hour for active runtime, which positions this as a managed alternative to self-hosted agent infrastructure where teams would otherwise spend months on setup before shipping anything to users.</li>
<li style="font-weight:400;">Early adopters include Rakuten, which deployed specialist enterprise agents across five business functions within a week each, and Sentry, which shipped a bug-to-PR pipeline in weeks instead of months by pairing their existing Seer debugging agent with a Claude-powered patching agent.</li>
<li style="font-weight:400;">Developers can get started via the Claude Console, the new CLI, or by using Claude Code with the built-in claude-api Skill, with multi-agent coordination and self-evaluation features still gated behind a <a href="http://claude.com/form/claude-managed-agents">research preview access request form</a>.</li>
</ul>
<p>25:51  Ryan – “So I don’t have to get a fleet of Mac Minis to run all my AI things?” </p>
<p>26:41 <a href="https://openai.com/index/next-phase-of-enterprise-ai">The next phase of enterprise AI</a></p>
<ul>
<li style="font-weight:400;"><a href="https://openai.com/">OpenAI</a> reports enterprise now accounts for more than 40% of revenue and is projected to reach parity with consumer revenue by the end of 2026, with APIs processing over 15 billion tokens per minute and <a href="https://openai.com/codex/">Codex</a> reaching 3 million weekly active users.</li>
<li style="font-weight:400;"><a href="https://openai.com/business/frontier/">OpenAI Frontier</a> is positioned as a company-wide agent deployment and management layer, distinct from single-product agent implementations, allowing agents to operate across an organization’s tools, systems, and data with centralized governance and permissions.</li>
<li style="font-weight:400;">A <a href="https://openai.com/index/amazon-partnership/">Stateful Runtime Environment</a> being co-developed with AWS is designed to give agents persistent context and memory across sessions, addressing a core limitation for complex enterprise workflows that span multiple tools and data sources.</li>
<li style="font-weight:400;">OpenAI is building toward a unified AI superapp that consolidates ChatGPT, Codex, and agentic browsing into a single employee-facing interface, with the stated goal of reducing enterprise rollout friction by leveraging ChatGPT’s existing 900 million weekly users who are already familiar with the interface.</li>
<li style="font-weight:400;"><a href="https://openai.com/index/frontier-alliance-partners/">Frontier Alliances</a>‘ partnerships with McKinsey, BCG, Accenture, Capgemini, Databricks, and Snowflake indicate OpenAI is pursuing an integration-first enterprise strategy, meeting customers within existing data infrastructure rather than requiring migration to new platforms.</li>
</ul>
<p>27:44  Ryan – “This sounds great; all these AI models are only as good as the data they have access to, and when you get into the Enterprise, you’re trying to integrate with all the IT services and other platforms that are used for development or other parts of the business, design tools – there’s all kinds of stuff. And it’s really tricky to sort of manage that. I’ve seen two models where you’re kind of left to your own devices, setting up your own MCP server or your own local integration somehow, or, if there is a platform, you know, sort of a sparse support of that. So I’m really happy to see this developed, and I’m really eager for this type of framework to be more prevalent.”</p>
<p>29:11 <a href="https://ai.meta.com/blog/introducing-muse-spark-msl/">Introducing Muse Spark: Scaling Towards Personal Superintelligence</a></p>
<ul>
<li style="font-weight:400;"><a href="https://ai.meta.com/">Meta</a> launched <a href="https://ai.meta.com/blog/introducing-muse-spark-msl/">Muse Spark</a>, the first model from its new Meta Superintelligence Labs division, available now at <a href="http://meta.ai">meta.ai</a> with a private API preview opening to select users. </li>
<li style="font-weight:400;">It is a natively multimodal reasoning model supporting tool-use, visual chain of thought, and multi-agent orchestration.</li>
<li style="font-weight:400;">A new Contemplating mode orchestrates multiple agents reasoning in parallel, achieving 58% on <a href="https://lastexam.ai/">Humanity’s Last Exam</a> and 38% on <a href="https://www.frontierscience.org/public/">FrontierScience Research</a>, positioning it alongside extreme reasoning modes from <a href="https://deepmind.google/models/gemini/deep-think/">Gemini Deep Think</a> and <a href="https://chatgpt.com/plans/pro/">GPT Pro</a>.</li>
<li style="font-weight:400;">Meta claims its new pretraining stack reaches equivalent capabilities with over an order of magnitude less compute than <a href="https://ai.meta.com/blog/llama-4-multimodal-intelligence/">Llama 4 Maverick</a>, which has direct implications for infrastructure costs and efficiency at scale, including their new Hyperion data center investment.</li>
<li style="font-weight:400;">The model uses a multi-agent test-time scaling approach that delivers stronger performance at comparable latency versus single-agent extended thinking, and applies token compression via thinking time penalties to optimize reasoning efficiency for serving at scale.</li>
<li style="font-weight:400;">A notable safety finding from <a href="https://www.apolloresearch.ai/">Apollo Research</a> identified that Muse Spark showed the highest rate of evaluation awareness of any model they have tested, frequently identifying scenarios as alignment traps. Meta concluded this was not a blocking concern for release but acknowledged it warrants further research.</li>
</ul>
<p>33:22  Justin – “So the thing about what’s on Humanity’s last exam right now is that the last update is from February 20th. So we’re just waiting to see when Mythos and this new Meta one get added to it, so that’ll be interesting.”</p>
<p>33:41 <a href="https://claude.com/blog/introducing-routines-in-claude-code">Introducing routines in Claude Code</a></p>
<ul>
<li style="font-weight:400;">Anthropic launched <a href="https://claude.com/blog/introducing-routines-in-claude-code">routines in Claude Code</a> as a research preview, letting developers configure automated workflows once with a prompt, repo, and connectors, then run them on a schedule, via API call, or in response to GitHub events without requiring a local machine to be running.</li>
<li style="font-weight:400;">Three trigger types are supported: scheduled cadences (hourly, nightly, or weekly), API-triggered endpoints where each routine gets its own URL and auth token, and GitHub webhook events that spin up a new session per matching PR and continue feeding it updates like comments and CI failures.</li>
<li style="font-weight:400;">The cloud-hosted infrastructure removes the need for developers to manage their own cron jobs, MCP servers, or additional tooling, since routines ship with built-in access to repos and connectors.</li>
<li style="font-weight:400;">Daily routine limits are tiered by plan: <a href="https://claude.ai/login?plan=pro">Pro</a> users get 5 per day, <a href="https://claude.ai/login?plan=max">Max</a> users get 15, and <a href="https://claude.com/pricing/team">Team</a> and <a href="https://claude.com/pricing/enterprise">Enterprise</a> users get 25, with additional runs available through extra usage at the same subscription usage rate as interactive sessions.</li>
<li style="font-weight:400;">Practical use cases already emerging include nightly bug triage that pulls from Linear and opens draft PRs, on-call alert summarization posted to Slack, and automated PR review flagging for sensitive code modules like authentication providers.</li>
</ul>
<p>38:32 <a href="https://openai.com/index/scaling-trusted-access-for-cyber-defense">Trusted access for the next era of cyber defense</a></p>
<ul>
<li style="font-weight:400;"><a href="https://openai.com/">OpenAI</a> launched <a href="https://openai.com/index/scaling-trusted-access-for-cyber-defense/">GPT-5.4-Cyber</a>, a fine-tuned variant of <a href="https://openai.com/index/introducing-gpt-5-4/">GPT-5.4</a> specifically designed for cybersecurity work, with reduced refusal boundaries for legitimate defensive tasks and new binary reverse engineering capabilities that let security professionals analyze compiled software without source code access.</li>
<li style="font-weight:400;">The Trusted Access for Cyber program is expanding from a limited pilot to thousands of individual verified defenders and hundreds of teams, with tiered access levels based on identity verification through <a href="http://chatgpt.com/cyber">chatgpt.com/cyber</a> for individuals and a separate enterprise request process for organizations.</li>
<li style="font-weight:400;"><a href="https://openai.com/index/codex-security-now-in-research-preview/">Codex Security</a>, which has been in preview, has contributed to fixing over 3,000 critical and high-severity vulnerabilities across the ecosystem, and OpenAI is positioning it as a shift from periodic security audits to continuous automated vulnerability detection integrated into developer workflows.</li>
<li style="font-weight:400;">Access to GPT-5.4-Cyber comes with notable tradeoffs for cloud and API users, specifically that Zero-Data Retention options may be restricted for higher-tier cyber-permissive access, which is a meaningful consideration for enterprises that rely on ZDR for compliance or data privacy requirements.</li>
<li style="font-weight:400;">OpenAI is framing this as a dual-use risk management challenge rather than a simple model release, explicitly acknowledging that cyber capabilities depend on user context and trust signals rather than model capability alone, and building automated verification systems to scale that judgment without manual review.</li>
</ul>
<p>33:52  Justin – “So weird. A week after Mythos.” </p>
<p>41:53 <a href="https://claude.com/blog/claude-code-desktop-redesign">Redesigning Claude Code on desktop for parallel agents </a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.anthropic.com/">Anthropic</a> released a redesigned <a href="https://claude.com/blog/claude-code-desktop-redesign">Claude Code desktop app</a> built specifically for managing parallel agentic coding sessions, with a new sidebar that lets developers run simultaneous tasks across multiple repos and filter sessions by status, project, or environment.</li>
<li style="font-weight:400;">The app introduces a drag-and-drop layout system where developers can arrange the terminal, diff viewer, file editor, and chat in custom grid configurations, reducing the need to switch between external tools during code review and shipping.</li>
<li style="font-weight:400;">A side chat feature (Command/Ctrl + semicolon) lets developers ask questions mid-task without polluting the main session context, a practical way to keep long-running agentic tasks on track.</li>
<li style="font-weight:400;">The redesign adds three view modes (Verbose, Normal, Summary) to control how much detail is shown about Claude’s tool calls, plus a usage indicator showing both context window and session consumption at a glance, which matters for teams managing API costs.</li>
<li style="font-weight:400;">The updated app is now available for <a href="https://claude.com/pricing/pro">Pro</a>, <a href="https://claude.com/pricing/max">Max</a>, <a href="https://claude.com/pricing/team">Team</a>, and <a href="https://claude.com/pricing/enterprise">Enterprise</a> plan users, as well as via the <a href="https://claude.com/platform/api">Claude API</a>, with SSH support now extended to Mac in addition to Linux for pointing sessions at remote machines.</li>
</ul>
<p>43:05  Ryan – “So this is everything I was just complaining about earlier. This is perfect. This is why – not having this level of tools – why I haven’t really adopted Claude Code for my main workflows. Because everything that they’re announcing here is exactly what I use GitHub Copilot for.” </p>
<h2>AWS</h2>
<p>46:02 <a href="https://aws.amazon.com/blogs/machine-learning/manage-ai-costs-with-amazon-bedrock-projects/">Manage AI costs with Amazon Bedrock Projects </a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/bedrock/latest/userguide/projects.html">Amazon Bedrock Projects</a> lets organizations attribute AI inference costs to specific workloads by passing a project ID in API calls, which then flows into <a href="https://aws.amazon.com/aws-cost-management/aws-cost-explorer/">AWS Cost Explorer</a> and <a href="https://docs.aws.amazon.com/cur/latest/userguide/dataexports-create.html">AWS Data Export</a>s for analysis. </li>
<li style="font-weight:400;">This addresses a real operational gap for teams doing chargebacks or investigating cost spikes across multiple AI applications.</li>
<li style="font-weight:400;">The feature works by attaching resource tags to projects and activating them as cost allocation tags in <a href="https://console.aws.amazon.com/costmanagement/">AWS Billing</a>, using the same tagging and cost management tools organizations already use for other AWS services. </li>
<li style="font-weight:400;">Tags can cover dimensions like application, environment, team, and cost center.</li>
<li style="font-weight:400;">Bedrock Projects currently supports the OpenAI-compatible APIs, including the Responses API and Chat Completions API, meaning teams already using the <a href="https://openai.github.io/openai-agents-python/">OpenAI SDK</a> can adopt this with minimal code changes by simply adding a project ID parameter. Requests without a project ID automatically fall to a default project, which could create attribution gaps if not managed carefully.</li>
<li style="font-weight:400;">Organizations can create up to 1,000 projects per AWS account, and there is a 24-hour delay before tags propagate to Cost Explorer and Data Exports, so activating tags immediately after creating the first project is recommended to avoid gaps in billing data.</li>
<li style="font-weight:400;">Pricing for this feature is not separately itemized since it layers on top of existing Bedrock inference costs, but the value is in visibility rather than new spend, helping teams identify where AI budget is actually going before costs scale further.</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/about-aws/whats-new/2026/04/bedrock-iam-cost-allocation/">https://aws.amazon.com/about-aws/whats-new/2026/04/bedrock-iam-cost-allocation/</a> </li>
</ul>
<p>46:19  Justin – “I can tell you that this is a must-have. Every cloud provider needs to provide this capability. This is a major problem in Vertex. It’s a major problem in Bedrock. And even the project level is probably not granular enough. I need it at IAM identity level.”     </p>
<p>50:56 <a href="https://aws.amazon.com/blogs/machine-learning/introducing-stateful-mcp-client-capabilities-on-amazon-bedrock-agentcore-runtime/">Introducing stateful MCP client capabilities on Amazon Bedrock AgentCore </a><a href="https://aws.amazon.com/blogs/machine-learning/introducing-stateful-mcp-client-capabilities-on-amazon-bedrock-agentcore-runtime/">Runtime</a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/bedrock-agentcore/latest/devguide/mcp-stateful-features.html">Amazon Bedrock AgentCore Runtime</a> now supports stateful <a href="https://modelcontextprotocol.io/specification/2025-11-25">MCP</a> servers, enabling bidirectional communication between MCP servers and clients. </li>
<li style="font-weight:400;">The key change is a single flag, stateless_http=False, which provisions a dedicated microVM per user session lasting up to 8 hours.</li>
<li style="font-weight:400;">Three new client capabilities are now available: elicitation for pausing tool execution to collect user input mid-workflow, sampling for delegating LLM generation back to the client without the server needing its own model credentials, and progress notifications for streaming real-time status updates during long-running operations.</li>
<li style="font-weight:400;">The sampling capability is particularly notable for enterprise use cases because it allows MCP servers to leverage the client’s connected LLM without holding API keys or model credentials directly, keeping model access control on the client side.</li>
<li style="font-weight:400;">Each stateful session gets CPU, memory, and filesystem isolation via microVMs, with sessions tracked through an Mcp-Session-Id header. </li>
<li style="font-weight:400;">Sessions expire after 15 minutes of inactivity or a maximum of 8 hours, after which clients must reinitialize.</li>
<li style="font-weight:400;">Practical use cases include multi-step financial workflows that confirm transactions before writing to <a href="https://aws.amazon.com/dynamodb/">DynamoDB</a>, travel booking tools that search options and then ask users to choose, and batch processing jobs that report incremental progress rather than leaving users waiting on a blank screen.</li>
</ul>
<p>51:53  Justin – “This can be dangerous. So definitely this one, if you’re implementing stateful MCPs, I would make sure you have a very good security model for them.” </p>
<p>54:53 <a href="https://aws.amazon.com/about-aws/whats-new/2026/04/aws-agent-registry-in-agentcore-preview/">AWS Agent Registry for centralized agent discovery and governance is </a><a href="https://aws.amazon.com/about-aws/whats-new/2026/04/aws-agent-registry-in-agentcore-preview/">now available in Preview</a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/bedrock-agentcore/latest/devguide/registry.html">AWS Agent Registry</a>, part of <a href="https://aws.amazon.com/bedrock/agentcore/">Amazon Bedrock AgentCore</a>, is now in preview as a centralized catalog for discovering and governing AI agents, tools, MCP servers, and custom resources within an organization, helping teams avoid rebuilding capabilities that already exist.</li>
<li style="font-weight:400;">The registry supports URL-based discovery that automatically pulls metadata like tool schemas from live agent endpoints, plus an approval workflow so admins can gate what becomes discoverable, with <a href="https://aws.amazon.com/cloudtrail/">CloudTrail</a> providing full audit trails for compliance.</li>
<li style="font-weight:400;">Developers can search the registry using natural language semantic search or keyword search, and can access it via the console, <a href="https://aws.amazon.com/cli/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">AWS CLI</a>, SDK, or directly from their IDEs as an MCP server, supporting both IAM and OAuth with custom JWT.</li>
<li style="font-weight:400;">The preview is available in five <a href="https://aws.amazon.com/about-aws/global-infrastructure/regions_az/">regions</a>: US East (N. Virginia), US West (Oregon), Europe (Ireland), Asia Pacific (Tokyo), and Asia Pacific (Sydney), with no pricing details published yet for this preview feature.</li>
<li style="font-weight:400;">For organizations running multiple AI agent projects across teams, this addresses a practical governance gap by providing visibility into what agents exist and enforcing policies before new ones are deployed or discovered.</li>
</ul>
<p>55:44  Ryan – “It’s funny cause I don’t really think about Bedrock AgentCore for Enterprise, but maybe it would allow that, maybe in a sideways kind of way.”</p>
<p>56:46 <a href="https://kiro.dev/blog/cli-2-0/">Kiro CLI 2.0: a new look and feel, headless CI/CD pipelines, and Windows </a><a href="https://kiro.dev/blog/cli-2-0/">support</a></p>
<ul>
<li style="font-weight:400;"><a href="https://kiro.dev/cli/">Kiro CLI 2.0</a> introduces headless mode, allowing developers to run the agentic terminal programmatically via API key and environment variables, enabling integration into CI/CD pipelines and build scripts without user interaction.</li>
<li style="font-weight:400;">Native Windows support removes the need for workarounds like WSL, letting developers use Kiro agents directly in Windows Terminal for tasks like codebase navigation, bug tracing, and workflow automation.</li>
<li style="font-weight:400;">The updated TUI is now generally available after an experimental period, adding a subagent monitoring view accessible via Ctrl+G, real-time task lists, and parallel subagent execution that protects parent agent context on complex tasks.</li>
<li style="font-weight:400;">The headless mode is particularly relevant for teams looking to automate pull request generation and deployment troubleshooting workflows, reducing the need for continuous manual monitoring in release pipelines.</li>
<li style="font-weight:400;">Pricing details are not specified in the announcement, so listeners interested in production use should check kiro.dev for current plan information before building automation workflows around the headless API.</li>
</ul>
<p>58:37 <a href="https://amazon2022tf.q4web.com/news/news-details/2026/Amazon-to-Acquire-Globalstar-and-Expand-Amazon-Leo-Satellite-Network/default.aspx">Amazon.com, Inc. – Amazon to Acquire Globalstar and Expand Amazon Leo </a><a href="https://amazon2022tf.q4web.com/news/news-details/2026/Amazon-to-Acquire-Globalstar-and-Expand-Amazon-Leo-Satellite-Network/default.aspx">Satellite Network</a></p>
<ul>
<li style="font-weight:400;">Amazon is acquiring <a href="https://cts.businesswire.com/ct/CT?id=smartlink&amp;url=http%3A%2F%2Fwww.globalstar.com&amp;esheet=54502325&amp;newsitemid=20260414237496&amp;lan=en-US&amp;anchor=www.globalstar.com&amp;index=4&amp;md5=46c87939dcf76a2643f76672cfbc37bf">Globalstar</a> in a deal expected to close in 2027, gaining its LEO satellite fleet, MSS spectrum licenses with global authorizations, and direct-to-device technology to expand the <a href="https://cts.businesswire.com/ct/CT?id=smartlink&amp;url=https%3A%2F%2Fleo.amazon.com%2F&amp;esheet=54502325&amp;newsitemid=20260414237496&amp;lan=en-US&amp;anchor=Learn+more+about+Amazon+Leo&amp;index=3&amp;md5=77262786b0d4506ed833806b0bece066">Amazon Leo</a> satellite network beyond its current broadband focus.</li>
<li style="font-weight:400;">Starting in 2028, Amazon Leo will deploy a next-generation Direct-to-Device satellite system enabling voice, text, and data services on standard mobile phones without specialized hardware, targeting coverage gaps where terrestrial cellular networks cannot reach.</li>
<li style="font-weight:400;">Amazon and Apple have signed an agreement for Amazon Leo to power satellite features on iPhone 14 and later and Apple Watch Ultra 3, continuing services like Emergency SOS, Messages, Find My, and Roadside Assistance via satellite that Globalstar currently provides to Apple.</li>
<li style="font-weight:400;">The combined network is designed to support hundreds of millions of endpoints globally, with practical applications spanning consumer emergency messaging, enterprise IoT, fleet tracking, disaster response fallback connectivity, and rural broadband extension.</li>
<li style="font-weight:400;">For AWS customers and partners, this positions Amazon as a vertically integrated connectivity provider competing directly with Starlink and other satellite operators, which could eventually influence how edge computing, IoT, and hybrid cloud architectures are designed for remote and mobile deployments.</li>
</ul>
<p>59:28  Justin – “I guess we can finally say that the conversion from Amazon the bookstore to Amazon the utility is finally complete.” </p>
<h2>GCP</h2>
<p>1:03:02  <a href="https://cloud.google.com/blog/products/containers-kubernetes/optimize-aiml-workloads-with-gke-cloud-storage-fuse-profiles/">Optimize AI/ML workloads with GKE Cloud Storage FUSE Profiles </a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/blog/products/containers-kubernetes/optimize-aiml-workloads-with-gke-cloud-storage-fuse-profiles/">GKE Cloud Storage FUSE Profiles</a>, now generally available in <a href="https://cloud.google.com/kubernetes-engine">GKE</a> version 1.35.1-gke.1616000, automate storage configuration for AI/ML workloads by replacing manual tuning with three pre-built StorageClasses: gcsfusecsi-training, gcsfusecsi-serving, and gcsfusecsi-checkpointing.</li>
<li style="font-weight:400;">The feature addresses a real operational pain point where customers were leaving performance on the table or experiencing Pod Out-of-Memory kills due to misconfigured Cloud Storage FUSE settings that previously required navigating dozens of pages of documentation.</li>
<li style="font-weight:400;">The system dynamically scans your bucket and analyzes node resources, including RAM, Local SSD, and accelerator type, to calculate optimal cache sizes at deployment time, removing the need to manually account for these variables across different infrastructure configurations.</li>
<li style="font-weight:400;">The serving profile includes automated <a href="https://cloud.google.com/storage/docs/anywhere-cache">Rapid Cache</a> integration, and Google reports a notable real-world result: model loading time for a Qwen3-235B-A22B workload on TPUs dropped from 39 hours to 14 minutes using the inference profile.</li>
<li style="font-weight:400;">Pricing for this feature follows standard GKE and Cloud Storage pricing since the profiles are pre-installed StorageClasses within the CSI driver, though teams should factor in Local SSD and RAM usage costs that the system may allocate automatically based on node resources.</li>
</ul>
<p>1:04:12 <a href="https://blog.google/innovation-and-ai/products/gemini-app/3d-models-charts/">Generate 3D models and interactive charts with the Gemini app</a></p>
<ul>
<li style="font-weight:400;"><a href="http://gemini.google.com/">Gemini</a> now generates interactive 3D models and charts directly in chat at gemini.google.com, moving beyond static text and diagrams to functional simulations users can manipulate in real time. </li>
<li style="font-weight:400;">This is available by selecting the Pro model and prompting Gemini to “show me” or “help me visualize” a concept.</li>
<li style="font-weight:400;">The feature supports adjustable parameters like sliders and numeric inputs, so users can modify variables such as gravity or velocity and immediately see updated results. </li>
<li style="font-weight:400;">This makes it practical for exploring scientific concepts, physics simulations, and molecular structures without external tools.</li>
<li style="font-weight:400;">The rollout is global for standard <a href="https://gemini.google/subscriptions/">Gemini app users</a>, though Education and Workspace accounts are currently excluded. No additional cost is mentioned beyond existing Gemini Pro access, so pricing appears to be included within current subscription tiers.</li>
<li style="font-weight:400;">Likely use cases include education, research, and data analysis workflows where visual exploration of complex systems adds clarity. </li>
<li style="font-weight:400;">Industries like life sciences, engineering, and academic institutions stand to benefit most from interactive molecular and physics visualizations.</li>
<li style="font-weight:400;">For GCP customers, this signals Google’s direction toward embedding richer, interactive AI outputs into its Gemini ecosystem, which could eventually extend to Workspace and enterprise tools once the Education and Workspace exclusion is lifted.</li>
</ul>
<p>1:04:56  Ryan – “This is something that makes me think about actively getting a Gemini Pro account, which I don’t have today. Just the amount of stuff that I do with 3D printing, and being able to generate a model that I can then import into a tool, and fuse and tweak it, or maybe just would generate G code directly. So this is, I like this, and it’s definitely something I can see myself using.”</p>
<p>1:06:59 <a href="https://cloud.google.com/blog/products/identity-security/essential-ai-and-cloud-security-now-on-by-default/">Essential AI and cloud security now on by default</a></p>
<ul>
<li style="font-weight:400;">Google Cloud is automatically enabling an enhanced Security Command Center Standard tier for eligible customers at no cost, adding AI protection features, including a unified dashboard that detects unprotected Gemini inference and reports on LLM guardrail violations, with general availability expected by the end of June 2025.</li>
<li style="font-weight:400;">The free Standard tier now includes more than 44 misconfiguration checks based on the <a href="https://docs.cloud.google.com/security-command-center/docs/compliance-manager-frameworks#security-essentials">Google Cloud Security Essentials</a> compliance framework, up from the previous count by 21 checks, along with agentless critical vulnerability scanning and graph-driven risk prioritization.</li>
<li style="font-weight:400;">Data security posture management has been added to the free tier, allowing teams to discover and visualize data across <a href="https://cloud.google.com/vertex-ai">Vertex AI</a>, <a href="https://cloud.google.com/bigquery">BigQuery</a>, and Cloud Storage, with Compliance Manager included for automated monitoring against the GCSE framework.</li>
<li style="font-weight:400;">SCC now surfaces in-context security findings directly inside <a href="https://console.cloud.google.com/cloud-hub/security-and-compliance">Cloud Hub</a>, <a href="https://console.cloud.google.com/compute/security">GCE</a>, and <a href="https://console.cloud.google.com/kubernetes/security/dashboard">GKE </a>dashboards, giving infrastructure administrators security insights without switching between tools, which should reduce time to remediation.</li>
<li style="font-weight:400;">Organizations needing advanced capabilities like threat intelligence, <a href="https://cloud.google.com/blog/products/identity-security/how-virtual-red-teams-can-find-high-risk-cloud-issues-before-attackers-do">virtual red team</a> risk analysis, or malware scanning can start a 30-day free trial of SCC Premium directly from the console, with the Standard tier serving as a no-cost baseline for teams not yet ready for premium features.</li>
</ul>
<p>1:07:52  Ryan – “I really like this, and especially the free tier aspect of this, just because it is already such a challenge to know where your AI workloads are. And then having the specific configuration checks is great. I do think that the checks themselves – I played around with the 21 – they were a little basic, so it wasn’t that great. I do think it’s a great thing to have. The data scanning is super key, because that’s typically been really expensive to run and classify your data, and know where your sense of data is. So very cool.” </p>
<p>1:08:40 <a href="https://cloud.google.com/blog/products/data-analytics/looker-studio-is-data-studio/">Looker Studio is Data Studio </a></p>
<ul>
<li style="font-weight:400;">Google is rebranding Looker Studio back to its original name, <a href="https://cloud.google.com/looker-studio">Data Studio</a>,  positioning it as a hub for personal data exploration and ad-hoc reporting across Google data sources, including <a href="https://cloud.google.com/bigquery">BigQuery</a>, <a href="https://workspace.google.com/products/sheets/">Google Sheets</a>, and <a href="https://business.google.com/us/google-ads/?pli=1">Google Ads</a>.</li>
<li style="font-weight:400;">The platform now serves as a single location for multiple asset types beyond traditional reports, including BigQuery conversational agents and data apps built in Colab notebooks, reflecting a broader shift toward AI-era analytics workflows.</li>
<li style="font-weight:400;">Data Studio will coexist with Looker rather than replace it, with <a href="https://cloud.google.com/looker">Looker</a> remaining the enterprise BI platform focused on governed data and semantic modeling, while Data Studio targets individual and small team use cases.</li>
<li style="font-weight:400;">Pricing follows a two-tier model: the standard Data Studio remains free for individual use, while Data Studio Pro adds AI features, enterprise security, and compliance capabilities at a paid tier purchasable through the Google Cloud console or Google Workspace Admin Console (specific Pro pricing was not disclosed in the announcement).</li>
<li style="font-weight:400;">Existing users should see no disruption, as all current reports, data sources, and assets will migrate automatically to the new experience without any required action.</li>
</ul>
<p>1:09:34  Justin – “That was one of the big problems with Looker Studio, was that it wasn’t really meant for enterprise. So this Data Studio Pro version gives you that capability, finally.” </p>
<p>1:10:51 <a href="https://cloud.google.com/blog/products/data-analytics/introducing-bigquery-graph/">Introducing BigQuery Graph</a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/blog/products/data-analytics/introducing-bigquery-graph/">BigQuery Graph</a> is now in preview, bringing native graph analytics into <a href="https://cloud.google.com/bigquery">BigQuery</a> using the ISO GQL standard. </li>
<li style="font-weight:400;">This lets analysts run multi-hop relationship queries without leaving BigQuery or learning a separate graph database system.</li>
<li style="font-weight:400;">The key technical distinction is that graph schemas are created on top of existing relational tables with no data duplication or movement. Users can mix SQL and GQL in the same queries, which lowers the barrier for teams already invested in SQL skills.</li>
<li style="font-weight:400;">Integration with <a href="https://docs.cloud.google.com/spanner/docs/graph/overview">Spanner Graph</a> is a notable addition, allowing federated queries that combine real-time Spanner data with historical BigQuery data in a single virtual graph. This addresses a common pain point where operational and analytical graph data live in separate systems.</li>
<li style="font-weight:400;">Real-world results from early adopters give some concrete numbers to consider: Curve reported roughly 9.1 million pounds in fraud savings by replacing SQL-based network analysis with graph queries, and Virgin Media O2 is running 4-hop queries to map relationships between accounts, devices, and activities.</li>
<li style="font-weight:400;">Pricing is not explicitly stated in the announcement, as this is a preview feature, so listeners should check the BigQuery documentation <a href="https://docs.cloud.google.com/bigquery/docs/graph-overview">here</a> for current details. </li>
<li style="font-weight:400;">Primary use cases include fraud detection, supply chain analysis, drug discovery, and customer relationship modeling.</li>
</ul>
<p>1:12:40 <a href="https://blog.google/products-and-platforms/products/chrome/skills-in-chrome/">Turn your best AI prompts into one-click tools in Chrome</a></p>
<ul>
<li style="font-weight:400;">Google launched <a href="https://blog.google/products-and-platforms/products/chrome/skills-in-chrome/">Skills in Chrome</a>, a feature that lets users save custom <a href="https://gemini.google.com/app">Gemini</a> prompts and rerun them with a single click using the forward slash or plus button interface, eliminating the need to retype repeated prompts across browsing sessions.</li>
<li style="font-weight:400;">Skills can operate across multiple tabs simultaneously, which makes it practical for tasks like comparing product specs or scanning several documents at once without manual prompt repetition.</li>
<li style="font-weight:400;">Google is also shipping a pre-built Skills library for common workflows like ingredient breakdowns, gift selection, and macro calculations, with options to customize any library Skill by editing the underlying prompt.</li>
<li style="font-weight:400;">On the privacy and security side, Skills inherits <a href="https://www.google.com/chrome/">Chrome’s</a> existing Gemini safeguards, including automated red-teaming and confirmation prompts before sensitive actions like sending email or adding calendar events.</li>
<li style="font-weight:400;">Saved Skills sync across signed-in Chrome desktop devices, making this more of a persistent personal workflow tool than a one-off browser feature, though it is limited to desktop, and there is no mention of separate pricing beyond existing Gemini in Chrome access.</li>
</ul>
<p>1:14:42  Ryan – “I’m trying to figure out whether I like this or not, right? Because I can think of some things that are kind of cool. And I’m trying to get around the, you know, the silliness of just executing things without really knowing what’s going on. That’s usually how security problems get introduced.”</p>
<h2>Azure</h2>
<p>1:17:36 <a href="https://www.forbes.com/sites/janakirammsv/2026/04/06/microsofts-agent-stack-confuses-developers-while-rivals-simplify/">Microsoft’s Agent Stack Confuses Developers While Rivals Simplify</a></p>
<ul>
<li style="font-weight:400;">Microsoft <a href="https://devblogs.microsoft.com/agent-framework/microsoft-agent-framework-version-1-0/">released Agent Framework 1.0</a> on April 3, merging Semantic Kernel and AutoGen into a single SDK after maintaining them as incompatible parallel frameworks. </li>
<li style="font-weight:400;">AutoGen will now receive only bug fixes and security patches, meaning developers on either framework face meaningful migration work to adopt the new unified tool.</li>
<li style="font-weight:400;">The Azure agent stack still spans multiple distinct surfaces, including <a href="https://github.com/microsoft/agent-framework">Agent Framework</a> for pro-code development, <a href="https://learn.microsoft.com/en-us/microsoft-copilot-studio/fundamentals-what-is-copilot-studio">Copilot Studio</a> for low-code, <a href="https://azure.microsoft.com/en-us/products/ai-foundry/agent-service/">Foundry Agent Service</a> as the managed runtime, and the <a href="https://learn.microsoft.com/en-us/microsoft-365/agents-sdk/">Microsoft 365 Agents SDK</a> for Teams distribution. Each surface has its own documentation and deployment model, requiring enterprise teams to make platform decisions before writing any agent logic.</li>
<li style="font-weight:400;"><a href="https://www.microsoft.com/en-us/microsoft-agent-365">Agent 365</a>, a governance and compliance control plane for monitoring agents at enterprise scale, reaches general availability on May 1 at $15 per user per month. This adds another procurement decision on top of the existing build and runtime layers rather than consolidating them.</li>
<li style="font-weight:400;">By comparison, Google Cloud’s Agent Development Kit feeds directly into Agent Engine on Vertex AI with a single CLI command for deployment, and AWS positions Strands Agents SDK as a thin framework that pairs cleanly with AgentCore as its managed runtime. Both competitors offer a more direct path from local development to production without requiring lateral platform decisions.</li>
<li style="font-weight:400;">Enterprise teams evaluating Azure for agentic workloads should map which surfaces their development, operations, and security teams will standardize on at each layer and account for the organizational cost of those decisions, including migration effort from Semantic Kernel or AutoGen.</li>
</ul>
<p>1:19:11  Matt – “Microsoft making things harder and more confusing? Never. ”\</p>
<h2>After Show</h2>
<p>54:04 <a href="https://cacm.acm.org/news/how-nasa-built-artemis-iis-fault-tolerant-computer/">How NASA Built Artemis II’s Fault-Tolerant Computer – Communications of </a></p>
<p><a href="https://cacm.acm.org/news/how-nasa-built-artemis-iis-fault-tolerant-computer/">the ACM</a></p>
<ul>
<li style="font-weight:400;">Artemis II’s Orion capsule runs eight CPUs in parallel across four Flight Control Modules, using a fail-silent design where faulty processors drop out rather than transmit bad data, and the system can lose three of four modules within 22 seconds and still operate safely on the remaining one.</li>
<li style="font-weight:400;">The architecture enforces strict determinism through time-triggered Ethernet and an ARINC653 scheduler, ensuring all processors see identical inputs and produce identical outputs, which is a notable contrast to modern Agile and DevOps practices, where this level of architectural discipline is increasingly uncommon.</li>
<li style="font-weight:400;">NASA uses dissimilar redundancy for the backup system, meaning different hardware, a different operating system, and independently written, simplified software, specifically to prevent a common software bug from taking down both primary and backup systems simultaneously.</li>
<li style="font-weight:400;">The verification process relies on supercomputer-scale fault injection and Monte Carlo stress testing to simulate full mission timelines with catastrophic hardware failures introduced, which offers a practical model for how cloud and infrastructure teams might approach resilience testing at scale.</li>
<li style="font-weight:400;">The broader industry implication is that as software takes over functions previously handled by mechanical or manual controls, whether in spacecraft, autonomous vehicles, or industrial systems, the engineering patterns developed here around fail-silent design and layered redundancy become increasingly relevant outside of aerospace.</li>
</ul>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2428621/c1e-nj4jtzz687t09q21-0v0k11pdszwj-ulnfch.mp3" length="168248934"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 351 of The Cloud Pod, where the weather is always cloudy! Justin, Matt, and Ryan are in the studio today and ready to bring you the latest in cloud and AI news. And it’s that time of year again – we’re coming up quickly on Google Next, place your so we’ve got our yearly predictions for what’s coming from Vegas, as well as more news about Mythos, Amazon finally becoming a utility, and even an aftershow where we discuss the computing power of Artemis. It’s a great show, so let’s get started!  

Titles we almost went with this week

 Three StorageClasses Walk Into an AI Workload
 Deprecated Models Don’t Die, They Just Fail Your API Calls
 SQL Walks Into a Graph Bar and Stays
 Too Many Agents Spoil the Workflow
 One Registry to Rule All Your Rogue AI Agents
 Eight CPUs Walk Into Space, Only One Comes Back
 Stop Retyping the Same Gemini Prompt Like a Caveman
 Claude Code Routines Let AI Work While You Sleep
 AWS Builds a Yellow Pages for Your AI Agents
 GPT Finally Stops Refusing to Talk About Hacking
 None of the hosts is ready for Next
 We are once again trying to look into our next next next crystal ball and failing
 Google is gonna announce AI, it’s just mandatory now
 Las Vegas is calling, our Livers are crying

 A big thanks to this week’s sponsors:
There are a lot of cloud cost management tools out there, but only Archera provides insured commitments. It sounds fancy, but it’s really simple. Archera gives you the cost savings of a 1 or 3-year AWS Savings Plan with a commitment as short as 30 days. If you do not use all the cloud resources you have committed to, Archera will literally cover the difference. Other cost management tools may say they offer “insured commitments”, but remember to ask: Will you actually give me my rebate? Because Archera will. 
Check out thecloudpod.net/archera to schedule a demo today. 
We also wanted to tell you about something coming to the US for the first time — WeAreDevelopers World Congress! 
They’ve been doing this in Europe for years, 15,000-plus attendees in Berlin, it’s one of the biggest developer events over there. Coté from Software Defined Talk is actually speaking at their Berlin event this summer, so we’ve got some firsthand context here. In September, they’re launching the North America edition. San José, September 23 to 25. 500-plus speakers, 18 tracks — cloud, infrastructure, DevOps, security, AI, data engineering, all of it. Speakers from Datadog, Honeycomb, Sentry, Google, LinkedIn, and Stack Overflow. Olivier Pomel, Christine Yen, Milin Desai, Kelsey Hightower – plus workshops and masterclasses, not just talks. These are people who know how to do a developer conference at scale. wearedevelopers.us, code DEVPOD26 for 15% off. Group rates on top of that for 4 or more.
Follow Up
01:47 AI Cybersecurity After Mythos: The Jagged Frontier 

Since the original Mythos/Project Glasswing announcement, AISLE published follow-up testing showing that small, inexpensive open-weight models can replicate much of the vulnerability detection work Anthropic attributed to Mythos, with all 8 tested models detecting the flagship FreeBSD NFS buffer overflow, including a 3.6B parameter model costing $0.11 per million tokens.
A notable correction to the framing of the original announcement: cybersecurity AI capability does not scale smoothly with model size or cost. 
Model rankings reshuffle completely across different security tasks, meaning there is no single b...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2428621/c1a-k5d5-jpxdoogjajxd-yeva85.jpg"></itunes:image>
                                                                            <itunes:duration>01:27:14</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2428621/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[350: It looks like you're trying to send an email from 250,000 miles away! Would you like help with that?]]>
                </title>
                <pubDate>Thu, 16 Apr 2026 16:30:13 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2423813</guid>
                                    <link>https://tcpfm.castos.com/episodes/350-it-looks-like-youre-trying-to-send-an-email-from-250000-miles-away-would-you-like-help-with</link>
                                <description>
                                            <![CDATA[<p>Welcome to episode 350 of The Cloud Pod, where the weather is always cloudy! Justin, Jonathan, and Matt are this week’s hosts, and they’ve scoured the clouds for all the latest news and announcements, including that Mythos drop. Is it the AI apocalypse that everyone is claiming? We’ve also got news from DigitalOcean, an email from Space, Claude and even some Guardrails. There’s a lot to cover, so let’s get started!</p>
<p>Titles we almost went with this week</p>
<ul>
<li> Two AIs Walk Into a Studio and Actually Sound Good </li>
<li> No More Idle GPUs Twiddling Their Tensor Cores </li>
<li> When AWS Availability Zones Become Unavailability Zones</li>
<li> Token by Token Codex Pricing Finally Makes Cents</li>
<li> Just Ask AWS Where All Your Money Went</li>
<li> You’ve Got mTLS: Amazon SES Locks Down Email Security</li>
<li> Cost Explorer Finally Speaks Plain English</li>
<li> Missiles Make AWS Multi-Region Strategy Mandatory</li>
<li> Shell Yeah Your Agent State Now Persists</li>
<li> S3 Files Finally Lets You ls Your Bucket</li>
<li> Claude Found Your Zero-Day Before Lunch</li>
<li> One Guardrail to Rule All Your AWS Accounts</li>
<li>Premium SSD Wins Azure VDI but Your Wallet Cries</li>
<li> No More Amnesia: Your Bedrock Agent Keeps Its Memories</li>
<li> Pay Per Claw Anthropic Sharpens Its Pricing Policy</li>
<li> Even Astronauts Need IT Support for Microsoft Outlook</li>
<li> AWS still can’t answer the question of what EC2 Other is</li>
<li> AWS announces several new Unavailability Zones
</li>
</ul>
<p> A big thanks to this week’s sponsor:</p>
<p>There are a lot of cloud cost management tools out there, but only Archera provides insured commitments. It sounds fancy, but it’s really simple. Archera gives you the cost savings of a 1 or 3-year AWS Savings Plan with a commitment as short as 30 days. If you do not use all the cloud resources you have committed to, Archera will literally cover the difference. Other cost management tools may say they offer “insured commitments”, but remember to ask: Will you actually give me my rebate? Because Archera will. </p>
<p>Check out <a href="http://www.thecloudpod.net/archera">thecloudpod.net/archera</a> to schedule a demo today. </p>
<h2>Follow Up</h2>
<p>00:45 <a href="https://www.geekwire.com/2026/ground-control-to-microsoft-artemis-2-astronauts-deal-with-outlook-hiccup-in-deep-space/">Ground control to Microsoft: Artemis 2 astronauts deal with Outlook hiccup </a><a href="https://www.geekwire.com/2026/ground-control-to-microsoft-artemis-2-astronauts-deal-with-outlook-hiccup-in-deep-space/">in deep space</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.geekwire.com/2026/nasa-countdown-artemis-moon-2/">Artemis 2 astronauts aboard NASA’s Orion spacecraft</a> encountered a common <a href="https://www.microsoft.com/en-us/microsoft-365/outlook/email-and-calendar-software-microsoft-outlook?deeplink=%2Fmail%2F&amp;sdf=0">Outlook</a> configuration issue on their first day in space, requiring <a href="https://techcrunch.com/2026/04/02/nasa-artemis-microsoft-outlook-astronauts/">remote IT support from Mission Control</a> to resolve it by reloading the commander’s files.</li>
<li style="font-weight:400;"><a href="https://www.nasa.gov/">NASA</a> uses commercial off-the-shelf software like Microsoft Outlook for crew scheduling and personal communications, while keeping primary flight systems on separate radiation-hardened hardware, illustrating a practical separation of concerns in mission-critical environments.</li>
<li style="font-weight:400;">The Outlook issue stemmed from the app having configuration problems when no direct network connection is available, which the flight director noted is not uncommon, raising questions about offline-readiness for software deployed in connectivity-constrained environments.</li>
<li style="font-weight:400;">This incident is a useful reminder for cloud and enterprise software users that applications heavily dependent on network connectivity can...</li></ul>
<h3>Chapters</h3>
<ul><li>(00:00:00) - Episode 350</li><li>(00:00:51) - NASA: Outgoing Hiccup in Deep Space</li><li>(00:03:36) - Iran Declares AWS, Google and Microsoft Data Centers as Military Targ</li><li>(00:07:53) - Codex Only Pricing with Pay as You Go</li><li>(00:09:50) - Will Bedrock prioritize its higher-priced plans?</li><li>(00:16:04) - Anthropic Expands Cloud Hardware Partnership With Google, Broadcom</li><li>(00:17:42) - Anthropic's Cloud Mythos Preview Announced</li><li>(00:21:51) - Amazon SES adds managed daemons to Mail Manager</li><li>(00:25:35) - Bedrock Guardrails 1.8 in AWS Cost Management</li><li>(00:33:20) - Amazon's EFS Proxy for S3 Files</li><li>(00:35:10) - NetApp: No S3Fs for AI & ML</li><li>(00:39:09) - GK Inference Gateway now supports real-time and async workload</li><li>(00:40:36) - Gemini API Documentation and Coding Agents</li><li>(00:43:35) - Azure Network Watcher: Firewall Comparison vs. Standard SSD</li><li>(00:50:33) - DigitalOcean Launches Cloud Security PAM</li><li>(00:52:28) - Week in the Cloud: September 7, 2016</li><li>(00:53:41) - A Top Microsoft Engineer Reveals How Microsoft Vaporized a Tr</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 350 of The Cloud Pod, where the weather is always cloudy! Justin, Jonathan, and Matt are this week’s hosts, and they’ve scoured the clouds for all the latest news and announcements, including that Mythos drop. Is it the AI apocalypse that everyone is claiming? We’ve also got news from DigitalOcean, an email from Space, Claude and even some Guardrails. There’s a lot to cover, so let’s get started!
Titles we almost went with this week

 Two AIs Walk Into a Studio and Actually Sound Good 
 No More Idle GPUs Twiddling Their Tensor Cores 
 When AWS Availability Zones Become Unavailability Zones
 Token by Token Codex Pricing Finally Makes Cents
 Just Ask AWS Where All Your Money Went
 You’ve Got mTLS: Amazon SES Locks Down Email Security
 Cost Explorer Finally Speaks Plain English
 Missiles Make AWS Multi-Region Strategy Mandatory
 Shell Yeah Your Agent State Now Persists
 S3 Files Finally Lets You ls Your Bucket
 Claude Found Your Zero-Day Before Lunch
 One Guardrail to Rule All Your AWS Accounts
Premium SSD Wins Azure VDI but Your Wallet Cries
 No More Amnesia: Your Bedrock Agent Keeps Its Memories
 Pay Per Claw Anthropic Sharpens Its Pricing Policy
 Even Astronauts Need IT Support for Microsoft Outlook
 AWS still can’t answer the question of what EC2 Other is
 AWS announces several new Unavailability Zones


 A big thanks to this week’s sponsor:
There are a lot of cloud cost management tools out there, but only Archera provides insured commitments. It sounds fancy, but it’s really simple. Archera gives you the cost savings of a 1 or 3-year AWS Savings Plan with a commitment as short as 30 days. If you do not use all the cloud resources you have committed to, Archera will literally cover the difference. Other cost management tools may say they offer “insured commitments”, but remember to ask: Will you actually give me my rebate? Because Archera will. 
Check out thecloudpod.net/archera to schedule a demo today. 
Follow Up
00:45 Ground control to Microsoft: Artemis 2 astronauts deal with Outlook hiccup in deep space

Artemis 2 astronauts aboard NASA’s Orion spacecraft encountered a common Outlook configuration issue on their first day in space, requiring remote IT support from Mission Control to resolve it by reloading the commander’s files.
NASA uses commercial off-the-shelf software like Microsoft Outlook for crew scheduling and personal communications, while keeping primary flight systems on separate radiation-hardened hardware, illustrating a practical separation of concerns in mission-critical environments.
The Outlook issue stemmed from the app having configuration problems when no direct network connection is available, which the flight director noted is not uncommon, raising questions about offline-readiness for software deployed in connectivity-constrained environments.
This incident is a useful reminder for cloud and enterprise software users that applications heavily dependent on network connectivity can...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[350: It looks like you're trying to send an email from 250,000 miles away! Would you like help with that?]]>
                </itunes:title>
                                    <itunes:episode>350</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p>Welcome to episode 350 of The Cloud Pod, where the weather is always cloudy! Justin, Jonathan, and Matt are this week’s hosts, and they’ve scoured the clouds for all the latest news and announcements, including that Mythos drop. Is it the AI apocalypse that everyone is claiming? We’ve also got news from DigitalOcean, an email from Space, Claude and even some Guardrails. There’s a lot to cover, so let’s get started!</p>
<p>Titles we almost went with this week</p>
<ul>
<li> Two AIs Walk Into a Studio and Actually Sound Good </li>
<li> No More Idle GPUs Twiddling Their Tensor Cores </li>
<li> When AWS Availability Zones Become Unavailability Zones</li>
<li> Token by Token Codex Pricing Finally Makes Cents</li>
<li> Just Ask AWS Where All Your Money Went</li>
<li> You’ve Got mTLS: Amazon SES Locks Down Email Security</li>
<li> Cost Explorer Finally Speaks Plain English</li>
<li> Missiles Make AWS Multi-Region Strategy Mandatory</li>
<li> Shell Yeah Your Agent State Now Persists</li>
<li> S3 Files Finally Lets You ls Your Bucket</li>
<li> Claude Found Your Zero-Day Before Lunch</li>
<li> One Guardrail to Rule All Your AWS Accounts</li>
<li>Premium SSD Wins Azure VDI but Your Wallet Cries</li>
<li> No More Amnesia: Your Bedrock Agent Keeps Its Memories</li>
<li> Pay Per Claw Anthropic Sharpens Its Pricing Policy</li>
<li> Even Astronauts Need IT Support for Microsoft Outlook</li>
<li> AWS still can’t answer the question of what EC2 Other is</li>
<li> AWS announces several new Unavailability Zones
</li>
</ul>
<p> A big thanks to this week’s sponsor:</p>
<p>There are a lot of cloud cost management tools out there, but only Archera provides insured commitments. It sounds fancy, but it’s really simple. Archera gives you the cost savings of a 1 or 3-year AWS Savings Plan with a commitment as short as 30 days. If you do not use all the cloud resources you have committed to, Archera will literally cover the difference. Other cost management tools may say they offer “insured commitments”, but remember to ask: Will you actually give me my rebate? Because Archera will. </p>
<p>Check out <a href="http://www.thecloudpod.net/archera">thecloudpod.net/archera</a> to schedule a demo today. </p>
<h2>Follow Up</h2>
<p>00:45 <a href="https://www.geekwire.com/2026/ground-control-to-microsoft-artemis-2-astronauts-deal-with-outlook-hiccup-in-deep-space/">Ground control to Microsoft: Artemis 2 astronauts deal with Outlook hiccup </a><a href="https://www.geekwire.com/2026/ground-control-to-microsoft-artemis-2-astronauts-deal-with-outlook-hiccup-in-deep-space/">in deep space</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.geekwire.com/2026/nasa-countdown-artemis-moon-2/">Artemis 2 astronauts aboard NASA’s Orion spacecraft</a> encountered a common <a href="https://www.microsoft.com/en-us/microsoft-365/outlook/email-and-calendar-software-microsoft-outlook?deeplink=%2Fmail%2F&amp;sdf=0">Outlook</a> configuration issue on their first day in space, requiring <a href="https://techcrunch.com/2026/04/02/nasa-artemis-microsoft-outlook-astronauts/">remote IT support from Mission Control</a> to resolve it by reloading the commander’s files.</li>
<li style="font-weight:400;"><a href="https://www.nasa.gov/">NASA</a> uses commercial off-the-shelf software like Microsoft Outlook for crew scheduling and personal communications, while keeping primary flight systems on separate radiation-hardened hardware, illustrating a practical separation of concerns in mission-critical environments.</li>
<li style="font-weight:400;">The Outlook issue stemmed from the app having configuration problems when no direct network connection is available, which the flight director noted is not uncommon, raising questions about offline-readiness for software deployed in connectivity-constrained environments.</li>
<li style="font-weight:400;">This incident is a useful reminder for cloud and enterprise software users that applications heavily dependent on network connectivity can behave unpredictably in low or no-connectivity scenarios, and offline mode reliability remains an important consideration for software selection.</li>
<li style="font-weight:400;"><a href="https://www.microsoft.com/en-us">Microsoft</a> has not issued a public comment, but the episode highlights how widely deployed enterprise software is, reaching use cases well beyond what vendors typically design or test for.</li>
</ul>
<p>03:31 <a href="https://siliconcanals.com/sc-d-iran-declared-aws-google-and-microsoft-data-centers-military-targets-the-legal-and-strategic-fallout-is-just-beginning/">Iran declared AWS, Google, and Microsoft data centers military targets. The Legal</a><a href="https://siliconcanals.com/sc-d-iran-declared-aws-google-and-microsoft-data-centers-military-targets-the-legal-and-strategic-fallout-is-just-beginning/"> and strategic fallout is just beginning</a></p>
<ul>
<li style="font-weight:400;">Iran’s April 2025 declaration named the <a href="https://www.war.gov/News/Releases/Release/Article/3239378/department-of-defense-announces-joint-warfighting-cloud-capability-procurement/">Joint Warfighting Cloud Capability</a> (JWCC) contract specifically, arguing that <a href="https://aws.amazon.com/?nc2=h_home">AWS</a>, <a href="https://cloud.google.com/">Google</a>, <a href="https://www.microsoft.com/en-us">Microsoft</a>, and <a href="https://www.oracle.com/">Oracle</a> data centers hosting Pentagon AI and intelligence workloads have lost civilian status under the <a href="https://www.unesco.org/en/memory-world/geneva-conventions-1864-1906-1929-and-1949-well-additional-protocols-1977-and-2005">Geneva Conventions</a> principle of distinction. </li>
<li style="font-weight:400;">The legal argument centers on the fact that classified military workloads share physical infrastructure with banking, healthcare, and consumer services.</li>
<li style="font-weight:400;">The JWCC contract, worth up to $9 billion, was deliberately designed to distribute military workloads across multiple commercial providers to avoid vendor lock-in, but this decision inadvertently spread the targeting problem across every major hyperscaler simultaneously rather than containing it to a single provider.</li>
<li style="font-weight:400;">Northern Virginia, the densest data center concentration on Earth, sits near the Pentagon’s most sensitive cloud workloads, meaning a single facility in Ashburn could simultaneously process classified Pentagon data, hospital records, and financial transactions with no practical way to separate them once a conflict begins.</li>
<li style="font-weight:400;">Insurance and operational costs are already responding to this risk, with businesses in geopolitically sensitive regions facing substantially higher premiums for multi-region redundancy and war-risk coverage, costs that will eventually pass through to end customers regardless of whether any strike occurs.</li>
<li style="font-weight:400;">The article identifies three structural fixes: DoD physically isolating JWCC workloads from civilian infrastructure, Congress updating defense cloud procurement rules to account for civilian collateral risk, and hyperscalers disclosing to commercial customers whether their specific facilities host military workloads. </li>
<li style="font-weight:400;">None of these changes are currently underway at a meaningful scale.</li>
</ul>
<p>04:05  Justin – “In the case of FedRAMP and JWCC, those are typically in the FedRAMP data centers in the US, so it’s a little bit of an interesting distinction, but there’s no guarantee that they’re not putting FedRAMP-type workloads into regions closer to the war zone. There’s no conversation about that, so I can see Iran’s point in this. And this will definitely make insurance and operating in clouds more expensive for companies who are very politically sensitive.” 
</p>
<h2>AI Is Going Great – Or How ML Makes Money </h2>
<p>08:07 <a href="https://openai.com/index/codex-flexible-pricing-for-teams">Codex now offers pay-as-you-go pricing for teams</a></p>
<ul>
<li style="font-weight:400;"><a href="https://openai.com/">OpenAI</a> is introducing pay-as-you-go pricing for <a href="https://openai.com/codex/">Codex</a>-only seats within <a href="https://chatgpt.com/business/">ChatGPT Business</a> and <a href="https://chatgpt.com/business/enterprise/">Enterprise</a> workspaces, billing on token consumption with no rate limits instead of a fixed per-seat fee, giving teams more cost visibility across workflows.</li>
<li style="font-weight:400;">ChatGPT Business annual pricing drops from $25 to $20 per seat for teams that want standard ChatGPT access with Codex usage limits included, while the new Codex-only seat option serves teams that want dedicated coding agent access without the broader ChatGPT bundle.</li>
<li style="font-weight:400;">OpenAI is offering eligible Business workspaces $100 in credits per new Codex-only team member added, capped at $500 per team for a limited time, which lowers the barrier for initial pilots.</li>
<li style="font-weight:400;">Codex now supports <a href="https://developers.openai.com/codex/plugins">Plugins</a> and <a href="https://developers.openai.com/codex/app/automations">Automations</a> through its <a href="https://openai.com/index/introducing-the-codex-app/">macOS and Windows </a>apps, allowing teams to connect the coding agent to existing internal systems and tooling rather than treating it as a standalone tool.</li>
<li style="font-weight:400;">OpenAI reports over 2 million weekly active Codex builders and a 6x growth in Codex users within Business and Enterprise accounts since January, with named customers including <a href="https://www.notion.so/login">Notion</a>, <a href="https://ramp.com/">Ramp</a>, and <a href="https://www.usebraintrust.com/">Braintrust</a> using it to standardize engineering workflows.</li>
</ul>
<p>09:31  Jonathan – “I think if you want the best performance, you’re going to have to pay for what you use. I think anyone that’s paying for a bundle is always going to be second class.” </p>
<p>13:57 <a href="https://techcrunch.com/2026/04/04/anthropic-says-claude-code-subscribers-will-need-to-pay-extra-for-openclaw-support/">Anthropic says Claude Code subscribers will need to pay extra for </a><a href="https://techcrunch.com/2026/04/04/anthropic-says-claude-code-subscribers-will-need-to-pay-extra-for-openclaw-support/">OpenClaw usage</a></p>
<ul>
<li style="font-weight:400;">Starting April 4, <a href="https://www.anthropic.com/">Anthropic</a> is requiring <a href="https://claude.com/product/claude-code">Claude Code</a> subscribers to pay separately on a pay-as-you-go basis for usage through third-party tools like <a href="https://openclaw.ai/">OpenClaw</a>, rather than drawing from their existing subscription limits. </li>
<li style="font-weight:400;">This affects all third-party harnesses, with more platforms to follow.</li>
<li style="font-weight:400;">Anthropic’s head of Claude Code cited infrastructure constraints and unsustainable usage patterns from third-party tools as the reason for the change, and noted the company is offering full refunds to subscribers who were unaware of the policy shift.</li>
<li style="font-weight:400;">The timing is notable given that OpenClaw’s creator, Peter Steinberger, recently <a href="https://techcrunch.com/2026/02/15/openclaw-creator-peter-steinberger-joins-openai/">joined OpenAI</a>, and OpenClaw continues as an open source project with OpenAI backing. Steinberger publicly stated he attempted to negotiate with Anthropic and only managed to delay the pricing change by one week.</li>
<li style="font-weight:400;">For developers building on or using AI coding assistants through third-party integrations, this signals a broader industry pattern where AI providers may separate subscription pricing from API-level or harness-level consumption, adding cost complexity for teams relying on open source tooling around proprietary models.</li>
<li style="font-weight:400;">OpenAI recently <a href="https://techcrunch.com/2026/03/29/why-openai-really-shut-down-sora/">shut down its Sora app</a> to reallocate compute resources, reflecting that both major AI providers are actively managing infrastructure capacity as demand from software engineering use cases like Claude Code continues to grow.</li>
</ul>
<p>14:59  Jonathan – “I understand *why* they’re doing it, because there’s a big difference between somebody having a conversation or somebody doing coding, where you are mostly using cache hits for the majority of the work, versus OpenClaw where the context changes constantly, and making calls every 60 seconds. It is a completely different type of workload. At the same time, I’m paying $200 a month…” </p>
<p>16:38 <a href="https://www.anthropic.com/news/google-broadcom-partnership-compute">Anthropic expands partnership with Google and Broadcom for multiple </a><a href="https://www.anthropic.com/news/google-broadcom-partnership-compute">gigawatts of next-generation compute</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.anthropic.com/">Anthropic</a> has signed a multi-gigawatt <a href="https://www.anthropic.com/news/expanding-our-use-of-google-cloud-tpus-and-services">TPU capacity</a> agreement with <a href="http://google.com">Google</a> and <a href="https://www.broadcom.com/">Broadcom</a>, with infrastructure expected to come online starting in 2027. This builds on an existing October 2025 TPU expansion and deepens Anthropic’s reliance on Google Cloud alongside AWS and NVIDIA hardware.</li>
<li style="font-weight:400;">Anthropic’s run-rate revenue has grown from roughly $9 billion at the end of 2025 to over $30 billion, with enterprise customers spending over $1 million annually, doubling from 500 to over 1,000 in under two months. The compute expansion is a direct response to this accelerating demand.</li>
<li style="font-weight:400;">Anthropic continues a multi-cloud hardware strategy, running <a href="https://claude.ai/new">Claude</a> on <a href="https://aws.amazon.com/ai/machine-learning/trainium/">AWS Trainium</a>, <a href="https://cloud.google.com/tpu">Google TPUs</a>, and NVIDIA GPUs to match workloads to appropriate chips. Amazon remains the primary cloud and training partner, with ongoing work on <a href="https://www.aboutamazon.com/news/aws/aws-project-rainier-ai-trainium-chips-compute-cluster">Project Rainier</a>.</li>
<li style="font-weight:400;">Claude is currently the only frontier AI model available across all three major cloud platforms: <a href="https://aws.amazon.com/bedrock/">AWS Bedrock</a>, <a href="https://cloud.google.com/vertex-ai">Google Cloud Vertex AI</a>, and <a href="https://azure.microsoft.com/en-us/products/ai-foundry/">Microsoft Azure Foundry</a>. </li>
<li style="font-weight:400;">This broad availability has practical implications for enterprises already committed to any of the three major cloud providers.</li>
<li style="font-weight:400;">The majority of new compute will be US-based, extending Anthropic’s November 2025 pledge to invest $50 billion in American AI infrastructure. </li>
<li style="font-weight:400;">For cloud practitioners, this signals continued long-term capacity constraints driving large-scale, multi-year infrastructure commitments across the industry.</li>
</ul>
<p>08:24 <a href="https://www.anthropic.com/glasswing">Project Glasswing: Securing critical software for the AI era </a></p>
<ul>
<li style="font-weight:400;">Anthropic announced <a href="https://www.anthropic.com/glasswing">Project Glasswing</a>, a coalition including AWS, Google, Microsoft, Apple, Cisco, NVIDIA, and others, built around a new unreleased model called <a href="https://www-cdn.anthropic.com/08ab9158070959f88f296514c21b7facce6f52bc.pdf">Claude Mythos Preview</a> that is focused specifically on finding and fixing software vulnerabilities in critical infrastructure.</li>
<li style="font-weight:400;">Mythos Preview has already identified thousands of high-severity vulnerabilities autonomously, including a 27-year-old flaw in <a href="https://www.openbsd.org/">OpenBSD</a>, a 16-year-old bug in FFmpeg that survived 5 million automated test runs, and a Linux kernel privilege escalation chain, all of which have since been patched.</li>
<li style="font-weight:400;">The model will not be generally available, but partners can access it via <a href="https://claude.com/platform/api">Claude API</a>, <a href="https://aws.amazon.com/bedrock/">Amazon Bedrock</a>, <a href="https://cloud.google.com/vertex-ai">Google Cloud Vertex AI</a>, and <a href="https://azure.microsoft.com/en-us/products/ai-foundry/">Microsoft Foundry</a> at $25 per million input tokens and $125 per million output tokens after an initial period covered by $100M in Anthropic usage credits.</li>
<li style="font-weight:400;">Anthropic is donating $4M to open-source security organizations, including <a href="https://alphaomega.com/">Alpha-Omega</a>, <a href="https://openssf.org/">OpenSSF</a> through the <a href="https://www.linuxfoundation.org/">Linux Foundation</a>, and the <a href="https://apache.org/">Apache Software Foundation</a>, to help maintainers respond to vulnerabilities the model surfaces.</li>
<li style="font-weight:400;">The initiative signals a shift in how AI safety and capability tradeoffs are being handled in practice, with Anthropic planning to test new cybersecurity safeguards on an upcoming Claude Opus model before considering any broader deployment of Mythos-class capabilities.</li>
</ul>
<p>20:44  Justin – “…is it probably really great at finding stuff? Is it really good at chaining things together to find these attacks? Yes. Is it as scary as they may get out to be? Maybe, maybe not. I don’t know; time will tell. I’m not going to be spending money on Mythos tokens to find out, but I am curious to see what people are coming out with now that it’s out in the wild.”</p>
<h2>AWS</h2>
<p>21:56 <a href="https://aws.amazon.com/blogs/aws/announcing-managed-daemon-support-for-amazon-ecs-managed-instances/">Announcing managed daemon support for Amazon ECS Managed </a><a href="https://aws.amazon.com/blogs/aws/announcing-managed-daemon-support-for-amazon-ecs-managed-instances/">Instances</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/ecs/managed-instances/">ECS</a> Managed Daemons lets platform engineers deploy and update monitoring, logging, and tracing agents independently from application teams, eliminating the need to coordinate task definition changes or service redeployments across hundreds of services.</li>
<li style="font-weight:400;">Daemons are guaranteed to start before application tasks and drain last, ensuring operational tooling like the <a href="https://docs.aws.amazon.com/AmazonCloudWatch/latest/monitoring/Install-CloudWatch-Agent.html">CloudWatch Agent</a> is always available throughout the application lifecycle, including during rolling updates.</li>
<li style="font-weight:400;">A new daemon_bridge network mode keeps daemon containers isolated from application networking while still allowing communication, and daemons support privileged container access and host filesystem mounts for deep system-level visibility.</li>
<li style="font-weight:400;">Each instance runs exactly one daemon copy shared across all application tasks on that instance, which optimizes resource utilization and allows CPU and memory parameters to be managed centrally without rebuilding AMIs or modifying application task definitions.</li>
<li style="font-weight:400;">The feature is available now in all <a href="https://aws.amazon.com/about-aws/global-infrastructure/regions_az/">AWS regions</a> at no additional charge beyond standard compute costs for the daemon tasks themselves, and can be configured through the ECS console or the new managed daemons API documented <a href="https://docs.aws.amazon.com/AmazonECS/latest/developerguide/getting-started-managed-instances.html">here</a>. </li>
</ul>
<p>22:55  Jonathan – “What a useful feature!” </p>
<p>23:36 <a href="https://aws.amazon.com/about-aws/whats-new/2026/04/ses-mail-manager-introduces-new-features/">Amazon SES Mail Manager adds new features for enhanced security and </a><a href="https://aws.amazon.com/about-aws/whats-new/2026/04/ses-mail-manager-introduces-new-features/">email processing</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/ses/">Amazon SES Mail Manager</a> now supports optional STARTTLS configuration, allowing legacy systems that lack full STARTTLS support to still connect to Mail Manager without requiring a full infrastructure overhaul.</li>
<li style="font-weight:400;">Mutual TLS (mTLS) adds certificate-based authentication at the Ingress Endpoint level, giving organizations a stronger identity verification layer for inbound email connections beyond standard encryption.</li>
<li style="font-weight:400;">Two new rule actions expand email processing flexibility: Invoke <a href="https://aws.amazon.com/lambda/">Lambda</a> lets you trigger custom code directly from rule sets for advanced routing or transformation logic, while the Bounce action sends RFC-compliant SMTP rejection responses back to sending servers.</li>
<li style="font-weight:400;">These features are available now across most SES Mail Manager regions, with the notable exception of Middle East UAE and Middle East Bahrain, so customers in those regions will need to wait for expansion.</li>
<li style="font-weight:400;">Pricing for SES Mail Manager follows existing SES usage-based pricing, so the cost impact of these new features will depend on Lambda invocation volume and overall email processing scale rather than any new flat fees.</li>
</ul>
<p>24:38  Justin – “They’ve actually added a lot of features to Mail Manager recently; the fact that it now can handle bounce protection and handles all of that stuff that you used to have to build your own toil for, it’s nice that that stuff is now there.” </p>
<p>26:07 <a href="https://aws.amazon.com/blogs/aws/amazon-bedrock-guardrails-supports-cross-account-safeguards-with-centralized-control-and-management/">Amazon Bedrock Guardrails supports cross-account safeguards with </a><a href="https://aws.amazon.com/blogs/aws/amazon-bedrock-guardrails-supports-cross-account-safeguards-with-centralized-control-and-management/">centralized control and management</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/category/artificial-intelligence/amazon-machine-learning/amazon-bedrock/amazon-bedrock-guardrails/">Amazon Bedrock Guardrails</a> now supports cross-account safeguards in general availability, letting organizations enforce a single guardrail policy across all AWS accounts and organizational units from a central management account, covering every <a href="https://aws.amazon.com/blogs/aws/category/artificial-intelligence/amazon-machine-learning/amazon-bedrock/">Bedrock</a> model invocation automatically.</li>
<li style="font-weight:400;">There are two enforcement levels: organization-level enforcement applies guardrails via <a href="https://aws.amazon.com/blogs/aws/category/security-identity-compliance/aws-organizations/">AWS Organizations</a> policies to all member accounts and OUs, while account-level enforcement applies guardrails to all Bedrock inference calls within a single account, giving teams flexibility to layer controls.</li>
<li style="font-weight:400;">A notable configuration option lets admins choose between Comprehensive mode, which enforces guardrails on all content regardless of caller tagging, and Selective mode, which only applies guardrails to content that callers explicitly tag, useful for mixed workloads with pre-validated and user-generated content.</li>
<li style="font-weight:400;">One practical gotcha worth flagging: specifying an incorrect guardrail ARN in the policy does not just fail silently; it blocks all Bedrock model inference for affected accounts, so ARN accuracy is critical before attaching policies to production OUs.</li>
<li style="font-weight:400;">The feature is available now across all AWS commercial and <a href="https://aws.amazon.com/govcloud-us/">GovCloud</a> regions where Bedrock Guardrails is supported, with pricing tied to each enforced guardrail based on its configured safeguards per the Amazon Bedrock pricing page. </li>
<li style="font-weight:400;">Automated Reasoning checks are not supported with this capability.</li>
</ul>
<p>27:02  Justin – “If you are not careful, you can lock yourself out of your domain.” </p>
<p>28:09 <a href="https://aws.amazon.com/about-aws/whats-new/2026/04/AWS-Cost-Explorer-Natural-Language-Query/">AWS Cost Explorer launches Natural Language Query capabilities powered </a><a href="https://aws.amazon.com/about-aws/whats-new/2026/04/AWS-Cost-Explorer-Natural-Language-Query/">by Amazon Q</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/aws-cost-management/aws-cost-explorer/">AWS Cost Explorer</a> now supports natural language queries powered by <a href="https://aws.amazon.com/q/developer/">Amazon Q Developer</a>, letting users ask plain-English questions like “Show me my top spending services this month” and receive both written insights and automatically updated charts, filters, and groupings simultaneously.</li>
<li style="font-weight:400;">The feature supports conversational follow-up questions with maintained context, meaning users can move from a quick cost check to a detailed investigation without switching tools or manually reconfiguring visualizations.</li>
<li style="font-weight:400;">When Amazon Q pulls from additional datasets beyond raw cost and usage data, such as pricing catalogs or anomaly detection, those results appear in a separate artifacts panel rather than the main Cost Explorer view, which is a useful distinction to understand when interpreting outputs.</li>
<li style="font-weight:400;">This is available at no additional charge across all commercial AWS Regions today, making it accessible without budget justification for teams already using Cost Explorer.</li>
<li style="font-weight:400;">The practical impact is that non-technical stakeholders, like finance or product teams, can now query AWS spend directly without needing to understand Cost Explorer’s filter and grouping mechanics, potentially reducing the bottleneck on cloud or DevOps teams for routine cost reporting.</li>
</ul>
<p>28:44  Justin – “I did play with, this because I was curious and I’ve done a lot of really cool things with AI for cost management recently. It’s not very good. Like most Amazon Q things, it’s not great.” </p>
<p>31:38 <a href="https://aws.amazon.com/blogs/machine-learning/building-real-time-conversational-podcasts-with-amazon-nova-2-sonic/">Building real-time conversational podcasts with Amazon Nova 2 Sonic</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/nova/models/">Amazon Nova 2 Sonic</a> is a speech-to-speech model available through <a href="https://aws.amazon.com/blogs/machine-learning/category/artificial-intelligence/amazon-machine-learning/amazon-bedrock/">Amazon Bedrock</a> that handles real-time conversational AI with support for seven languages and a 1 million token context window, making it practical for voice-first applications like customer support and interactive learning.</li>
<li style="font-weight:400;">The AWS blog post demonstrates a proof-of-concept podcast generator that uses two Nova Sonic instances to simulate a host-and-expert dialogue, streaming audio in real time using a Flask and AsyncIO architecture with RxPy for reactive event processing.</li>
<li style="font-weight:400;">A notable technical detail is the stage-aware content filtering system, which distinguishes between SPECULATIVE and FINAL generation stages to eliminate duplicate audio chunks and prevent artifacts, using a combination of interruption markers, text deduplication, and audio hash fingerprinting.</li>
<li style="font-weight:400;">The architecture captures audio at 16kHz PCM input and returns synthesized speech at 24kHz PCM output through a bidirectional event stream brokered by Amazon Bedrock, with the blog noting that PyAudio is suitable for server-side demos, but production deployments should use Web Audio API or WebRTC for browser clients.</li>
<li style="font-weight:400;">Practical use cases beyond podcasting include multilingual content localization, ecommerce product commentary, and enterprise training content, with pricing tied to Amazon Bedrock consumption-based rates rather than a fixed subscription, so costs scale with actual usage volume.</li>
</ul>
<p>32:32  Matt – “Still not out of a podcasting job yet. Got it.” </p>
<p>33:40 <a href="https://aws.amazon.com/blogs/aws/launching-s3-files-making-s3-buckets-accessible-as-file-systems/?ck_subscriber_id=512838477">Launching S3 Files, making S3 buckets accessible as file systems </a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/s3/features/files/">Amazon S3 Files</a> lets you mount any general-purpose <a href="https://aws.amazon.com/s3/">S3 bucket</a> as a native NFS v4.1 file system on <a href="https://aws.amazon.com/ec2/">EC2</a>, <a href="https://aws.amazon.com/ecs/">ECS</a>, <a href="https://aws.amazon.com/eks/">EKS</a>, and <a href="https://aws.amazon.com/lambda/">Lambda</a>, meaning you can use standard file commands like ls, cp, and echo while changes sync back to S3 within minutes. </li>
<li style="font-weight:400;">This eliminates the longstanding tradeoff between S3’s durability and cost versus a file system’s interactive capabilities.</li>
<li style="font-weight:400;">Under the hood, S3 Files is built on <a href="https://aws.amazon.com/efs">EFS</a> and delivers approximately 1ms latency for active data, with intelligent pre-fetching and byte-range reads to minimize unnecessary data transfer and costs. </li>
<li style="font-weight:400;">Files not on high-performance storage are served directly from S3 to maximize throughput for large sequential reads.</li>
<li style="font-weight:400;">The feature is positioned specifically for workloads where multiple compute resources need shared, concurrent access to the same data without duplication, including agentic AI systems using file-based Python tools and ML training pipelines. It supports NFS close-to-open consistency for collaborative workloads.</li>
<li style="font-weight:400;">Pricing is based on data stored in the file system, small file reads, write operations, and S3 requests during synchronization, so costs will vary significantly by access pattern and workload type. </li>
<li style="font-weight:400;">Full pricing details are on the S3 <a href="https://aws.amazon.com/s3/pricing/">pricing page</a>, and the service is available now in all commercial AWS regions.</li>
<li style="font-weight:400;">AWS is careful to position S3 Files alongside rather than replacing EFS and <a href="https://aws.amazon.com/fsx/">FSx</a>, noting FSx remains the better choice for on-premises NAS migrations, HPC workloads with Lustre, and workloads requiring NetApp ONTAP or Windows File Server compatibility.</li>
<li style="font-weight:400;">Last Week in AWS blog: <a href="https://www.lastweekinaws.com/blog/s3-is-not-a-filesystem-but-now-theres-one-in-front-of-it/">S3 Is Not a Filesystem (But Now There’s One In Front of It)</a></li>
</ul>
<p>35:32  Jonathan – “In other possibly related news, the NetApp stock price is down from $104 to $96 in the past 24 hours, because that’s basically what NetApp tiered storage does.” </p>
<h2>GCP</h2>
<p>39:37  <a href="https://cloud.google.com/blog/products/containers-kubernetes/unifying-real-time-and-async-inference-with-gke-inference-gateway/">Unifying real-time and async inference with GKE Inference Gateway</a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.cloud.google.com/kubernetes-engine/docs/concepts/about-gke-inference-gateway">GKE Inference Gateway</a> now supports both real-time and async inference workloads on the same shared GPU/TPU accelerator pool, eliminating the need to maintain separate clusters for each traffic type. </li>
<li style="font-weight:400;">This addresses a common infrastructure inefficiency where real-time clusters sit idle during off-peak hours while async jobs run on underutilized secondary hardware.</li>
<li style="font-weight:400;">The async component works by integrating a <a href="https://ai.google.dev/gemini-api/docs/batch-api?batch=file">Batch Processing Agent</a> with Cloud Pub/Sub, where latency-tolerant requests are pulled from a queue and routed to the Inference Gateway as lower-priority “sheddable” traffic that fills unused compute cycles between real-time spikes.</li>
<li style="font-weight:400;">Testing showed that without the Async Processor Agent, unmanaged multiplexing of low-priority requests caused 99% message drop, while using the agent resulted in 100% of latency-tolerant requests being served during available capacity windows. </li>
<li style="font-weight:400;">This demonstrates that the priority enforcement mechanism is doing meaningful work, not just theoretical traffic shaping.</li>
<li style="font-weight:400;">The project is open source and available on GitHub at <a href="http://github.com/llm-d-incubation/llm-d-async">github.com/llm-d-incubation/llm-d-async</a>, meaning teams can use it across multiple cloud environments rather than being locked into GKE specifically. Pricing would follow standard GKE and Pub/Sub usage costs with no separate charge for the gateway component itself.</li>
<li style="font-weight:400;">The next development phase will add deadline-aware scheduling, letting users set soft completion windows for batch jobs so the system can make more informed decisions about when to process filler traffic relative to real-time demand.</li>
</ul>
<p>40:48   Jonathan – “…that’s very cool; especially works in the interest of the cloud vendors now who can maximize their utilization of the GPUs. There’s still a lot CPUs sitting there idle though, like 60 to 70% CPU idle while those GPUs are full on.” </p>
<p>41:04 <a href="https://blog.google/innovation-and-ai/technology/developers-tools/gemini-api-docsmcp-agent-skills/">Improve coding agents’ performance with Gemini API Docs MCP and Agent </a><a href="https://blog.google/innovation-and-ai/technology/developers-tools/gemini-api-docsmcp-agent-skills/">Skills.</a></p>
<ul>
<li style="font-weight:400;">Google released two tools to address a core limitation of coding agents: outdated API knowledge due to training data cutoffs. </li>
<li style="font-weight:400;"><a href="https://ai.google.dev/gemini-api/docs/coding-agents#mcp-setup">The Gemini API Docs MCP</a> connects agents to live Gemini API documentation via the Model Context Protocol, while <a href="https://ai.google.dev/gemini-api/docs/coding-agents#available-skills">Gemini API Developer Skills</a> adds best-practice patterns and SDK guidance.</li>
<li style="font-weight:400;">Using both tools together shows measurable improvements in evals, achieving a 96.3% pass rate with 63% fewer tokens per correct answer compared to standard prompting. </li>
<li style="font-weight:400;">The token reduction is worth noting for developers concerned about cost and latency in agentic workflows.</li>
<li style="font-weight:400;">The MCP server is accessible at gemini-api-docs-mcp.dev and works with any MCP-compatible coding agent, making it broadly applicable beyond just Google-native tooling. Setup documentation is available at <a href="https://ai.google.dev/gemini-api/docs/coding-agents">ai.google.dev/gemini-api/docs/coding-agents</a>.</li>
<li style="font-weight:400;">This approach of pairing a live documentation server with a skills layer is a practical pattern that other API providers could adopt, and it highlights a growing need for real-time context injection as AI coding tools become more common in developer workflows.</li>
</ul>
<h2>Azure</h2>
<p>44:09 <a href="https://azure.microsoft.com/en-us/updates?id=559876">Public Preview: Rule impact analysis on Azure Network Watcher </a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/network-watcher/traffic-analytics-rule-impact-analyzer">Azure Network Watcher </a>now offers a public preview feature called Rule Impact Analysis, which lets network admins simulate the effect of security admin rules before actually applying them to their environment, reducing the risk of unintended connectivity disruptions.</li>
<li style="font-weight:400;">The feature is particularly useful for teams managing <a href="https://learn.microsoft.com/en-us/azure/virtual-network-manager/overview">Azure Virtual Network Manager</a> security configurations, as it helps identify rule conflicts and validate that connectivity requirements are met before deployment.</li>
<li style="font-weight:400;">This addresses a common operational pain point where applying network security rules in production environments can cause outages or unexpected behavior that is difficult to roll back quickly.</li>
<li style="font-weight:400;">Target users are network and security engineers in organizations with complex <a href="https://azure.microsoft.com/en-us/get-started/azure-portal/">Azure</a> networking topologies who need a safer change management process for security policy updates.</li>
<li style="font-weight:400;">The feature is currently in public preview, which typically means no additional cost beyond standard Network Watcher pricing, though customers should verify final pricing at general availability via the Azure pricing calculator at azure.microsoft.com/pricing.</li>
</ul>
<p>44:35  Justin – “2026 and we still dealing with rule conflicts and firewalls.” </p>
<p>48:01 <a href="https://www.go-euc.com/azure-vdi-storage-benchmark-premium-ssd-vs-standard-ssd-performance-and-cost-breakdown/">Azure VDI Storage Benchmark: Premium SSD vs Standard SSD </a><a href="https://www.go-euc.com/azure-vdi-storage-benchmark-premium-ssd-vs-standard-ssd-performance-and-cost-breakdown/">Performance and Cost Breakdown </a></p>
<ul>
<li style="font-weight:400;">GO-EUC’s benchmark research comparing <a href="https://learn.microsoft.com/en-us/azure/virtual-machines/disks-types#premium-ssds">Premium SSD</a> and <a href="https://learn.microsoft.com/en-us/azure/virtual-machines/disks-types#standard-ssds">Standard SSD</a> for Azure VDI workloads found that Premium SSD delivers up to 8 times higher IOPS and 80-90% lower latency than Standard SSD, with the performance gap widening as disk size increases.</li>
<li style="font-weight:400;">Standard SSD shows a fixed performance ceiling of roughly 850-980 IOPS regardless of disk size, while Premium SSD scales from about 1800 IOPS at 128GB up to 8100 IOPS at 2048GB, making disk sizing a meaningful architectural lever only for Premium SSD.</li>
<li style="font-weight:400;">The cost comparison is less straightforward than it appears because Standard SSD carries transaction fees that can push its total cost close to Premium SSD pricing under heavy VDI workloads, making Premium SSD a more predictable cost option despite its higher base price.</li>
<li style="font-weight:400;">The 2048GB Premium SSD at $284.94 per month emerges as the recommended sweet spot, since moving to 4096GB costs $545.10 with only marginal performance gains, and at 2500-seat scale that sizing decision translates to over $7.8 million in annual cost difference.</li>
<li style="font-weight:400;">The research used synthetic DiskSpd testing rather than real user load simulation, so results reflect maximum disk capabilities under controlled conditions and may differ from production environments, with GO-EUC noting a load simulation follow-up is planned.</li>
</ul>
<p>49:25  Justin – “It’s not Microsoft, and it’s not something Microsoft paid for and was something done independently, so I approve.” </p>
<h2>Emerging Clouds </h2>
<p>51:26 <a href="https://www.digitalocean.com/blog/now-available-cloud-security-posture-management">Now Available: DigitalOcean Cloud Security Posture Management (CSPM) </a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.digitalocean.com/">DigitalOcean</a> has launched a native <a href="https://www.digitalocean.com/blog/now-available-cloud-security-posture-management">Cloud Security Posture Management tool</a> that continuously evaluates resources like <a href="https://www.digitalocean.com/products/droplets">Droplets</a> and <a href="https://docs.digitalocean.com/products/databases/">Databases</a> for misconfigurations without requiring agents or third-party tools, making it accessible to smaller teams without dedicated security staff.</li>
<li style="font-weight:400;">The tool is built directly into the DigitalOcean dashboard and API, addressing a common pain point where security visibility requires separate tooling and context switching across platforms.</li>
<li style="font-weight:400;">Unlimited free scans are available to all DigitalOcean customers, with advanced rules, automated guidance, and API integrations on upgraded plans, lowering the barrier to entry for basic security posture monitoring.</li>
<li style="font-weight:400;">A feature called <a href="https://docs.digitalocean.com/products/cspm/how-to/use-security-advisor/">Security Advisor</a> adds an AI layer that summarizes findings and surfaces high-priority risks, helping teams focus on the most impactful issues first and reducing alert fatigue.</li>
<li style="font-weight:400;">This offering is positioned toward startups and SMBs running production workloads, including AI inference, who may lack the resources to implement enterprise-grade security tooling but still need consistent visibility into infrastructure risk.</li>
</ul>
<p>52:23  Matt – “It’s definitely a nice feature to give the general developer or security person that might not know the intricacies of DigitalOcean a ‘here’s a read flag go look at this’.” </p>
<h2>After Show</h2>
<p>54:04 <a href="https://isolveproblems.substack.com/p/how-microsoft-vaporized-a-trillion">How Microsoft Vaporized a Trillion Dollars</a></p>
<ul>
<li style="font-weight:400;">The author, a senior Microsoft engineer who rejoined Azure Core in May 2023, discovered on his first day that a 122-person org was seriously planning to port large portions of Windows to a tiny, low-power ARM chip on the Azure Boost accelerator card — a plan he immediately recognized as physically impossible given the hardware constraints.</li>
<li style="font-weight:400;">Nobody at Microsoft could explain why up to 173 agents were needed to manage each Azure node, what they all did, or how they interacted — a sprawl that created enormous fragility in the system orchestrating VMs for OpenAI, government clouds, and other mission-critical workloads.</li>
<li style="font-weight:400;">After the elimination of dedicated testers in 2014 and a talent exodus of original Azure architects, much of the org was staffed by junior engineers with 1–2 years of experience, led by managers without deep systems backgrounds, creating a persistent gap in senior technical leadership.</li>
<li style="font-weight:400;">The node management stack suffered millions of unattributed crashes per month, memory leaks, resource leaks, and “zombie VMs,” with each monthly release introducing more bugs than it fixed and most rollouts ending in panicked rollbacks.</li>
<li style="font-weight:400;">A publicly exposed web server (WireServer) running on the secure host OS held unencrypted tenant data from multiple customers in shared memory caches — a serious security liability in a hostile multi-tenancy environment — while crashing 300,000–500,000 times per month fleet-wide.</li>
<li style="font-weight:400;">Despite public claims at Ignite conferences from 2023–2025 that key components had been offloaded to Azure Boost and rewritten in Rust, the author states that as of late 2024, zero of 64 identified work items had been completed, and work hadn’t started on roughly 60 of them.</li>
<li style="font-weight:400;">“Digital escort sessions” — where $18/hour employees executed commands on production nodes under direction from overseas support staff, including from China — became routine, with nearly 200 JIT access requests per day observed over a two-month period, directly contradicting the original “no human touch” design vision.</li>
<li style="font-weight:400;">The author proposed an incremental componentization strategy to modernize the node stack from first principles — including a cross-platform component model, a new message bus, and security-hardened caches — but lower-level management responded with defensiveness and the org eventually terminated his employment.</li>
<li style="font-weight:400;">The consequences materialized: OpenAI signed an $11.9B deal with CoreWeave in March 2025 and later a $300B deal with Oracle, the Secretary of Defense publicly cited “a breach of trust” with Microsoft, and Microsoft’s stock dropped over 30% from its late-October 2025 peak, erasing more than a trillion dollars in market cap.</li>
<li style="font-weight:400;">The author escalated his concerns in formal letters to the Cloud + AI EVP (November 2024), the CEO (January 2025), and the Board of Directors — all sent before the public unraveling — and received no acknowledgment, reply, or request for clarification from any of them.</li>
</ul>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback, or ask questions at theCloudPod.net or tweet at us with the hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2423813/c1e-2okobqqkr8cm26j0-qdpwwr7rtg1-mjuc2z.mp3" length="74965216"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 350 of The Cloud Pod, where the weather is always cloudy! Justin, Jonathan, and Matt are this week’s hosts, and they’ve scoured the clouds for all the latest news and announcements, including that Mythos drop. Is it the AI apocalypse that everyone is claiming? We’ve also got news from DigitalOcean, an email from Space, Claude and even some Guardrails. There’s a lot to cover, so let’s get started!
Titles we almost went with this week

 Two AIs Walk Into a Studio and Actually Sound Good 
 No More Idle GPUs Twiddling Their Tensor Cores 
 When AWS Availability Zones Become Unavailability Zones
 Token by Token Codex Pricing Finally Makes Cents
 Just Ask AWS Where All Your Money Went
 You’ve Got mTLS: Amazon SES Locks Down Email Security
 Cost Explorer Finally Speaks Plain English
 Missiles Make AWS Multi-Region Strategy Mandatory
 Shell Yeah Your Agent State Now Persists
 S3 Files Finally Lets You ls Your Bucket
 Claude Found Your Zero-Day Before Lunch
 One Guardrail to Rule All Your AWS Accounts
Premium SSD Wins Azure VDI but Your Wallet Cries
 No More Amnesia: Your Bedrock Agent Keeps Its Memories
 Pay Per Claw Anthropic Sharpens Its Pricing Policy
 Even Astronauts Need IT Support for Microsoft Outlook
 AWS still can’t answer the question of what EC2 Other is
 AWS announces several new Unavailability Zones


 A big thanks to this week’s sponsor:
There are a lot of cloud cost management tools out there, but only Archera provides insured commitments. It sounds fancy, but it’s really simple. Archera gives you the cost savings of a 1 or 3-year AWS Savings Plan with a commitment as short as 30 days. If you do not use all the cloud resources you have committed to, Archera will literally cover the difference. Other cost management tools may say they offer “insured commitments”, but remember to ask: Will you actually give me my rebate? Because Archera will. 
Check out thecloudpod.net/archera to schedule a demo today. 
Follow Up
00:45 Ground control to Microsoft: Artemis 2 astronauts deal with Outlook hiccup in deep space

Artemis 2 astronauts aboard NASA’s Orion spacecraft encountered a common Outlook configuration issue on their first day in space, requiring remote IT support from Mission Control to resolve it by reloading the commander’s files.
NASA uses commercial off-the-shelf software like Microsoft Outlook for crew scheduling and personal communications, while keeping primary flight systems on separate radiation-hardened hardware, illustrating a practical separation of concerns in mission-critical environments.
The Outlook issue stemmed from the app having configuration problems when no direct network connection is available, which the flight director noted is not uncommon, raising questions about offline-readiness for software deployed in connectivity-constrained environments.
This incident is a useful reminder for cloud and enterprise software users that applications heavily dependent on network connectivity can...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2423813/c1a-k5d5-9jgnn0r7tn5k-zszpxe.jpg"></itunes:image>
                                                                            <itunes:duration>01:02:29</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2423813/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[349: Gmail Finally Lets You Ditch xXDragonSlayer2004Xx]]>
                </title>
                <pubDate>Wed, 08 Apr 2026 20:06:53 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2416816</guid>
                                    <link>https://tcpfm.castos.com/episodes/349-gmail-finally-lets-you-ditch-xxdragonslayer2004xx</link>
                                <description>
                                            <![CDATA[<p>Welcome to episode 349 of The Cloud Pod, where the weather is always cloudy! Justin and Jonathan managed to make it into the studio this week, and they brought a guest! Dave Garaway jas joined us, and brought some on-the-ground knowledge from GTC, plus a slew of supply chain attacks, Gmail username changes and Claude’s code debacle. We’ve got all this and more – so let’s get started! </p>
<h3>Titles we almost went with this week</h3>
<ul>
<li> AWS Console Gets a Makeover Nobody Asked For</li>
<li> From Eight Hours to 22 Seconds, Hackers Got Fast</li>
<li> AWS Spring Cleaning Hits Nine Services Hard</li>
<li> Trivy Pursuit Turns Into a 500K Credential Heist</li>
<li> Skip the Consultant, AWS Security Now Hacks Itself</li>
<li> AWS Pen Testing Agent Pokes Your Cloud Around the Clock</li>
<li> Your Cringey Gmail Address Gets a Second Chance</li>
<li> Stop Babysitting Servers, Let Google Handle MCP</li>
<li> AI Agent Untangles Your Kubernetes Networking Spaghetti</li>
<li> One Bad Actor Poisons a Hundred Million Downloads</li>
<li> Lambda Finally Hits the Gym with 32 GB</li>
<li> From GPU Hype to Production Inference Without the Hyperscaler Headache
</li>
</ul>
<h2>Follow Up</h2>
<p>01:28 <a href="https://arstechnica.com/tech-policy/2026/03/hegseth-trump-had-no-authority-to-order-anthropic-to-be-blacklisted-judge-says/">Hegseth, Trump had no authority to order Anthropic to be blacklisted, </a><a href="https://arstechnica.com/tech-policy/2026/03/hegseth-trump-had-no-authority-to-order-anthropic-to-be-blacklisted-judge-says/">judge says</a></p>
<ul>
<li style="font-weight:400;">A US District Judge granted <a href="https://www.anthropic.com/">Anthropic</a> a preliminary injunction blocking the <a href="https://arstechnica.com/tech-policy/2026/03/anthropic-sues-us-over-blacklisting-white-house-calls-firm-radical-left-woke/">Department of War’s blacklisting</a>, ruling the designation was First Amendment retaliation rather than a legitimate national security action.</li>
<li style="font-weight:400;">The court found officials lacked authority to blacklist Anthropic without considering less restrictive alternatives or providing evidence of an urgent security risk, noting the designation was triggered by Anthropic’s “hostile manner through the press.”</li>
<li style="font-weight:400;">The practical business impact was already substantial before the ruling, with three trade deals cancelled and other potential partners delaying negotiations, representing potentially billions in lost contracts over five years.</li>
<li style="font-weight:400;">Anthropic continues to balance the legal fight with maintaining its government relationships, <a href="https://www.anthropic.com/news/where-stand-department-war">publicly emphasizing alignment with the Department of War’s mission</a> around safe AI deployment even while litigating against it.</li>
<li style="font-weight:400;">For cloud and AI vendors, this case establishes a notable precedent around government procurement decisions and First Amendment protections, with implications for how companies publicly challenge federal contracting positions.</li>
</ul>
<p>02:35  Jonathan – “I’m guessing Anthropic is super busy with all the people coming to them for deals right now, because it seems to me that Anthropic is getting all the business customers and OpenAI are getting the personal customers.”  </p>
<p>04:08 <a href="https://delve.co/blog/delve-announces-changes-and-new-customer-support-measures">Delve Announces Changes and New Customer Support Measures </a></p>
<ul>
<li style="font-weight:400;"><a href="https://delve.co/">Delve</a> has <a href="https://delve.co/blog/response-to-misleading-claims">responded to allegations</a> from an anonymous <a href="https://substack.com/">Substack</a> post by denying claims of faked evidence, clarifying that independent AICPA-accredited auditors, not Delve, issue SOC 2 reports and ISO 27001 certifications. </li>
<li style="font-weight:400;">The company published...</li></ul>
<h3>Chapters</h3>
<ul><li>(00:00:07) - The Cloud Pod</li><li>(00:01:37) - Anthropic Wins Preliminary Injunction Against US Blacklist</li><li>(00:04:14) - Delve: We're Not Filling Their Own Audits With</li><li>(00:08:03) - Nvidia's GTC 2017 Announcement</li><li>(00:16:21) - Will Microsoft Fill the Kubernetes Demand?</li><li>(00:24:09) - Are Data Centers Bad for the Environment?</li><li>(00:24:57) - Is the Cloud in a Bubble?</li><li>(00:27:07) - Gmail Lets You Change Your Username Every 12 Months</li><li>(00:28:56) - Supply Chain Hackers Hit</li><li>(00:32:47) - Anthropic's Cloud Code Leaks</li><li>(00:36:09) - Amazon's New Console Enhancements</li><li>(00:37:48) - AWS Lambda: Up to 32 GB of RAM and 16</li><li>(00:40:17) - AWS Security Agents and AWS DevOps Agent</li><li>(00:44:35) - Amazon Bedrock Agent Core Evaluations</li><li>(00:52:55) - Edge Computing: Custom AI Agents</li><li>(00:55:01) - Amazon's Reference Architecture for Building a FinOps Agent Using Amazon Bed</li><li>(00:58:23) - Google's TurboQuant Compresses LLM Data to 3 Bits</li><li>(01:01:08) - Google's Open Source AI playbook for sustainability reporting</li><li>(01:01:51) - Azure Launches AI Agent to Troubleshoot Kuber</li><li>(01:03:12) - Week in the Cloud: Nvidia GTC 2017</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 349 of The Cloud Pod, where the weather is always cloudy! Justin and Jonathan managed to make it into the studio this week, and they brought a guest! Dave Garaway jas joined us, and brought some on-the-ground knowledge from GTC, plus a slew of supply chain attacks, Gmail username changes and Claude’s code debacle. We’ve got all this and more – so let’s get started! 
Titles we almost went with this week

 AWS Console Gets a Makeover Nobody Asked For
 From Eight Hours to 22 Seconds, Hackers Got Fast
 AWS Spring Cleaning Hits Nine Services Hard
 Trivy Pursuit Turns Into a 500K Credential Heist
 Skip the Consultant, AWS Security Now Hacks Itself
 AWS Pen Testing Agent Pokes Your Cloud Around the Clock
 Your Cringey Gmail Address Gets a Second Chance
 Stop Babysitting Servers, Let Google Handle MCP
 AI Agent Untangles Your Kubernetes Networking Spaghetti
 One Bad Actor Poisons a Hundred Million Downloads
 Lambda Finally Hits the Gym with 32 GB
 From GPU Hype to Production Inference Without the Hyperscaler Headache


Follow Up
01:28 Hegseth, Trump had no authority to order Anthropic to be blacklisted, judge says

A US District Judge granted Anthropic a preliminary injunction blocking the Department of War’s blacklisting, ruling the designation was First Amendment retaliation rather than a legitimate national security action.
The court found officials lacked authority to blacklist Anthropic without considering less restrictive alternatives or providing evidence of an urgent security risk, noting the designation was triggered by Anthropic’s “hostile manner through the press.”
The practical business impact was already substantial before the ruling, with three trade deals cancelled and other potential partners delaying negotiations, representing potentially billions in lost contracts over five years.
Anthropic continues to balance the legal fight with maintaining its government relationships, publicly emphasizing alignment with the Department of War’s mission around safe AI deployment even while litigating against it.
For cloud and AI vendors, this case establishes a notable precedent around government procurement decisions and First Amendment protections, with implications for how companies publicly challenge federal contracting positions.

02:35  Jonathan – “I’m guessing Anthropic is super busy with all the people coming to them for deals right now, because it seems to me that Anthropic is getting all the business customers and OpenAI are getting the personal customers.”  
04:08 Delve Announces Changes and New Customer Support Measures 

Delve has responded to allegations from an anonymous Substack post by denying claims of faked evidence, clarifying that independent AICPA-accredited auditors, not Delve, issue SOC 2 reports and ISO 27001 certifications. 
The company published...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[349: Gmail Finally Lets You Ditch xXDragonSlayer2004Xx]]>
                </itunes:title>
                                    <itunes:episode>349</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p>Welcome to episode 349 of The Cloud Pod, where the weather is always cloudy! Justin and Jonathan managed to make it into the studio this week, and they brought a guest! Dave Garaway jas joined us, and brought some on-the-ground knowledge from GTC, plus a slew of supply chain attacks, Gmail username changes and Claude’s code debacle. We’ve got all this and more – so let’s get started! </p>
<h3>Titles we almost went with this week</h3>
<ul>
<li> AWS Console Gets a Makeover Nobody Asked For</li>
<li> From Eight Hours to 22 Seconds, Hackers Got Fast</li>
<li> AWS Spring Cleaning Hits Nine Services Hard</li>
<li> Trivy Pursuit Turns Into a 500K Credential Heist</li>
<li> Skip the Consultant, AWS Security Now Hacks Itself</li>
<li> AWS Pen Testing Agent Pokes Your Cloud Around the Clock</li>
<li> Your Cringey Gmail Address Gets a Second Chance</li>
<li> Stop Babysitting Servers, Let Google Handle MCP</li>
<li> AI Agent Untangles Your Kubernetes Networking Spaghetti</li>
<li> One Bad Actor Poisons a Hundred Million Downloads</li>
<li> Lambda Finally Hits the Gym with 32 GB</li>
<li> From GPU Hype to Production Inference Without the Hyperscaler Headache
</li>
</ul>
<h2>Follow Up</h2>
<p>01:28 <a href="https://arstechnica.com/tech-policy/2026/03/hegseth-trump-had-no-authority-to-order-anthropic-to-be-blacklisted-judge-says/">Hegseth, Trump had no authority to order Anthropic to be blacklisted, </a><a href="https://arstechnica.com/tech-policy/2026/03/hegseth-trump-had-no-authority-to-order-anthropic-to-be-blacklisted-judge-says/">judge says</a></p>
<ul>
<li style="font-weight:400;">A US District Judge granted <a href="https://www.anthropic.com/">Anthropic</a> a preliminary injunction blocking the <a href="https://arstechnica.com/tech-policy/2026/03/anthropic-sues-us-over-blacklisting-white-house-calls-firm-radical-left-woke/">Department of War’s blacklisting</a>, ruling the designation was First Amendment retaliation rather than a legitimate national security action.</li>
<li style="font-weight:400;">The court found officials lacked authority to blacklist Anthropic without considering less restrictive alternatives or providing evidence of an urgent security risk, noting the designation was triggered by Anthropic’s “hostile manner through the press.”</li>
<li style="font-weight:400;">The practical business impact was already substantial before the ruling, with three trade deals cancelled and other potential partners delaying negotiations, representing potentially billions in lost contracts over five years.</li>
<li style="font-weight:400;">Anthropic continues to balance the legal fight with maintaining its government relationships, <a href="https://www.anthropic.com/news/where-stand-department-war">publicly emphasizing alignment with the Department of War’s mission</a> around safe AI deployment even while litigating against it.</li>
<li style="font-weight:400;">For cloud and AI vendors, this case establishes a notable precedent around government procurement decisions and First Amendment protections, with implications for how companies publicly challenge federal contracting positions.</li>
</ul>
<p>02:35  Jonathan – “I’m guessing Anthropic is super busy with all the people coming to them for deals right now, because it seems to me that Anthropic is getting all the business customers and OpenAI are getting the personal customers.”  </p>
<p>04:08 <a href="https://delve.co/blog/delve-announces-changes-and-new-customer-support-measures">Delve Announces Changes and New Customer Support Measures </a></p>
<ul>
<li style="font-weight:400;"><a href="https://delve.co/">Delve</a> has <a href="https://delve.co/blog/response-to-misleading-claims">responded to allegations</a> from an anonymous <a href="https://substack.com/">Substack</a> post by denying claims of faked evidence, clarifying that independent AICPA-accredited auditors, not Delve, issue SOC 2 reports and ISO 27001 certifications. </li>
<li style="font-weight:400;">The company published a formal rebuttal and is now rolling out operational changes to address customer concerns.</li>
<li style="font-weight:400;">To support customers facing questions from their own clients and procurement teams, Delve is offering complimentary re-audits through independent auditors, complimentary grey-box penetration tests, and formal engagement letters from auditors, all at no cost.</li>
<li style="font-weight:400;">On the transparency side, Delve is moving auditor communications directly into customer Slack channels or shared email threads, so customers have full visibility into the audit process rather than relying on Delve as an intermediary.</li>
<li style="font-weight:400;">The platform is also adding clearer disclosures to templates and forms to explicitly identify them as guidance tools aligned to industry standards, addressing a core point of confusion raised in the controversy.</li>
<li style="font-weight:400;">For cloud practitioners, this situation highlights the importance of understanding the distinction between compliance automation platforms and the independent auditors who issue attestations, a boundary that procurement teams are increasingly scrutinizing when evaluating vendor security posture.</li>
</ul>
<p>06:12  Justin – “I think the reality is that, and we talked about this last week, is that SOC 2 audits are very heavily templatized. That’s how these companies make them, and they work them. They do need to be edited, reviewed, and approved, and the right things need to be done, but they can’t always start as a template. A template’s not the problem. It’s what appears to be the automation and then the rubber-stamping by these auditors.”</p>
<p>06:39 <a href="https://substack.com/home/post/p-192144506">Delve – Fake Compliance as a Service – Part II – Day 1 of 5</a></p>
<ul>
<li style="font-weight:400;">This article covers allegations against Delve, a compliance automation startup, and represents a follow-up to earlier reporting. It does not directly relate to cloud platform news typically covered on The Cloud Pod, but here are the relevant talking points for context.</li>
<li style="font-weight:400;">A whistleblower from Delve provided internal screenshots and recordings after the initial article, including conversations suggesting the company’s auditing partner, Accorp, may not conduct thorough evidence reviews before issuing SOC 2 reports.</li>
<li style="font-weight:400;">Internal communications indicate Delve built an automated report generation tool, which contradicts the company’s public claim that it does not generate compliance reports on behalf of clients.</li>
<li style="font-weight:400;">Leaked internal notes from Karun Kaushik, dated November 2024, acknowledge that Delve’s platform had not released any new compliance frameworks since January 2025, a period that overlaps with the company’s Series A fundraise, raising questions about the accuracy of investor materials.</li>
<li style="font-weight:400;">Delve has transitioned clients to a new auditing firm called Ezzy and Associates, telling clients they will not need to restart SOC 2 Type 2 observation periods despite the auditor change, which compliance professionals would generally consider irregular, given the reported evidence quality concerns.</li>
<li style="font-weight:400;">For cloud practitioners, this situation is a reminder that compliance automation tools require scrutiny of both the underlying audit processes and the third-party auditors involved, as the validity of certifications like SOC 2 depends on the rigor of evidence collection and review.</li>
</ul>
<p>06:57  Justin – “It’s just getting worse. I don’t know that Delve actually survives this.” </p>
<h2>General News </h2>
<p>08:17 <a href="https://www.exxactcorp.com/blog/news/nvidia-gtc-2026-recap-era-of-tokens-and-inference">NVIDIA GTC 2026 Recap: Tokens &amp; Inference</a></p>
<ul>
<li style="font-weight:400;">Jensen Huang reframed how AI infrastructure ROI should be measured, shifting from raw compute specs to tokens per watt and token speed at a fixed power budget. </li>
<li style="font-weight:400;"><a href="https://www.nvidia.com/en-us/data-center/technologies/rubin/">Vera Rubin</a> is projected to deliver approximately 5x more revenue potential per gigawatt compared to <a href="https://www.nvidia.com/en-us/data-center/technologies/blackwell-architecture/">Blackwell</a>, which has direct implications for how cloud operators and enterprises evaluate hardware investments.</li>
<li style="font-weight:400;">The Vera Rubin platform integrates the acquired <a href="https://www.nvidia.com/en-us/data-center/lpx/">Groq 3 LPX chip</a> alongside the Rubin GPU, with <a href="https://developer.nvidia.com/blog/introducing-nvidia-dynamo-a-low-latency-distributed-inference-framework-for-scaling-reasoning-ai-models/">NVIDIA’s Dynamo</a> software splitting inference workloads between the two chips. This heterogeneous approach delivers 35x more throughput per megawatt for latency-sensitive workloads compared to running Vera Rubin GPUs alone.</li>
<li style="font-weight:400;">NVIDIA introduced <a href="https://docs.openclaw.ai/providers/nvidia">OpenClaw</a>, an open-source agentic AI framework, alongside an enterprise-hardened version called <a href="https://www.nvidia.com/en-us/ai/nemoclaw/">NeMo Claw</a> that adds policy enforcement, network guardrails, and a privacy router to prevent data exfiltration. The security layer addresses a real concern for organizations deploying agents with access to internal infrastructure.</li>
<li style="font-weight:400;">NVIDIA released six domain-specific open model families, including <a href="https://developer.nvidia.com/nemotron">Nemotron</a> for language tasks, <a href="https://docs.nvidia.com/bionemo-framework/latest/">BioNeMo</a> for drug discovery, <a href="https://www.nvidia.com/en-us/ai/cosmos/">Cosmos</a> for robotics simulation, and <a href="https://resources.nvidia.com/en-us-climate/earth-2">Earth2</a> for climate forecasting, positioning these as the foundation for sovereign AI deployments where organizations want to avoid dependence on a small number of external model providers.</li>
<li style="font-weight:400;">The DSX digital twin platform uses <a href="https://www.nvidia.com/en-us/omniverse/">Omniverse</a> to simulate thermal, electrical, and network conditions before a data center is physically built, with NVIDIA estimating roughly a factor of two in recoverable efficiency across a typical AI factory deployment through better design and live operational optimization.</li>
</ul>
<p>09:51  Dave – “Being in technology, that is a great place to go to put your finger on the pulse of where things are.” </p>
<p>27:28 <a href="https://www.digitalocean.com/blog/production-inference-era-nvidia-gtc">GTC 2026 Confirmed It: The Inference Era Is Here </a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.digitalocean.com/">DigitalOcean</a> is positioning itself specifically around production inference workloads, announcing a new <a href="https://www.digitalocean.com/solutions/global-infrastructure">Richmond data center</a> built with <a href="https://www.digitalocean.com/products/gradient/gpu-droplets">NVIDIA HGX B300 systems</a> and a 400 Gbps non-blocking RDMA fabric designed for reasoning and agentic use cases.</li>
<li style="font-weight:400;">The company is bringing <a href="https://www.digitalocean.com/blog/nvidia-dynamo-1-now-available">NVIDIA Dynamo 1.0</a> to its Kubernetes offering and expanding model access for reasoning, long-context, multimodal, and agentic workloads, which addresses the operational complexity developers face when moving AI from experimentation into production.</li>
<li style="font-weight:400;">DigitalOcean reported over 43,000 <a href="https://marketplace.digitalocean.com/apps/openclaw">OpenClaw</a> deployments since launch, suggesting meaningful developer adoption for always-on assistant and agentic application use cases on their platform.</li>
<li style="font-weight:400;">The broader industry signal from <a href="https://www.nvidia.com/gtc/">NVIDIA GTC 2026</a> is that cost per token, time to first token, and uptime are becoming as important as model quality, shifting infrastructure conversations from raw compute to full-system optimization, including CPUs alongside accelerators.</li>
<li style="font-weight:400;">For smaller AI builders and startups, DigitalOcean’s focus on reducing setup friction through tools like <a href="https://www.digitalocean.com/community/tutorials/how-to-set-up-nemoclaw">1-Click Droplets</a> for <a href="https://www.nvidia.com/en-us/ai/nemoclaw/">NemoClaw</a> and direct deployment from <a href="http://build.nvidia.com">build.nvidia.com</a> to Serverless Inference represents a practical alternative to hyperscaler complexity for running agents at scale.</li>
</ul>
<p>27:42  Dave – “They are talking about a bubble – the people I’ve been talking to – but one of the neoclouds I was talking about said, ‘when we get to the point when we don’t have the need, we’re going to start powering the neighborhoods for free, so we’re just going to start giving out power for free’ so hopefully the good neighbor will extend out there.” </p>
<p>28:12 <a href="https://arstechnica.com/gadgets/2026/03/you-can-finally-change-the-goofy-gmail-address-you-chose-years-ago/">You can finally change the goofy Gmail address you chose years ago</a></p>
<ul>
<li style="font-weight:400;">Gmail turns 22 years old on April 1, and Google is <a href="https://blog.google/products-and-platforms/products/workspace/google-account-username-change/">marking the occasion</a> by finally allowing US-based users to change their Gmail username without creating an entirely new account, addressing a long-standing limitation of the platform.</li>
<li style="font-weight:400;">The change is limited to once every 12 months per account, which Google has not formally explained but likely serves as a spam mitigation measure to prevent abuse of the feature.</li>
<li style="font-weight:400;">For cloud and IT professionals managing Google Workspace environments, this raises practical questions around identity management, email routing, and how username changes interact with existing integrations and third-party services tied to a Gmail address.</li>
<li style="font-weight:400;">The feature is rolling out gradually in the US, so not all accounts will see the option immediately, and it remains to be seen when international users outside the initial test group will get access. You can check <a href="http://myaccount.google.com/google-account-email">here</a> to see if the feature is available to you. </li>
<li style="font-weight:400;">This highlights a broader tension in long-lived identity platforms where usernames chosen decades ago become liabilities, and how platforms balance user flexibility with the operational complexity of allowing address changes at scale.</li>
</ul>
<p>30:00 <a href="https://x.com/TFTC21/status/2036561276193866064?s=20">TeamPCP Attack</a></p>
<ul>
<li style="font-weight:400;">On March 19, threat actor group TeamPCP compromised <a href="https://trivy.dev/">Trivy</a>, a widely used open-source vulnerability scanner from <a href="https://www.aquasec.com/">Aqua Security</a>, by injecting credential-stealing malware into 75 GitHub Action tags, Docker images, and CI/CD pipelines, turning the security tool itself into the attack vector.</li>
<li style="font-weight:400;">The malware collected SSH keys, cloud credentials, Kubernetes secrets, and environment files from affected systems, with attackers then using those stolen credentials to pivot into LiteLLM, a Python framework for AI model API management, pushing two malicious versions to PyPI that executed automatically on Python process startup.</li>
<li style="font-weight:400;">The LiteLLM compromise reportedly yielded approximately 500,000 stolen credentials, and the attackers deployed privileged pods across Kubernetes clusters and installed persistent backdoors on nodes, demonstrating how a single supply chain entry point can cascade across entire production environments.</li>
<li style="font-weight:400;">This attack illustrates a notable pattern in modern supply chain compromises where each set of stolen credentials unlocks the next target, moving from CI/CD pipelines to public package repositories to production infrastructure in a deliberate escalation chain.</li>
<li style="font-weight:400;">Organizations relying on open-source security tooling in automated pipelines should audit recent Trivy and LiteLLM usage, check for the specific compromised versions noted, and review whether any credentials or secrets were exposed in affected environments.</li>
</ul>
<p>Con’t <a href="https://www.aquasec.com/blog/trivy-supply-chain-attack-what-you-need-to-know/">Update: Ongoing Investigation and Continued Remediation</a></p>
<ul>
<li style="font-weight:400;">The Trivy supply chain attack began in late February 2026 when attackers exploited a GitHub Actions misconfiguration to extract a privileged access token, then used residual credentials after an incomplete rotation to publish malicious artifacts on March 19, affecting version 0.69.4 and 76 of 77 trivy-action version tags.</li>
<li style="font-weight:400;">The attack’s most notable technique was force-pushing existing version tags to point at malicious commits, meaning CI/CD pipelines referencing those tags continued running without any visible indication of change, while the payload silently exfiltrated cloud credentials, SSH keys, Kubernetes tokens, and other secrets before legitimate scanning logic executed.</li>
<li style="font-weight:400;">Any organization that ran affected versions during the compromise window should treat all secrets accessible to those pipeline environments as exposed and rotate them immediately, including cloud provider credentials, container registry tokens, Git credentials, and NPM publish tokens, which researchers confirmed are being actively weaponized across the NPM ecosystem.</li>
<li style="font-weight:400;">The core hardening lesson from this incident is to pin GitHub Actions to full immutable commit SHA hashes rather than mutable version tags, since version tags can be silently redirected to malicious code without any workflow changes on the consumer side.</li>
<li style="font-weight:400;"><a href="https://www.aquasec.com/">Aqua’s</a> commercial platform was isolated from the compromise because it uses a separate build system with no shared GitHub infrastructure, CI/CD pipelines, or signing systems, and its controlled integration process meant the malicious release was never incorporated into commercial products.</li>
</ul>
<p>30:51 <a href="https://techcrunch.com/2026/03/31/hacker-hijacks-axios-open-source-project-used-by-millions-to-push-malware/">Hacker hijacks Axios open-source project, used by millions, to push </a><a href="https://techcrunch.com/2026/03/31/hacker-hijacks-axios-open-source-project-used-by-millions-to-push-malware/">malware</a></p>
<ul>
<li style="font-weight:400;">A hacker compromised a maintainer account for the Axios JavaScript <a href="https://www.npmjs.com/package/axios">library on npm</a>, pushing malicious versions that included a remote access trojan targeting Windows, macOS, and Linux users. </li>
<li style="font-weight:400;">Axios <a href="https://security.snyk.io/package/npm/axios#:~:text=WEEKLY%20DOWNLOADS%20(100.3M)">receives over 100 million weekly downloads</a>, making the potential exposure substantial.</li>
<li style="font-weight:400;">The attack window was approximately three hours before being detected and stopped, but security firm Aikido advises anyone who downloaded Axios during that period to treat their system as compromised. The self-deleting malware complicates <a href="https://www.stepsecurity.io/blog/axios-compromised-on-npm-malicious-versions-drop-remote-access-trojan#:~:text=had%20been%20live%20for%20approximately%202%20hours%2053%20minutes">forensic investigation and detection</a>.</li>
<li style="font-weight:400;">Account takeover was the entry point here, with the attacker replacing the legitimate maintainer’s email to delay recovery. This highlights how a single compromised developer credential can weaponize a widely trusted package against an entire downstream ecosystem.</li>
<li style="font-weight:400;">This is another example of a software <a href="http://techcrunch.com/2022/11/29/software-supply-chain-security-is-broader-than-solarwinds-and-log4j/">supply chain attack</a>, a pattern that has affected <a href="https://techcrunch.com/2020/12/21/after-the-fireeye-and-solarwinds-breaches-whats-your-failsafe/">SolarWinds</a>, <a href="https://techcrunch.com/2021/12/10/apple-icloud-twitter-and-minecraft-vulnerable-to-ubiquitous-zero-day-exploit/">Log4j</a>, and <a href="http://polyfill.io">Polyfill.io</a> in recent years. Developers and security teams should be reviewing dependency monitoring practices and considering tools that detect unexpected package version changes automatically.</li>
<li style="font-weight:400;">For cloud-focused teams, any CI/CD pipeline or serverless function that auto-installs npm dependencies without version pinning or integrity checks is a potential exposure point. Locking dependency versions and using tools like StepSecurity or Aikido for supply chain monitoring are practical mitigations worth discussing.</li>
</ul>
<p>31:49  Jonathan – “I just can’t believe how much trust, blind trust, dumb trust, if you want to call it that, is involved in an awful lot of open source projects. I mean, the entirety of PyPy – I’ve got a module on PyPy – I could commit some bad code to my repo in 15 minutes; if somebody installs my package, it’s going to run. I’m not aware of a great deal of security checks that happen automatically on the backend there, but that entire ecosystem is built on trust. It’s not good at all.”</p>
<h2>AI Is Going Great – Or How ML Makes Money </h2>
<p>34:06 <a href="https://arstechnica.com/ai/2026/03/entire-claude-code-cli-source-code-leaks-thanks-to-exposed-map-file/">Entire Claude Code CLI source code leaks thanks to exposed map file</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.anthropic.com/">Anthropic</a> accidentally shipped <a href="https://claude.com/product/claude-code">Claude Code</a> npm version 2.1.88 with an exposed source map file, revealing nearly 2,000 TypeScript files and over 512,000 lines of code for the CLI tool. </li>
<li style="font-weight:400;">Anthropic confirmed it was a packaging error, not a security breach, and stated that no customer data or credentials were exposed.</li>
<li style="font-weight:400;">The leaked code has already been archived, posted to a public GitHub repository, and forked tens of thousands of times, meaning the codebase is effectively public regardless of any takedown efforts. </li>
<li style="font-weight:400;">This gives competitors and developers a detailed look at how Anthropic built its agentic coding tool.</li>
<li style="font-weight:400;">Developers analyzing the code have surfaced technical details about Claude Code’s memory architecture, including background memory rewriting and memory validity verification steps. </li>
<li style="font-weight:400;">These implementation details were previously undocumented and give insight into how the tool manages context across long coding sessions.</li>
<li style="font-weight:400;">For cloud developers and teams evaluating AI coding tools, the leak provides an unusually transparent view into the engineering decisions behind a production agentic CLI, which could inform how teams build or evaluate similar tooling. It also raises a practical reminder about source map hygiene in npm package publishing pipelines.</li>
</ul>
<p>35:51  Jonathan – “The question is, did you really need the unobfuscated source code anyway? You’ve got AI tools. You can literally point Claude at it and say, hey, how does this work? I know because I did it a year ago.”</p>
<h2>AWS</h2>
<p>37:39 <a href="https://aws.amazon.com/blogs/aws/customize-your-aws-management-console-experience-with-visual-settings-including-account-color-region-and-service-visibility/">Customize your AWS Management Console experience with visual settings </a><a href="https://aws.amazon.com/blogs/aws/customize-your-aws-management-console-experience-with-visual-settings-including-account-color-region-and-service-visibility/">including account color, region and service visibility</a></p>
<ul>
<li style="font-weight:400;">AWS introduced <a href="https://docs.aws.amazon.com/awsconsolehelpdocs/latest/gsg/getting-started-uxc.html?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">User Experience Customization (UXC)</a> in August 2025 and is now expanding it with the ability to hide unused Regions and services from the console, reducing visual clutter for teams working in scoped environments.</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/about-aws/whats-new/2025/08/aws-management-console-assigning-color-aws-account/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">Account color coding</a> is a practical multi-account management tool, letting administrators assign colors like red for production and orange for development to reduce the risk of accidental changes in the wrong environment.</li>
<li style="font-weight:400;">The visibility settings are cosmetic only and do not restrict access via <a href="https://aws.amazon.com/cli/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">AWS CLI</a>, <a href="https://builder.aws.com/build/tools?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">SDKs</a>, or APIs, so teams should not confuse this with a security or governance control like Service Control Policies.</li>
<li style="font-weight:400;">Administrators can manage these settings programmatically using a new AWS CloudFormation resource type AWS::UXC::AccountCustomization with visibleServices and visibleRegions parameters, making it deployable at scale across accounts.</li>
<li style="font-weight:400;">There is no additional cost mentioned for UXC customization features, and they are available today in the <a href="https://console.aws.amazon.com/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">AWS Management Console</a> with configuration options accessible through the unified settings gear icon.</li>
</ul>
<p>39:26 <a href="https://aws.amazon.com/about-aws/whats-new/2026/03/lambda-32-gb-memory-16-vcpus/">AWS Lambda supports up to 32 GB of memory and 16 vCPUs for Lambda </a><a href="https://aws.amazon.com/about-aws/whats-new/2026/03/lambda-32-gb-memory-16-vcpus/">Managed Instances</a></p>
<ul>
<li style="font-weight:400;">Lambda Managed Instances now supports up to 32 GB of memory and 16 vCPUs, tripling the previous limits of 10 GB and roughly 6 vCPUs, which opens the door for workloads like media transcoding, large-scale data processing, and scientific simulations to run serverlessly.</li>
<li style="font-weight:400;">A notable addition here is the configurable memory-to-vCPU ratio at 2:1, 4:1, or 8:1, giving developers actual control over resource balance rather than the fixed proportional scaling that standard Lambda has always used.</li>
<li style="font-weight:400;">Lambda Managed Instances run functions on managed EC2 instances with built-in routing, load balancing, and auto-scaling, so customers get specialized compute configurations, including the latest-generation processors and high-bandwidth networking without taking on operational overhead.</li>
<li style="font-weight:400;">Pricing will be worth watching closely since Lambda Managed Instances sit in a different cost tier than standard Lambda, and teams should evaluate whether the compute gains justify the cost difference compared to running equivalent workloads on ECS or EKS.</li>
<li style="font-weight:400;">Configuration is available through the AWS Console, CLI, CloudFormation, CDK, and SAM in all regions where Lambda Managed Instances are generally available, so adoption fits into existing infrastructure-as-code workflows without requiring new tooling.</li>
</ul>
<p>40:18  Jonathan – “Lambda’s already pretty cheap to begin with, though. I wonder quite how much they could charge for managing the control plane, and are you still paying for the compute? Not a lot, I would think. Maybe they charge per host, or a small fixed fee per invocation, or something. It’s going to be interesting.”</p>
<p>41:56 <a href="https://aws.amazon.com/blogs/machine-learning/aws-launches-frontier-agents-for-security-testing-and-cloud-operations/">AWS launches frontier agents for security testing and cloud operations | </a><a href="https://aws.amazon.com/blogs/machine-learning/aws-launches-frontier-agents-for-security-testing-and-cloud-operations/">Artificial Intelligence</a></p>
<ul>
<li style="font-weight:400;">AWS has launched two generally available frontier agents: <a href="https://aws.amazon.com/security-agent/">AWS Security Agent</a> for autonomous penetration testing and <a href="https://aws.amazon.com/devops-agent/">AWS DevOps Agent</a> for incident resolution and SRE tasks. </li>
<li style="font-weight:400;">These differ from typical AI assistants in that they operate independently for hours or days without constant human direction to complete complex, multi-step workflows.</li>
<li style="font-weight:400;">AWS Security Agent ingests source code, architecture diagrams, and documentation to identify attack chains that traditional scanners miss, compressing penetration testing timelines by over 90% according to early customers. This shifts pen testing from a periodic, cost-constrained activity to an on-demand capability available 24/7 across an entire application portfolio.</li>
<li style="font-weight:400;">AWS DevOps Agent integrates with a broad set of existing tools, including <a href="https://docs.aws.amazon.com/cloudwatch/">CloudWatch</a>, <a href="https://www.datadoghq.com/">Datadog</a>, <a href="https://www.dynatrace.com/">Dynatrace</a>, <a href="https://www.splunk.com/">Splunk</a>, GitHub, and <a href="https://azure.microsoft.com/en-us/products/devops/">Azure DevOps</a>, making it usable across multicloud and on-premises environments. Preview customers report up to 75% lower MTTR and 94% root cause accuracy, with WGU cutting one incident resolution from two hours to 28 minutes.</li>
<li style="font-weight:400;">The DevOps Agent can work alongside tools like Kiro and Claude Code to not only identify root causes but generate validated fixes that feed back into CI/CD pipelines, moving the capability beyond investigation into actual remediation.</li>
<li style="font-weight:400;">Pricing details are not specified in the announcement, so teams evaluating these services should check the AWS Security Agent and AWS DevOps Agent product pages directly for current cost information before planning adoption.</li>
</ul>
<p>43:07  Jonathan – “Let me just scratch DevOps off my list of potential jobs.” </p>
<p>46:22 <a href="https://aws.amazon.com/about-aws/whats-new/2026/03/agentcore-evaluations-generally-available/">Amazon Bedrock AgentCore Evaluations is now generally available</a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/bedrock-agentcore/latest/devguide/evaluations.html">Amazon Bedrock AgentCore Evaluations</a> is now generally available, offering automated quality assessment for AI agents through two modes: online evaluation that continuously samples and scores live production traffic, and on-demand evaluation that plugs into CI/CD pipelines for regression testing.</li>
<li style="font-weight:400;">The service ships with 13 built-in evaluators covering response quality, safety, task completion, and tool usage, reducing the need for teams to build custom scoring logic from scratch before they can start measuring agent behavior.</li>
<li style="font-weight:400;">For teams with domain-specific needs, custom evaluators can be configured using your own prompts and model choice for LLM-based scoring, or implemented as Python or JavaScript functions hosted in <a href="https://aws.amazon.com/lambda/">Lambda</a> for code-based evaluation logic.</li>
<li style="font-weight:400;">Ground Truth support lets developers measure agents against reference answers, behavioral assertions at the session level, and expected tool execution sequences, giving teams a structured way to define and validate what correct agent behavior actually looks like.</li>
<li style="font-weight:400;">AgentCore Evaluations integrates with AgentCore Observability for unified monitoring and real-time alerts, and is available across nine <a href="https://aws.amazon.com/about-aws/global-infrastructure/regions_az/">AWS regions</a>, including US East, US West, multiple Asia Pacific regions, and two European regions. Pricing details are not specified in the announcement, so check the AWS pricing page for current costs.</li>
</ul>
<p>47:17  Justin – “I like the idea of this, but then if you’re continuously monitoring it and it degrades, what do you do? What’s step two? Like, we detected it, cool, now what?”</p>
<p>57:10 <a href="https://aws.amazon.com/blogs/machine-learning/build-a-finops-agent-using-amazon-bedrock-agentcore/">Build a FinOps agent using Amazon Bedrock AgentCore</a></p>
<ul>
<li style="font-weight:400;">AWS published a reference architecture for building a FinOps agent using <a href="https://aws.amazon.com/bedrock/agentcore/">Amazon Bedrock AgentCore</a> that consolidates data from <a href="https://aws.amazon.com/aws-cost-management/aws-cost-explorer/">Cost Explorer</a>, <a href="https://aws.amazon.com/aws-cost-management/aws-budgets/">AWS Budgets</a>, and <a href="https://aws.amazon.com/compute-optimizer/">Compute Optimizer</a> into a single conversational interface, giving finance teams natural language access to cost analysis without navigating multiple consoles.</li>
<li style="font-weight:400;">The solution uses five <a href="https://aws.amazon.com/cdk/">CDK</a> stacks to wire together <a href="https://docs.aws.amazon.com/bedrock-agentcore/latest/devguide/agents-tools-runtime.html">AgentCore Runtime</a>, <a href="https://docs.aws.amazon.com/bedrock-agentcore/latest/devguide/gateway.html">Gateway</a>, <a href="https://docs.aws.amazon.com/bedrock-agentcore/latest/devguide/memory.html">Memory</a>, and <a href="https://docs.aws.amazon.com/bedrock-agentcore/latest/devguide/identity.html">Identity</a> components alongside the <a href="https://strandsagents.com/%5C">Strands Agent SDK</a> and <a href="https://aws.amazon.com/solutions/guidance/deploying-model-context-protocol-servers-on-aws/">Model Context Protocol</a> servers, showing how these newer AgentCore building blocks fit together in a production-style deployment that takes roughly 15-20 minutes to stand up.</li>
<li style="font-weight:400;">AgentCore Memory retains 30 days of conversation context, which means users can ask follow-up questions like “what about the second one?” without re-explaining prior context, a practical improvement for teams doing iterative cost investigations.</li>
<li style="font-weight:400;">The architecture transforms open-source AWS Labs MCP servers from stdio transport to streamable HTTP, builds them as <a href="https://aws.amazon.com/ec2/graviton/">ARM64 Graviton</a> container images, and hosts them on AgentCore Runtime with JWT authorization, which is a useful pattern for teams looking to adapt existing MCP tooling for hosted agent environments.</li>
<li style="font-weight:400;">Pricing for this solution involves multiple services, including Bedrock model inference with <a href="https://www.anthropic.com/news/claude-sonnet-4-5">Claude Sonnet 4.5</a>, AgentCore Runtime and Memory, <a href="https://aws.amazon.com/cognito/">Cognito</a>, <a href="https://aws.amazon.com/codebuild/">CodeBuild</a>, and <a href="https://aws.amazon.com/ecr/">ECR</a>, so costs will vary based on query volume and conversation history retention rather than a flat rate.</li>
</ul>
<p>58:31  Dave – “I can’t wait to kick the tires on that one!” </p>
<p>59:03 <a href="https://aws.amazon.com/blogs/machine-learning/building-an-ai-powered-system-for-compliance-evidence-collection/">Building an AI-powered system for compliance evidence collection</a></p>
<ul>
<li style="font-weight:400;">AWS published a reference architecture for automating compliance evidence collection using <a href="https://aws.amazon.com/bedrock/">Amazon Bedrock</a> with the <a href="https://aws.amazon.com/nova/">Amazon Nova 2 Lite</a> model and a browser extension for <a href="http://chrome">Chrome</a> and <a href="https://www.firefox.com/en-US/?redirect_source=mozilla-org">Firefox</a>. </li>
<li style="font-weight:400;">The solution replaces manual screenshot workflows by executing pre-defined JSON workflows that navigate web applications, capture timestamped screenshots, and store organized evidence in S3.</li>
<li style="font-weight:400;">The AI layer operates in three modes: chat for ad-hoc compliance questions, designer mode for generating workflow JSON from uploaded compliance documents, and report generation mode that produces an HTML report delivered via Amazon SES after workflow completion.</li>
<li style="font-weight:400;">Authentication uses <a href="https://aws.amazon.com/cognito/">Amazon Cognito</a> with <a href="https://docs.aws.amazon.com/STS/latest/APIReference/welcome.html">AWS STS</a> to provide scoped, least-privilege credentials to the browser extension, meaning the extension only gets access to Bedrock, S3, and <a href="https://aws.amazon.com/ses/">SES</a> rather than broad account permissions.</li>
<li style="font-weight:400;">The entire infrastructure deploys via a single CloudFormation template that creates the Cognito user pool, identity pool, S3 bucket with encryption and versioning, IAM roles, and Lambda functions in minutes. The sample code is available at the aws-samples GitHub repository.</li>
<li style="font-weight:400;">Costs will vary based on Amazon Bedrock Nova 2 Lite inference usage, S3 storage for screenshots and reports, and SES sending volume, so organizations with frequent audit cycles should model their expected workflow execution frequency before deploying at scale.</li>
</ul>
<p>1:00:00  Jonathan – “Screenshots? Why are we using screenshots in 2026?” </p>
<h2>GCP</h2>
<p>1:00:46  <a href="https://research.google/blog/turboquant-redefining-ai-efficiency-with-extreme-compression/">TurboQuant: Redefining AI efficiency with extreme compression</a></p>
<ul>
<li style="font-weight:400;"><a href="https://research.google/">Google Research</a> has published TurboQuant, a <a href="https://en.wikipedia.org/wiki/Vector_quantization">vector quantization</a> algorithm that compresses LLM <a href="https://huggingface.co/blog/not-lain/kv-caching">key-value cache</a> data to as low as 3 bits without requiring model retraining or fine-tuning, while maintaining accuracy on standard benchmarks like LongBench and Needle In A Haystack using <a href="https://deepmind.google/models/gemma/gemma-4/">Gemma</a> and <a href="https://mistral.ai/">Mistral</a> models.</li>
<li style="font-weight:400;">The core technical approach combines two sub-algorithms: <a href="https://arxiv.org/abs/2502.02617">PolarQuant</a>, which converts vectors to polar coordinates to eliminate normalization overhead, and <a href="https://dl.acm.org/doi/10.1609/aaai.v39i24.34773">QJL (Quantized Johnson-Lindenstrauss)</a>, which uses a single sign bit per value to achieve zero memory overhead error correction.</li>
<li style="font-weight:400;">Performance results show 4-bit TurboQuant achieves up to 8x speedup in computing attention logits compared to 32-bit unquantized keys on H100 GPUs, and reduces key-value memory footprint by at least 6x, which is relevant for teams running inference at scale.</li>
<li style="font-weight:400;">For vector search use cases, <a href="https://arxiv.org/abs/2504.19874">TurboQuant</a> outperforms existing methods like PQ and RabbiQ on recall ratios without requiring dataset-specific tuning or large codebooks, making it a practical option for semantic search systems operating over billions of vectors.</li>
<li style="font-weight:400;">Google notes this research applies directly to Gemini’s key-value cache bottlenecks and large-scale search infrastructure, though no specific GCP product integration or pricing details have been announced alongside the research publication.</li>
</ul>
<p>1:02:28  Jonathan – “What’s funny about this whole technology is that the video game industry has been using exactly the same algorithms for 25 years. And this is just a new application of the same technology. It’s kind of funny. Hey guys, we’ve got a new paper out!”</p>
<p>1:03:41 <a href="https://cloud.google.com/blog/topics/sustainability/ai-tools-for-sustainable-infrastructure-and-reporting/">AI Tools for Sustainable Infrastructure and Reporting</a></p>
<ul>
<li style="font-weight:400;">Google published an <a href="https://sustainability.google/reports/ai-playbook-for-sustainability-reporting/">open-source AI playbook</a> for sustainability reporting, documenting how they used Gemini to cross-reference environmental claims against internal policies and <a href="https://notebooklm.google.com/notebook/62e5c8db-3dd2-407c-8d19-32ae4ae799db">NotebookLM</a> to turn their static <a href="https://sustainability.google/google-2025-environmental-report/">Environmental Report</a> into a queryable knowledge base. </li>
<li style="font-weight:400;">The playbook includes specific prompts and lessons learned, making it a practical resource for teams building similar workflows.</li>
<li style="font-weight:400;"><a href="https://sustainability.equinix.com/">Equinix</a> built a sustainability data lake in <a href="https://cloud.google.com/bigquery">BigQuery</a> that automatically ingests data from 240+ global sites, reducing their reporting cycle from weeks of manual spreadsheet work to on-demand insights. This was driven by a 46% year-over-year increase in customer sustainability data requests, which made manual processes unsustainable at scale.</li>
<li style="font-weight:400;">The Equinix case illustrates a cost and efficiency argument for serverless architecture, where moving to BigQuery eliminated idle compute resources, reduced energy consumption, and improved performance per watt. Google frames this as a triple win of price, performance, and environmental footprint.</li>
<li style="font-weight:400;">Google is connecting this work to their <a href="https://cloud.google.com/transform/betting-on-efficient-ai-the-4-ms?e=48754805">Well-Architected Framework sustainability pillar, using a 4Ms model</a> covering Machine, Model, Mechanization, and Map as a structured approach for customers designing efficient AI and data infrastructure. </li>
<li style="font-weight:400;">The WAF sustainability pillar documentation is available <a href="https://docs.cloud.google.com/architecture/framework/sustainability">here</a>.</li>
<li style="font-weight:400;">The practical takeaway for GCP customers is that sustainability reporting can shift from a manual compliance exercise to a data product with strategic value, particularly for organizations managing large real estate or infrastructure footprints where energy and resource data is already being collected across many sites.</li>
</ul>
<h2>Azure</h2>
<p>1:04:36 <a href="https://azure.microsoft.com/en-us/updates?id=557887">Public Preview: AI Agent for container networking troubleshooting</a></p>
<ul>
<li style="font-weight:400;"><a href="https://azure.microsoft.com/en-us">Azure</a> has launched a public preview of an AI agent designed to help engineers troubleshoot Kubernetes networking issues through a lightweight web-based interface, addressing the common problem of logs and metrics being scattered across multiple tools.</li>
<li style="font-weight:400;">The core value here is reducing manual correlation work during incidents, where engineers typically have to jump between <a href="https://kubernetes.io/docs/reference/kubectl/generated/kubectl/">kubectl</a>, <a href="https://azure.microsoft.com/en-us/products/monitor/?ef_id=_k_aac6810bef481d333049e8de1a7565a7_k_&amp;OCID=AIDcmm5edswduu_SEM__k_aac6810bef481d333049e8de1a7565a7_k_&amp;msclkid=aac6810bef481d333049e8de1a7565a7">Azure Monitor</a>, and other diagnostics tools to piece together what went wrong in a cluster network.</li>
<li style="font-weight:400;">This fits into Microsoft’s broader push to embed AI assistance directly into operational workflows rather than requiring engineers to leave their environment and consult separate documentation or support channels.</li>
<li style="font-weight:400;">Target users are platform and DevOps engineers running containerized workloads on Azure Kubernetes Service who deal with networking incidents and want faster root cause identification without deep networking expertise.</li>
<li style="font-weight:400;">The feature is currently in public preview, so pricing details are not yet confirmed, and teams should evaluate it with that in mind before building it into critical incident response workflows. More details are available at the Azure Updates page at azure.microsoft.com/en-us/updates with ID 557887.</li>
</ul>
<p>1:05:33  Dave – “Well, my first thought on this is that if most teams, at least that I’ve built, are already pulling all that data in there and finding a way to correlate the data and we resolve those issues quicker. So good for them for just automating that.”</p>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback, or ask questions at theCloudPod.net or tweet at us with the hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2416816/c1e-5rkrb71843hq6xko-ww4zzgjvh9gp-0sqnto.mp3" length="124523984"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 349 of The Cloud Pod, where the weather is always cloudy! Justin and Jonathan managed to make it into the studio this week, and they brought a guest! Dave Garaway jas joined us, and brought some on-the-ground knowledge from GTC, plus a slew of supply chain attacks, Gmail username changes and Claude’s code debacle. We’ve got all this and more – so let’s get started! 
Titles we almost went with this week

 AWS Console Gets a Makeover Nobody Asked For
 From Eight Hours to 22 Seconds, Hackers Got Fast
 AWS Spring Cleaning Hits Nine Services Hard
 Trivy Pursuit Turns Into a 500K Credential Heist
 Skip the Consultant, AWS Security Now Hacks Itself
 AWS Pen Testing Agent Pokes Your Cloud Around the Clock
 Your Cringey Gmail Address Gets a Second Chance
 Stop Babysitting Servers, Let Google Handle MCP
 AI Agent Untangles Your Kubernetes Networking Spaghetti
 One Bad Actor Poisons a Hundred Million Downloads
 Lambda Finally Hits the Gym with 32 GB
 From GPU Hype to Production Inference Without the Hyperscaler Headache


Follow Up
01:28 Hegseth, Trump had no authority to order Anthropic to be blacklisted, judge says

A US District Judge granted Anthropic a preliminary injunction blocking the Department of War’s blacklisting, ruling the designation was First Amendment retaliation rather than a legitimate national security action.
The court found officials lacked authority to blacklist Anthropic without considering less restrictive alternatives or providing evidence of an urgent security risk, noting the designation was triggered by Anthropic’s “hostile manner through the press.”
The practical business impact was already substantial before the ruling, with three trade deals cancelled and other potential partners delaying negotiations, representing potentially billions in lost contracts over five years.
Anthropic continues to balance the legal fight with maintaining its government relationships, publicly emphasizing alignment with the Department of War’s mission around safe AI deployment even while litigating against it.
For cloud and AI vendors, this case establishes a notable precedent around government procurement decisions and First Amendment protections, with implications for how companies publicly challenge federal contracting positions.

02:35  Jonathan – “I’m guessing Anthropic is super busy with all the people coming to them for deals right now, because it seems to me that Anthropic is getting all the business customers and OpenAI are getting the personal customers.”  
04:08 Delve Announces Changes and New Customer Support Measures 

Delve has responded to allegations from an anonymous Substack post by denying claims of faked evidence, clarifying that independent AICPA-accredited auditors, not Delve, issue SOC 2 reports and ISO 27001 certifications. 
The company published...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2416816/c1a-k5d5-xxkvvznmsx0k-7knxu4.jpg"></itunes:image>
                                                                            <itunes:duration>01:04:27</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2416816/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[348: Compliance Theater Now Available as a Subscriptions]]>
                </title>
                <pubDate>Thu, 02 Apr 2026 05:54:07 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2412061</guid>
                                    <link>https://tcpfm.castos.com/episodes/348-compliance-theater-now-available-as-a-subscriptions</link>
                                <description>
                                            <![CDATA[<p>Welcome to episode 348 of The Cloud Pod, where the weather is always cloudy! Justin, Ryan, and Matt are in the studio this week to bring you all the latest news in AI and Cloud, inclduing Strykers troubles, AWS’ birthday, Bedrock Agents, and Claude Code – plus so much more. Let’s get started! </p>
<h3>Titles we almost went with this week</h3>
<ul>
<li> SOC 2 It to Me Delve Fires Back </li>
<li> Shell Yeah Bedrock Agents Just Got Command Line Powers</li>
<li>When Your SOC 2 Report Is Just Fan Fiction</li>
<li> uv, Ruff, and ty Walk Into an OpenAI Acquisition</li>
<li> Hash Field Expiration Is Here, and It’s No Redis Herring</li>
<li> Stop Paying Full Price for Tokens You Already Bought</li>
<li> Fake It Till You Audit It</li>
<li> Cache Me If You Can CNCF Sandbox Edition</li>
<li> Microsoft Learns Consent Matters in Copilot Rollout</li>
<li> Microsoft’s Stinky Cloud Gets Federal Seal of Approval</li>
<li> When Your Audit Trail Leads to a Blog Fight</li>
<li> Ping Your AI Agent on Discord Like a Millennial</li>
<li> Twenty Years of AWS and the Bill Never Stops</li>
<li>The LLM hack that feels a lot like Node Shift Left Package issues</li>
<li> Claude Code Auto Mode Lets AI Work Unsupervised</li>
<li> Stop Babysitting Your AI Claude Code Goes Solo</li>
<li> Auto Mode Gives Claude Code the Keys to the Car</li>
<li> Java comes to the coffee shop with AI</li>
</ul>
<h2>General News </h2>
<p>01:21 <a href="https://www.stryker.com/us/en/about/news/2026/a-message-to-our-customers-03-2026.html">Customer Updates: Stryker Network Disruption </a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.stryker.com/us/en/index.html">Stryker</a> confirmed a cyberattack on March 11, 2026, that disrupted their internal <a href="https://www.microsoft.com/en-us">Microsoft</a> corporate environment, affecting order processing, manufacturing, and shipping, but notably not their connected medical devices or cloud-hosted products.</li>
<li style="font-weight:400;">The attack vector was specific to Stryker’s Microsoft environment, which meant products running on <a href="https://aws.amazon.com/">AWS</a> (<a href="https://www.stryker.com/us/en/smart-care/products/vocera-edge.html">Vocera Edge</a>, <a href="https://www.stryker.com/us/en/smart-care/products/vocera-ease.html">Vocera Ease</a>) and <a href="https://cloud.google.com/">Google Cloud</a> Platform (care.ai) were architecturally isolated and unaffected, demonstrating a practical benefit of multi-cloud separation.</li>
<li style="font-weight:400;">Stryker explicitly stated this was not ransomware or malware, and government agencies, including CISA, FBI, and the White House National Cyber Director, were engaged, with domain seizures linked to threat actors already executed.</li>
<li style="font-weight:400;">The incident highlights how healthcare organizations can architect medical device and cloud product infrastructure to be independent of corporate IT environments, as every product from <a href="https://www.stryker.com/us/en/joint-replacement/systems/Mako_SmartRobotics_Overview.html">Mako</a> to <a href="https://www.stryker.com/us/en/surgical-technologies/products/surgicount-safety-sponge-system/surgicount-index.html">SurgiCount</a> to <a href="https://www.stryker.com/us/en/emergency-care/products/lifepak-35.html">LIFEPAK</a> operated normally due to network segmentation.</li>
<li style="font-weight:400;">Real-world patient impact was limited but present, with some personalized implant cases rescheduled due to shipping delays, underscoring that even contained corporate IT incidents can have downstream effects on physical supply chains.</li>
</ul>
<p>02:30  Justin – “HugOps to the entire Stryker team; I couldn’t imagine having to rebuild my entire Windows estate at a company the size of Stryker in the middle of trying to do business and everything else.” </p>
<p>05:00 <a href="https://arstechnica.com/information-technology/2026/03/federal-cyber-experts-called-microsofts-cloud-a-pile-of-shit..."></a></p>
<h3>Chapters</h3>
<ul><li>(00:00:00) - Episode 348</li><li>(00:01:31) - Stryker Attack: How to Survive an Attack</li><li>(00:05:09) - Critics: Microsoft Cloud Was a Pile of SHIT</li><li>(00:06:50) - Dell Says Delve is a Fraudster and Should Be Removed</li><li>(00:14:12) - Dell vs Delve: The Smear</li><li>(00:18:50) - Light LLM: Supply Chain Attack</li><li>(00:23:04) - Kubernetes: Open sourcing the GKE Cluster Autos</li><li>(00:25:59) - Kubecon 2018: Azure Kubernetes Networking</li><li>(00:27:48) - Snowflake Announces Project Snow AI Platform</li><li>(00:29:36) - Codex to Acquire Astral</li><li>(00:32:15) - Cloud Code 2.8: Connect to Telegram and Discord</li><li>(00:34:54) - Cloud Cowork Launches Computer Use Capabilities</li><li>(00:37:46) - OpenClaw: Auto-Mode for Cloud Code & Research</li><li>(00:40:31) - Amazon Bedrock Agent Core: Invoke Agent Runtime with a Shell</li><li>(00:42:03) - Amazon EC2 Scanning with Chain Guard</li><li>(00:46:03) - AWS Turns 20 Years Old</li><li>(00:47:36) - AWS MCP Server in Preview: CloudWatch 2.8</li><li>(00:48:52) - GCP Cloud SQL Read Pools: Auto-Scaling</li><li>(00:51:17) - How to Design with AI in 2020</li><li>(00:53:48) - Microsoft at GTC 2017: Nvidia and Azure</li><li>(00:56:34) - Microsoft Temporarily Halt Copilot App Deployment</li><li>(00:58:12) - Microsoft's SQL Server Management Plan at SQLCon 2026</li><li>(01:03:36) - Azure Skills Plugin: What's Included?</li><li>(01:05:51) - Microsoft's Azure DevOps Remote MCP Server</li><li>(01:07:56) - Java 26: AI Integration, More</li><li>(01:09:01) - Oracle Announces AI-in-The-</li><li>(01:10:19) - This Week in Cloud: AI News</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 348 of The Cloud Pod, where the weather is always cloudy! Justin, Ryan, and Matt are in the studio this week to bring you all the latest news in AI and Cloud, inclduing Strykers troubles, AWS’ birthday, Bedrock Agents, and Claude Code – plus so much more. Let’s get started! 
Titles we almost went with this week

 SOC 2 It to Me Delve Fires Back 
 Shell Yeah Bedrock Agents Just Got Command Line Powers
When Your SOC 2 Report Is Just Fan Fiction
 uv, Ruff, and ty Walk Into an OpenAI Acquisition
 Hash Field Expiration Is Here, and It’s No Redis Herring
 Stop Paying Full Price for Tokens You Already Bought
 Fake It Till You Audit It
 Cache Me If You Can CNCF Sandbox Edition
 Microsoft Learns Consent Matters in Copilot Rollout
 Microsoft’s Stinky Cloud Gets Federal Seal of Approval
 When Your Audit Trail Leads to a Blog Fight
 Ping Your AI Agent on Discord Like a Millennial
 Twenty Years of AWS and the Bill Never Stops
The LLM hack that feels a lot like Node Shift Left Package issues
 Claude Code Auto Mode Lets AI Work Unsupervised
 Stop Babysitting Your AI Claude Code Goes Solo
 Auto Mode Gives Claude Code the Keys to the Car
 Java comes to the coffee shop with AI

General News 
01:21 Customer Updates: Stryker Network Disruption 

Stryker confirmed a cyberattack on March 11, 2026, that disrupted their internal Microsoft corporate environment, affecting order processing, manufacturing, and shipping, but notably not their connected medical devices or cloud-hosted products.
The attack vector was specific to Stryker’s Microsoft environment, which meant products running on AWS (Vocera Edge, Vocera Ease) and Google Cloud Platform (care.ai) were architecturally isolated and unaffected, demonstrating a practical benefit of multi-cloud separation.
Stryker explicitly stated this was not ransomware or malware, and government agencies, including CISA, FBI, and the White House National Cyber Director, were engaged, with domain seizures linked to threat actors already executed.
The incident highlights how healthcare organizations can architect medical device and cloud product infrastructure to be independent of corporate IT environments, as every product from Mako to SurgiCount to LIFEPAK operated normally due to network segmentation.
Real-world patient impact was limited but present, with some personalized implant cases rescheduled due to shipping delays, underscoring that even contained corporate IT incidents can have downstream effects on physical supply chains.

02:30  Justin – “HugOps to the entire Stryker team; I couldn’t imagine having to rebuild my entire Windows estate at a company the size of Stryker in the middle of trying to do business and everything else.” 
05:00 ]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[348: Compliance Theater Now Available as a Subscriptions]]>
                </itunes:title>
                                    <itunes:episode>348</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p>Welcome to episode 348 of The Cloud Pod, where the weather is always cloudy! Justin, Ryan, and Matt are in the studio this week to bring you all the latest news in AI and Cloud, inclduing Strykers troubles, AWS’ birthday, Bedrock Agents, and Claude Code – plus so much more. Let’s get started! </p>
<h3>Titles we almost went with this week</h3>
<ul>
<li> SOC 2 It to Me Delve Fires Back </li>
<li> Shell Yeah Bedrock Agents Just Got Command Line Powers</li>
<li>When Your SOC 2 Report Is Just Fan Fiction</li>
<li> uv, Ruff, and ty Walk Into an OpenAI Acquisition</li>
<li> Hash Field Expiration Is Here, and It’s No Redis Herring</li>
<li> Stop Paying Full Price for Tokens You Already Bought</li>
<li> Fake It Till You Audit It</li>
<li> Cache Me If You Can CNCF Sandbox Edition</li>
<li> Microsoft Learns Consent Matters in Copilot Rollout</li>
<li> Microsoft’s Stinky Cloud Gets Federal Seal of Approval</li>
<li> When Your Audit Trail Leads to a Blog Fight</li>
<li> Ping Your AI Agent on Discord Like a Millennial</li>
<li> Twenty Years of AWS and the Bill Never Stops</li>
<li>The LLM hack that feels a lot like Node Shift Left Package issues</li>
<li> Claude Code Auto Mode Lets AI Work Unsupervised</li>
<li> Stop Babysitting Your AI Claude Code Goes Solo</li>
<li> Auto Mode Gives Claude Code the Keys to the Car</li>
<li> Java comes to the coffee shop with AI</li>
</ul>
<h2>General News </h2>
<p>01:21 <a href="https://www.stryker.com/us/en/about/news/2026/a-message-to-our-customers-03-2026.html">Customer Updates: Stryker Network Disruption </a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.stryker.com/us/en/index.html">Stryker</a> confirmed a cyberattack on March 11, 2026, that disrupted their internal <a href="https://www.microsoft.com/en-us">Microsoft</a> corporate environment, affecting order processing, manufacturing, and shipping, but notably not their connected medical devices or cloud-hosted products.</li>
<li style="font-weight:400;">The attack vector was specific to Stryker’s Microsoft environment, which meant products running on <a href="https://aws.amazon.com/">AWS</a> (<a href="https://www.stryker.com/us/en/smart-care/products/vocera-edge.html">Vocera Edge</a>, <a href="https://www.stryker.com/us/en/smart-care/products/vocera-ease.html">Vocera Ease</a>) and <a href="https://cloud.google.com/">Google Cloud</a> Platform (care.ai) were architecturally isolated and unaffected, demonstrating a practical benefit of multi-cloud separation.</li>
<li style="font-weight:400;">Stryker explicitly stated this was not ransomware or malware, and government agencies, including CISA, FBI, and the White House National Cyber Director, were engaged, with domain seizures linked to threat actors already executed.</li>
<li style="font-weight:400;">The incident highlights how healthcare organizations can architect medical device and cloud product infrastructure to be independent of corporate IT environments, as every product from <a href="https://www.stryker.com/us/en/joint-replacement/systems/Mako_SmartRobotics_Overview.html">Mako</a> to <a href="https://www.stryker.com/us/en/surgical-technologies/products/surgicount-safety-sponge-system/surgicount-index.html">SurgiCount</a> to <a href="https://www.stryker.com/us/en/emergency-care/products/lifepak-35.html">LIFEPAK</a> operated normally due to network segmentation.</li>
<li style="font-weight:400;">Real-world patient impact was limited but present, with some personalized implant cases rescheduled due to shipping delays, underscoring that even contained corporate IT incidents can have downstream effects on physical supply chains.</li>
</ul>
<p>02:30  Justin – “HugOps to the entire Stryker team; I couldn’t imagine having to rebuild my entire Windows estate at a company the size of Stryker in the middle of trying to do business and everything else.” </p>
<p>05:00 <a href="https://arstechnica.com/information-technology/2026/03/federal-cyber-experts-called-microsofts-cloud-a-pile-of-shit-approved-it-anyway/">Federal cyber experts called Microsoft’s cloud a “pile of shit,” and </a><a href="https://arstechnica.com/information-technology/2026/03/federal-cyber-experts-called-microsofts-cloud-a-pile-of-shit-approved-it-anyway/">approved it anyway</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.fedramp.gov/">FedRAMP</a> authorized <a href="https://www.fedramp.gov/marketplace/products/MSO365MT/">Microsoft’s Government Community Cloud</a> High despite internal reviewers finding insufficient security documentation, issuing an unusual “buyer beware” notice to agencies considering the product. </li>
<li style="font-weight:400;">This raises questions about the integrity of the federal cloud authorization process when commercial pressures intersect with security evaluations.</li>
<li style="font-weight:400;">The <a href="https://learn.microsoft.com/en-us/office365/servicedescriptions/office-365-platform-service-description/office-365-us-government/gcc-high-and-dod">GCC High</a> offering is specifically designed to handle some of the US government’s most sensitive data, making the documentation gaps particularly consequential, given that Microsoft had already been linked to two significant federal breaches involving Russian and Chinese state actors.</li>
<li style="font-weight:400;">The core technical concern was Microsoft’s inability to adequately document how data is protected as it moves between servers within their cloud infrastructure, leaving reviewers unable to assess the system’s overall security posture with confidence.</li>
<li style="font-weight:400;">For cloud practitioners and federal agencies, this situation highlights the risk of relying on vendor-provided security documentation without independent verification, especially for high-sensitivity workloads where compliance approval does not necessarily equal verified security.</li>
<li style="font-weight:400;">The outcome has broader implications for FedRAMP’s credibility as a security benchmark, since agencies selecting cloud providers often treat authorization as a meaningful security signal rather than a conditional or incomplete endorsement.</li>
</ul>
<p>06:00  Ryan – “If you can’t adequately explain how basic things like encryption and security controls are handled in your environment, that’s not good, right? Because while it’s not completely indicative of a security problem, it’s highly suspect.” </p>
<p>06:51 <a href="https://substack.com/home/post/p-191342187">Delve – Fake Compliance as a Service – Part I </a></p>
<ul>
<li style="font-weight:400;">A detailed investigation alleges that <a href="https://delve.co/">Delve</a>, a compliance automation platform, fabricates audit evidence, including board meeting records and test results, then uses Indian certification mills operating through US shell entities to rubber-stamp reports rather than conduct independent verification.</li>
<li style="font-weight:400;">The core technical concern is that Delve reportedly generates identical audit reports across all clients, meaning the auditor independence required by AICPA and ISO standards is structurally violated since Delve itself is effectively acting as both platform and auditor.</li>
<li style="font-weight:400;">Companies using Delve for HIPAA or GDPR compliance may face significant regulatory exposure, as the article claims the platform skips major framework requirements while telling clients they have achieved 100% compliance, potentially creating criminal liability under HIPAA and fines up to 4% of global revenue under GDPR.</li>
<li style="font-weight:400;">The investigation highlights a broader issue in the compliance automation space where AI and automation claims may not reflect actual product capabilities, with the article describing Delve as essentially a template pack with a SaaS wrapper rather than a genuinely automated compliance tool.</li>
<li style="font-weight:400;">For cloud-focused companies evaluating compliance platforms, this case underscores the importance of verifying auditor independence credentials, requesting evidence of actual testing procedures, and understanding whether a platform produces genuinely customized documentation or pre-populated templates adopted with minimal review.</li>
<li style="font-weight:400;">Interested in reading the leaked spreadsheet? Find those <a href="https://archive.ph/6ZSzX">here</a> and the leaked documents <a href="https://mega.nz/folder/3ZNi3DqZ#ZH-M2Au1zErISCPD5Hgegg">here</a>. </li>
</ul>
<p>08:47  Ryan – “I’m not a big fan of checkbox security and having that around just for compliance purposes. But it’s also like, this is really a misrepresentation. You look at things and, and it’s certified by Delve; it’s not certified by these other companies. And if all that evidence, the specifics they listed in the report are crazy, just how, like, this is not cool. It’s just generated. It’s not even real in the slightest.”</p>
<p>11:37 <a href="https://delve.co/blog/response-to-misleading-claims">Response to Misleading Claims </a></p>
<ul>
<li style="font-weight:400;">Delve is a SOC 2 compliance automation platform serving over 1,700 customers, and this response addresses a Substack post making claims about the legitimacy of its audit processes. </li>
<li style="font-weight:400;">The core distinction Delve makes is that it automates evidence collection and provides templates, while independent licensed auditors retain sole authority to issue final reports.</li>
<li style="font-weight:400;">The debate touches on a broader industry practice where compliance platforms provide standardized control sets based on AICPA and ISO frameworks, meaning structural overlap across reports is expected rather than evidence of fraud. </li>
<li style="font-weight:400;">This is worth discussing because buyers of compliance software often do not fully understand where the platform ends and the auditor begins.</li>
<li style="font-weight:400;">Delve claims 120+ automated integrations, which is a notable gap from the 14 cited in the original criticism, and speaks to how quickly compliance tooling has evolved in the cloud ecosystem. </li>
<li style="font-weight:400;">For cloud-native companies pursuing SOC 2, the depth of integrations directly affects how much manual evidence collection is required.</li>
<li style="font-weight:400;">The use of pre-filled templates for board minutes and policies is standard practice across compliance platforms, but it raises a legitimate question about whether customers treat these as starting points or simply submit them unchanged. </li>
<li style="font-weight:400;">This is a real risk area for organizations where compliance becomes a checkbox exercise rather than a genuine security posture.</li>
<li style="font-weight:400;">The competitive compliance automation market, which includes players like Vanta and Drata, means disputes like this are likely to continue as vendors differentiate on auditor quality, automation depth, and pricing. </li>
<li style="font-weight:400;">Listeners evaluating compliance tools should independently verify auditor accreditation regardless of which platform they use.</li>
</ul>
<p>13:08  Ryan – “I would argue the use of pre-filled templates is common…prefilled and direct copied templates from between companies.” </p>
<p>19:04 <a href="https://futuresearch.ai/blog/litellm-pypi-supply-chain-attack/">Supply Chain Attack in litellm 1.82.8 on PyPI</a></p>
<ul>
<li style="font-weight:400;"><a href="https://github.com/BerriAI/litellm">Litellm</a> versions 1.82.7 and 1.82.8 on <a href="https://pypi.org/">PyPI</a> were found to contain a malicious .pth file that executes automatically on every Python process startup, with no corresponding release on the official <a href="https://github.com/">GitHub</a> repository, indicating the PyPI account was likely compromised.</li>
<li style="font-weight:400;">The malware follows a three-stage attack pattern: collecting SSH keys, cloud credentials, .env files, and Kubernetes configs; encrypting and exfiltrating them to a domain unrelated to legitimate litellm infrastructure; then attempting persistent backdoor installation via systemd and privileged <a href="https://kubernetes.io/">Kubernetes</a> pod creation.</li>
<li style="font-weight:400;">The attack was discovered because a bug in the malware caused an exponential fork bomb through a recursive .pth file, triggering, which crashed the host machine and made the compromise visible rather than silent.</li>
<li style="font-weight:400;">Any developer or CI/CD pipeline that pulled litellm as a transitive dependency after March 24, 2026, should treat all credentials on that machine as compromised and rotate SSH keys, cloud provider tokens, API keys, and database passwords immediately.</li>
<li style="font-weight:400;">This incident highlights the risk of supply chain attacks through transitive dependencies, where a package you never directly installed can introduce malicious code into your environment, making dependency auditing and package integrity verification important practices for cloud-connected development workflows.</li>
</ul>
<p>21:21 Justin – “Yeah… that’s bad too.” </p>

<p>KUBECON EU</p>
<p>23:24 <a href="https://cloud.google.com/blog/products/containers-kubernetes/gke-and-oss-innovation-at-kubecon-eu-2026/">GKE and OSS innovation at KubeCon EU 2026</a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.cloud.google.com/kubernetes-engine/docs/concepts/autopilot-overview">GKE Autopilot</a> is no longer a cluster-level decision made at creation time. Standard clusters can now enable Autopilot compute classes on a per-workload basis, removing the need to create entirely new clusters when workload requirements change.</li>
<li style="font-weight:400;">Google is open-sourcing the <a href="https://docs.cloud.google.com/kubernetes-engine/docs/concepts/cluster-autoscaler">GKE Cluster Autoscaler</a>, one of the core infrastructure provisioning components, with the goal of making it available to the broader Kubernetes community as a vendor-neutral tool.</li>
<li style="font-weight:400;"><a href="https://llm-d.ai/">llm-d</a>, a Kubernetes-native distributed inference framework built with Red Hat and NVIDIA, has been accepted as a <a href="https://www.cncf.io/announcements/2025/11/11/cncf-launches-certified-kubernetes-ai-conformance-program-to-standardize-ai-workloads-on-kubernetes/">CNCF</a> Sandbox project. It addresses inference-aware traffic management, multi-node replica orchestration, and KV cache offloading in a hardware-agnostic way.</li>
<li style="font-weight:400;">Google released an open-source <a href="https://kubernetes.io/docs/concepts/scheduling-eviction/dynamic-resource-allocation/">DRA</a> driver for TPUs, coordinated alongside <a href="https://blogs.nvidia.com/blog/nvidia-at-kubecon-2026">NVIDIA</a>, donating their own DRA driver, establishing Dynamic Resource Allocation as a shared standard for describing specialized hardware across Kubernetes workloads.</li>
<li style="font-weight:400;">TPU support is coming to Ray v2.55 with backing from both Google and Anyscale, and a new Ray History Server in alpha allows users to debug completed or terminated RayJobs using persisted logs, state, and metrics through the <a href="https://docs.cloud.google.com/kubernetes-engine/docs/add-on/ray-on-gke/quickstarts/ray-gpu-cluster">Ray Dashboard on GKE</a>.</li>
</ul>
<p>24:29  Ryan – “It’s super nice of them to open source that, because it does seem like a very powerful thing to use. I love the idea of having individual workloads on a cluster, and be able to delegate to managed and unmanaged… it’s kind of neat.”  </p>
<p>24:49 <a href="https://cloud.google.com/blog/products/containers-kubernetes/llm-d-officially-a-cncf-sandbox-project/">llm-d officially a CNCF Sandbox project</a></p>
<ul>
<li style="font-weight:400;">llm-d has been accepted as a CNCF Sandbox project, with Google Cloud as a founding contributor alongside Red Hat, IBM Research, CoreWeave, and NVIDIA. </li>
<li style="font-weight:400;">The project aims to extend Kubernetes for LLM inference workloads under an open-source model with no vendor lock-in, available at llm-d.ai.</li>
<li style="font-weight:400;">The core technical contribution is model-aware request routing through the llm-d Endpoint Picker, which considers KV-cache hit rates, in-flight requests, and queue depth to direct traffic to optimal backends. </li>
<li style="font-weight:400;">In <a href="https://cloud.google.com/blog/products/containers-kubernetes/how-gke-inference-gateway-improved-latency-for-vertex-ai?e=48754805">production testing on Vertex</a> AI, this approach reduced Time-to-First-Token latency by over 35% for coding workloads and improved P95 tail latency by 52% for bursty chat workloads.</li>
<li style="font-weight:400;">A notable outcome of the routing intelligence was doubling Vertex AI’s prefix cache hit rate from 35% to 70%, which directly reduces re-computation overhead and lowers cost-per-token for high-volume inference deployments.</li>
<li style="font-weight:400;">Google leads development of the Kubernetes <a href="https://lws.sigs.k8s.io/docs/overview/">LeaderWorkerSet API</a>, which llm-d uses to orchestrate prefill and decode disaggregation across independently scalable pods, supporting both TPU and GPU fleets at scale.</li>
<li style="font-weight:400;">Google has also <a href="https://vllm.ai/blog/vllm-tpu">extended vLLM natively for Cloud TPUs</a> with a unified PyTorch and JAX backend, delivering up to 5x throughput gains over the initial release. Pricing for running llm-d workloads depends on underlying GKE and accelerator costs, which vary by instance type and region.</li>
</ul>
<p>26:21 <a href="https://opensource.microsoft.com/blog/2026/03/24/whats-new-with-microsoft-in-open-source-and-kubernetes-at-kubecon-cloudnativecon-europe-2026/">What’s new with Microsoft in open-source and Kubernetes at KubeCon + </a><a href="https://opensource.microsoft.com/blog/2026/03/24/whats-new-with-microsoft-in-open-source-and-kubernetes-at-kubecon-cloudnativecon-europe-2026/">CloudNativeCon Europe 2026 </a></p>
<ul>
<li style="font-weight:400;">Dynamic Resource Allocation has reached general availability in Kubernetes, and Microsoft’s DRANet now includes upstream support for Azure RDMA NICs, meaning GPU-to-NIC topology alignment is handled at the scheduler level rather than through manual configuration. 
<ul>
<li style="font-weight:400;">This matters for teams running distributed training workloads where network topology directly affects performance.</li>
</ul>
</li>
<li style="font-weight:400;"><a href="https://github.com/kaito-project/kubeairunway">AI Runway</a> is a new open-source project under the KAITO umbrella that provides a common Kubernetes API for inference workloads, with a web interface, HuggingFace model discovery, GPU memory fit indicators, and real-time cost estimates. 
<ul>
<li style="font-weight:400;">It supports multiple runtimes, including <a href="https://developer.nvidia.com/blog/introducing-nvidia-dynamo-a-low-latency-distributed-inference-framework-for-scaling-reasoning-ai-models/">NVIDIA Dynamo</a> and <a href="https://ray-project.github.io/kuberay/">KubeRay</a>, giving platform teams a single control plane for model deployments without requiring end users to know Kubernetes.</li>
</ul>
</li>
<li style="font-weight:400;">AKS networking gets several notable updates, including <a href="https://aka.ms/aks/application-network">Azure Kubernetes Application Network</a> for identity-aware mTLS and traffic telemetry without a full service mesh, WireGuard encryption at the node level via <a href="https://github.com/cilium/">Cilium</a>, and <a href="https://aka.ms/aks/pod-cidr-expansion">Pod CIDR expansion</a> that lets clusters grow IP ranges in place rather than requiring a full rebuild. 
<ul>
<li style="font-weight:400;">Pricing for Advanced Container Networking Services features like <a href="https://aka.ms/acns/cilium-mtls">Cilium mTLS</a> is not specified in the announcement.</li>
</ul>
</li>
<li style="font-weight:400;">On the observability side, <a href="https://azure.microsoft.com/en-us/products/kubernetes-service">AKS</a> now surfaces <a href="https://aka.ms/aks/managed-gpu-metrics">GPU utilization</a> directly into managed <a href="https://prometheus.io/">Prometheus</a> and <a href="https://grafana.com/">Grafana</a>, closing a monitoring gap that previously required manual exporter configuration. 
<ul>
<li style="font-weight:400;">A new <a href="https://learn.microsoft.com/en-us/azure/aks/advanced-container-networking-services-overview">agentic container networking</a> interface also lets operators run natural-language diagnostic queries against live telemetry, reducing time to identify network issues.</li>
</ul>
</li>
<li style="font-weight:400;">Blue-green agent pool upgrades and agent pool rollback are now available in AKS, letting teams provision a parallel node pool with the new configuration, validate it, and revert to the previous Kubernetes version and node image if problems appear. </li>
<li style="font-weight:400;">AKS Desktop also reached general availability, giving developers a local environment that mirrors production AKS configuration.</li>
</ul>
<p>27:42  Ryan – “And if you’ve ever debugged an issue on Kubernetes, then you know that there’s logs everywhere that you have to go and review and correlate across each other, so having an agent that can go and look across all those places and diagnose issues is fantastic.” </p>
<h2>AI Is Going Great – Or How ML Makes Money </h2>
<p>28:22 <a href="https://www.snowflake.com/en/blog/snowflake-for-work-business-users/">Project SnowWork: The easiest way for business users to get work done</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.snowflake.com/en/">Snowflake</a> announced Project SnowWork in Research Preview, an agentic AI platform targeting business users in finance, sales, marketing, and operations who need to complete multi-step data workflows without writing code or relying on technical teams.</li>
<li style="font-weight:400;">The platform differentiates itself from general AI assistants by grounding outputs in an organization’s existing Snowflake data and automatically enforcing existing RBAC and governance policies, meaning users only see data they are already authorized to access.</li>
<li style="font-weight:400;">Project SnowWork ships with pre-built persona profiles for specific business functions, so a finance user gets workflows tuned to FP&amp;A KPIs and close narratives while a sales user gets pipeline risk summaries, rather than a one-size-fits-all interface.</li>
<li style="font-weight:400;">Practical use cases highlighted include compressing financial close storytelling from days to a single workflow and replacing manual pipeline rollups with automated executive briefs, which gives listeners a concrete sense of the time savings being targeted.</li>
<li style="font-weight:400;">Access is currently limited to a select group of customers in a collaborative research preview, so this is not a general availability release, and organizations interested in early access would need to engage directly with Snowflake.</li>
</ul>
<p>27:42  Ryan – “I do like the idea of bringing AI to the data rather than the data to the AI, which is a common problem, especially in enterprise platforms. I worry a little bit;  The RBAC and authorization in Snowflake is very complex, and I wonder if people are actually going through and actually defining those in a way that would be proper segmentation? But I guess, you know, they have access to it today, they just have to know how to query it.”</p>
<p>30:10 <a href="https://openai.com/index/openai-to-acquire-astral">OpenAI to acquire Astral </a></p>
<ul>
<li style="font-weight:400;"><a href="https://openai.com/">OpenAI</a> is acquiring <a href="https://astral.sh/">Astral</a>, the company behind three widely adopted Python developer tools: <a href="https://docs.astral.sh/uv/">uv</a> for dependency and environment management, <a href="https://docs.astral.sh/ruff/">Ruff</a> for linting and formatting, and <a href="https://docs.astral.sh/ty/">ty</a> for type safety enforcement. </li>
<li style="font-weight:400;">The Astral team will join the <a href="https://openai.com/codex/">Codex</a> team after the deal closes, pending regulatory approval.</li>
<li style="font-weight:400;">Codex has reached over 2 million weekly active users, with 3x user growth and 5x usage increase since the start of 2025. This acquisition appears aimed at deepening Codex’s ability to operate across the full Python development lifecycle rather than just generating code snippets.</li>
<li style="font-weight:400;">The stated goal is to move Codex toward participating in complete development workflows, including planning changes, modifying codebases, running tools, verifying results, and maintaining software over time. Integrating Astral’s tooling directly into that workflow gives Codex agents access to infrastructure developers already use daily.</li>
<li style="font-weight:400;">OpenAI has committed to continuing support for Astral’s open source projects after closing, which matters to the Python community given how widely these tools are already embedded in developer workflows. Developers using uv or Ruff should not expect immediate disruption to those projects.</li>
<li style="font-weight:400;">For cloud and platform teams, this signals a trend toward AI coding agents that are tightly coupled with language-specific toolchains rather than operating as generic code generators, which could influence how development environments and CI/CD pipelines are structured going forward.</li>
</ul>
<p>30:47  Justin – “I don’t know why they needed to buy the company to do all this, it is open source already.” </p>
<p>32:50 <a href="https://venturebeat.com/orchestration/anthropic-just-shipped-an-openclaw-killer-called-claude-code-channels">Anthropic just shipped an OpenClaw killer called Claude Code Channels, </a><a href="https://venturebeat.com/orchestration/anthropic-just-shipped-an-openclaw-killer-called-claude-code-channels">letting you message it over Telegram and Discord </a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.anthropic.com/">Anthropic</a> released <a href="https://code.claude.com/docs/en/channels-reference">Claude Code Channels</a> in version 2.1.80, enabling developers to connect their Claude Code sessions to <a href="https://web.telegram.org/">Telegram</a> and <a href="https://discord.com/">Discord</a> bots, shifting from a synchronous chat model to an asynchronous, persistent agent that can work autonomously and notify users when tasks are completed.</li>
<li style="font-weight:400;">The feature is built on Anthropic’s open-source <a href="https://modelcontextprotocol.info/">Model Context Protocol</a>, which acts as a standardized bridge between Claude Code and external messaging platforms. </li>
<li style="font-weight:400;">The setup uses the <a href="https://bun.sh/">Bun JavaScript runtime</a> to run a polling service that injects incoming messages as session events, allowing Claude to execute code, run tests, and reply back through the messaging app.</li>
<li style="font-weight:400;">Practically, this eliminates the need for developers to maintain dedicated hardware like a Mac Mini running open-source agent frameworks 24/7, since Claude Code itself now handles session persistence when run in a background terminal or on a VPS.</li>
<li style="font-weight:400;">The plugin architecture is open, with official Telegram and Discord connectors hosted on GitHub under Anthropic repositories, meaning the community can build additional connectors for platforms like Slack or WhatsApp without waiting for Anthropic to ship them natively.</li>
<li style="font-weight:400;">The feature remains tied to <a href="https://claude.com/pricing">Anthropic’s commercial subscriptions</a> (Pro, Max, and Enterprise), so while the MCP layer is open, the underlying Claude model and Claude Code harness are proprietary, which is an important cost and vendor-lock consideration for teams evaluating this against self-hosted alternatives.</li>
</ul>
<p>33:50  Justin – “I tried to use this, and it don’t work for me, but I didn’t have enough time to test it, I had too many Claude sessions going, and I needed to kill all of them and update properly to the 2.1.80 version. But I am curious to play with it a little more.”      </p>
<p>35:34 <a href="https://claude.com/blog/dispatch-and-computer-use">Put Claude to work on your computer </a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.anthropic.com/">Anthropic</a> has launched computer use capabilities in <a href="https://www.anthropic.com/product/claude-cowork">Claude Cowork</a> and <a href="https://claude.com/product/claude-code">Claude Code</a>, now in research preview for Pro and Max subscribers on macOS. Claude can directly control a browser, mouse, keyboard, and screen to complete tasks when no direct connector exists, with no setup required.</li>
<li style="font-weight:400;">The feature follows a tool priority hierarchy, reaching for service connectors like Slack or Google Calendar first, then falling back to direct computer control. Claude requests explicit permission before accessing new applications and can be stopped at any point.</li>
<li style="font-weight:400;">Anthropic has built in prompt injection safeguards by scanning model activations during computer use sessions. They acknowledge that the capability is still early and recommend users avoid sensitive data and start with trusted applications only.</li>
<li style="font-weight:400;"><a href="https://support.claude.com/en/articles/13947068-assign-tasks-to-claude-from-anywhere-in-cowork">Dispatch</a>, released alongside this update, enables a continuous conversation thread between mobile and desktop, letting users assign tasks from their phone and pick up completed work on their computer. 
<ul>
<li style="font-weight:400;">Use cases include automated morning email checks, scheduled metric pulls, and triggering Claude Code sessions for pull requests.</li>
</ul>
</li>
<li style="font-weight:400;">The combination of Dispatch and computer use means Claude can execute multi-step workflows on a desktop while the user is away, such as making IDE changes, running tests, and submitting a PR. </li>
<li style="font-weight:400;">Current limitations include macOS-only support, slower execution compared to direct integrations, and occasional need for retries on complex tasks.</li>
</ul>
<p>36:28  Ryan – “I didn’t know this was macOS only, because I was going to put it on my Linux server so I could get compute that wasn’t my laptop.” </p>
<p>38:32 <a href="https://claude.com/blog/auto-mode">Auto mode for Claude Code</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.anthropic.com/">Anthropi</a>c launched auto mode for <a href="https://claude.com/product/claude-code">Claude Code</a> in research preview for Team plan users, with Enterprise and API access coming soon. It works with both Claude Sonnet 4.6 and Opus 4.6, offering a middle ground between the default conservative permission prompts and the risky dangerously-skip-permissions flag.</li>
<li style="font-weight:400;">The core mechanism is a classifier that reviews each tool call before execution, automatically blocking potentially destructive actions like mass file deletion, sensitive data exfiltration, or malicious code execution, while letting safe actions proceed without interruption.</li>
<li style="font-weight:400;">This directly addresses a practical developer workflow problem: Claude Code’s default mode requires frequent human approvals that prevent truly unattended long-running tasks, and auto mode allows developers to kick off extended jobs without babysitting the process.</li>
<li style="font-weight:400;">Anthropic is transparent about the limitations, noting the classifier may still allow some risky actions when user intent is ambiguous, and may occasionally block benign ones. They continue to recommend using it in isolated environments rather than treating it as a fully safe alternative.</li>
<li style="font-weight:400;">There is a small performance tradeoff to be aware of, as auto mode adds some overhead to token consumption, cost, and latency per tool call due to the classifier running before each action.</li>
</ul>
<h2>AWS</h2>
<p>41:21 <a href="https://aws.amazon.com/about-aws/whats-new/2026/03/bedrock-agentcore-runtime-shell-command/">Amazon Bedrock AgentCore Runtime now supports shell command </a><a href="https://aws.amazon.com/about-aws/whats-new/2026/03/bedrock-agentcore-runtime-shell-command/">execution</a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/bedrock-agentcore/latest/devguide/agents-tools-runtime.html">Amazon Bedrock AgentCore Runtime</a> now includes InvokeAgentRuntimeCommand, an API that lets developers execute shell commands directly inside a running agent session, streaming output in real time over HTTP/2 and returning exit codes without custom container logic.</li>
<li style="font-weight:400;">The practical benefit here is that AI agents frequently need to run deterministic operations like tests, dependency installs, or git commands alongside LLM reasoning, and previously, developers had to build all that process management themselves inside their containers.</li>
<li style="font-weight:400;">Commands run in the same container, filesystem, and environment as the agent session and can execute concurrently with agent invocations without blocking, which simplifies architectures for coding agents, CI/CD automation, and similar workflows.</li>
<li style="font-weight:400;">The feature is available across 14 <a href="https://aws.amazon.com/about-aws/global-infrastructure/regions_az/">AWS regions</a>, including major US, European, and Asia Pacific locations, giving teams broad geographic coverage for latency-sensitive or data-residency-constrained workloads.</li>
<li style="font-weight:400;">Pricing details are not specified in the announcement, so teams evaluating this should check the AgentCore Runtime pricing page directly before building cost models around heavy command execution workloads.</li>
</ul>
<p>42:11  Ryan – I do get the advantages of this. Most of my use cases in GitHub Autopilot or Cloud Code it’s running Shell to do lots of things, especially executing tests, and so for CI-CD type workflows, you couldn’t do anything without it. I’m really curious how teams were working around this; people that were previously using Agent Core, because I bet that is ugly. But yeah, it’s going to be dangerous.”</p>
<p>42:56 <a href="https://aws.amazon.com/about-aws/whats-new/2026/03/amazon-inspector-agentless-ec2-scanning-windows/">Amazon Inspector expands agentless EC2 scanning and introduces </a><a href="https://aws.amazon.com/about-aws/whats-new/2026/03/amazon-inspector-agentless-ec2-scanning-windows/">Windows KB-based findings</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/inspector/">Amazon Inspector</a> now supports agentless <a href="https://aws.amazon.com/ec2/">EC2</a> scanning for a broader range of software, including <a href="https://wordpress.com/">WordPress</a>, <a href="https://httpd.apache.org/">Apache HTTP Server</a>, <a href="https://www.python.org/">Python</a> packages, and Ruby gems, plus Windows OS vulnerabilities, with no configuration changes required for existing customers.</li>
<li style="font-weight:400;">The new Windows KB-based findings consolidate multiple CVEs addressed by a single Microsoft patch into one finding, surfacing the highest CVSS score, EPSS score, and exploit availability, which reduces noise and makes remediation more straightforward.</li>
<li style="font-weight:400;">All existing CVE-based Windows OS findings will automatically transition to KB-based findings, meaning security teams will see fewer duplicate alerts and can map findings directly to specific Microsoft patches via included KB article links.</li>
<li style="font-weight:400;">The agentless approach lowers the operational overhead for security teams managing large EC2 fleets, particularly in environments where installing and maintaining agents is restricted or impractical.</li>
<li style="font-weight:400;">Both capabilities are available across all AWS Regions where Amazon Inspector is currently offered, and pricing follows the existing Inspector model based on instance scanning volume, so customers should review the Inspector pricing page for current rates.</li>
</ul>
<p>43:33  Justin – “I’m actually shocked this wasn’t already there, because CVE is really just the generic way that you would find these, but typically they’re always linked to a knowledge-based article which then typically links you to the patch, so I don’t know how people got from the CVE to the patch without this before, other than maybe the CVE mentions the KB articles.”</p>
<p>22:53 <a href="https://aws.amazon.com/about-aws/whats-new/2026/03/amazon-ecr-pull-through-cache-chainguard/">Amazon ECR now supports pull-through cache for Chainguard</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/ecr/">Amazon ECR</a> pull-through cache now supports <a href="https://www.chainguard.dev/">Chainguard</a> as an upstream registry source, allowing customers to automatically sync Chainguard container images into ECR without building custom synchronization workflows.</li>
<li style="font-weight:400;">Chainguard images are known for their minimal attack surface and security-focused builds, so pairing them with ECR’s native image scanning and lifecycle policies gives teams a more integrated security posture for their container supply chain.</li>
<li style="font-weight:400;">The practical benefit here is operational simplicity: teams using Chainguard images at scale no longer need separate tooling to keep images current, as ECR handles the sync automatically and frequently.</li>
<li style="font-weight:400;">Cached Chainguard images inherit standard ECR capabilities, including lifecycle policies for cost management and image scanning, which means customers get consistent governance across both their own images and upstream Chainguard images.</li>
<li style="font-weight:400;">The feature is available in all AWS regions where ECR pull-through cache is supported, and pricing follows standard ECR storage and data transfer rates with no additional charge specific to the Chainguard integration. Full details are in the ECR pull-through cache documentation <a href="http://docs.aws.amazon.com/AmazonECR/latest/userguide/pull-through-cache.html">here</a>. </li>
</ul>
<p>46:22  Matt – “It’s massive, but checks a box for your security team, right, that doesn’t want to understand how containers work. Just use this one, and you’ll have to worry about it. It’s like, but I can install anything I want on it. So is it actually going to help?”</p>
<p>47:57 <a href="https://www.geekwire.com/2026/aws-at-20-inside-the-rise-of-amazons-cloud-empire-and-whats-at-stake-in-the-ai-era/">AWS at 20*: Inside the rise of Amazon’s cloud empire, and what’s at stake </a><a href="https://www.geekwire.com/2026/aws-at-20-inside-the-rise-of-amazons-cloud-empire-and-whats-at-stake-in-the-ai-era/">in the AI era</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/">AWS</a> turns 20 this month, growing from 10 cents per compute hour in 2006 to nearly $129 billion in annual revenue, which would place it in the Fortune 500 top 40 as a standalone company. </li>
<li style="font-weight:400;">The article traces how <a href="https://aws.amazon.com/s3/">S3</a> and <a href="https://aws.amazon.com/ec2/">EC2</a> established the pay-per-use primitive model that directly undercut Oracle-style licensing and reshaped enterprise IT economics.</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/bedrock/">Bedrock</a> has become the fastest-growing service in AWS history, surpassing 100,000 customers and generating multi-billion dollar revenue with 60% quarter-over-quarter spending growth. AWS built it as a multi-model platform rather than pushing a single in-house option, following the same pattern it used with CPUs and GPUs by offering AMD, Intel, Graviton, Nvidia, and Trainium alongside each other.</li>
<li style="font-weight:400;"><a href="https://www.aboutamazon.com/news/aws/aws-project-rainier-ai-trainium-chips-compute-cluster">Project Rainier</a>, an AI compute cluster powered by over 500,000 <a href="https://aws.amazon.com/ai/machine-learning/trainium/">Trainium2</a> chips in Indiana, represents AWS attempting to reduce dependence on Nvidia by building its own silicon stack from chip to data center. </li>
<li style="font-weight:400;">The OpenAI partnership, worth up to $100 billion in cloud commitments over eight years, brings OpenAI workloads onto Trainium chips, making it the second major AI lab after Anthropic to commit to Amazon’s custom silicon.</li>
<li style="font-weight:400;">AWS still leads cloud revenue at over $116 billion annually, but Azure at $75 billion and Google Cloud at $50 billion annual run rates show the gap narrowing, particularly in AI workloads. </li>
<li style="font-weight:400;">Corey Quinn’s Cisco analogy is worth discussing: AWS could remain profitable and essential while becoming less central to where AI innovation actually happens.</li>
<li style="font-weight:400;">Jassy has publicly projected AWS could reach $600 billion in annual revenue by 2036 with AI as the driver, backing that with $200 billion in capital expenditure planned for this year alone, which would consume nearly all of Amazon’s operating cash flow.</li>
<li style="font-weight:400;">Happy Birthday </li>
</ul>
<p>49:37 <a href="https://aws.amazon.com/about-aws/whats-new/2026/03/aws-mcp-server-preview-enhanced-monitoring/">AWS MCP Server (Preview) now with enhanced monitoring and semantic </a><a href="https://aws.amazon.com/about-aws/whats-new/2026/03/aws-mcp-server-preview-enhanced-monitoring/">search capability</a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/aws-mcp/">AWS MCP Server</a> in preview now automatically publishes <a href="https://docs.aws.amazon.com/aws-mcp/latest/userguide/monitoring-overview.html">metrics</a> to <a href="https://docs.aws.amazon.com/AmazonCloudWatch/latest/monitoring/WhatIsCloudWatch.html">CloudWatch</a> under the AWS-MCP namespace at no additional cost, covering invocation counts, success rates, client errors, server errors, and throttling for individual tools like the AWS API caller and Agent SOP retriever.</li>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/aws-mcp/latest/userguide/agent-sops.html">Agent SOPs</a> are pre-built, tested workflows that guide AI assistants through complex multi-step AWS tasks, and the documentation search tool now uses semantic similarity so agents can discover the right SOP through natural language queries rather than exact keyword matching.</li>
<li style="font-weight:400;">The CloudWatch integration addresses a previous gap where customers had no visibility into agent-driven changes, enabling teams to track usage patterns, identify permission issues, and configure alarms when error rates exceed defined thresholds.</li>
<li style="font-weight:400;">The service is currently available only in US East (N. Virginia) in preview, which is worth noting for teams with data residency requirements or those operating primarily in other regions.</li>
<li style="font-weight:400;">For listeners building AI-assisted infrastructure automation, this update provides a practical observability layer for MCP-based agents, which is increasingly relevant as teams adopt AI assistants for AWS operations tasks.</li>
</ul>
<p>50:26  Ryan – “Why did everything go offline? Now you can find out!” </p>
<h2>GCP</h2>
<p>50:59  <a href="https://cloud.google.com/blog/products/databases/cloudsql-read-pools-support-autoscaling/">CloudSQL read pools support autoscaling </a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/sql">Cloud SQL</a> read pools, now generally available for Enterprise Plus edition, let you provision up to 20 read replicas behind a single load-balanced endpoint for <a href="https://cloud.google.com/sql/docs/mysql/about-read-pools">MySQL</a> and <a href="https://cloud.google.com/sql/docs/postgres/about-read-pools">PostgreSQL</a>, removing the need to manually manage multiple replicas or reconfigure applications when nodes are added or removed.</li>
<li style="font-weight:400;">The new autoscaling feature dynamically adjusts node count based on CPU utilization or database connection thresholds, with users defining minimum and maximum node counts so the pool scales within those bounds automatically during traffic fluctuations.</li>
<li style="font-weight:400;">Pools with two or more nodes are backed by a 99.99% availability SLA that covers maintenance downtime, and configuration changes like VM type or database flag updates are applied across all nodes with near-zero downtime.</li>
<li style="font-weight:400;">From a cost perspective, autoscaling helps avoid over-provisioning by scaling in during low-traffic periods, meaning you pay only for nodes actively in use rather than maintaining a fixed fleet sized for peak load.</li>
<li style="font-weight:400;">Retail and other industries with variable workloads are a natural fit, and teams can get started via gcloud CLI, Terraform, or the REST API, with a 30-day free trial available at cloud.google.com/sql for hands-on access to Enterprise Plus features.</li>
<li style="font-weight:400;">Want to sign up for a free trial of Cloud SQL? You can do that <a href="https://docs.cloud.google.com/sql/docs/mysql/create-free-trial-instance">here</a>. </li>
</ul>
<p>52:29  Matt – “The feature here I actually like is that it autoscales reads… nothing I’ve seen will do auto scaling on the reads for SQL and scale it out horizontally in that way. Like, even Aurora, if you’re on the normal one, you build a read replica, you have to build each read replica, and then either route or round robin to those ones. So if it’s actually going to do automatic adding and removing based on capacity needs, that’s a pretty nice feature because it can save you a lot of money.” </p>
<p>53:35 <a href="https://blog.google/innovation-and-ai/models-and-research/google-labs/stitch-ai-ui-design/">Design UI using AI with Stitch from Google Labs</a></p>
<ul>
<li style="font-weight:400;"><a href="https://labs.google/">Google Labs</a> has evolved <a href="https://stitch.withgoogle.com/">Stitch</a> (stitch.withgoogle.com) into an AI-native design canvas that converts natural language descriptions into high-fidelity UI designs, targeting both professional designers and non-designers who want to move from concept to prototype quickly.</li>
<li style="font-weight:400;">The updated tool introduces an infinite canvas, a design agent that reasons across a project’s full history, and an Agent Manager for running multiple design directions in parallel, which addresses a common pain point of managing divergent design explorations.</li>
<li style="font-weight:400;"><a href="http://design.md">DESIGN.md</a> is a notable addition that lets users extract and export design systems as an agent-friendly markdown file, making it easier to apply consistent design rules across projects or share them with other tools without starting from scratch each time.</li>
<li style="font-weight:400;">Stitch connects to developer workflows through an <a href="https://stitch.withgoogle.com/docs/mcp/setup/">MCP server</a> and <a href="https://github.com/google-labs-code/stitch-sdk">SDK</a>, with export options to <a href="https://aistudio.google.com/">AI Studio</a> and <a href="https://antigravity.google/">Antigravity</a>, positioning it as a handoff layer between design and development rather than a standalone tool.</li>
<li style="font-weight:400;">Pricing details are not specified in the announcement, so listeners interested in using Stitch for production workflows should check the documentation at stitch.withgoogle.com for current access and cost information.</li>
</ul>

<p>55:20  Ryan – “I was developing something for my family, and it looks like you would expect, and so I can’t wait to try this out. And it was really impressive how fast, and how little feedback you gave it.” </p>
<h2>Azure</h2>
<p>56:12 <a href="https://blogs.microsoft.com/blog/2026/03/16/microsoft-at-nvidia-gtc-new-solutions-for-microsoft-foundry-azure-ai-infrastructure-and-physical-ai/">Microsoft at NVIDIA GTC: New solutions for Microsoft Foundry, Azure AI </a><a href="https://blogs.microsoft.com/blog/2026/03/16/microsoft-at-nvidia-gtc-new-solutions-for-microsoft-foundry-azure-ai-infrastructure-and-physical-ai/">infrastructure and Physical AI </a></p>
<ul>
<li style="font-weight:400;"><a href="https://aka.ms/FoundryAgentsGA-blog">Microsoft Foundry Agent Service and Observability in Foundry Control Plane</a> are now generally available, giving enterprise teams a unified platform to build, deploy, and monitor AI agents with end-to-end visibility into agent behavior across tools, data, and workflows.</li>
<li style="font-weight:400;">Azure is the first hyperscale cloud to power on <a href="https://x.com/satyanadella/status/2032515189086761005">NVIDIA Vera Rubin NVL72</a> systems in its labs, with rollout planned to liquid-cooled datacenters over the coming months, following deployment of hundreds of thousands of Grace Blackwell GPUs in under a year. </li>
<li style="font-weight:400;">This positions Azure as a target platform for inference-heavy and reasoning-based workloads at scale.</li>
<li style="font-weight:400;">NVIDIA Nemotron models are now available through Microsoft Foundry, and the Fireworks AI integration allows customers to fine-tune open-weight models into low-latency deployments that can be distributed to the edge. </li>
<li style="font-weight:400;">Pricing for these models is not specified in the announcement and would vary based on usage.</li>
<li style="font-weight:400;">Microsoft is extending <a href="https://blogs.microsoft.com/blog/2026/02/24/microsoft-sovereign-cloud-adds-governance-productivity-and-support-for-large-ai-models-securely-running-even-when-completely-disconnected/">NVIDIA Vera Rubin platform support to Azure Local</a>, allowing organizations in sovereign and regulated environments to run next-generation AI workloads while maintaining Azure-consistent governance through Azure Arc and Foundry Local.</li>
<li style="font-weight:400;">A new <a href="https://github.com/microsoft/physical-ai-toolchain">Physical AI Toolchain, available via a public GitHub repository</a>, integrates NVIDIA Physical AI Data Factory with Azure services, enabling developers to build robotics and physical AI workflows that connect physical assets, simulation environments, and cloud training into repeatable enterprise pipelines.</li>
</ul>
<p>57:38 Justin – “Skynet is VERY excited.” </p>
<p>59:06 <a href="https://www.theregister.com/2026/03/18/automatic_deployment_copilot/">Microsoft 365 pauses Copilot creep after admins cry foul</a></p>
<ul>
<li style="font-weight:400;">Microsoft has paused the automatic deployment of the <a href="https://www.microsoft.com/en-US/microsoft-365-copilot/download-copilot-app">Microsoft 365 Copilot app</a> to desktop users, halting a rollout that had already slipped twice from its original October 2025 target date. </li>
<li style="font-weight:400;">The pause has no specified end date, and existing installations remain unaffected.</li>
<li style="font-weight:400;">The core admin complaint was that the opt-out default model increased IT workload by forcing organizations to set policies on Microsoft’s timeline rather than their own. Admins who want to proceed with deployment can still do so manually through other available methods.</li>
<li style="font-weight:400;">European Economic Area customers were already excluded from this rollout, likely reflecting ongoing regulatory considerations around default software installations in that region.</li>
<li style="font-weight:400;">This pause aligns with broader reported changes to Microsoft’s approach of embedding Copilot across Windows 11 surfaces, suggesting some recalibration of how aggressively the assistant is pushed to end users. </li>
<li style="font-weight:400;">For IT decision-makers, the key takeaway is that centralized control over AI tool deployment remains a practical concern, and Microsoft’s willingness to halt the rollout signals that enterprise admin feedback carries weight in deployment decisions.</li>
</ul>
<p>59:45  Justin – “Don’t force your IT people to do things. That’s not good. They’re already overworked and stressed.” </p>
<p>1:00:46 <a href="https://www.microsoft.com/en-us/sql-server/blog/2026/03/18/advancing-agentic-ai-with-microsoft-databases-across-a-unified-data-estate/">Advancing agentic AI with Microsoft databases across a unified data </a><a href="https://www.microsoft.com/en-us/sql-server/blog/2026/03/18/advancing-agentic-ai-with-microsoft-databases-across-a-unified-data-estate/">estate </a></p>
<ul>
<li style="font-weight:400;">Microsoft announced a savings plan for databases at <a href="https://sqlcon.us/">SQLCon 2026</a>, offering up to 35% savings versus <a href="https://aka.ms/savings-plan-db">pay-as-you-go pricing</a> on a one-year hourly spend commitment, automatically applied across eligible Azure database services, including Azure SQL.</li>
<li style="font-weight:400;"><a href="https://aka.ms/ssms-2241-blog">GitHub Copilot is now generally available in SQL Server Management Studio 22</a>, bringing chat and T-SQL code assistance directly into <a href="https://learn.microsoft.com/en-us/ssms/sql-server-management-studio-ssms">SSMS</a> for developers and DBAs who already use Copilot in Visual Studio and VS Code.</li>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/azure-sql/database/service-tier-hyperscale?view=azuresql">Azure SQL Database Hyperscale</a> gained new public preview features, including a SQL MCP Server for connecting SQL data to <a href="https://www.youtube.com/watch?v=uTv-E-vz570&amp;list=PLLasX02E8BPBCP7KdYsjKKFFQUmNEUmE9&amp;index=2">AI agents</a>, larger 160 and 192 vCore options, and enhanced vector indexes with full insert, update, and delete support requiring no code changes.</li>
<li style="font-weight:400;"><a href="https://aka.ms/Fabric-databases-FabCon-SQLCon26">SQL database in Fabric</a> reached general availability for several enterprise security features, including SQL Auditing, Customer-Managed Keys, and Dynamic Data Masking, with workspace-level Private Link in preview, targeting customers with strict governance and compliance requirements.</li>
<li style="font-weight:400;">Microsoft introduced the <a href="https://aka.ms/database-hub">Database Hub</a> in Fabric, now in early <a href="https://aka.ms/database-hub">access</a>, providing a single management plane across <a href="https://azure.microsoft.com/en-us/products/azure-sql/database/">Azure SQL</a>, <a href="https://learn.microsoft.com/en-us/azure/cosmos-db/">Cosmos DB</a>, <a href="https://www.postgresql.org/">PostgreSQL</a>, <a href="https://azure.microsoft.com/en-us/products/mysql/">MySQL</a>, and <a href="https://learn.microsoft.com/en-us/sql/sql-server/azure-arc/overview?view=sql-server-ver17">Arc-enabled SQL Server</a>, with agent-assisted monitoring that surfaces estate-wide signals and recommended actions. </li>
<li style="font-weight:400;">Interested in signing up for Database Hub? You can do that <a href="http://aka.ms/database-hub">here</a>. </li>
</ul>
<p>1:01:37  Matt – “There’s a lot of ‘things’ in this blog post; the biggest one for me is the savings plan for databases… It’s just built in there now. It really means you can get those savings; you don’t have to commit or be a hyperscaler.” </p>
<p>1:03:28 <a href="https://azure.microsoft.com/en-us/updates?id=558183">Generally Available: Versionless key support for transparent data </a><a href="https://azure.microsoft.com/en-us/updates?id=558183">encryption in Azure SQL Database </a></p>
<ul>
<li style="font-weight:400;"><a href="https://azure.microsoft.com/en-us/products/azure-sql/database/">Azure SQL Database</a> now supports versionless keys for transparent data encryption, meaning customers can point to a key in <a href="https://azure.microsoft.com/en-us/products/key-vault/">Azure Key Vault</a> without pinning to a specific version, and the database will automatically use the latest key version as it rotates.</li>
<li style="font-weight:400;">This reduces operational overhead for teams managing customer-managed keys, eliminating the manual step of updating TDE configurations each time a key is rotated in Azure Key Vault or Managed HSM.</li>
<li style="font-weight:400;">The practical benefit is improved reliability around key rotation workflows, since missed version updates previously could cause access disruptions to encrypted databases, a real risk in regulated industries with frequent rotation policies.</li>
<li style="font-weight:400;">This feature is generally available and integrates with existing Azure Key Vault and Managed HSM setups, so customers already using bring-your-own-key TDE can adopt versionless references without rebuilding their encryption architecture.</li>
<li style="font-weight:400;">No additional cost is associated with this feature beyond standard Azure Key Vault or Managed HSM pricing, making it a straightforward operational improvement for any Azure SQL Database customer using customer-managed keys.</li>
</ul>
<p>1:04:10  Justin – “There’s no additional cost for this, and thank god, because this is the dumbest feature I’ve ever heard of in my entire life. Why does it not just do it automatically?”  </p>
<p>1:06:24 <a href="https://visualstudiomagazine.com/articles/2026/03/13/microsoft-launches-azure-skills-plugin-to-give-ai-coding-agents-real-azure-expertise.aspx">Microsoft Launches Azure Skills Plugin to Give AI Coding Agents Real </a><a href="https://visualstudiomagazine.com/articles/2026/03/13/microsoft-launches-azure-skills-plugin-to-give-ai-coding-agents-real-azure-expertise.aspx">Azure Expertise</a></p>
<ul>
<li style="font-weight:400;">Microsoft released the <a href="https://devblogs.microsoft.com/all-things-azure/announcing-the-azure-skills-plugin/">Azure Skills Plugin</a>, available at aka.ms/azure-plugin, which bundles over 19 curated Azure workflow skills, the <a href="https://learn.microsoft.com/en-us/azure/developer/azure-mcp-server/get-started">Azure MCP Server</a> with 200+ tools across 40+ Azure services, and the <a href="https://learn.microsoft.com/en-us/azure/foundry/mcp/get-started?tabs=user">Foundry MCP Server</a> into a single install for AI coding agents. </li>
<li style="font-weight:400;">The goal is to move agents beyond generic code suggestions toward actual Azure deployment actions like provisioning, cost optimization, and live diagnostics.</li>
<li style="font-weight:400;">The skills layer is the core differentiator here, encoding decision trees and sequencing logic for real Azure workflows rather than simple prompt snippets. Key skills include azure-prepare for generating infrastructure code, azure-validate for pre-flight checks, azure-deploy for orchestrating through the <a href="https://learn.microsoft.com/en-us/azure/developer/azure-developer-cli/install-azd?tabs=winget-windows%2Cbrew-mac%2Cscript-linux&amp;pivots=os-windows">Azure Developer CLI</a>, and azure-diagnostics for troubleshooting using logs and KQL queries.</li>
<li style="font-weight:400;">The plugin is designed to be portable across agent hosts, including <a href="https://code.visualstudio.com/docs/copilot/overview">GitHub Copilot in VS Code</a>, <a href="https://github.com/features/copilot/cli/">Copilot CLI</a>, and Claude Code, with configuration handled automatically through a .mcp.json file and a .github/plugins/azure-skills folder. Teams using multiple agent tools do not need to maintain separate configurations for each.</li>
<li style="font-weight:400;">Microsoft is explicit that this setup requires real credentials and real Azure resources, recommending least-privilege access, explicit tool approvals, and skills sourced only from trusted repositories. This positions the agent as a supervised collaborator rather than an autonomous actor, which is a practical consideration for teams evaluating security posture.</li>
<li style="font-weight:400;">Prerequisites include Node.js 18 or later, Azure CLI authenticated via az login, and optionally the Azure Developer CLI for deployment workflows. No specific pricing is listed for the plugin itself, though costs will vary based on the underlying Azure services and resources the agent provisions during use.</li>
</ul>
<p>1:07:48  Matt – “I’m actually most excited for the KQL feature because writing KQL is like writing SQL, but harder, but also I’m terrible at both, so don’t judge that one statement. But if I can live, just tell it to search the logs in a certain way, because right now I just have this terrible workflow of Claude – this is what I’m looking for in KQL. Copy-paste, take the screenshot, put it back over here, copy-paste, and iterate through this very slow cycle. So if I can have it understand KQL, so much better.” </p>
<p>1:08:56 <a href="https://devops.com/azure-devops-remote-mcp-server-lands-in-microsoft-foundry-giving-ai-agents-direct-access-to-your-devops-data/">Azure DevOps Remote MCP Server Lands in Microsoft Foundry, Giving AI </a><a href="https://devops.com/azure-devops-remote-mcp-server-lands-in-microsoft-foundry-giving-ai-agents-direct-access-to-your-devops-data/">Agents Direct Access to Your DevOps Data</a></p>
<ul>
<li style="font-weight:400;">Microsoft launched the <a href="https://learn.microsoft.com/en-us/azure/devops/mcp-server/remote-mcp-server?view=azure-devops">Azure DevOps Remote MCP Server</a> in public preview on March 17, followed by its integration into <a href="https://ai.azure.com/">Microsoft Foundry</a> two days later. </li>
<li style="font-weight:400;">The server gives AI agents a hosted, authenticated connection to Azure DevOps data, including work items, pull requests, pipelines, repos, and wikis via a single URL endpoint at mcp.dev.azure.com.</li>
<li style="font-weight:400;">Authentication runs entirely through Microsoft Entra, meaning organizations apply their existing identity policies, conditional access rules, and permission boundaries to agent access without building separate integrations. Notably, only Entra-backed Azure DevOps organizations are supported, leaving MSA-backed and on-premises deployments without this option for now.</li>
<li style="font-weight:400;">Two access control headers stand out for enterprise use: X-MCP-Readonly restricts agents to read-only operations, and X-MCP-Toolsets lets teams scope which tool categories an agent can access. This shifts the governance conversation from whether agents should touch DevOps data to defining the specific conditions under which they can.</li>
<li style="font-weight:400;">The Foundry integration connects Azure DevOps data to Foundry’s full agent development lifecycle, including model access, orchestration, evaluation, and deployment. Teams can add the server through the Foundry tool catalog and control which specific operations each agent is permitted to perform.</li>
<li style="font-weight:400;">Current limitations worth noting include client support restricted to Visual Studio and VS Code without extra setup, while Claude Desktop, GitHub Copilot CLI, and ChatGPT require additional OAuth configuration in Entra before connecting. Microsoft has also indicated plans to eventually archive the local MCP Server in favor of this remote version, so teams on the local server should begin evaluating migration. No separate pricing has been announced beyond standard Azure DevOps and Foundry costs. </li>
</ul>
<h2>Oracle</h2>
<p>1:11:13 <a href="https://www.oracle.com/news/announcement/oracle-releases-java-26-2026-03-17/">Oracle Releases Java 26</a> </p>
<ul>
<li style="font-weight:400;"><a href="https://openjdk.org/projects/jdk/26/">Java 26</a> ships 10 JDK Enhancement Proposals covering AI integration, cryptography, and language simplification, including HTTP/3 support in the HTTP Client API and a fourth preview of primitive types in pattern matching. None of these are final features yet, with several JEPs still in preview or incubator status after multiple rounds.</li>
<li style="font-weight:400;">The Ahead-of-Time Object Caching feature from <a href="http://v">Project Leyden</a> is worth noting as it extends startup time improvements to work with any garbage collector, including ZGC, which addresses a practical pain point for cloud-native Java deployments where cold start latency matters.</li>
<li style="font-weight:400;">Oracle is launching the <a href="https://www.oracle.com/java/technologies/downloads/jvp/">Java Verified Portfolio</a>, a bundled support offering covering <a href="https://openjfx.io/">JavaFX</a>, <a href="http://v">Helidon</a>, and the <a href="https://code.visualstudio.com/docs/java/extensions">VS Code Java extension</a>, included free for Java SE subscribers and OCI customers running Java workloads. For everyone else, pricing is not explicitly stated beyond noting that many components remain free for a wide range of use cases.</li>
<li style="font-weight:400;"><a href="https://openjdk.org/jeps/504">The Applet API</a> removal in JEP 504 is notable mainly as a cleanup item, having been deprecated since JDK 17, and signals Oracle is willing to break legacy compatibility when features have been sufficiently warned about over multiple release cycles.</li>
<li style="font-weight:400;">Helidon is being proposed as an OpenJDK project and aligned to the Java release cadence, which tightens Oracle’s control over the microservices framework ecosystem while keeping it open source, a pattern Oracle has used with other technologies in its portfolio.</li>
</ul>
<p>1:11:21  Justin – “They brought AI to Java, and all is going to be lost.” </p>
<p>1:12:21 <a href="https://www.oracle.com/news/announcement/oracle-unveils-ai-database-agentic-innovations-for-business-data-2026-03-24/">Oracle Unveils AI Database Agentic Innovations for Business Data</a></p>
<ul>
<li style="font-weight:400;">Oracle announced a bundle of agentic AI capabilities for <a href="https://www.oracle.com/database/">Oracle AI Database</a> at its AI World Tour in London, centered on keeping AI workloads closer to the data rather than moving data to external AI systems. </li>
<li style="font-weight:400;">The headline additions include the <a href="https://blogs.oracle.com/database/announcing-oracle-autonomous-ai-vector-database-limited-availability">Autonomous AI Vector Database</a> in limited availability on free and low-cost developer tiers, a Private Agent Factory for no-code agent building, and a Unified Memory Core for storing agent context across multiple data types in a single engine.</li>
<li style="font-weight:400;">The security angle is notable here. <a href="http://v">Oracle Deep Data Security</a> and the <a href="https://www.oracle.com/database/private_ai_services_container/">Private AI Services Container</a> are positioned to address prompt injection and data leakage risks by enforcing least-privilege access at the database layer rather than in application code, which is a practical concern for enterprises deploying agents against sensitive business data.</li>
<li style="font-weight:400;"><a href="https://www.oracle.com/database/technologies/trusted-answer-search-downloads.html">Oracle Trusted Answer Search</a> takes a conservative approach to reducing hallucinations by matching user questions to pre-built reports via vector search rather than letting an LLM answer directly, which trades flexibility for determinism and may suit regulated industries but limits open-ended query use cases.</li>
<li style="font-weight:400;">The open standards additions, specifically Vectors on Ice for Apache Iceberg support and an Autonomous AI Database MCP Server, are worth noting because they reduce some of the lock-in concerns that typically follow Oracle announcements, though customers still need to be running Oracle AI Database to benefit.</li>
<li style="font-weight:400;">Pricing details are sparse in the announcement. The Autonomous AI Vector Database is available through the Oracle Cloud free tier or a low-cost developer tier, with a one-click upgrade path to full Autonomous AI Database, but Oracle has not published specific per-unit costs for the new agentic capabilities.</li>
</ul>

<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback, or ask questions at theCloudPod.net or tweet at us with the hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2412061/c1e-p8j8uw1n3da1rwod-xx7pkn17upwg-way3m1.mp3" length="136879947"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 348 of The Cloud Pod, where the weather is always cloudy! Justin, Ryan, and Matt are in the studio this week to bring you all the latest news in AI and Cloud, inclduing Strykers troubles, AWS’ birthday, Bedrock Agents, and Claude Code – plus so much more. Let’s get started! 
Titles we almost went with this week

 SOC 2 It to Me Delve Fires Back 
 Shell Yeah Bedrock Agents Just Got Command Line Powers
When Your SOC 2 Report Is Just Fan Fiction
 uv, Ruff, and ty Walk Into an OpenAI Acquisition
 Hash Field Expiration Is Here, and It’s No Redis Herring
 Stop Paying Full Price for Tokens You Already Bought
 Fake It Till You Audit It
 Cache Me If You Can CNCF Sandbox Edition
 Microsoft Learns Consent Matters in Copilot Rollout
 Microsoft’s Stinky Cloud Gets Federal Seal of Approval
 When Your Audit Trail Leads to a Blog Fight
 Ping Your AI Agent on Discord Like a Millennial
 Twenty Years of AWS and the Bill Never Stops
The LLM hack that feels a lot like Node Shift Left Package issues
 Claude Code Auto Mode Lets AI Work Unsupervised
 Stop Babysitting Your AI Claude Code Goes Solo
 Auto Mode Gives Claude Code the Keys to the Car
 Java comes to the coffee shop with AI

General News 
01:21 Customer Updates: Stryker Network Disruption 

Stryker confirmed a cyberattack on March 11, 2026, that disrupted their internal Microsoft corporate environment, affecting order processing, manufacturing, and shipping, but notably not their connected medical devices or cloud-hosted products.
The attack vector was specific to Stryker’s Microsoft environment, which meant products running on AWS (Vocera Edge, Vocera Ease) and Google Cloud Platform (care.ai) were architecturally isolated and unaffected, demonstrating a practical benefit of multi-cloud separation.
Stryker explicitly stated this was not ransomware or malware, and government agencies, including CISA, FBI, and the White House National Cyber Director, were engaged, with domain seizures linked to threat actors already executed.
The incident highlights how healthcare organizations can architect medical device and cloud product infrastructure to be independent of corporate IT environments, as every product from Mako to SurgiCount to LIFEPAK operated normally due to network segmentation.
Real-world patient impact was limited but present, with some personalized implant cases rescheduled due to shipping delays, underscoring that even contained corporate IT incidents can have downstream effects on physical supply chains.

02:30  Justin – “HugOps to the entire Stryker team; I couldn’t imagine having to rebuild my entire Windows estate at a company the size of Stryker in the middle of trying to do business and everything else.” 
05:00 ]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2412061/c1a-k5d5-5z3vq7p2u13o-1fr0ux.jpg"></itunes:image>
                                                                            <itunes:duration>01:10:59</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2412061/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[347: The CloudPod is Only Recording this Week “Because of AI”]]>
                </title>
                <pubDate>Thu, 26 Mar 2026 21:24:20 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2406600</guid>
                                    <link>https://tcpfm.castos.com/episodes/347-the-cloudpod-is-only-recording-this-week-because-of-ai</link>
                                <description>
                                            <![CDATA[<p>Welcome to episode 347 of The Cloud Pod, where the forecast is always cloudy! Justin, Jonathan, and Ryan are in the studio recording today, and thankfully, Jonathan hasn’t replaced us all with Skynet – yet. This week, we’re discussing how old our tools (and us) are (hint: it’s really old), whether or not the SaasApocalypse is upon us, and whether or not the business or AI is responsible for the latest round of layoffs. </p>
<h3>Titles we almost went with this week</h3>
<ul>
<li> S3 Bucket Names Finally Stop Being a Global Hunger Games</li>
<li> One Million Tokens Walk Into a Context Window</li>
<li> SLO Down and Smell the Reliability Metrics</li>
<li> CloudWatch Finally Watches Your Whole Cloud Organization</li>
<li> S3 Turns 20 and Still Buckets the Competition</li>
<li> Azure SRE Agent Goes GA So You Don’t Have To</li>
<li> Twenty Years of S3 and No Signs of Object Permanence</li>
<li> One Rule to Monitor Them All Across AWS</li>
<li>One Flag to Secure Them All on Cloud Run</li>
<li> SaaSpocalypse Now Atlassian Layoffs Hit the Jira</li>
<li> No More Bucket Name Bingo with S3 Regional Namespaces</li>
<li> A Picture Is Worth a Thousand Claude Tokens</li>
<li> One Command to Rule Your Autonomous AI Agents</li>
<li>AI Fixes Your Incidents Before Your Boss Notices</li>
<li> The CloudPod is only recording this week “Because of AI”</li>
<li> Amazon begs users to leave Simple DB with another migration tool</li>
</ul>
<h2>Follow Up</h2>
<p>00:54 <a href="https://www.geekwire.com/2026/microsofts-brief-in-anthropic-case-shows-new-alliance-and-willingness-to-challenge-trump-administration/">Microsoft’s brief in Anthropic case shows new alliance and willingness to </a><a href="https://www.geekwire.com/2026/microsofts-brief-in-anthropic-case-shows-new-alliance-and-willingness-to-challenge-trump-administration/">challenge Trump administration</a></p>
<ul>
<li style="font-weight:400;">Microsoft <a href="https://storage.courtlistener.com/recap/gov.uscourts.cand.465515/gov.uscourts.cand.465515.34.1.pdf">filed an amicus brief</a> in <a href="https://www.anthropic.com/">Anthropic’s</a> <a href="https://storage.courtlistener.com/recap/gov.uscourts.cand.465515/gov.uscourts.cand.465515.1.0_5.pdf">lawsuit</a> against the <a href="https://www.war.gov/">U.S. Department of War</a>, urging a federal judge to temporarily block the Pentagon’s designation of Anthropic as a supply chain risk, citing substantial costs to government contractors that rely on Anthropic models.</li>
<li style="font-weight:400;">The brief arrived one day after Microsoft launched <a href="https://www.microsoft.com/en-us/microsoft-365/blog/2026/03/09/copilot-cowork-a-new-way-of-getting-work-done/">Copilot Cowork</a>, built on Anthropic’s <a href="https://claude.ai/new">Claude</a>, and four months after Microsoft <a href="https://www.geekwire.com/2025/microsoft-to-invest-5b-in-anthropic-as-claude-maker-commits-30b-to-azure-in-new-nvidia-alliance/">committed up to $5 billion</a> in Anthropic as part of a deal requiring Anthropic to spend at least $30 billion on <a href="https://azure.microsoft.com/en-us/get-started/azure-portal/">Azure</a>, making the legal filing directly tied to concrete commercial dependencies.</li>
<li style="font-weight:400;">Microsoft highlighted a procedural inconsistency in the government’s approach: the Pentagon gave itself six months to transition off Anthropic’s models while making the supply chain designation effective immediately for contractors, creating an unequal compliance burden.</li>
<li style="font-weight:400;">Amazon, which has invested $8 billion in Anthropic, has not publicly responded to the lawsuit or the designation, creating a notable contrast in how two major cloud providers with similar financial exposure are handling the situation.</li>
<li style="font-weight:400;">OpenAI announced its own Pentagon deal on the same day the Anthropic designation was issued, and<a href="https://www.wired.com/story/openai-deepmind-employees-file-amicus..."></a></li></ul>
<h3>Chapters</h3>
<ul><li>(00:00:00) - The Cloud Pod</li><li>(00:00:54) - Podcasters: 17 Hours Long</li><li>(00:01:09) - Microsoft's Amicus Brief in Anthropic Lawsuit</li><li>(00:06:19) - Claude Launches In-App Visualization Feature</li><li>(00:08:41) - Databricks Launches GENIE Code as a General Available Product</li><li>(00:11:09) - 1. Million Context Window</li><li>(00:17:31) - Code: Auto-Compaction</li><li>(00:19:20) - GPT 5.4 Mini and Nano: Smaller Models for</li><li>(00:22:19) - Amazon S3: 20 Years of Computing</li><li>(00:24:56) -  AWS S3: Regional Namespaces for General Purpose Bools</li><li>(00:27:30) - Amazon CloudWatch</li><li>(00:28:58) - Amazon SimpleDB now supports exporting domain data directly to S3</li><li>(00:31:14) - Amazon CloudWatch: EC2: Detailed Monitoring Enablement</li><li>(00:32:35) - Google Cloud's Sensitive Data Protection</li><li>(00:35:57) - Google Completing Acquisition of Wiz Cloud Security Platform</li><li>(00:40:24) - Google Cloud's Kubernetes Inference Gateway</li><li>(00:45:40) - Azure S3 Agent</li><li>(00:50:00) - Azure's Cloud Migration Agent and GitHub Copilot modernization agent</li><li>(00:53:45) - Microsoft Merges Copilot into a Unified Organization</li><li>(00:56:17) - Copilot: What's Next for the Service?</li><li>(00:57:56) - Week in the Cloud: Microsoft</li><li>(00:58:36) - Amazon AI Voice Service misconfiguration in Spanish</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 347 of The Cloud Pod, where the forecast is always cloudy! Justin, Jonathan, and Ryan are in the studio recording today, and thankfully, Jonathan hasn’t replaced us all with Skynet – yet. This week, we’re discussing how old our tools (and us) are (hint: it’s really old), whether or not the SaasApocalypse is upon us, and whether or not the business or AI is responsible for the latest round of layoffs. 
Titles we almost went with this week

 S3 Bucket Names Finally Stop Being a Global Hunger Games
 One Million Tokens Walk Into a Context Window
 SLO Down and Smell the Reliability Metrics
 CloudWatch Finally Watches Your Whole Cloud Organization
 S3 Turns 20 and Still Buckets the Competition
 Azure SRE Agent Goes GA So You Don’t Have To
 Twenty Years of S3 and No Signs of Object Permanence
 One Rule to Monitor Them All Across AWS
One Flag to Secure Them All on Cloud Run
 SaaSpocalypse Now Atlassian Layoffs Hit the Jira
 No More Bucket Name Bingo with S3 Regional Namespaces
 A Picture Is Worth a Thousand Claude Tokens
 One Command to Rule Your Autonomous AI Agents
AI Fixes Your Incidents Before Your Boss Notices
 The CloudPod is only recording this week “Because of AI”
 Amazon begs users to leave Simple DB with another migration tool

Follow Up
00:54 Microsoft’s brief in Anthropic case shows new alliance and willingness to challenge Trump administration

Microsoft filed an amicus brief in Anthropic’s lawsuit against the U.S. Department of War, urging a federal judge to temporarily block the Pentagon’s designation of Anthropic as a supply chain risk, citing substantial costs to government contractors that rely on Anthropic models.
The brief arrived one day after Microsoft launched Copilot Cowork, built on Anthropic’s Claude, and four months after Microsoft committed up to $5 billion in Anthropic as part of a deal requiring Anthropic to spend at least $30 billion on Azure, making the legal filing directly tied to concrete commercial dependencies.
Microsoft highlighted a procedural inconsistency in the government’s approach: the Pentagon gave itself six months to transition off Anthropic’s models while making the supply chain designation effective immediately for contractors, creating an unequal compliance burden.
Amazon, which has invested $8 billion in Anthropic, has not publicly responded to the lawsuit or the designation, creating a notable contrast in how two major cloud providers with similar financial exposure are handling the situation.
OpenAI announced its own Pentagon deal on the same day the Anthropic designation was issued, and]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[347: The CloudPod is Only Recording this Week “Because of AI”]]>
                </itunes:title>
                                    <itunes:episode>347</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p>Welcome to episode 347 of The Cloud Pod, where the forecast is always cloudy! Justin, Jonathan, and Ryan are in the studio recording today, and thankfully, Jonathan hasn’t replaced us all with Skynet – yet. This week, we’re discussing how old our tools (and us) are (hint: it’s really old), whether or not the SaasApocalypse is upon us, and whether or not the business or AI is responsible for the latest round of layoffs. </p>
<h3>Titles we almost went with this week</h3>
<ul>
<li> S3 Bucket Names Finally Stop Being a Global Hunger Games</li>
<li> One Million Tokens Walk Into a Context Window</li>
<li> SLO Down and Smell the Reliability Metrics</li>
<li> CloudWatch Finally Watches Your Whole Cloud Organization</li>
<li> S3 Turns 20 and Still Buckets the Competition</li>
<li> Azure SRE Agent Goes GA So You Don’t Have To</li>
<li> Twenty Years of S3 and No Signs of Object Permanence</li>
<li> One Rule to Monitor Them All Across AWS</li>
<li>One Flag to Secure Them All on Cloud Run</li>
<li> SaaSpocalypse Now Atlassian Layoffs Hit the Jira</li>
<li> No More Bucket Name Bingo with S3 Regional Namespaces</li>
<li> A Picture Is Worth a Thousand Claude Tokens</li>
<li> One Command to Rule Your Autonomous AI Agents</li>
<li>AI Fixes Your Incidents Before Your Boss Notices</li>
<li> The CloudPod is only recording this week “Because of AI”</li>
<li> Amazon begs users to leave Simple DB with another migration tool</li>
</ul>
<h2>Follow Up</h2>
<p>00:54 <a href="https://www.geekwire.com/2026/microsofts-brief-in-anthropic-case-shows-new-alliance-and-willingness-to-challenge-trump-administration/">Microsoft’s brief in Anthropic case shows new alliance and willingness to </a><a href="https://www.geekwire.com/2026/microsofts-brief-in-anthropic-case-shows-new-alliance-and-willingness-to-challenge-trump-administration/">challenge Trump administration</a></p>
<ul>
<li style="font-weight:400;">Microsoft <a href="https://storage.courtlistener.com/recap/gov.uscourts.cand.465515/gov.uscourts.cand.465515.34.1.pdf">filed an amicus brief</a> in <a href="https://www.anthropic.com/">Anthropic’s</a> <a href="https://storage.courtlistener.com/recap/gov.uscourts.cand.465515/gov.uscourts.cand.465515.1.0_5.pdf">lawsuit</a> against the <a href="https://www.war.gov/">U.S. Department of War</a>, urging a federal judge to temporarily block the Pentagon’s designation of Anthropic as a supply chain risk, citing substantial costs to government contractors that rely on Anthropic models.</li>
<li style="font-weight:400;">The brief arrived one day after Microsoft launched <a href="https://www.microsoft.com/en-us/microsoft-365/blog/2026/03/09/copilot-cowork-a-new-way-of-getting-work-done/">Copilot Cowork</a>, built on Anthropic’s <a href="https://claude.ai/new">Claude</a>, and four months after Microsoft <a href="https://www.geekwire.com/2025/microsoft-to-invest-5b-in-anthropic-as-claude-maker-commits-30b-to-azure-in-new-nvidia-alliance/">committed up to $5 billion</a> in Anthropic as part of a deal requiring Anthropic to spend at least $30 billion on <a href="https://azure.microsoft.com/en-us/get-started/azure-portal/">Azure</a>, making the legal filing directly tied to concrete commercial dependencies.</li>
<li style="font-weight:400;">Microsoft highlighted a procedural inconsistency in the government’s approach: the Pentagon gave itself six months to transition off Anthropic’s models while making the supply chain designation effective immediately for contractors, creating an unequal compliance burden.</li>
<li style="font-weight:400;">Amazon, which has invested $8 billion in Anthropic, has not publicly responded to the lawsuit or the designation, creating a notable contrast in how two major cloud providers with similar financial exposure are handling the situation.</li>
<li style="font-weight:400;">OpenAI announced its own Pentagon deal on the same day the Anthropic designation was issued, and<a href="https://www.wired.com/story/openai-deepmind-employees-file-amicus-brief-anthropic-dod-lawsuit/"> 37 researchers from OpenAI and Google separately filed an amicus brief supporting Anthropic</a>, indicating the case is drawing broad attention across the AI and cloud industry with potential implications for how AI guardrails are treated in government contracts.</li>
</ul>
<p>01:37  Justin – “Oh, yeah, there’s a vested interest in the lawsuit which we did not mention last week, so I wanted to follow up on that, because that explains very clearly why Microsoft is throwing in with Anthropic on this.” </p>
<h2>General News </h2>
<p>02:37 <a href="https://go.theregister.com/feed/www.theregister.com/2026/03/11/atlassian_layoffs/">Atlassian to shed ten percent of staff, because of AI </a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.atlassian.com/">Atlassian</a> is cutting roughly 1,600 employees, about 10 percent of its workforce, <a href="https://www.atlassian.com/blog/announcements/atlassian-team-update-march-2026">citing</a> AI-driven changes to required skill sets and a need to self-fund further AI and enterprise sales investment.</li>
<li style="font-weight:400;">The company’s market cap has dropped from a peak of around 112 billion dollars in 2021 to approximately 20 billion dollars today, providing financial context for why cost restructuring is happening alongside the AI narrative.</li>
<li style="font-weight:400;">The SaaSpocalypse concept is worth discussing here, as Atlassian is among the SaaS vendors analysts flag as potentially vulnerable to organizations replacing traditional tools with AI-generated or vibe-coded alternatives.</li>
<li style="font-weight:400;">Atlassian points to 25 percent cloud revenue growth, 600 customers spending over 1 million dollars annually, and 5 million users on its <a href="https://www.atlassian.com/software/rovo">Rovo AI suite</a> as indicators that the business is still growing, which creates an interesting tension with the layoff announcement.</li>
<li style="font-weight:400;">For cloud practitioners, this is a concrete example of how AI adoption is beginning to visibly reshape headcount decisions at established SaaS vendors, not just startups, which has implications for how enterprises evaluate vendor stability and long-term support commitments.</li>
</ul>
<p>03:18  Justin – “I’ve seen Rovo, which is Atlassian’s AI suite, and if that’s the best they can do… I have fears for the long-term health and viability of Jira in general. I’m kind of over the whole let’s blame AI for our bad business decisions. That’s going to get old real quick.” </p>
<h2>AI Is Going Great – Or How ML Makes Money </h2>
<p>06:18 <a href="https://claude.com/blog/claude-builds-visuals">Claude builds interactive visuals right in your conversation</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.anthropic.com/">Anthropic</a> has launched in beta a new inline <a href="https://claude.com/blog/claude-builds-visuals">visualization feature for Claude</a> that generates interactive charts, diagrams, and other visuals directly within chat conversations, available across all plan tiers at no additional cost.</li>
<li style="font-weight:400;">These visuals are distinct from Claude’s existing <a href="https://claude.ai/catalog/artifact">artifacts</a> system in a notable way: they are temporary and contextual, appearing inline rather than in a side panel, and they update or disappear as the conversation evolves rather than serving as persistent shareable documents.</li>
<li style="font-weight:400;">Claude determines autonomously when a visual would aid comprehension, but users can also prompt it directly with natural language requests like “draw this as a diagram” or “visualize how this might change over time,” and can request adjustments iteratively within the same conversation.</li>
<li style="font-weight:400;">The feature is part of a broader set of response format improvements Anthropic has been rolling out, including purpose-built layouts for recipes and weather queries, as well as direct in-conversation integrations with third-party tools like <a href="https://www.figma.com/">Figma</a>, <a href="https://www.canva.com/">Canva</a>, and <a href="https://slack.com/signin#/signin">Slack</a>.</li>
<li style="font-weight:400;">For developers and enterprise users, the practical implication is that Claude can now serve as a lightweight data visualization layer within workflows without requiring users to export data to separate charting tools, which could reduce friction in analytical and educational use cases.</li>
</ul>
<p>07:27  Ryan – “Kind of excited when Claude decides that the monkey making the queries needs bigger pictures because the text isn’t working out, so it’s like, I get you, Claude. I see what you’re doing.”</p>
<p>07:38  Jonathan – “Anthropic’s Claude: Now with crayons.” </p>
<p>08:50 <a href="https://www.databricks.com/blog/introducing-genie-code">Introducing Genie Code </a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.databricks.com/">Databricks</a> has launched <a href="https://www.databricks.com/blog/introducing-genie-code">Genie Code</a> as a generally available product, positioning it as an agentic AI system built specifically for data teams rather than general software development. </li>
<li style="font-weight:400;">It handles end-to-end tasks, including pipeline building, dashboard creation, ML model training, and production monitoring, directly within <a href="https://www.databricks.com/product/collaborative-notebooks">Databricks notebooks</a>, SQL editor, and <a href="https://www.databricks.com/blog/introducing-databricks-lakeflow">Lakeflow Pipelines</a>.</li>
<li style="font-weight:400;">The system claims to outperform a leading coding agent by more than 2x on real-world data science tasks, with the key differentiator being deep <a href="https://docs.databricks.com/aws/en/data-governance/unity-catalog/">Unity Catalog</a> integration that gives it access to data lineage, usage patterns, governance policies, and business semantics rather than just reading raw code.</li>
<li style="font-weight:400;">Genie Code routes tasks across multiple models automatically, selecting from frontier LLMs, open source models, or custom Databricks-hosted models depending on the job, removing the need for users to manually choose models for different tasks.</li>
<li style="font-weight:400;">A notable upcoming capability is background agents, which will proactively monitor Lakeflow pipelines and AI models, triage failures, handle routine Databricks Runtime upgrades, and auto-fix issues like schema mismatches in a sandboxed environment before alerting the team.</li>
<li style="font-weight:400;">The governance angle is worth discussing for enterprise cloud users: Genie Code enforces Unity Catalog access controls during all operations, meaning it only surfaces data assets a user is authorized to see and respects existing lineage rules when building pipelines, which addresses a common concern with agentic systems operating on sensitive production data.</li>
</ul>
<p>10:05  Ryan – “I don’t think it will kill Glue or any of the ETL things, but hopefully it will just do it for you, and then I don’t think I care anymore.”  </p>
<p>11:19 <a href="https://claude.com/blog/1m-context-ga">1M context is now generally available for Opus 4.6 and Sonnet 4.6</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.anthropic.com/">Anthropic</a> has moved 1M context windows to general availability for <a href="https://claude.ai/new">Claude</a> <a href="https://www.anthropic.com/news/claude-opus-4-6">Opus 4.6</a> and <a href="https://www.anthropic.com/claude/sonnet">Sonnet 4.6</a>, with standard pricing applying across the full window and no long-context premium. </li>
<li style="font-weight:400;">Opus 4.6 is <a href="https://claude.com/pricing">priced</a> at $5/$25 per million input/output tokens, and Sonnet 4.6 at $3/$15, meaning a 900K-token request costs the same per-token rate as a 9K one.</li>
<li style="font-weight:400;">On the performance side, Opus 4.6 scores 78.3% on MRCR v2, a benchmark measuring recall and reasoning across long contexts, which Anthropic claims is the highest among frontier models at that context length.</li>
<li style="font-weight:400;">Practical use cases include loading entire codebases, thousands of pages of contracts, or full agent traces with tool calls and intermediate reasoning, eliminating the need for lossy summarization or manual context management that long-context workflows previously required.</li>
<li style="font-weight:400;">Claude Code users on <a href="https://claude.com/pricing/max">Max</a>, <a href="https://claude.com/pricing/team">Team</a>, and <a href="https://claude.com/pricing/enterprise">Enterprise</a> plans now get 1M context automatically with Opus 4.6, meaning fewer session compactions and more conversation history retained without consuming extra usage credits.</li>
<li style="font-weight:400;">The 1M context window is available natively on the Claude Platform and through <a href="https://aws.amazon.com/bedrock/">Amazon Bedrock</a>, <a href="https://cloud.google.com/vertex-ai">Google Cloud Vertex AI</a>, and <a href="https://azure.microsoft.com/en-us/products/ai-foundry/">Microsoft Foundry</a>, making it accessible across the major cloud provider ecosystems that developers are already using.</li>
</ul>
<p>19:46 <a href="https://openai.com/index/introducing-gpt-5-4-mini-and-nano">Introducing GPT-5.4 mini and nano</a></p>
<ul>
<li style="font-weight:400;">OpenAI released <a href="https://openai.com/index/introducing-gpt-5-4-mini-and-nano/">GPT-5.4 mini and nano</a>, two small models positioned for high-volume, latency-sensitive workloads. </li>
<li style="font-weight:400;">GPT-5.4 mini runs more than 2x faster than GPT-5 mini while approaching GPT-5.4 performance on benchmarks like SWE-Bench Pro and OSWorld-Verified.</li>
<li style="font-weight:400;">Pricing is notably lower than larger models: GPT-5.4 mini costs $0.75 per 1M input tokens and $4.50 per 1M output tokens, while GPT-5.4 nano comes in at $0.20 input and $1.25 output per 1M tokens, with a 400k context window on mini.</li>
<li style="font-weight:400;">The models are designed for multi-model orchestration patterns where a larger model like <a href="https://openai.com/index/introducing-gpt-5-4/">GPT-5.4</a> handles planning and coordination while GPT-5.4 mini subagents execute narrower parallel tasks, a pattern OpenAI has built directly into their <a href="https://openai.com/codex/">Codex</a> product.</li>
<li style="font-weight:400;">In Codex specifically, GPT-5.4 mini uses only 30% of the GPT-5.4 quota, giving developers a cost-effective path for simpler coding tasks like codebase navigation, targeted edits, and debugging loops without sacrificing too much capability.</li>
<li style="font-weight:400;">GPT-5.4 nano is API-only and recommended for classification, data extraction, ranking, and simpler subagent tasks, making it a practical option for cloud workloads where cost and throughput matter more than deep reasoning.</li>
</ul>
<p>21:00  Ryan – “I’m a fan of these little models for certain things; as part of that tuning, my agent definitions have gotten a lot more complex. A lot of times, I’m breaking out agent definitions so that I can specifically use one of the smaller models for certain types of tasks. Data extraction being a big one.” </p>
<h2>AWS</h2>
<p>22:53 <a href="https://aws.amazon.com/blogs/aws/twenty-years-of-amazon-s3-and-building-whats-next/">Twenty years of Amazon S3 and building what’s next </a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/s3/">S3</a> turns 20 years old this month, growing from 1 petabyte of capacity and 15 cents per gigabyte in 2006 to hundreds of exabytes storing over 500 trillion objects at just over 2 cents per gigabyte today, representing roughly an 85% price reduction over two decades.</li>
<li style="font-weight:400;">A notable engineering detail is that code written for S3 in 2006 still works today unchanged, with AWS maintaining complete API backward compatibility through multiple infrastructure generations, which is why the S3 API has become a de facto standard across the storage industry.</li>
<li style="font-weight:400;">On the technical side, AWS has spent 8 years progressively rewriting performance-critical S3 components in <a href="https://rust-lang.org/">Rust</a> for memory safety and performance, and uses formal methods with automated proofs to mathematically verify consistency in the index subsystem and cross-region replication.</li>
<li style="font-weight:400;">AWS is positioning S3 as a universal data foundation with three newer capabilities worth noting: S3 Tables for managed <a href="https://iceberg.apache.org/">Apache Iceberg</a> analytics, S3 Vectors for native vector storage supporting up to 2 billion vectors per index at sub-100ms latency, and S3 Metadata for centralized object cataloging, all priced at standard S3 cost structures rather than specialized database pricing.</li>
<li style="font-weight:400;">The maximum object size has grown from 5 GB to 50 TB, and AWS reports customers have collectively saved over $6 billion in storage costs through S3 Intelligent-Tiering compared to S3 Standard storage class pricing.</li>
</ul>
<p>24:08  Justin – “I am a big fan of the S3 vectors because we use it for Bolt.” </p>
<p>25:39 <a href="https://aws.amazon.com/blogs/aws/introducing-account-regional-namespaces-for-amazon-s3-general-purpose-buckets/">Introducing account regional namespaces for Amazon S3 general-purpose </a><a href="https://aws.amazon.com/blogs/aws/introducing-account-regional-namespaces-for-amazon-s3-general-purpose-buckets/">buckets</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/s3/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">AWS S3</a> now supports account regional namespaces for general-purpose buckets, where bucket names automatically include your account ID and region as a suffix, such as mybucket-123456789012-us-east-1-an. </li>
<li style="font-weight:400;">This solves the long-standing problem of bucket name collisions in the global namespace, particularly useful for large organizations managing buckets at scale across multiple regions.</li>
<li style="font-weight:400;">The feature integrates with <a href="https://aws.amazon.com/iam/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">IAM</a> and <a href="https://aws.amazon.com/organizations/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">AWS Organizations</a> service control policies via the new s3:x-amz-bucket-namespace condition key, allowing security teams to enforce that employees only create buckets within their account’s namespace. 
<ul>
<li style="font-weight:400;">This gives enterprises a straightforward governance mechanism to prevent naming conflicts and unauthorized bucket creation.</li>
</ul>
</li>
<li style="font-weight:400;">Existing global namespace buckets cannot be renamed to use the account regional namespace, so this is a forward-looking change for new bucket creation only. S3 table buckets, vector buckets, and directory buckets already operate in account-level or zonal namespaces, so this update brings general-purpose buckets in line with those patterns.</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/cloudformation/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">CloudFormation</a> support is included via the BucketNamespace property and pseudo parameters AWS::AccountId and AWS::Region, making it straightforward to update existing IaC templates. CLI and <a href="https://aws.amazon.com/sdk-for-python/">Boto3</a> support is also available using the x-amz-bucket-namespace header or BucketNamespace parameter.</li>
<li style="font-weight:400;">The feature is available across 37 AWS regions, including AWS China and GovCloud, at no additional cost, making it a low-friction adoption for teams looking to simplify bucket naming conventions without budget impact.</li>
</ul>
<p>27:17  Jonathan – “What’s really annoying is your account number is part of the public S3 bucket name! I wish a security person had been in the room there.” </p>
<p>28:17 <a href="https://aws.amazon.com/about-aws/whats-new/2026/03/cloudwatch-application-signals-adds-slo-capabilities/">Amazon CloudWatch Application Signals adds new SLO capabilities</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/cloudwatch/">Amazon CloudWatch</a> Application Signals now includes three new SLO capabilities: <a href="https://docs.aws.amazon.com/AmazonCloudWatch/latest/monitoring/CloudWatch-ServiceLevelObjectives.html#CloudWatch-ServiceLevelObjectives-Recommendations">SLO Recommendations</a>, Service-Level SLOs, and SLO Performance Report, addressing longstanding gaps in data-driven reliability management for AWS customers.</li>
<li style="font-weight:400;">SLO Recommendations analyzes 30 days of historical P99 latency and error rate data to suggest appropriate reliability targets, reducing the manual guesswork that previously led to misconfigured thresholds and alert fatigue.</li>
<li style="font-weight:400;">Service-Level SLOs give teams a consolidated view of reliability across all operations within a service, making it easier to align technical monitoring with business objectives without stitching together multiple dashboards.</li>
<li style="font-weight:400;">The SLO Performance Report adds calendar-aligned historical reporting at daily, weekly, and monthly intervals, which is useful for teams that need to present reliability data to stakeholders in business-friendly formats.</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/cloudwatch/pricing/">Pricing</a> is usage-based, tied to inbound and outbound application requests plus SLO charges, with each SLO generating 2 application signals per service level indicator metric period. The features are available in all regions where CloudWatch Application Signals is supported.</li>
</ul>
<p>29:11  Jonathan – “So instead of fixing your product, you just use a tool that tells you that you should turn down your commitments to your customers. Ok…” </p>
<p>29:57 <a href="https://aws.amazon.com/about-aws/whats-new/2026/03/amazon-simpledb-domain-export-to-amazon-s3/">Amazon SimpleDB now supports exporting domain data to Amazon S3</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/simpledb/">Amazon SimpleDB</a>, one of AWS’s oldest database services dating back to 2007, now supports exporting domain data directly to S3 in <a href="https://json.org">JSON</a> format, giving long-time users a practical path to migrate away from the service or archive data for compliance purposes.</li>
<li style="font-weight:400;">The export tool introduces three new APIs (StartDomainExport, GetExport, and ListExports) with background processing that avoids any performance impact on the running database, which matters for users who cannot afford downtime during data extraction.</li>
<li style="font-weight:400;">Cross-region and cross-account support, along with multiple encryption options, make this useful for organizations with strict data governance requirements who need to move SimpleDB data into modern storage or database systems.</li>
<li style="font-weight:400;">Rate limiting is set at 5 exports per domain and 25 per account within a 24-hour window, so teams with large numbers of domains should plan their migration timelines accordingly rather than assuming bulk exports can happen all at once.</li>
<li style="font-weight:400;">The tool itself is free to use, but standard S3 data transfer charges apply, so cost planning should account for data volume when scoping a migration or archival project.</li>
</ul>
<p>30:53  Justin – “SimpleDB gets a new feature!” </p>
<p>32:19 <a href="https://aws.amazon.com/about-aws/whats-new/2026/03/cloudwatch-org-enablement-ec2-metrics/">Amazon CloudWatch introduces organization-wide EC2 detailed monitoring </a><a href="https://aws.amazon.com/about-aws/whats-new/2026/03/cloudwatch-org-enablement-ec2-metrics/">enablement</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/cloudwatch/">CloudWatch</a> now supports organization-wide rules to automatically enable <a href="https://aws.amazon.com/ec2/">EC2 </a>detailed monitoring, shifting metrics collection from a per-instance manual task to a centralized policy-driven configuration across the entire AWS Organizations.</li>
<li style="font-weight:400;">Rules can be scoped to the full organization, specific accounts, or individual resources using tags, so teams can target environments like production workloads without enabling the feature universally and incurring unnecessary costs.</li>
<li style="font-weight:400;">The 1-minute interval metrics that detailed monitoring provides are particularly relevant for Auto Scaling groups, where faster data collection means scaling policies can respond more quickly to utilization changes rather than waiting for the default 5-minute interval.</li>
<li style="font-weight:400;">The feature covers both existing and newly launched instances within the rule scope, which closes a common gap where new instances spun up after policy creation would otherwise miss monitoring configuration.</li>
<li style="font-weight:400;">Detailed monitoring costs apply per instance per metric per month per CloudWatch pricing, so organizations should evaluate tag-based scoping carefully to avoid unexpected billing increases when rolling this out broadly.</li>
</ul>
<p>33:17  Ryan – “I mean, what’s wrong with the previous method of waiting until you  had an outage, not having the data, and THEN turning it on for your project?” </p>
<h2>GCP</h2>
<p>33:47 <a href="https://cloud.google.com/blog/products/identity-security/why-context-is-the-missing-link-in-ai-data-security/">Why context is the missing link in AI data security </a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/">Google Cloud’s</a> <a href="http://cloud.google.com/security/products/sensitive-data-protection">Sensitive Data Protection</a> is now generally available with new context classifiers for medical and finance data, plus image object detectors for faces and passports, moving beyond simple keyword matching to understand the semantic meaning of data.</li>
<li style="font-weight:400;">For AI training workflows on <a href="http://cloud.google.com/vertex-ai">Vertex AI</a>, SDP can scan unstructured image data using OCR and object detection to find sensitive content like credit card numbers or photo IDs, then generate redacted versions rather than discarding the data entirely, preserving training dataset quality.</li>
<li style="font-weight:400;">The context-aware approach addresses a practical problem with traditional regex-based detection: the same number sequence can be treated differently depending on surrounding words, so “order number” passes through while “wallet number” triggers financial context classification and redaction.</li>
<li style="font-weight:400;">SDP serves as the underlying engine for several other Google Cloud products, including <a href="http://cloud.google.com/security/products/model-armor">Model Armor</a>, <a href="http://cloud.google.com/security/products/security-command-center">Security Command Center</a>, and <a href="http://cloud.google.com/solutions/contact-center-ai-platform">Contact Center</a> as a Service, meaning improvements here propagate across those services automatically.</li>
<li style="font-weight:400;">Organizations in regulated industries like healthcare and finance are the most direct beneficiaries, as the tool helps ensure AI agents only access data appropriate to their function during both training and live user interactions. Pricing details are not specified in the announcement, so teams should check cloud.google.com/security/products/sensitive-data-protection for current rates.</li>
</ul>
<p>35:16  Ryan – “I don’t really think that’s usually where the sensitive data is. It can be, in some workloads, but probably not the majority, so there’s so many false positives, so I really like the idea that they’re having context be a part of that decision.”</p>
<p>37:16 <a href="https://cloud.google.com/blog/products/identity-security/google-completes-acquisition-of-wiz/">Welcoming Wiz to Google Cloud: Redefining security for the AI era</a></p>
<ul>
<li style="font-weight:400;">Google has completed its acquisition of <a href="https://www.wiz.io/">Wiz</a>, a cloud and AI security platform, which will retain its brand and continue supporting multicloud environments, including AWS, Azure, and Oracle Cloud Platform, alongside Google Cloud.</li>
<li style="font-weight:400;">Wiz connects code, cloud, and runtime into a single context, allowing security teams to map application architecture, permissions, data flows, and runtime behavior in real time to identify and prioritize exploitable attack paths before they reach production.</li>
<li style="font-weight:400;">The combined offering integrates Wiz’s cloud security platform with <a href="https://cloud.google.com/security/google-unified-security?e=48754805&amp;hl=en">Google Security Operations</a>, <a href="https://cloud.google.com/security/mandiant">Mandiant Consulting</a>, and <a href="https://cloud.google.com/security/products/threat-intelligence">Google Threat Intelligence</a> under the <a href="https://cloud.google.com/security/google-unified-security?e=48754805&amp;hl=en">Google Unified Security</a> umbrella, with <a href="https://gemini.google.com/">Gemini AI</a> assisting in threat hunting, remediation workflows, and audit documentation.</li>
<li style="font-weight:400;">A notable focus of the acquisition is AI-specific security, addressing threats that target AI models and those generated by AI systems, which is increasingly relevant as organizations deploy AI agents fed with business-critical data.</li>
<li style="font-weight:400;">Pricing details for the combined platform have not been announced, but Wiz products will remain available through existing partner channels, system integrators, and managed security service providers, suggesting continuity for current Wiz customers during the transition.</li>
</ul>
<p>38:16  Justin – “Typically on these acquisitions, it takes about a year for Google to figure out how to package them properly, and most likely they’ll want a separate contract for it anyways because that’s how all the integration acquisitions they’ve done are.”</p>
<p>39:22 <a href="https://cloud.google.com/blog/products/serverless/iap-integration-with-cloud-run/">IAP integration with Cloud Run</a></p>
<ul>
<li style="font-weight:400;">Google <a href="https://cloud.google.com/run?e=48754805&amp;hl=en">Cloud Run</a> now supports direct <a href="https://cloud.google.com/security/products/iap?e=48754805&amp;hl=en">Identity-Aware Proxy</a> integration in general availability, allowing developers to enable IAP authentication with a single UI click or the –iap flag in gcloud, eliminating the previous requirement to configure load balancers manually. IAP carries no additional cost beyond standard Cloud Run charges, with limited exceptions noted in the pricing docs.</li>
<li style="font-weight:400;">IAP on Cloud Run supports enterprise authentication features, including user and group identity policies, context-aware access controls based on IP, geolocation, and device status, and <a href="https://cloud.google.com/iap/docs/use-workforce-identity-federation">Workforce Identity Federation</a> for external identity providers. This makes it practical for organizations that need to secure internal web applications without building custom authentication layers.</li>
<li style="font-weight:400;">A separate change allows Cloud Run services to disable the default IAM invoker check by selecting “Allow Public access,” which resolves a long-standing friction point for teams trying to host public-facing applications while also enforcing <a href="https://cloud.google.com/resource-manager/docs/organization-policy/restricting-domains#console">Domain Restricted Sharing</a> org policies.</li>
<li style="font-weight:400;">The two features address different scenarios: IAP is the recommended path for internal business applications requiring user authentication, while the public access option suits public websites, store locators, or private microservices where network-level controls like <a href="https://cloud.google.com/security/products/armor">Cloud Armor</a> handle security instead.</li>
<li style="font-weight:400;">Real-world adoption examples include L’Oreal using IAP across their Google Cloud application portfolio and Bilt Rewards disabling IAM invoker checks on multi-regional Cloud Run services to simplify edge routing while relying on Cloud Armor for security enforcement.</li>
</ul>
<p>39:57  Ryan – This is a neat little feature. I don’t know how widely known it is, but it’s something that I’ve been using for a while.”  </p>
<p>42:09 <a href="https://cloud.google.com/blog/products/containers-kubernetes/multi-cluster-gke-inference-gateway-helps-scale-ai-workloads/">Multi-cluster GKE Inference Gateway helps scale AI workloads</a></p>
<ul>
<li style="font-weight:400;">Google Cloud has launched a preview of multi-cluster GKE Inference Gateway, which extends the existing <a href="https://cloud.google.com/kubernetes-engine/docs/concepts/gateway-api">GKE Gateway API</a> to enable model-aware load balancing for AI inference workloads across multiple GKE clusters and regions. </li>
<li style="font-weight:400;">This addresses practical limitations of single-cluster deployments like GPU/TPU capacity caps and regional availability risks.</li>
<li style="font-weight:400;">The system introduces two core Kubernetes custom resources, InferencePool and InferenceObjective, which group model-server backends and define routing priorities, respectively. </li>
<li style="font-weight:400;">This allows the gateway to intelligently multiplex latency-sensitive and lower-priority inference requests across a distributed fleet.</li>
<li style="font-weight:400;">A notable technical capability is the GCPBackendPolicy resource, which enables load balancing decisions based on real-time custom metrics such as KV cache utilization on model servers. </li>
<li style="font-weight:400;">This is more inference-specific than traditional request-count or latency-based routing approaches.</li>
<li style="font-weight:400;">The architecture uses a dedicated config cluster to manage a single Gateway configuration that routes traffic to multiple target clusters, simplifying operations for teams running globally distributed AI services. Supported use cases include disaster recovery, capacity bursting, and heterogeneous hardware utilization.</li>
<li style="font-weight:400;">Pricing for this feature is not separately detailed in the announcement, so costs would likely follow existing GKE and Cloud Load Balancing pricing structures. Teams evaluating this should factor in multi-cluster networking and potential cross-region data transfer costs alongside their GPU/TPU resource expenses.</li>
</ul>
<p>43:06  Ryan – “Simplify. Sure…” </p>
<p>44:35 <a href="https://blog.google/innovation-and-ai/technology/developers-tools/more-control-over-gemini-api-costs/">More transparency and control over Gemini API costs</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aistudio.google.com/prompts/new_chat">Google AI Studio</a> now supports <a href="https://ai.google.dev/gemini-api/docs/billing#project-spend-caps">Project Spend Caps</a>, letting developers set monthly dollar limits per project directly from the <a href="https://aistudio.google.com/spend?e=0">Spend</a> tab. </li>
<li style="font-weight:400;">There is a roughly 10-minute enforcement delay, so users remain responsible for any overages incurred during that window.</li>
<li style="font-weight:400;"><a href="https://ai.google.dev/gemini-api/docs/billing#about-billing">Usage Tiers</a> have been redesigned with lower spend qualifications, automatic tier upgrades based on payment history, and <a href="https://ai.google.dev/gemini-api/docs/billing#tier-spend-caps">system-defined billing account caps</a> that increase as you move to higher tiers. This reduces manual intervention for developers scaling their API usage over time.</li>
<li style="font-weight:400;">Three new dashboards have been added to Google AI Studio covering rate limits, costs, and usage. The <a href="https://aistudio.google.com/rate-limit">rate limit dashboard</a> tracks RPM, TPM, and RPD per project, while the <a href="https://aistudio.google.com/spend">cost dashboard</a> offers a daily breakdown filterable by model and time range going back up to a full month.</li>
<li style="font-weight:400;"><a href="https://aistudio.google.com/projects">Billing setup</a> can now be completed entirely within Google AI Studio, including linking billing profiles to projects, removing the previous need to navigate across multiple Google Cloud console windows. </li>
<li style="font-weight:400;">This consolidation is particularly useful for teams managing several projects under one billing account.</li>
<li style="font-weight:400;">Developers building with Imagen and <a href="https://aistudio.google.com/models/veo-3">Veo</a> now have dedicated usage graphs alongside standard request metrics, giving multimodal workloads the same observability previously available only for text-based Gemini API calls.</li>
</ul>
<p>45:13  Justin – “If you’ve ever tried to figure out who is using what models and what they’re doing with them and how much it costs, you know that this is all terrible – and this doesn’t actually improve it all that much.”  </p>
<h2>Azure</h2>
<p>47:35 <a href="https://azure.microsoft.com/en-us/updates?id=558321">Generally Available: Azure SRE Agent with new capabilities </a></p>
<ul>
<li style="font-weight:400;"><a href="https://azure.microsoft.com/en-us/products/sre-agent/">Azure SRE Agent</a> is now generally available as an AI-powered operations tool designed to help teams diagnose incidents faster and automate response workflows, to reduce downtime and manual operational work.</li>
<li style="font-weight:400;">The GA release introduces deep context gathering capabilities, meaning the agent can pull together relevant signals and telemetry during an incident rather than requiring engineers to manually correlate data across multiple tools.</li>
<li style="font-weight:400;">This fits naturally into teams already using <a href="https://azure.microsoft.com/en-us/products/monitor/?msclkid=07455af5e2aa197c895033368c798118#overview/&amp;ef_id=_k_07455af5e2aa197c895033368c798118_k_&amp;OCID=AIDcmmhvcv1sd6_SEM_k_07455af5e2aa197c895033368c798118&amp;utm_source=bing&amp;utm_medium=cpc&amp;utm_campaign=590417846&amp;utm_adgroup=1267738649779319&amp;utm_term=%7BKeywordId%7D&amp;utm_content=79233774313407">Azure Monitor</a>, <a href="https://learn.microsoft.com/en-us/azure/azure-monitor/app/usage?tabs=users">Application Insights</a>, and related observability tooling, as the agent is positioned to work within existing Azure operations workflows rather than requiring a separate platform.</li>
<li style="font-weight:400;">The primary target audience is operations and SRE teams managing production workloads on Azure who are looking to reduce the time between incident detection and resolution without adding headcount.</li>
<li style="font-weight:400;">Pricing details were not included in the announcement, so teams evaluating this should check the Azure pricing page directly <a href="http://azure.microsoft.com/pricing">here</a> before planning adoption, as AI-powered agent services on Azure typically carry consumption-based costs.</li>
</ul>
<p>48:25  Jonathan – “All right, so they run the services, which are going to have problems. And now they want me to pay for another service so that I can use that tool to troubleshoot the problems with the other tools that I’m already paying for. OK…” </p>
<p>55:59 <a href="https://azure.microsoft.com/en-us/blog/many-agents-one-team-scaling-modernization-on-azure/">Many agents, one team: Scaling modernization on Azure </a></p>
<ul>
<li style="font-weight:400;">Azure announced two new public preview offerings: the <a href="https://azure.microsoft.com/en-us/products/copilot">Azure Copilot</a> <a href="https://aka.ms/MigrationAgentLaunchBlog">migration agent</a> and the<a href="https://github.com/features/copilot"> GitHub Copilot</a> <a href="https://aka.ms/ghcp-modernization-agent/blog">modernization agent</a>, designed to automate discovery, assessment, planning, and deployment for organizations moving workloads to Azure. </li>
<li style="font-weight:400;">The migration agent targets servers, virtual machines, applications, and databases, while the modernization agent orchestrates code upgrades at scale across multiple applications simultaneously.</li>
<li style="font-weight:400;">The two agents are designed to work together, with GitHub Copilot scanning application code to produce assessment reports that Azure Copilot’s migration agent then ingests to inform cloud infrastructure planning. This integration aims to close the historical gap between developer-level code work and infrastructure decisions around landing zones, networking, and governance.</li>
<li style="font-weight:400;">Early customer results show a 70% reduction in total modernization effort using <a href="https://azure.microsoft.com/en-us/blog/accelerate-migration-and-modernization-with-agentic-ai/">GitHub Copilot modernization capabilities</a>, and Ahold Delhaize is cited as a customer that reduced complexity and accelerated delivery using these agentic workflows across discovery, assessment, and execution.</li>
<li style="font-weight:400;">Microsoft is pairing these agentic tools with a structured delivery program called <a href="https://aka.ms/Cloud-Accelerate-Factory">Cloud Accelerate Factory</a>, a no-cost benefit under Azure Accelerate where Microsoft experts work alongside customers from discovery through production. Pricing for the agents themselves is not specified in the announcement, so listeners should check Azure pricing pages directly for cost details.</li>
<li style="font-weight:400;">According to a Forrester Q1 2026 survey of 223 global IT leaders, 91% view application modernization as necessary for enabling AI in their business, which provides context for why Microsoft is investing in automating what has traditionally been a slow, manual planning process.</li>
</ul>
<p>52:32  Ryan – “I keep waiting for someone to tout the success of how they did it, they’ve migrated all their terrible legacy code into this new thing, and it all works – but I haven’t seen it…”  </p>
<p>53:28 <a href="https://techcommunity.microsoft.com/blog/azure-ai-foundry-blog/announcing-fireworks-ai-on-microsoft-foundry/4500950">Announcing Fireworks AI on Microsoft Foundry</a></p>
<ul>
<li style="font-weight:400;"><a href="http://ai.azure.com/">Microsoft Foundry</a> now integrates with <a href="https://fireworks.ai/">Fireworks AI’s</a> inference cloud, giving customers access to models like <a href="https://www.deepseek.com/en/">DeepSeek v3.2</a>, <a href="https://www.kimi.com/ai-models/kimi-k2-5">Kimi K2.5</a>, and OpenAI’s <a href="https://openai.com/index/introducing-gpt-oss/">gpt-oss-120b</a> through both pay-per-token and provisioned throughput deployment options. </li>
<li style="font-weight:400;">This is currently in public preview and requires an opt-in through the Azure portal’s Preview features panel.</li>
<li style="font-weight:400;">Pricing follows a per-million-token model for serverless deployments covering input, cached input, and output tokens, with US Data Zone availability across six regions, including East US and West US. </li>
<li style="font-weight:400;">Default quota limits start at either 250K or 25K tokens per minute, depending on subscription type, with additional quota available via a request form.</li>
<li style="font-weight:400;">A notable addition is custom model support, allowing teams who have fine-tuned models from families like <a href="https://huggingface.co/Qwen/Qwen3-14B">Qwen3-14B</a>, DeepSeek v3, or Kimi K2 to import and deploy those weights directly into Foundry projects. </li>
<li style="font-weight:400;">The <a href="https://learn.microsoft.com/en-us/azure/developer/azure-developer-cli/install-azd?tabs=winget-windows%2Cbrew-mac%2Cscript-linux&amp;pivots=os-windows">Azure Developer CLI</a> has been updated with an azd ai models create command to facilitate the weight transfer process.</li>
<li style="font-weight:400;">Fireworks-hosted models are distinct from Azure Direct models in that they skip <a href="https://azure.microsoft.com/en-us/products/ai-services/ai-content-safety/?ef_id=_k_5bba362a0ce41383df485fe9781229c2_k_&amp;OCID=AIDcmm22670kx6_SEM_k_5bba362a0ce41383df485fe9781229c2&amp;utm_adgroup=1271037185071562&amp;msclkid=5bba362a0ce41383df485fe9781229c2">Microsoft’s Responsible AI safety assessments</a>, so teams needing safety evaluations will need to use Foundry’s built-in risk and safety evaluator tools separately.</li>
<li style="font-weight:400;">Model retirement for serverless deployments comes with at least 30 days’ notice, and customers can extend usage past retirement dates by switching to provisioned throughput deployments, which use existing Global PTU quota and reservation commitments.</li>
</ul>
<p>54:30  Justin – “Sounds like it’s a cross-connect that they’ve done to Firework’s cloud basically, to provide this to you, so it’s sort of interesting.” </p>
<p>56:02 <a href="https://blogs.microsoft.com/blog/2026/03/17/announcing-copilot-leadership-update/">Announcing Copilot leadership update </a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.microsoft.com/en-us">Microsoft</a> is reorganizing its <a href="https://copilot.microsoft.com/">Copilot</a> efforts by merging consumer and commercial teams into a single unified org, structured around four pillars: Copilot experience, Copilot platform, Microsoft 365 apps, and AI models. </li>
<li style="font-weight:400;">Jacob Andreou will lead the combined Copilot experience as EVP, reporting directly to Satya Nadella.</li>
<li style="font-weight:400;">Mustafa Suleyman is shifting focus exclusively to what Microsoft calls its “superintelligence” effort, concentrating on frontier model development, enterprise-tuned model lineages, and reducing inference costs at scale over the next five years.</li>
<li style="font-weight:400;">The restructuring reflects a product direction where Copilot moves from individual features toward an integrated system connecting agents, apps, and workflows, with recent announcements like <a href="https://www.microsoft.com/en-us/microsoft-copilot/blog/2026/02/26/copilot-tasks-from-answers-to-actions/">Copilot Tasks</a>, <a href="https://www.microsoft.com/en-us/microsoft-365/blog/2026/03/09/copilot-cowork-a-new-way-of-getting-work-done/">Copilot Cowork</a>, and <a href="https://www.microsoft.com/en-us/microsoft-agent-365">Agent 365</a> representing early examples of this approach.</li>
<li style="font-weight:400;">For enterprise customers, the key practical implication is that commercial and consumer Copilot capabilities will converge, meaning IT and governance controls will need to account for a more unified product surface rather than separate consumer and business tracks.</li>
<li style="font-weight:400;">The Copilot Leadership Team now includes Suleyman, Andreou, Charles Lamanna, Perry Clarke, and Ryan Roslansky, signaling that Microsoft 365 app development and platform infrastructure will be tightly coordinated with model development rather than operating independently.</li>
</ul>
<p>57:23 Ryan – “Noticeably missing is Github’s Copilot…” </p>
<h2>After Show </h2>
<p>55:59 <a href="https://apnews.com/article/washington-dol-spanish-accent-ai-3a1b8438a5674c07242a8d48c057d5a3?ck_subscriber_id=512838477#463:%20Beanstalk%20AI:%20The%20Resurrection%20Nobody%20Asked%20For%20-%2021048004">Washington state hotline callers hear AI voice with Spanish accent </a></p>
<ul>
<li style="font-weight:400;">Washington state’s Department of Licensing accidentally routed Spanish-language callers to an AI voice speaking English with a Spanish accent for several months, a direct result of a misconfiguration by DOL staff using Amazon Web Services Polly.</li>
<li style="font-weight:400;">AP journalists were able to replicate the issue by selecting the AWS Polly voice named “Lucia,” which is designed to mimic Castilian Spanish, highlighting how easy it is to misconfigure AI voice services when teams lack familiarity with the underlying platform options.</li>
<li style="font-weight:400;">The incident is a practical reminder that deploying AI-driven customer service tools across multiple languages requires thorough testing and quality assurance, particularly for government agencies serving diverse populations with real accessibility needs.</li>
<li style="font-weight:400;">Amazon provided the platform but declined interview requests, raising a recurring question in cloud deployments about where vendor responsibility ends and customer configuration responsibility begins when things go wrong in production.</li>
<li style="font-weight:400;">The story went viral with around 2 million TikTok views, which illustrates how public-facing AI failures in government services can quickly become reputational issues, adding pressure on agencies to treat AI deployment with the same rigor as other critical infrastructure.</li>
</ul>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback, or ask questions at theCloudPod.net or tweet at us with the hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2406600/c1e-dd5dfom7vwso5g77-5z38j729u7nq-6ajp3f.mp3" length="121214636"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 347 of The Cloud Pod, where the forecast is always cloudy! Justin, Jonathan, and Ryan are in the studio recording today, and thankfully, Jonathan hasn’t replaced us all with Skynet – yet. This week, we’re discussing how old our tools (and us) are (hint: it’s really old), whether or not the SaasApocalypse is upon us, and whether or not the business or AI is responsible for the latest round of layoffs. 
Titles we almost went with this week

 S3 Bucket Names Finally Stop Being a Global Hunger Games
 One Million Tokens Walk Into a Context Window
 SLO Down and Smell the Reliability Metrics
 CloudWatch Finally Watches Your Whole Cloud Organization
 S3 Turns 20 and Still Buckets the Competition
 Azure SRE Agent Goes GA So You Don’t Have To
 Twenty Years of S3 and No Signs of Object Permanence
 One Rule to Monitor Them All Across AWS
One Flag to Secure Them All on Cloud Run
 SaaSpocalypse Now Atlassian Layoffs Hit the Jira
 No More Bucket Name Bingo with S3 Regional Namespaces
 A Picture Is Worth a Thousand Claude Tokens
 One Command to Rule Your Autonomous AI Agents
AI Fixes Your Incidents Before Your Boss Notices
 The CloudPod is only recording this week “Because of AI”
 Amazon begs users to leave Simple DB with another migration tool

Follow Up
00:54 Microsoft’s brief in Anthropic case shows new alliance and willingness to challenge Trump administration

Microsoft filed an amicus brief in Anthropic’s lawsuit against the U.S. Department of War, urging a federal judge to temporarily block the Pentagon’s designation of Anthropic as a supply chain risk, citing substantial costs to government contractors that rely on Anthropic models.
The brief arrived one day after Microsoft launched Copilot Cowork, built on Anthropic’s Claude, and four months after Microsoft committed up to $5 billion in Anthropic as part of a deal requiring Anthropic to spend at least $30 billion on Azure, making the legal filing directly tied to concrete commercial dependencies.
Microsoft highlighted a procedural inconsistency in the government’s approach: the Pentagon gave itself six months to transition off Anthropic’s models while making the supply chain designation effective immediately for contractors, creating an unequal compliance burden.
Amazon, which has invested $8 billion in Anthropic, has not publicly responded to the lawsuit or the designation, creating a notable contrast in how two major cloud providers with similar financial exposure are handling the situation.
OpenAI announced its own Pentagon deal on the same day the Anthropic designation was issued, and]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2406600/c1a-k5d5-rk2v9d7df4zv-jwgcre.jpg"></itunes:image>
                                                                            <itunes:duration>01:02:32</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2406600/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[346: Zuckerberg Finally Finds His People, They Are All AI Agents]]>
                </title>
                <pubDate>Thu, 19 Mar 2026 21:16:37 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2401248</guid>
                                    <link>https://tcpfm.castos.com/episodes/346-zuckerberg-finally-finds-his-people-they-are-all-ai-agents</link>
                                <description>
                                            <![CDATA[<p>Welcome to episode 346 of The Cloud Pod, where the forecast is always cloudy! Hold on to your butts, because Justin, Ryan, and Matt are in the studio today, and they’re ready to bring you all the latest in Cloud and AI news, including the usual: Meta buying social networks, Amazon responding to outages, and OpenAI giving up another version of GPT. Let’s get into it! </p>
<h3>Titles we almost went with this week</h3>
<ul>
<li>✍️ Cloudflare Spent $1100 to Rewrite Next.js in a Week</li>
<li> One Pipe to Rule All Your OpenTelemetry Data</li>
<li>☑️ Check Yourself Before Google Wrecks Your Cloud Config</li>
<li> Copilot Takes Jira Tickets So You Don't Have To</li>
<li>‍✈️ GitHub Copilot Agent Joins Your Jira Workflow Uninvited</li>
<li> When AI Agents Network, Meta Swipes Right on Moltbook</li>
<li>️ Sixty Controls Walk Into a Terraform Repository</li>
<li> One Security Console to Rule All Your Clouds</li>
<li> AI Ate My Lock-In, and I Feel Fine</li>
<li>⛅ Oracle Sees $90 Billion Future Cloudy With a Chance of GPUs</li>
<li> Your API Has Trust Issues, and We Can Prove It</li>
<li> Stop Running Three Pipelines Like a Telemetry Hoarder</li>
<li> From Database Dinosaur to AI Cash Cow</li>
<li>☠️ Meta: Target acquired; must kill Moltbook</li>
<li> Meta saw Moltbook and said, “WE MUST OWN IT AND KILL.”</li>
</ul>
<h2>Follow Up</h2>
<p>00:51 <a href="https://www.anthropic.com/news/where-stand-department-war">Where things stand with the Department of War </a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.anthropic.com/">Anthropic</a> has been designated a supply chain risk to US national security by the <a href="https://www.war.gov/">Department of War</a>, a designation the company is challenging in court as legally unsound under <a href="https://uscode.house.gov/view.xhtml?req=granuleid:USC-prelim-title10-section3252&amp;num=0&amp;edition=prelim">10 USC 3252</a>.</li>
<li style="font-weight:400;">The practical scope of the designation is narrow, applying only to the use of <a href="https://claude.ai/new">Claude</a> in direct Department of War contracts, not to all customers that hold such contracts or to unrelated business with Anthropic.</li>
<li style="font-weight:400;">Anthropic has <a href="https://www.anthropic.com/news/statement-comments-secretary-war">stated</a> that it will continue to provide its models to the Department of War and the national security community at nominal cost, with ongoing engineering support, during any transition period and for as long as permitted.</li>
<li style="font-weight:400;">The company's two stated exceptions to military use involve fully autonomous weapons and mass domestic surveillance, and Anthropic has clarified these do not extend to operational decision-making, which it considers the military's domain.</li>
<li style="font-weight:400;">For cloud and enterprise customers, the key takeaway is that existing Claude deployments unrelated to Department of War contracts remain unaffected, though the legal dispute introduces uncertainty into federal procurement pipelines involving AI services.</li>
<li style="font-weight:400;">We will keep you updated on this in 12-18 months…</li>
</ul>
<h2>AI Is Going Great - Or How ML Makes Money </h2>
<p>01:21 <a href="https://openai.com/index/introducing-gpt-5-4">Introducing GPT-5.4</a></p>
<ul>
<li style="font-weight:400;">OpenAI released <a href="https://openai.com/index/introducing-gpt-5-4/">GPT-5.4</a> across ChatGPT, the API, and Codex, positioning it as their most capable reasoning model to date. It merges the coding strengths of <a href="https://openai.com/index/introducing-gpt-5-3-codex/">GPT-5.3-Codex</a> with general reasoning, professional knowledge work, and native computer-use capabilities in a single model.</li>
<li style="font-weight:400;">The computer-use capabilities are a notable technical step, with GPT-5.4 achieving a 75% success rate on OSWorld-Verified desktop navigation, surpassing the reported human benchmark of 72.4% and...</li></ul>
<h3>Chapters</h3>
<ul><li>(00:00:00) - Week 2: Iran War and More</li><li>(00:01:26) - OpenAI GPT 5.4 Injection Into Chat</li><li>(00:04:26) - OpenAI ChatGpt for Excel 5.4</li><li>(00:06:07) - Carl</li><li>(00:08:39) - Shift Left: OpenAI's Codec Security in Research Preview</li><li>(00:12:16) - OpenAI Expands Into AI Security With PromptFu Acquisition</li><li>(00:14:46) - Code Review: Databricks Launches Cazale, an</li><li>(00:21:38) - Meta Superintelligence Labs Buys AI Agent Social Network</li><li>(00:25:12) - Cloudflare's Complete Re-write of Next JS</li><li>(00:31:00) - Cloudflare Launches OAuth Vulnerability Scanner</li><li>(00:33:30) - Amazon's AI-Caused Outages</li><li>(00:37:37) - Amazon OpenSearch and Neptune Analytics: 35% DB Savings Plan</li><li>(00:39:24) - Amazon Bedrock</li><li>(00:41:36) - Amazon Connect Health: A HIPAA-Eligible AI Agent</li><li>(00:46:45) - Amazon EC2: Copilot CLI to End</li><li>(00:49:18) - Amazon Route 53 Global Resolver Now Available</li><li>(00:51:33) - Amazon ECS: Automating GitHub Actions to Container Deployment</li><li>(00:53:59) - Google Cloud Security Checklist</li><li>(00:56:00) - Google's Autonomous Data Steward for Telecoms</li><li>(00:57:14) - Google Notebook LM now goes after YouTubers</li><li>(00:59:05) - Google's first natively multimodal embedding model</li><li>(01:01:07) - Google's Gemini in Docs, Sheets, and Drive</li><li>(01:03:08) - Postgres as a managed serverless database with Azure</li><li>(01:04:48) - Microsoft 365 Copilot to Integrate with Cowork</li><li>(01:10:02) - OCI Cost Anomaly Detection</li><li>(01:10:59) - Oracle Announces Q3 Earnings</li><li>(01:12:53) - Week in Cloud: The Cloud and AI</li><li>(01:13:32) - Xbox One: Project Helix</li><li>(01:17:55) - Xbox One vs. Playstation 4</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 346 of The Cloud Pod, where the forecast is always cloudy! Hold on to your butts, because Justin, Ryan, and Matt are in the studio today, and they’re ready to bring you all the latest in Cloud and AI news, including the usual: Meta buying social networks, Amazon responding to outages, and OpenAI giving up another version of GPT. Let’s get into it! 
Titles we almost went with this week

✍️ Cloudflare Spent $1100 to Rewrite Next.js in a Week
 One Pipe to Rule All Your OpenTelemetry Data
☑️ Check Yourself Before Google Wrecks Your Cloud Config
 Copilot Takes Jira Tickets So You Don't Have To
‍✈️ GitHub Copilot Agent Joins Your Jira Workflow Uninvited
 When AI Agents Network, Meta Swipes Right on Moltbook
️ Sixty Controls Walk Into a Terraform Repository
 One Security Console to Rule All Your Clouds
 AI Ate My Lock-In, and I Feel Fine
⛅ Oracle Sees $90 Billion Future Cloudy With a Chance of GPUs
 Your API Has Trust Issues, and We Can Prove It
 Stop Running Three Pipelines Like a Telemetry Hoarder
 From Database Dinosaur to AI Cash Cow
☠️ Meta: Target acquired; must kill Moltbook
 Meta saw Moltbook and said, “WE MUST OWN IT AND KILL.”

Follow Up
00:51 Where things stand with the Department of War 

Anthropic has been designated a supply chain risk to US national security by the Department of War, a designation the company is challenging in court as legally unsound under 10 USC 3252.
The practical scope of the designation is narrow, applying only to the use of Claude in direct Department of War contracts, not to all customers that hold such contracts or to unrelated business with Anthropic.
Anthropic has stated that it will continue to provide its models to the Department of War and the national security community at nominal cost, with ongoing engineering support, during any transition period and for as long as permitted.
The company's two stated exceptions to military use involve fully autonomous weapons and mass domestic surveillance, and Anthropic has clarified these do not extend to operational decision-making, which it considers the military's domain.
For cloud and enterprise customers, the key takeaway is that existing Claude deployments unrelated to Department of War contracts remain unaffected, though the legal dispute introduces uncertainty into federal procurement pipelines involving AI services.
We will keep you updated on this in 12-18 months…

AI Is Going Great - Or How ML Makes Money 
01:21 Introducing GPT-5.4

OpenAI released GPT-5.4 across ChatGPT, the API, and Codex, positioning it as their most capable reasoning model to date. It merges the coding strengths of GPT-5.3-Codex with general reasoning, professional knowledge work, and native computer-use capabilities in a single model.
The computer-use capabilities are a notable technical step, with GPT-5.4 achieving a 75% success rate on OSWorld-Verified desktop navigation, surpassing the reported human benchmark of 72.4% and...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[346: Zuckerberg Finally Finds His People, They Are All AI Agents]]>
                </itunes:title>
                                    <itunes:episode>346</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p>Welcome to episode 346 of The Cloud Pod, where the forecast is always cloudy! Hold on to your butts, because Justin, Ryan, and Matt are in the studio today, and they’re ready to bring you all the latest in Cloud and AI news, including the usual: Meta buying social networks, Amazon responding to outages, and OpenAI giving up another version of GPT. Let’s get into it! </p>
<h3>Titles we almost went with this week</h3>
<ul>
<li>✍️ Cloudflare Spent $1100 to Rewrite Next.js in a Week</li>
<li> One Pipe to Rule All Your OpenTelemetry Data</li>
<li>☑️ Check Yourself Before Google Wrecks Your Cloud Config</li>
<li> Copilot Takes Jira Tickets So You Don't Have To</li>
<li>‍✈️ GitHub Copilot Agent Joins Your Jira Workflow Uninvited</li>
<li> When AI Agents Network, Meta Swipes Right on Moltbook</li>
<li>️ Sixty Controls Walk Into a Terraform Repository</li>
<li> One Security Console to Rule All Your Clouds</li>
<li> AI Ate My Lock-In, and I Feel Fine</li>
<li>⛅ Oracle Sees $90 Billion Future Cloudy With a Chance of GPUs</li>
<li> Your API Has Trust Issues, and We Can Prove It</li>
<li> Stop Running Three Pipelines Like a Telemetry Hoarder</li>
<li> From Database Dinosaur to AI Cash Cow</li>
<li>☠️ Meta: Target acquired; must kill Moltbook</li>
<li> Meta saw Moltbook and said, “WE MUST OWN IT AND KILL.”</li>
</ul>
<h2>Follow Up</h2>
<p>00:51 <a href="https://www.anthropic.com/news/where-stand-department-war">Where things stand with the Department of War </a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.anthropic.com/">Anthropic</a> has been designated a supply chain risk to US national security by the <a href="https://www.war.gov/">Department of War</a>, a designation the company is challenging in court as legally unsound under <a href="https://uscode.house.gov/view.xhtml?req=granuleid:USC-prelim-title10-section3252&amp;num=0&amp;edition=prelim">10 USC 3252</a>.</li>
<li style="font-weight:400;">The practical scope of the designation is narrow, applying only to the use of <a href="https://claude.ai/new">Claude</a> in direct Department of War contracts, not to all customers that hold such contracts or to unrelated business with Anthropic.</li>
<li style="font-weight:400;">Anthropic has <a href="https://www.anthropic.com/news/statement-comments-secretary-war">stated</a> that it will continue to provide its models to the Department of War and the national security community at nominal cost, with ongoing engineering support, during any transition period and for as long as permitted.</li>
<li style="font-weight:400;">The company's two stated exceptions to military use involve fully autonomous weapons and mass domestic surveillance, and Anthropic has clarified these do not extend to operational decision-making, which it considers the military's domain.</li>
<li style="font-weight:400;">For cloud and enterprise customers, the key takeaway is that existing Claude deployments unrelated to Department of War contracts remain unaffected, though the legal dispute introduces uncertainty into federal procurement pipelines involving AI services.</li>
<li style="font-weight:400;">We will keep you updated on this in 12-18 months…</li>
</ul>
<h2>AI Is Going Great - Or How ML Makes Money </h2>
<p>01:21 <a href="https://openai.com/index/introducing-gpt-5-4">Introducing GPT-5.4</a></p>
<ul>
<li style="font-weight:400;">OpenAI released <a href="https://openai.com/index/introducing-gpt-5-4/">GPT-5.4</a> across ChatGPT, the API, and Codex, positioning it as their most capable reasoning model to date. It merges the coding strengths of <a href="https://openai.com/index/introducing-gpt-5-3-codex/">GPT-5.3-Codex</a> with general reasoning, professional knowledge work, and native computer-use capabilities in a single model.</li>
<li style="font-weight:400;">The computer-use capabilities are a notable technical step, with GPT-5.4 achieving a 75% success rate on OSWorld-Verified desktop navigation, surpassing the reported human benchmark of 72.4% and up from GPT-5.2's 47.3%. </li>
<li style="font-weight:400;">This makes it the first general-purpose OpenAI model with native computer use built in, making it relevant for developers building agents that operate across web browsers and desktop software.</li>
<li style="font-weight:400;">Tool search is a practical efficiency improvement for agentic API workflows, dynamically loading tool definitions only when needed rather than stuffing all definitions into the prompt upfront. In testing against Scale's <a href="https://labs.scale.com/leaderboard/mcp_atlas">MCP Atlas benchmark</a> on 36 MCP servers, this reduced total token usage by 47% with no loss in accuracy, directly translating to lower API costs for tool-heavy applications.</li>
<li style="font-weight:400;">On the professional work side, GPT-5.4 scores 87.3% on an internal investment banking spreadsheet benchmark, up from 68.4% for GPT-5.2, and achieves 91% on <a href="https://www.harvey.ai/blog/introducing-biglaw-bench">BigLaw Bench</a> for legal document work. The ChatGPT for Excel add-in, launched alongside it, gives Enterprise customers a direct integration path.</li>
<li style="font-weight:400;">Pricing is higher per token than GPT-5.2 in the API, though OpenAI notes the model's token efficiency should offset costs for many workloads. </li>
<li style="font-weight:400;">Batch and Flex pricing remain available at half the standard rate, and Priority processing is available at 2x the standard rate for latency-sensitive use cases.</li>
</ul>
<p>02:19  Justin - “There’s also been a slew of every cloud provider in the world announcing Chat-GPT 5.4 is now available, and we will not be telling you about all of them, but assume that if you use a different model or different cloud, they probably have it.” </p>
<p>04:33 <a href="https://openai.com/index/chatgpt-for-excel">Introducing ChatGPT for Excel and new financial data integrations</a></p>
<ul>
<li style="font-weight:400;"><a href="https://openai.com/">OpenAI</a> launched <a href="https://chatgpt.com/apps/spreadsheets/?openaicom-did=ad04b478-4d65-49f6-a679-e4f8c7cd9645&amp;openaicom_referred=true">ChatGPT for Excel</a> in beta, an add-in powered by <a href="https://openai.com/index/introducing-gpt-5-4/">GPT-5.4</a> that lets users build, update, and analyze spreadsheet models using plain language descriptions. </li>
<li style="font-weight:400;">It preserves existing formulas and structure, asks permission before making changes, and links answers to specific cells for auditability. </li>
<li style="font-weight:400;">Available now for <a href="https://openai.com/business/chatgpt-pricing/">Business, Enterprise, Edu, Pro, and Plus</a> users in the US, Canada, and Australia.</li>
<li style="font-weight:400;">GPT-5.4 (also available as <a href="https://openai.com/index/gpt-5-4-thinking-system-card/">GPT-5.4 Thinking</a>) is now live in ChatGPT, Codex, and the API, with OpenAI noting it was specifically tuned on real-world finance workflows, including financial modeling, scenario analysis, data extraction, and long-form research.</li>
<li style="font-weight:400;">New financial data integrations bring Moody's, Dow Jones Factiva, MSCI, Third Bridge, MT Newswire, and others directly into ChatGPT workflows, with FactSet coming soon. </li>
<li style="font-weight:400;">Organizations can also connect proprietary data sources using Model Context Protocol (MCP), centralizing market, company, and internal data in a single interface.</li>
<li style="font-weight:400;">For enterprise deployments, the Excel add-in supports RBAC, SAML SSO, SCIM, audit logs, AES-256 encryption at rest, TLS 1.2+ in transit, and data residency controls. In Enterprise and Edu workspaces, the feature is off by default and requires admin enablement with custom roles and group permissions.</li>
<li style="font-weight:400;"><a href="https://workspace.google.com/marketplace/app/gpt_for_sheets_and_docs/677318054654">ChatGPT for Google Sheets</a> is listed as coming soon, signaling OpenAI is extending this spreadsheet integration beyond the Microsoft ecosystem.</li>
</ul>
<p>04:49  Justin - “If I were a betting man, I’d also say they’re going to have a PowerPoint version any day.” </p>
<p>06:13 <a href="https://www.databricks.com/blog/meet-karl-faster-agent-enterprise-knowledge-powered-custom-rl">Meet KARL: A Faster Agent for Enterprise Knowledge, powered by custom </a><a href="https://www.databricks.com/blog/meet-karl-faster-agent-enterprise-knowledge-powered-custom-rl">RL</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.databricks.com/">Databricks</a> introduced KARL (<a href="https://arxiv.org/abs/2603.05218">Knowledge Agent with Reinforcement Learning</a>), a custom model built using RL techniques to handle grounded reasoning tasks like document search, fact-finding, and multi-step reasoning across enterprise data sources.</li>
<li style="font-weight:400;">KARL was trained with a few thousand GPU hours using entirely synthetic data. In internal testing, it matched or outperformed Frontier's proprietary models on inference cost, latency, and response quality simultaneously.</li>
<li style="font-weight:400;">The core technical challenge KARL addresses is hard-to-verify tasks, where there is no single correct answer, making <a href="https://arxiv.org/abs/2505.17373">RL</a> reward signal design particularly difficult compared to domains like math or code, where correctness is easier to measure.</li>
<li style="font-weight:400;">Databricks is now offering a Custom RL private preview backed by <a href="https://docs.databricks.com/aws/en/compute/serverless/gpu">Serverless GPU Compute</a>, allowing enterprise customers to use the same RL pipeline that produced KARL to build domain-specific, cost-optimized versions of their own high-volume agents.</li>
<li style="font-weight:400;">For enterprises running AI agents at scale, this approach suggests that custom RL fine-tuning on smaller models can substantially reduce inference costs compared with relying on general-purpose frontier models, a practical consideration as agentic workload costs grow.</li>
<li style="font-weight:400;">Interested in checking out the preview? You can find more information on that <a href="https://forms.gle/YR171eqRupM43tVW9">here</a>. </li>
</ul>
<p>07:09  Ryan - “It's kind of a neat idea to provide sort of the pipeline there. I mean, I guess the big cloud providers are producing agent-building platforms and stuff; I wonder how much of this you can follow the path that they use for creating KARL and building your own domain-specific agent in the same way. I like the idea. Smaller model, less GPU.”</p>
<p>08:55  <a href="https://openai.com/index/codex-security-now-in-research-preview">Codex Security: now in research preview</a></p>
<ul>
<li style="font-weight:400;">OpenAI launched <a href="https://developers.openai.com/codex/security">Codex Security</a> in research preview, formerly known as <a href="https://openai.com/index/introducing-aardvark/">Aardvark</a>, and is now available to ChatGPT Pro, Enterprise, Business, and Edu customers via the <a href="https://developers.openai.com/codex/cloud">Codex web</a> with free usage for the first month. </li>
<li style="font-weight:400;">The tool functions as an agentic application security scanner that builds a project-specific threat model to identify and prioritize vulnerabilities with context-aware fixes.</li>
<li style="font-weight:400;">The performance metrics from the beta are notable: false positive rates dropped by over 50%, overreported severity findings fell by more than 90%, and noise was reduced by 84% in some repositories. </li>
<li style="font-weight:400;">Over the last 30 days, it scanned more than 1.2 million commits, surfacing 792 critical and 10,561 high-severity findings, with critical issues appearing in fewer than 0.1% of commits.</li>
<li style="font-weight:400;">The tool uses sandboxed validation environments to pressure-test findings before surfacing them and can generate working proofs of concept when configured with a project-specific runtime environment. It also learns from user feedback on finding severity to refine its threat model over time.</li>
<li style="font-weight:400;">Codex Security has already produced real-world results in open source, with 14 CVEs assigned across projects including <a href="https://github.com/openssh/openssh-portable/commit/c991273c18afc490313a9f282383eaf59d9c13b9">OpenSSH</a>, <a href="https://lists.gnupg.org/pipermail/gnutls-help/2025-July/004883.html">GnuTLS</a>, <a href="https://github.com/gogs/gogs/security/advisories/GHSA-p6x6-9mx6-26wj">GOGS</a>, PHP, and Chromium. </li>
<li style="font-weight:400;">OpenAI is also launching Codex for OSS, offering free ChatGPT Pro and Plus accounts, as well as Codex Security access for open-source maintainers.</li>
</ul>
<p>10:07  Ryan - “I wish AI wouldn’t generate all those vulnerabilities in code… but I do like that these tools are available.”  </p>
<p>12:40 <a href="https://openai.com/index/openai-to-acquire-promptfoo">OpenAI to acquire Promptfoo </a></p>
<ul>
<li style="font-weight:400;">OpenAI is acquiring <a href="https://www.promptfoo.dev/">Promptfoo</a>, an AI security platform used by over 25 percent of Fortune 500 companies, with plans to integrate its technology directly into <a href="https://openai.com/business/frontier/">OpenAI Frontier</a>, the company's enterprise platform for building AI agents.</li>
<li style="font-weight:400;">Promptfoo's core capabilities include automated red-teaming and security testing for LLM applications, targeting risks such as prompt injection, jailbreaks, data leaks, tool misuse, and out-of-policy agent behavior. </li>
<li style="font-weight:400;">These will become native features within Frontier rather than separate tools.</li>
<li style="font-weight:400;">The acquisition addresses a practical gap for enterprise AI deployments: systematic ways to test agent behavior before production, maintain audit trails, and meet governance and compliance requirements as AI agents connect to real data and business systems.</li>
<li style="font-weight:400;">Promptfoo also maintains a widely used <a href="https://github.com/promptfoo/promptfoo">open-source</a> CLI and library on GitHub, and OpenAI has stated it will continue developing the open-source project alongside the integrated enterprise capabilities, which is notable for developers already using those tools.</li>
<li style="font-weight:400;">For enterprises building on Frontier, this signals that security testing and evaluation are moving from optional add-ons to built-in requirements of the development workflow, with direct implications for how teams structure AI deployment pipelines and compliance documentation.</li>
</ul>
<p>13:36  Justin - “It's good that this company got bought, integrated into the models is a great stepping stone, and I look forward to seeing more red teaming agents, because I think that's an area companies really have underinvested, and with our new cyber warfare world, it's going to become more more important that you're doing more active red teaming.”</p>
<p>15:21 <a href="https://www.databricks.com/blog/introducing-kasal">Introducing Kasal </a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.databricks.com/">Databricks</a> released <a href="https://www.databricks.com/blog/introducing-kasal">Kasal</a>, an open-source visual platform for building multi-agent AI workflows without writing orchestration code. </li>
<li style="font-weight:400;">Users can drag and drop agents onto a canvas or describe workflows conversationally, and Kasal automatically generates the underlying CrewAI-based Python code.</li>
<li style="font-weight:400;">Kasal runs natively on Databricks Apps with built-in OBO authentication, SQLite or <a href="https://www.databricks.com/product/lakebase">Lakebase</a> persistence, and <a href="https://www.databricks.com/product/managed-mlflow">MLflow</a> tracing integration, meaning teams can move from visual design to production deployment with minimal additional configuration.</li>
<li style="font-weight:400;">The platform supports both sequential and hierarchical agent modes, in which hierarchical workflows include a manager agent coordinating specialized subagents, useful for tasks such as generating customer-specific sales presentations by combining product and customer data pipelines.</li>
<li style="font-weight:400;">Observability is handled at two layers: business users see execution timelines and workflow status in the Kasal frontend. At the same time, AI engineers can use MLflow tracing to debug LLM calls and agent behavior at a technical level.</li>
<li style="font-weight:400;">Workflows built in Kasal can be exported as Python code for further customization, and reusable plans can be registered in a shared catalog, giving teams a path from low-code prototyping to production-grade pipelines without being locked into the visual interface.</li>
</ul>
<p>15:48  Justin - “They didn’t mention security review; I just want to call that out.” </p>
<p>17:04 <a href="https://claude.com/blog/code-review">Code Review for Claude Code</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.anthropic.com/">Anthropic</a> launched <a href="https://claude.com/plugins/code-review">Code Review</a> for <a href="https://claude.com/product/claude-code">Claude Code</a> in research preview for Team and Enterprise plans, using a multi-agent system that dispatches parallel agents to find bugs, filter false positives, and rank issues by severity, delivering results as a single summary comment plus inline annotations on each PR.</li>
<li style="font-weight:400;">Internal metrics show the system increased substantive review comments from 16% to 54% of PRs at Anthropic, with large PRs over 1,000 lines receiving findings 84% of the time, averaging 7.5 issues, and less than 1% of findings marked incorrect by engineers.</li>
<li style="font-weight:400;">Reviews scale dynamically with PR complexity, averaging around 20 minutes per review, and are billed at roughly $15 to $25 per review, making this notably more expensive than the existing open-source <a href="https://github.com/anthropics/claude-code-action">Claude Code GitHub Action</a>, which remains available as a lighter-weight alternative.</li>
<li style="font-weight:400;">A practical example from TrueNAS shows the system surfacing a pre-existing type mismatch bug in adjacent code that was silently wiping an encryption key cache on every sync, the kind of latent issue outside the direct changeset that human reviewers typically would not investigate.</li>
<li style="font-weight:400;">The system intentionally does not approve PRs, keeping humans in the decision loop. At the same time, admins on Team and Enterprise plans retain controls over spend and usage, positioning this as a depth-focused supplement to human review rather than a replacement.</li>
</ul>
<p>18:15  Justin - “The COST of the review is really the biggest thing…definitely something that is a factor in all of these things.”</p>
<p>22:24 <a href="https://arstechnica.com/ai/2026/03/meta-acquires-moltbook-the-ai-agent-social-network/">Meta acquires Moltbook, the AI agent social network</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.meta.com/en-gb/about/">Meta</a> <a href="https://www.axios.com/2026/03/10/meta-facebook-moltbook-agent-social-network">acquired</a> <a href="https://www.moltbook.com/">Moltbook</a>, an AI agent social network built as a Reddit-style platform where every participant is an AI agent run by a human, with no direct human membership. </li>
<li style="font-weight:400;">The founders will join Meta Superintelligence Labs, though deal terms were not disclosed.</li>
<li style="font-weight:400;">Meta specifically called out Moltbook's "always-on directory" approach for connecting agents as a novel development, suggesting the acquisition is focused on agent discovery and coordination infrastructure rather than the social network concept itself.</li>
<li style="font-weight:400;">Moltbook was built on <a href="https://openclaw.ai/">OpenClaw</a>, an LLM coding agent wrapper that enables prompting via WhatsApp and Discord and supports deep local system access through community plugins. </li>
<li style="font-weight:400;">OpenClaw's founder was separately hired by OpenAI in February, indicating both major AI labs are recruiting from the same open-source agent ecosystem.</li>
<li style="font-weight:400;">For developers and businesses, the acquisition signals that agent-to-agent communication protocols and persistent agent directories are becoming areas of serious investment, which could influence how cloud-based agentic workflows are designed going forward.</li>
<li style="font-weight:400;">A practical caveat worth noting: Moltbook lacked security controls to verify that all participants were actually AI agents, meaning some posts were likely written by humans posing as agents. This highlights that agent identity and authentication remain unsolved problems in agentic system design.</li>
</ul>
<p>22:39  Justin - “We didn't really talk about Moltbook because we didn't want to talk about OpenClaw extensively, but basically, OpenClaw is a terrible way that you can run AI agents in a fully unsafe manner that accesses all of your personal data, and one of the things you could do is add a skill that would basically have it randomly post things onto MoltBook, which could include your bank accounts or security things if you're not careful in your security. And Meta buying this is just sort of the classic; it's a social network, and it could take us down, let's just take it off the market and kill it.”</p>
<h2>Cloud Tools </h2>
<p>23:58 <a href="https://share.google/Kq5WSHtBWv8cJtduf">GitHub Copilot coding agent for Jira is now in public preview</a></p>
<ul>
<li style="font-weight:400;"><a href="https://github.com/features/copilot">GitHub Copilot</a> coding agent now <a href="https://github.blog/changelog/2026-03-05-github-copilot-coding-agent-for-jira-is-now-in-public-preview/">integrates directly</a> with <a href="https://www.atlassian.com/software/jira/free">Jira Cloud</a>, allowing teams to assign Jira issues to Copilot and receive AI-generated draft pull requests in their connected GitHub repositories without leaving their existing workflow.</li>
<li style="font-weight:400;">The agent works asynchronously and autonomously, analyzing issue descriptions and comments for context, implementing code changes, and posting status updates back in Jira, including asking clarifying questions when needed.</li>
<li style="font-weight:400;">This integration targets common, repetitive tasks such as bug fixes and documentation updates and respects existing pull request review and approval rules, so teams do not need to change their governance processes.</li>
<li style="font-weight:400;">Setup requires installing two marketplace apps, one from Atlassian and one from GitHub, and notably requires Jira Cloud with <a href="https://www.atlassian.com/software/rovo">Rovo</a> enabled alongside an active GitHub Copilot coding agent subscription, so there are meaningful prerequisite costs to consider.</li>
<li style="font-weight:400;">The integration supports GitHub Data Residency customers across supported regions, which is a practical consideration for teams with data sovereignty requirements.</li>
</ul>
<p>24:42  Ryan - “That’s interesting, because Rovo is Atlassian’s AI bot…I’m curious about why that’s required.”  </p>
<p>26:09 <a href="https://blog.pragmaticengineer.com/the-pulse-cloudflare-rewrites-next-js-as-ai-rewrites-commercial-open-source/">The Pulse: Cloudflare rewrites Next.js as AI rewrites commercial open </a><a href="https://blog.pragmaticengineer.com/the-pulse-cloudflare-rewrites-next-js-as-ai-rewrites-commercial-open-source/">source</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.cloudflare.com/">Cloudflare</a> released <a href="https://vinext.io/">vinext</a>, a rewrite of <a href="http://next.js">Next.js</a> that replaces <a href="https://vercel.com/">Vercel</a>'s proprietary Turbopack build system with the standard Vite build tool, allowing Next.js applications to deploy to Cloudflare Workers with a single command and producing client bundles that are reportedly up to 57% smaller.</li>
<li style="font-weight:400;">The project was completed by one engineer in one week, using approximately $1,100 in AI tokens via the <a href="https://opencode.ai/docs/agents/">OpenCode agent</a> and <a href="https://www.anthropic.com/news/claude-opus-4-5">Claude Opus 4.5</a>, reducing what would traditionally have taken years of engineering to days. However, the result is explicitly experimental and not yet battle-tested at scale.</li>
<li style="font-weight:400;">A key practical concern is that vinext covers 94% of the Next.js API surface, with roughly 67,000 lines of code, compared with Next.js's 194,000, meaning edge cases and security auditing remain outstanding before production use at any meaningful traffic level.</li>
<li style="font-weight:400;">Cloudflare also released a migration agent skill that integrates with tools like Claude Code, <a href="https://cursor.com/">Cursor</a>, and <a href="https://openai.com/codex/">Codex</a>, allowing developers to run a single command to migrate an existing Next.js project to vinext, with compatibility checks, dependency installation, and config generation handled automatically.</li>
<li style="font-weight:400;">The broader implication for cloud engineers is that comprehensive open-source test suites now serve as a blueprint for AI-assisted rewrites, which puts pressure on commercial open-source business models that rely on deployment lock-in rather than infrastructure, support, or community as their primary differentiators.</li>
</ul>
<p>27:31 Ryan - “I feel like it's an awful precedent, right? Like, the whole point of open source is community collaboration, and this is directly in the face of that. Like, why would you release something open source if someone's just going to use an AI agent to create their own fork of it?”</p>
<p>31:58 <a href="https://blog.cloudflare.com/vulnerability-scanner/">Active defense: introducing a stateful vulnerability scanner for APIs</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.cloudflare.com/">Cloudflare</a> launched a beta Web and <a href="https://www.cloudflare.com/learning/security/api/what-is-api-security/">API</a> Vulnerability Scanner focused initially on <a href="https://owasp.org/API-Security/editions/2023/en/0xa1-broken-object-level-authorization/">BOLA (Broken Object Level Authorization)</a>, which is the top threat in the OWASP API Top 10. </li>
<li style="font-weight:400;">Unlike WAF rules that catch syntax-based attacks, BOLA involves valid authenticated requests that violate business logic, making them invisible to traditional defenses.</li>
<li style="font-weight:400;">The scanner is stateful, meaning it builds an API call graph from your OpenAPI spec and chains requests together logically, creating resources as an owner and then attempting to access them as an attacker. This solves a core limitation of legacy DAST tools that evaluate each request in isolation and miss authorization flaws that span multiple API calls.</li>
<li style="font-weight:400;">To handle ambiguous or inconsistent <a href="https://www.openapis.org/what-is-openapi">OpenAPI schemas</a>, the scanner uses Cloudflare Workers AI, which runs <a href="https://developers.cloudflare.com/workers-ai/models/gpt-oss-120b/">OpenAI's gpt-oss-120b model</a> with <a href="https://platform.openai.com/docs/guides/structured-outputs">structured outputs</a> to infer data dependencies between endpoints automatically. This removes the manual configuration burden that typically slows DAST tool deployment.</li>
<li style="font-weight:400;">Credential security is handled by the <a href="https://developer.hashicorp.com/vault/docs/secrets/transit">HashiCorp Vault Transit Secret Engine</a>, where credentials are encrypted immediately upon submission and decrypted only by the specific Rust worker executing the test. This is a notable design choice, given that vulnerability scanners, by definition, need access to valid API credentials.</li>
<li style="font-weight:400;">The scanner is now available in open beta for API Shield customers via the API, allowing teams to trigger scans and pull results into CI/CD pipelines or security dashboards. </li>
<li style="font-weight:400;">Cloudflare plans to extend coverage to OWASP Web Top 10 threats like SQLi and XSS in future releases.</li>
</ul>
<p>33:22  Ryan - “This is super cool. This is the AI-enhanced security scanning I’ve been waiting for.” </p>
<h2>AWS</h2>
<p>34:43 <a href="https://www.cnbc.com/2026/03/10/amazon-plans-deep-dive-internal-meeting-address-ai-related-outages.html">Amazon plans 'deep dive' internal meeting to address outages</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.cnbc.com/quotes/AMZN/">Amazon</a>'s retail site experienced four Sev 1 outages in a single week, including a <a href="https://www.cnbc.com/2026/03/05/amazon-online-store-suffers-outage-for-some-users.html">six-hour checkout and account access failure</a> on March 5, prompting an internal deep-dive meeting led by SVP Dave Treadwell to review the availability posture.</li>
<li style="font-weight:400;">An internal document initially cited GenAI-assisted changes as a contributing factor to a trend of incidents since Q3. </li>
<li style="font-weight:400;">Still, that reference was removed before the meeting, and Amazon later clarified that only one incident involved AI and none involved AI-written code.</li>
<li style="font-weight:400;">Amazon is implementing new safeguards that require additional review of GenAI-assisted production changes, with Treadwell acknowledging that best practices for using generative AI in production environments have not yet been fully established.</li>
<li style="font-weight:400;">A separate AWS outage in December was linked to the <a href="https://www.cnbc.com/2025/07/14/aws-launches-kiro-ai-coding-program.html">Kiro AI coding tool</a>. However, Amazon attributed that incident to user error rather than the AI itself, highlighting an ongoing pattern of questions around AI tooling in production deployments.</li>
<li style="font-weight:400;">With Amazon projecting $200 billion in capital expenditures this year while simultaneously reducing its workforce by tens of thousands, the reliability of AI-assisted development workflows becomes a practical concern for any organization adopting similar tooling at scale.</li>
</ul>
<p>36:36  Ryan - “Hold on to your butts, but we’re going to see a lot more of this.” </p>
<p>39:00 <a href="https://aws.amazon.com/about-aws/whats-new/2026/03/dbsp-opensearch-service-neptune-analytics/">Database Savings Plans now supports Amazon OpenSearch Service and </a><a href="https://aws.amazon.com/about-aws/whats-new/2026/03/dbsp-opensearch-service-neptune-analytics/">Amazon Neptune Analytics</a></p>
<ul>
<li style="font-weight:400;">Database Savings Plans now cover <a href="https://aws.amazon.com/opensearch-service/">Amazon OpenSearch Service</a> and <a href="https://aws.amazon.com/neptune/">Amazon Neptune Analytics</a>, offering up to 35% savings with a one-year commitment and no upfront payment required.</li>
<li style="font-weight:400;">The plans apply automatically across serverless and provisioned instances regardless of engine, instance family, size, or region, so customers can switch instance types like moving from m7i.large.search to c8g.2xlarge.search without losing their discount.</li>
<li style="font-weight:400;">This expansion is useful for organizations running search or graph analytics workloads at scale, since Neptune Analytics and OpenSearch can carry substantial hourly costs that benefit from committed-use pricing.    </li>
<li style="font-weight:400;">Customers can use the Savings Plans Purchase Analyzer in the AWS Billing and Cost Management Console to model custom scenarios before committing, which reduces the guesswork in sizing a commitment.</li>
<li style="font-weight:400;">Available now in all AWS regions except China. </li>
<li style="font-weight:400;">Pricing details are available <a href="http://aws.amazon.com/savingsplans/database-pricing">here</a>.</li>
</ul>
<p>39:34  Justin - “Finally. Thank you.” </p>
<p>40:54 <a href="https://aws.amazon.com/about-aws/whats-new/2026/03/elastic-beanstalk-ai-analysis/">AWS Elastic Beanstalk now offers AI-powered environment analysis</a></p>
<ul>
<li style="font-weight:400;">A<a href="https://docs.aws.amazon.com/elasticbeanstalk/latest/dg/Welcome.html">WS Elastic Beanstalk</a> now integrates with <a href="https://aws.amazon.com/bedrock/">Amazon Bedrock</a> to provide AI-powered analysis of environment health issues, automatically collecting events, instance health data, and logs to generate step-by-step troubleshooting recommendations without manual log review.</li>
<li style="font-weight:400;">The feature is triggered from the Elastic Beanstalk console via an AI Analysis button when environment health reaches Warning, Degraded, or Severe status, and is also accessible programmatically through the existing <a href="https://docs.aws.amazon.com/elasticbeanstalk/latest/api/API_RequestEnvironmentInfo.html">RequestEnvironmentInfo</a> and <a href="https://docs.aws.amazon.com/elasticbeanstalk/latest/api/API_RetrieveEnvironmentInfo.html">RetrieveEnvironmentInfo</a> CLI and API operations.</li>
<li style="font-weight:400;">This is a practical addition for teams managing Beanstalk environments who want to reduce mean time to resolution, particularly useful for developers who may not have deep operational expertise in diagnosing platform-level issues.</li>
<li style="font-weight:400;">Availability is limited to regions where both Elastic Beanstalk and Amazon Bedrock are supported, so teams in regions without Bedrock coverage will not have access, and AWS has not published specific pricing details for this feature beyond standard Beanstalk and Bedrock usage costs.</li>
<li style="font-weight:400;">This continues a broader AWS pattern of embedding Bedrock-powered assistance into existing managed services, similar to features seen in other consoles, positioning AI-assisted operations as a standard capability rather than a standalone product.</li>
</ul>
<p>41:55  Matt - “I will say troubleshooting Beanstalk is a pain in the butt. It just says ‘degraded’ and you’re like ‘why’? And at one point, I had an issue with Beanstalk where it needed a specific CloudWatch put metric in order to do it; it got to the point I opened a support case, and asked AWS why it wasn't working. And they're like, here's this - buried 17 pages into… so I can definitely see it being useful.”</p>
<p>43:13 <a href="https://aws.amazon.com/about-aws/whats-new/2026/03/amazon-connect-health-agentic-ai-healthcare/">Introducing Amazon Connect Health, Agentic AI Built for Healthcare</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/health/connect-health/">Amazon Connect Health</a> is now generally available, offering five purpose-built AI agents targeting healthcare administrative workflows, including patient verification, appointment scheduling, ambient documentation, patient insights, and medical coding with ICD-10 and CPT code generation.</li>
<li style="font-weight:400;">The service is HIPAA-eligible and integrates natively with <a href="https://docs.aws.amazon.com/connect/latest/adminguide/what-is-amazon-connect.html">Amazon Connect</a>,  allowing contact center and point-of-care workflows to be configured in minutes rather than months, which is a notable deployment speed advantage for healthcare IT teams.</li>
<li style="font-weight:400;">The two GA agents (patient verification and ambient documentation) are ready for production use today, while appointment management, patient insights, and medical coding remain in preview, so organizations should plan adoption timelines accordingly.</li>
<li style="font-weight:400;">Point-of-care capabilities like ambient listening and medical coding are accessible via a unified SDK, letting developers embed these features directly into existing EHR systems rather than requiring a full platform migration.</li>
<li style="font-weight:400;">The service is currently limited to US East (N. Virginia) and US West (Oregon), and AWS has not published specific pricing details publicly, so healthcare organizations will need to engage AWS directly to understand cost structures before planning deployments.</li>
</ul>
<p>43:45  Justin - “This is a great example of a really purpose-built AI that has a specific use case, and I’d almost rather talk to the AI at any time of the day that can book my appointment rather than waiting for the office to open during the day when I’m busy.” </p>
<p>27:58 <a href="https://aws.amazon.com/about-aws/whats-new/2026/03/amazon-lightsail-openclaw/">Amazon Lightsail now offers OpenClaw, a private self-hosted AI assistant</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/lightsail/pricing/">Amazon Lightsail</a> now supports deploying <a href="https://openclaw.ai/">OpenClaw</a>, a self-hosted AI assistant that runs on your own Lightsail instance, giving users a private alternative to cloud-based AI services where data stays within their own infrastructure.</li>
<li style="font-weight:400;">The offering includes several built-in security features out of the box: sandboxed agent sessions, one-click HTTPS without manual TLS setup, device pairing authentication, and automatic configuration snapshots, reducing the typical operational overhead of self-hosting AI tools.</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/bedrock/">Amazon Bedrock</a> serves as the default model provider, which ties this directly into the broader AWS AI ecosystem, though users can swap models or connect to messaging platforms like Slack, Telegram, WhatsApp, and Discord for different workflows.</li>
<li style="font-weight:400;">Pricing follows standard Lightsail instance pricing rather than a separate AI-specific cost structure, which may make this appealing for small teams or developers who want predictable monthly costs; check the <a href="https://aws.amazon.com/lightsail/pricing/">Lightsail pricing</a> page at aws.amazon.com/lightsail/pricing for current instance rates.</li>
<li style="font-weight:400;">The feature is available across <a href="https://docs.aws.amazon.com/lightsail/latest/userguide/understanding-regions-and-availability-zones-in-amazon-lightsail.html">15 AWS Regions</a>, including US East, US West, Frankfurt, London, Tokyo, and Jakarta, and can be accessed directly from the Lightsail console with quick start documentation available for getting up and running quickly.</li>
</ul>
<p>44:46  Justin - “If you want to try it (OpenClaw) and you can’t get a Mac Mini because everyone is buying them for their OpenClaw implementations, Amazon Lightsail now supports (it).” </p>
<p>47:22 <a href="https://aws.amazon.com/about-aws/whats-new/2026/03/amazon-opensearch-ingestion-unified-ingestion-endpoint-opentelemetry">Amazon OpenSearch Ingestion now supports a unified ingestion endpoint </a><a href="https://aws.amazon.com/about-aws/whats-new/2026/03/amazon-opensearch-ingestion-unified-ingestion-endpoint-opentelemetry">for OpenTelemetry data</a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/general/latest/gr/opensearch-service.html#opensearch-service-regions">Amazon OpenSearch Ingestion</a> now accepts logs, metrics, and traces through a single unified pipeline endpoint, eliminating the previous requirement to run three separate pipelines for each OpenTelemetry signal type.</li>
<li style="font-weight:400;">The consolidation reduces operational overhead around access control, monitoring, and lifecycle management, which translates to lower infrastructure costs for teams running observability at scale.</li>
<li style="font-weight:400;">A practical benefit is incremental OpenTelemetry adoption: teams can start with one signal type and add others later without reconfiguring the pipeline, lowering the barrier to getting started.</li>
<li style="font-weight:400;">Signal correlation becomes more straightforward when all three data types flow through a centralized pipeline, giving teams a more complete view of application health in one place.</li>
<li style="font-weight:400;">The unified endpoint is available now in all regions where Amazon OpenSearch Ingestion is supported, and customers can configure it through the AWS Management Console or CLI. </li>
<li style="font-weight:400;">Pricing follows existing OpenSearch Ingestion rates based on Ingestion OCUs, so no new cost model is introduced.</li>
</ul>
<p>47:54  Ryan - “I mean, at the ingestion layer? I don’t know. Because this is really at the logs- equivalent…”</p>
<p>48:27 <a href="https://aws.amazon.com/blogs/containers/announcing-the-end-of-support-for-the-aws-copilot-cli/">Announcing the end-of-support for the AWS Copilot CLI</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.github.io/copilot-cli/">AWS Copilot CLI</a> reaches end of support on June 12, 2026, meaning it will no longer receive new features or security updates, though it remains available as an open-source project on GitHub.</li>
<li style="font-weight:400;">AWS recommends two primary migration paths: <a href="https://docs.aws.amazon.com/AmazonECS/latest/developerguide/express-service-overview.html">Amazon ECS Express Mode</a> for teams wanting a fast, opinionated path to production with automatic ALB, TLS, and auto-scaling provisioning, and <a href="https://docs.aws.amazon.com/prescriptive-guidance/latest/aws-cdk-layers/layer-3.html">AWS CDK L3</a> constructs for teams needing fine-grained infrastructure control in familiar programming languages.</li>
<li style="font-weight:400;">ECS Express Mode is the closest functional replacement for Copilot's most common patterns, supporting shared Application Load Balancers across up to 25 services and eliminating the need to learn a custom manifest format.</li>
<li style="font-weight:400;">Teams migrating Worker Services, Backend Services, and Scheduled Jobs have specific CDK construct equivalents available, including QueueProcessingFargateService for SQS-based workloads and ScheduledFargateTask for cron-based jobs.</li>
<li style="font-weight:400;">Since Copilot uses standard CloudFormation under the hood, teams can also simply adopt the existing generated stacks and manage them directly, which represents the lowest-effort migration option for teams not ready to switch tooling.</li>
</ul>
<p>49:26  Justin - “ I mean, yeah, this is kind of the first step into a fully managed world of ECS, and I remember when it came out we talked about it and was like, well, this is nice, but we really want what became Amazon ECS Express, and so they kind of deprecated themselves in their own way with better solution.”</p>
<p>51:04 <a href="https://aws.amazon.com/about-aws/whats-new/2026/03/amazon-route-53-global-resolver/">Amazon Route 53 Global Resolver is now generally available</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/route53/global-resolver/">Amazon Route 53 Global Resolver</a> is now generally available across 30 AWS Regions, expanding from the 11-region <a href="https://aws.amazon.com/blogs/aws/introducing-amazon-route-53-global-resolver-for-secure-anycast-dns-resolution-preview/">preview shown at re:Invent 2025</a>, with support for both IPv4 and IPv6 DNS query traffic from any location.</li>
<li style="font-weight:400;">The service functions as an internet-reachable anycast DNS resolver, allowing authorized clients in an organization to resolve both public internet domains and private <a href="https://aws.amazon.com/route53/">Route 53</a> hosted zones without being tied to a specific network location.</li>
<li style="font-weight:400;">Security filtering is a core capability, blocking malicious domains, DNS tunneling, Domain Generation Algorithms, and now with GA, Dictionary DGA threats, alongside centralized query logging for visibility across the organization.</li>
<li style="font-weight:400;">This positions Global Resolver as a managed alternative to running your own DNS resolver infrastructure for distributed or remote workforces, reducing operational overhead while centralizing DNS policy enforcement.</li>
<li style="font-weight:400;">New customers get a 30-day free trial to evaluate the service, with pricing details available <a href="http://aws.amazon.com/route53/global-resolver">here</a>.</li>
</ul>
<p>51:57  Ryan - “I both love and hate this. Having operated a global Anycast resolver, I know how much of a pain it is, and so I wouldn't want to set another one up, and I would gladly pay Amazon to do that. However, I don't know that they're removing the annoying parts. And you add more abstraction, I wonder, troubleshooting failed queries; that's going to be really difficult. And you have a lot more control when you control the network for these things, and so I'm very dubious about this one. But if it just works, then it'll probably be worth it.”</p>
<p>53:29 <a href="https://aws.amazon.com/blogs/containers/automated-deployments-with-github-actions-for-amazon-ecs-express-mode/">Automated deployments with GitHub Actions for Amazon ECS Express </a><a href="https://aws.amazon.com/blogs/containers/automated-deployments-with-github-actions-for-amazon-ecs-express-mode/">Mode</a></p>
<ul>
<li style="font-weight:400;">AWS published a walkthrough for connecting GitHub Actions to <a href="https://docs.aws.amazon.com/AmazonECS/latest/developerguide/express-service-overview.html">Amazon ECS Express Mode</a>, automating the full pipeline from code commit to container deployment, including image builds, ECR pushes, and service updates without manual coordination.</li>
<li style="font-weight:400;">The integration uses <a href="https://docs.aws.amazon.com/IAM/latest/UserGuide/id_roles_providers_create_oidc.html">OIDC for authentication</a> instead of stored AWS credentials, meaning <a href="https://github.com/marketplace/actions/amazon-ecs-deploy-express-service-action-for-github-actions">GitHub Actions</a> receives temporary credentials that expire after each workflow run, which reduces the risk surface compared to long-lived access keys sitting in repository secrets.</li>
<li style="font-weight:400;">ECS Express Mode handles the infrastructure heavy lifting automatically, provisioning an <a href="https://aws.amazon.com/elasticloadbalancing/application-load-balancer/">ALB</a>, target groups, health checks, auto scaling based on CPU, and security groups, so teams get a production-ready stack from a minimal workflow configuration.</li>
<li style="font-weight:400;">Image tagging uses the first 7 characters of the git commit SHA, giving teams precise version traceability and a straightforward path to rollback by referencing a specific immutable image in ECS deployment history.</li>
<li style="font-weight:400;">Costs are usage-based, covering <a href="https://docs.aws.amazon.com/AmazonECS/latest/developerguide/AWS_Fargate.html">ECS Fargate</a> tasks, ECR storage, and data transfer, with no GitHub Actions charges for public repositories. The estimated setup time is 20 to 30 minutes, making this a relatively low-friction starting point for teams not yet running automated container deployments.</li>
</ul>
<h2>GCP</h2>
<p>55:59 <a href="https://cloud.google.com/blog/products/identity-security/introducing-the-google-cloud-recommended-security-checklist/">Introducing the Google Cloud recommended security checklist</a></p>
<ul>
<li style="font-weight:400;">Google Cloud published a <a href="http://github.com/GoogleCloudPlatform/ociso-solutions">recommended security checklist</a> at <a href="http://docs.cloud.google.com/docs/security/gcmvsp">docs.cloud.google.com/docs/security/gcmvsp</a>, featuring <a href="https://docs.cloud.google.com/docs/security/gcmvsp">60 curated controls</a> across six domains, including authentication, data protection, and network security, organized into Basic, Intermediate, and Advanced tiers.</li>
<li style="font-weight:400;">The checklist is directly motivated by data from the <a href="https://cloud.google.com/resources/content/cloud-threat-horizons-report-h2-2025">2025 Google Cloud Threat Horizons Report</a>, which found that weak credentials and misconfigurations account for nearly 76% of cloud compromise (that’s a BIG number), making these controls particularly relevant for organizations assessing their baseline posture.</li>
<li style="font-weight:400;">A companion <a href="https://github.com/GoogleCloudPlatform/ociso-solutions/tree/main/gcmvsp">Terraform repository on GitHub</a> provides deployable code for the controls, moving the checklist beyond documentation into something teams can act on immediately and consistently.</li>
<li style="font-weight:400;">The checklist is free to use and aligns with the open <a href="https://mvsp.dev/">Minimum Viable Secure Product</a> framework, meaning organizations can cross-reference it against existing compliance or vendor-neutral security standards they may already be tracking.</li>
<li style="font-weight:400;">Early access customers reported being able to identify and activate critical controls in a single session, which suggests this is a practical tool for teams that need to establish or audit a security baseline without extensive prior GCP expertise.</li>
</ul>
<p>56:52  Ryan - “So, your mileage may vary. Some of the code that they have in the solution requires really, really high privileges to run in your GCP environment, so it's one of those things where you might not be able to get that far with it unless you're administering the cloud directly. But it's definitely, I think, a lot of really good, useful things that you can then take… anything that allows people to focus on what they care about is pretty great.”</p>
<p>58:06 <a href="https://cloud.google.com/blog/topics/telecommunications/new-agents-for-the-autonomous-network-operations-framework/">New agents for the Autonomous Network Operations framework</a></p>
<ul>
<li style="font-weight:400;">Google Cloud expanded its <a href="https://cloud.google.com/blog/topics/telecommunications/the-autonomous-network-operations-framework-for-csps">Autonomous Network Operations framework</a> with two new components: the Autonomous Data Steward and the Core Network VoLTE Agent, both built on Gemini and targeted at telecom operators managing complex network infrastructure.</li>
<li style="font-weight:400;">The Autonomous Data Steward addresses a core scaling problem by using a zero-copy architecture with <a href="https://cloud.google.com/dataplex">Dataplex Universal Catalog</a> to store metadata pointers instead of duplicating datasets, reducing storage costs by up to 70% while enabling real-time data access across previously siloed domains like RAN, Core, and Probes.</li>
<li style="font-weight:400;">The VoLTE Agent builds on the Data Steward foundation to monitor voice quality metrics like Call Setup Success Rates and Mean Opinion Scores, correlate SIP and Diameter signaling data for root cause analysis, and recommend corrective actions like call routing adjustments without requiring manual intervention.</li>
<li style="font-weight:400;">New Zealand telecom provider One NZ is already deploying the VoLTE Agent in production, which gives this announcement a concrete, real-world validation point rather than remaining purely a proof-of-concept offering.</li>
<li style="font-weight:400;">Google and Future Connections have open-sourced the core methodologies behind these agents, allowing telecom operators to build and customize their own agentic workflows; interested parties need to contact their Google Account Team for early access, and pricing is not publicly listed.</li>
</ul>
<p>58:39  Justin - “This is all a lot of stuff for TelCo’s, but it’s cool, if you’re into geeky TelCo things, check it out.” </p>
<p>59:24 <a href="https://blog.google/innovation-and-ai/products/notebooklm/generate-your-own-cinematic-video-overviews-in-notebooklm/">NotebookLM adds Cinematic Video Overviews</a></p>
<ul>
<li style="font-weight:400;"><a href="https://blog.google/innovation-and-ai/products/notebooklm/">NotebookLM</a>'s Cinematic <a href="https://blog.google/innovation-and-ai/models-and-research/google-labs/notebooklm-video-overviews-studio-upgrades/">Video Overviews</a> moves beyond static narrated slides to generate fluid animations and detailed visuals from user-provided sources, using a combination of <a href="https://deepmind.google/models/gemini/">Gemini 3</a> and <a href="https://aistudio.google.com/models/veo-3">Veo 3</a> models working together.</li>
<li style="font-weight:400;">Gemini functions as a creative director in this pipeline, handling narrative structure, visual style selection, format decisions, and self-refinement passes to maintain consistency across the generated video.</li>
<li style="font-weight:400;">This is a consumer-facing AI feature rather than a direct GCP infrastructure offering, but it demonstrates practical multi-model orchestration that GCP customers building their own AI pipelines may find instructive.</li>
<li style="font-weight:400;">Availability is currently limited to English-language users on web and mobile who subscribe to <a href="https://gemini.google/us/subscriptions/?hl=en">Google AI Ultra</a>, which is priced at $249.99 per month, and is restricted to users 18 and older.</li>
<li style="font-weight:400;">The primary use cases center on education and knowledge synthesis, where users can transform documents, research, or other sources into video summaries, which could be relevant for training content, internal documentation, or learning platforms built on Google's ecosystem.</li>
</ul>
<p>1:00:21  Justin - “A little bit pricey to replace all the YouTubers, but coming soon.” </p>
<p>1:01:14 <a href="https://blog.google/innovation-and-ai/technology/developers-tools/gemini-embedding-2/">Gemini Embedding 2: Our first natively multimodal embedding model</a></p>
<ul>
<li style="font-weight:400;">Gemini Embedding 2 is now in Public Preview via the <a href="https://ai.google.dev/gemini-api/docs/embeddings">Gemini API</a> and <a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/models/gemini/embedding-2">Vertex AI</a>, marking Google's first natively multimodal embedding model built on the Gemini architecture. It maps text, images, video up to 120 seconds, audio, and PDFs into a single unified embedding space across 100-plus languages.</li>
<li style="font-weight:400;">A notable technical detail is that audio is embedded natively without requiring intermediate transcription, which removes a common pipeline step that previously added latency and potential accuracy loss in multimodal workflows.</li>
<li style="font-weight:400;">The model uses Matryoshka Representation Learning to support flexible output dimensions scaling down from a default of 3072, with recommended sizes of 3072, 1536, and 768. </li>
<li style="font-weight:400;">This lets developers trade off retrieval quality against storage and compute costs depending on their use case.</li>
<li style="font-weight:400;">Interleaved multimodal input, such as combining an image and text in a single request, allows the model to capture relationships between media types rather than treating each modality independently. </li>
<li style="font-weight:400;">This is particularly relevant for RAG pipelines, semantic search, and data clustering applications.</li>
<li style="font-weight:400;">Integration is available through <a href="https://docs.langchain.com/oss/python/integrations/text_embedding/google_generative_ai">LangChain</a>, <a href="https://developers.llamaindex.ai/python/framework/integrations/embeddings/google_genai/">LlamaIndex</a>, <a href="https://haystack.deepset.ai/integrations/google-genai">Haystack</a>, Weaviate, QDrant, ChromaDB, and Vertex AI Vector Search, meaning teams can adopt this model without significant changes to existing tooling. </li>
<li style="font-weight:400;">Pricing details are not specified in the announcement, so listeners should check the Vertex AI pricing page directly before planning production workloads.</li>
<li style="font-weight:400;">Interested in checking out that demo? Find it <a href="https://findmemedia.lmm.ai/">here</a>. </li>
</ul>
<p>1:02:29  Ryan - “I go back and forth on these multimodal, because I feel like there's so much bloat and we use the wrong model for so many use cases, and I feel like the multimodal is a really good way to do that. So it is interesting, I just haven't seen a use case where I would see a whole lot of benefit of being able to sort of use the multimodal model to get an answer out of an LLM that I wouldn't be able to get using other tools.”</p>
<p>1:03:28 <a href="https://blog.google/products-and-platforms/products/workspace/gemini-workspace-updates-march-2026/">Google shares Gemini updates to Docs, Sheets, Slides and Drive</a></p>
<ul>
<li style="font-weight:400;">Google is rolling out beta updates to Gemini across Docs, Sheets, Slides, and Drive that allow the assistant to pull context from a user's own files, emails, calendar, and the web when generating or editing content. </li>
<li style="font-weight:400;">This cross-source grounding is the core technical shift here, moving Gemini from a generic assistant to one that works with personal data.</li>
<li style="font-weight:400;">In Docs, new features include style matching across a document and format matching against a reference file, so Gemini can populate a travel itinerary template using flight and hotel details pulled directly from a user's Gmail inbox. This kind of structured extraction from unstructured personal data is worth noting for enterprise use cases.</li>
<li style="font-weight:400;">Sheets gets a "Fill with Gemini" capability that lets users drag down a column and have Gemini populate cells with real-time web data or summarized content, similar to how a formula works but using natural language and live search results. </li>
<li style="font-weight:400;">This could be useful for research-heavy workflows like competitive analysis or application tracking.</li>
<li style="font-weight:400;">Drive gains an AI Overview feature in search results that summarizes relevant file contents with citations before a user even opens a document, plus a new Ask Gemini panel for querying across documents, emails, and calendar simultaneously.</li>
<li style="font-weight:400;">Availability is limited to <a href="https://one.google.com/intl/en/about/google-ai-plans/">Google AI Ultra</a> and Pro subscribers at google.com/intl/en/about/google-ai-plans, with English-only support globally for Docs, Sheets, and Slides, and U.S.-only for Drive. Workspace business customers have a separate path through the <a href="https://workspace.google.com/blog/product-announcements/reimagining-content-creation">Google Workspace blog</a>.</li>
</ul>
<p>1:04:21  Justin - “So if you’re in the Google workspaces places, you’ve not got basically what Copilot gave you, but better.” </p>
<h2>Azure</h2>
<p>1:05:29 <a href="https://www.databricks.com/blog/azure-databricks-lakebase-generally-available">Azure Databricks Lakebase is Generally Available</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/databricks/introduction/">Azure Databricks</a> <a href="https://www.databricks.com/product/lakebase">Lakebase</a> is now generally available as a managed, serverless Postgres offering that stores operational data directly in lakehouse storage, eliminating the need for ETL pipelines between transactional systems and analytics workloads.</li>
<li style="font-weight:400;">The service separates compute from storage and scales to zero when idle, with usage-based pricing meaning customers pay only for compute consumed. Specific pricing details are not published in the announcement, so listeners should check the Azure Databricks pricing page for current rates.</li>
<li style="font-weight:400;">Lakebase integrates with <a href="https://learn.microsoft.com/en-us/azure/databricks/data-governance/unity-catalog/">Unity Catalog</a>, giving teams a single governance layer covering operational, analytical, and AI workloads with consistent access control, lineage tracking, and auditing across the entire Databricks data estate.</li>
<li style="font-weight:400;">Developers get instant zero-copy branching and point-in-time recovery, allowing teams to test schema changes or debug against production data without affecting live users or requiring duplicate infrastructure.</li>
<li style="font-weight:400;">The service supports standard Postgres tooling, including pgAdmin, DBeaver, pgvector for AI search, and PostGIS for geospatial use cases, and integrates with <a href="https://www.microsoft.com/en-us/security/business/identity-access/microsoft-entra-id">Microsoft Entra ID</a> and Azure networking, making it a practical option for teams already invested in the Azure ecosystem.</li>
<li style="font-weight:400;">Cool. Glad to have another database available.</li>
</ul>
<p>1:07:17 <a href="https://www.microsoft.com/en-us/microsoft-365/blog/2026/03/09/copilot-cowork-a-new-way-of-getting-work-done/">Copilot Cowork: A new way of getting work done</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.microsoft.com/en-us/microsoft-365/blog/2026/03/09/copilot-cowork-a-new-way-of-getting-work-done/">Copilot Cowork</a> is a new <a href="https://www.microsoft.com/en-gb/microsoft-365">Microsoft 365</a> feature that moves Copilot beyond answering questions into actually executing multi-step work tasks, such as rescheduling calendar conflicts, building meeting packets, and coordinating product launch assets across Outlook, Teams, and Excel.</li>
<li style="font-weight:400;">The feature is powered by <a href="https://techcommunity.microsoft.com/blog/microsoft365copilotblog/a-closer-look-at-work-iq/4499789">Work IQ</a>, which pulls signals from across Microsoft 365 apps to give Copilot contextual understanding of your work before taking action, with user-controlled checkpoints to approve, pause, or modify tasks before changes are applied.</li>
<li style="font-weight:400;">A notable technical detail is that Cowork integrates <a href="https://claude.ai/new">Claude</a> from <a href="https://www.anthropic.com/">Anthropic</a> alongside Microsoft's own models, reflecting a multi-model approach where Copilot selects the most appropriate model for a given task rather than relying on a single provider.</li>
<li style="font-weight:400;">Enterprise governance is built in by default, with identity, permissions, and compliance policies applied automatically, and all actions running in a sandboxed cloud environment that keeps tasks progressing safely across devices.</li>
<li style="font-weight:400;">Cowork is currently in Research Preview with limited customers and will expand to the Frontier program in late March 2026, with no public pricing details announced yet, so organizations interested in early access should check the Frontier <a>here</a>.</li>
</ul>
<p>57:31 <a href="https://blogs.microsoft.com/blog/2026/03/09/introducing-the-first-frontier-suite-built-on-intelligence-trust/">Introducing the First Frontier Suite built on Intelligence + Trust </a></p>
<ul>
<li style="font-weight:400;">Microsoft announced <a href="https://microsoftpartners.microsoft.com/abs/Blog/?title=Introducing%20Microsoft%20365%20E7:%20The%20Frontier%20Suite">Microsoft 365 E7: The Frontier Suite</a>, available May 1 at $99 per user, bundling <a href="https://www.microsoft.com/en-us/microsoft-365/enterprise/e5">Microsoft 365 E5</a>, <a href="https://m365.cloud.microsoft/">Microsoft 365 Copilot</a>, and the new <a href="https://www.microsoft.com/en-us/microsoft-agent-365">Agent 365</a> into a single SKU that includes <a href="https://microsoft.github.io/EntraSuite-Training/">Entra Suite</a>, <a href="https://www.microsoft.com/en-gb/microsoft-365/microsoft-defender-for-individuals">Defender</a>, <a href="https://intune.microsoft.com">Intune</a>, and Purview capabilities.</li>
<li style="font-weight:400;">Agent 365, also generally available May 1 at $15 per user, functions as a control plane for AI agents, giving IT and security teams a single interface to observe, govern, and secure agents across the organization. </li>
<li style="font-weight:400;">Microsoft reports visibility into over 500,000 internal agents as Customer Zero, generating 65,000 daily responses in the past 28 days.</li>
<li style="font-weight:400;">Wave 3 of Microsoft 365 Copilot introduces model diversity by adding <a href="https://claude.ai/">Anthropic Claude</a> to mainline chat alongside <a href="https://openai.com/">OpenAI models</a>, and includes a research preview of Copilot Cowork for long-running multi-step tasks built in collaboration with Anthropic.</li>
<li style="font-weight:400;">The concept of Work IQ is central to this announcement, positioning Microsoft 365 Copilot as differentiated from generic model-plus-connector solutions by embedding organizational context about how people work, who they work with, and what content they use.</li>
<li style="font-weight:400;">Adoption metrics cited include paid Copilot seats growing over 160% year over year, daily active usage up ten times, and the number of customers deploying more than 35,000 seats tripling year over year, with 90% of Fortune 500 companies now using Copilot in some capacity.</li>
</ul>
<p>1:10:54  Ryan - “This is interesting; I know, in evaluations and talking to people from different companies, when they were rolling this out originally - I think it was something like 30 or 50 bucks a user, no one wanted to pay that price. And there was a minimum number of users. So it was a large amount of money.” </p>
<h2>Oracle </h2>
<p>1:12:29 <a href="https://blogs.oracle.com/cloud-infrastructure/introducing-ocis-cost-anomaly-detection">Introducing OCI’s Cost Anomaly Detection</a></p>
<ul>
<li style="font-weight:400;">Oracle launched <a href="https://docs.oracle.com/en-us/iaas/Content/Billing/Concepts/costanomalydetectionoverview.htm">OCI Cost Anomaly Detection</a> as a no-cost feature that uses machine learning to monitor daily cloud spend across all services and regions, alerting users when costs deviate from forecasted baselines. </li>
<li style="font-weight:400;">This is a welcome addition, given that most cloud providers offer similar capabilities, with AWS and Azure having had comparable tools for some time.</li>
<li style="font-weight:400;">The ML model accounts for daily, weekly, yearly, and holiday seasonality patterns, and users can provide feedback to improve accuracy and reduce false positives. </li>
<li style="font-weight:400;">Custom <a href="https://docs.oracle.com/en-us/iaas/Content/Billing/Concepts/costmonitors.htm">cost monitors</a> can be scoped by compartment or tags, which gives teams reasonable flexibility for environment or application-level tracking.</li>
<li style="font-weight:400;">Alert thresholds can be configured as absolute dollar amounts or percentage variances, which helps reduce alert noise by focusing notifications on anomalies that actually exceed meaningful cost boundaries. This is a practical design choice that avoids the common problem of alert fatigue in cost monitoring tools.</li>
<li style="font-weight:400;">Default monitors are automatically created at the tenancy, service, and region level, meaning customers get baseline coverage without any configuration, though teams with complex multi-compartment environments will likely need to invest time building custom monitors to get a genuinely useful signal.</li>
<li style="font-weight:400;">The feature is free, which removes the awkward situation of paying for a tool designed to help you avoid overspending, though the real value depends on how accurately the forecasting model performs in practice, something Oracle has not provided specific benchmark data on in this announcement.</li>
</ul>
<p>1:12:42  Justin - “This has been at every other cloud forever, so…” </p>
<p>1:13:24 <a href="https://www.oracle.com/news/announcement/q3fy26-earnings-release-2026-03-10/">Oracle Announces Fiscal Year 2026 Third Quarter Financial Results</a></p>
<ul>
<li style="font-weight:400;">Yeah, we know. They report at weird times. </li>
<li style="font-weight:400;">Oracle reported Q3 fiscal 2026 total revenue of $17.2 billion, up 22% year-over-year, with cloud revenue specifically hitting $8.9 billion, a 44% increase, marking the first quarter in over 15 years where both organic revenue and non-GAAP EPS grew at 20% or more simultaneously.</li>
<li style="font-weight:400;">The Remaining Performance Obligations figure of $553 billion, up 325% from last year, is the headline number worth scrutinizing, as Oracle notes most of this growth comes from large-scale AI contracts funded either through customer prepayments for GPU purchases or customer-supplied hardware, which is a notably different model than traditional cloud commitments.</li>
<li style="font-weight:400;">Oracle raised $30 billion in debt and equity financing within days of announcing a $50 billion capital raise program, with the proceeds tied to funding infrastructure for AI training and inferencing capacity, and the company is projecting $50 billion in capital expenditures for fiscal year 2026.</li>
<li style="font-weight:400;">Oracle is openly stating it has restructured product development teams into smaller groups due to AI code generation tools, framing this as a cost reduction and productivity improvement for SaaS development, though the workforce implications of building more software with fewer people deserve attention.</li>
<li style="font-weight:400;">The company raised fiscal year 2027 total revenue guidance to $90 billion, up from prior estimates, while maintaining fiscal year 2026 guidance of $67 billion, suggesting Oracle is betting heavily that AI infrastructure demand will remain supply-constrained and that its cloud positioning will capture a meaningful share of that spending.</li>
</ul>
<p>1:14:47  Justin - “That’s a pretty good bet, so I get it. I also think Oracle is kind of lucking into the multi-cloud…because people are having to adopt Oracle cloud to get the capacity they need.”  </p>
<h2>After Show </h2>
<p>57:31 <a href="https://www.geekwire.com/2026/xbox-surprise-microsoft-reveals-next-generation-project-helix-console/">Xbox surprise: Microsoft reveals 'Project Helix' as the codename of its next </a><a href="https://www.geekwire.com/2026/xbox-surprise-microsoft-reveals-next-generation-project-helix-console/">console </a></p>
<ul>
<li style="font-weight:400;">Microsoft revealed the codename Project Helix for its next-generation Xbox console, confirmed by new Xbox CEO Asha Sharma, who recently replaced Phil Spencer after his 38-year tenure at Microsoft.</li>
<li style="font-weight:400;">The announcement is notable given persistent industry speculation that Microsoft might exit the console hardware business entirely, suggesting the gaming division intends to continue through at least one more console generation.</li>
<li style="font-weight:400;">Project Helix is described as leading in performance and supporting both Xbox and PC games, continuing the cross-platform compatibility direction Microsoft has pursued in recent years.</li>
<li style="font-weight:400;">A current RAM shortage driven by AI data center demand is affecting the broader hardware supply chain, potentially pushing the console's release beyond the initially rumored late-2027 window, which is a direct example of how AI infrastructure buildout creates ripple effects across other tech sectors.</li>
<li style="font-weight:400;">For cloud professionals, this is worth watching because Xbox hardware increasingly ties into Microsoft's cloud gaming and Game Pass ecosystem, meaning console generation transitions have implications for Azure-based gaming services and infrastructure planning.</li>
</ul>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback, or ask questions at theCloudPod.net or tweet at us with the hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2401248/c1e-8m9mbv9d37uvqvrn-0v9zn93jf7kv-1e23se.mp3" length="151721731"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 346 of The Cloud Pod, where the forecast is always cloudy! Hold on to your butts, because Justin, Ryan, and Matt are in the studio today, and they’re ready to bring you all the latest in Cloud and AI news, including the usual: Meta buying social networks, Amazon responding to outages, and OpenAI giving up another version of GPT. Let’s get into it! 
Titles we almost went with this week

✍️ Cloudflare Spent $1100 to Rewrite Next.js in a Week
 One Pipe to Rule All Your OpenTelemetry Data
☑️ Check Yourself Before Google Wrecks Your Cloud Config
 Copilot Takes Jira Tickets So You Don't Have To
‍✈️ GitHub Copilot Agent Joins Your Jira Workflow Uninvited
 When AI Agents Network, Meta Swipes Right on Moltbook
️ Sixty Controls Walk Into a Terraform Repository
 One Security Console to Rule All Your Clouds
 AI Ate My Lock-In, and I Feel Fine
⛅ Oracle Sees $90 Billion Future Cloudy With a Chance of GPUs
 Your API Has Trust Issues, and We Can Prove It
 Stop Running Three Pipelines Like a Telemetry Hoarder
 From Database Dinosaur to AI Cash Cow
☠️ Meta: Target acquired; must kill Moltbook
 Meta saw Moltbook and said, “WE MUST OWN IT AND KILL.”

Follow Up
00:51 Where things stand with the Department of War 

Anthropic has been designated a supply chain risk to US national security by the Department of War, a designation the company is challenging in court as legally unsound under 10 USC 3252.
The practical scope of the designation is narrow, applying only to the use of Claude in direct Department of War contracts, not to all customers that hold such contracts or to unrelated business with Anthropic.
Anthropic has stated that it will continue to provide its models to the Department of War and the national security community at nominal cost, with ongoing engineering support, during any transition period and for as long as permitted.
The company's two stated exceptions to military use involve fully autonomous weapons and mass domestic surveillance, and Anthropic has clarified these do not extend to operational decision-making, which it considers the military's domain.
For cloud and enterprise customers, the key takeaway is that existing Claude deployments unrelated to Department of War contracts remain unaffected, though the legal dispute introduces uncertainty into federal procurement pipelines involving AI services.
We will keep you updated on this in 12-18 months…

AI Is Going Great - Or How ML Makes Money 
01:21 Introducing GPT-5.4

OpenAI released GPT-5.4 across ChatGPT, the API, and Codex, positioning it as their most capable reasoning model to date. It merges the coding strengths of GPT-5.3-Codex with general reasoning, professional knowledge work, and native computer-use capabilities in a single model.
The computer-use capabilities are a notable technical step, with GPT-5.4 achieving a 75% success rate on OSWorld-Verified desktop navigation, surpassing the reported human benchmark of 72.4% and...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2401248/c1a-k5d5-dm1gwj61ar92-fjgulh.jpg"></itunes:image>
                                                                            <itunes:duration>01:18:38</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2401248/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[345: Damn It… my excuse is now gone for Disaster Recovery]]>
                </title>
                <pubDate>Thu, 12 Mar 2026 00:43:12 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2391099</guid>
                                    <link>https://tcpfm.castos.com/episodes/345-damn-it-my-excuse-is-now-gone-for-disaster-recovery</link>
                                <description>
                                            <![CDATA[<p>Welcome to episode 345 of The Cloud Pod, where the forecast is always cloudy! Justin, Ryan, and Matt are in the studio this week and are ready to bring you all the latest in cloud and AI news, including what’s going on between Anthropic, the DOD, and OpenAI, what the war means for Middle East data centers (Spoiler – I hope you have a good Disaster Recovery plan), and Transit Gateway pricing changes that are enough to make a grown man cry. And don’t bother waiting: Matt has completely forgotten almost two years of “bye everybody” and now claims full amnesia as to what his outtro is. Oh well. Let’s get into today’s show. </p>
<h3>Titles we almost went with this week</h3>
<ul>
<li> Claude Learned to Use a Computer Better Than Your Dad **OpenAI</li>
<li> Amazon and OpenAI’s $138 Billion AI Bromance</li>
<li> When Two AZs Go Dark the Cloud Gets Crispy</li>
<li> Fifty Billion Reasons AWS Loves OpenAI Now **Anthropic</li>
<li> Azure Still Wins Even When AWS Thinks It Did</li>
<li> Fire, Water, and a Multi-AZ Assumption Goes Up in Smoke</li>
<li> Claude Refuses to Go Full Skynet for the Pentagon</li>
<li> GPT-5.3 Instant Finally Stops Lecturing You</li>
<li> No Killer Robots Without Human Approval Please</li>
<li> Terraform Finally Sees Your Forgotten Cloud Resources</li>
<li> Stage Before You Rage Deploy Azure Firewall</li>
<li> CrowdStrike to Zscaler AWS Wants Your Security Tab</li>
<li> One Hub to Rule Your API Sprawl</li>
<li> Transit Gateway Attachments Just Got Surprisingly Expensive</li>
<li> Azure Container Registry Finally Has Room for Your AI Hoarding</li>
<li> Bedrock Gets a Roommate OpenAI Moves In</li>
<li> Azure Firewall Gets a Safety on the Trigger</li>
<li> Stop Writing Scripts, Just Import the Dang Infrastructure</li>
<li> Audit Your APIs Before March 2026 Bites You</li>
<li> Damn it… my excuse not to DR is gone</li>
<li> I’m Epically Furious about DR</li>
</ul>
<h2>AI Is Going Great – Or How ML Makes Money </h2>
<p>03:34 <a href="https://www.anthropic.com/news/acquires-vercept">Anthropic acquires Vercept to advance Claude’s computer use capabilities </a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.anthropic.com/">Anthropic</a> acquired <a href="https://vercept.com/">Vercept</a>, a team specializing in AI perception and interaction, to strengthen Claude’s computer use capabilities. </li>
<li style="font-weight:400;">The Vercept founders, including Ross Girshick, bring deep expertise in how AI systems visually interpret and interact with software interfaces.</li>
<li style="font-weight:400;"><a href="https://www.anthropic.com/claude/sonnet">Claude Sonnet 4.6</a> shows substantial improvement in computer use benchmarks, jumping from under 15% on the <a href="https://os-world.github.io/">OSWorld evaluation</a> in late 2024 to 72.5% today. </li>
<li style="font-weight:400;">The model is now approaching human-level performance on tasks like navigating spreadsheets and completing multi-tab web forms.</li>
<li style="font-weight:400;">Computer use enables Claude to operate inside live applications the way a human would, handling multi-step workflows across tools that cannot be automated through code alone. </li>
<li style="font-weight:400;">This is relevant for enterprise use cases involving document processing, browser-based workflows, and cross-application task management.</li>
<li style="font-weight:400;">This is Anthropic’s second acquisition in a short period, following the purchase of Bun, which was tied to the Claude Code milestone. The pattern suggests Anthropic is actively acquiring specialized engineering teams rather than just technology assets.</li>
<li style="font-weight:400;">For developers and businesses building agentic workflows on Claude, the improved computer use performance means more reliable automation of complex, real-world software tasks without requiring custom integrations or APIs for every application involved.</li>
</ul>
<p>05:18  Justin – “It seems like every day I have to upda...</p>
<h3>Chapters</h3>
<ul><li>(00:00:00) - Foreign, Where the forecast is always cloudy</li><li>(00:01:04) - Let's Talk Cloud</li><li>(00:01:37) - Anthropic Expands Cloud's Computer Use Capabilities</li><li>(00:07:17) - How to Write a Review in AI-Native</li><li>(00:10:07) - Alexis Skill Creator Update & Cloud Agent</li><li>(00:12:18) - Anthropic Banished from Supporting the Military</li><li>(00:18:49) - Google AI's Gemini 3.1 & Nanobana</li><li>(00:20:07) - Chat GPT 5.3: More Alikes, Less Dist</li><li>(00:24:08) - Comments on the AI News</li><li>(00:24:58) - Amazon AWS: Drone Strike in the Middle East Affects Infrastructure</li><li>(00:30:52) - Azure vs. Google: The Distributed Data Center</li><li>(00:34:06) - OpenAI Expands Cloud Deal to $100 Million</li><li>(00:39:51) - Amazon Security Hub Extended: Full Stack Enterprise Security with Curated Partner</li><li>(00:41:57) - Amazon's Encryption Controls: Starting Soon</li><li>(00:44:17) - Amazon Cloud: Natural Language to Cedar Compliance</li><li>(00:47:13) - API Specs: Combat Specs Sprawl</li><li>(00:48:39) - Google Cloud's Polyglot Storage approach for Chatbot</li><li>(00:52:53) - Matt's Azure Quotation</li><li>(00:53:21) - Azure Monitor Pipeline: New Public Preview</li><li>(00:55:30) - Microsoft Azure Local Disconnected and Large Model Support</li><li>(00:58:39) - Azure Functions for Linux: Best Practices for Self-signed Cert</li><li>(01:00:54) - Microsoft Azure Confidential VMs: Learning the Names</li><li>(01:04:44) - Azure firewall policy: Two-Phase Draft and Deploy</li><li>(01:05:51) - Azure container registry: 100 terabytes of storage</li><li>(01:09:00) - Azure 2.8 Resource Limits</li><li>(01:09:52) - This Week in the Cloud: OpenAI and the Trump Administration</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 345 of The Cloud Pod, where the forecast is always cloudy! Justin, Ryan, and Matt are in the studio this week and are ready to bring you all the latest in cloud and AI news, including what’s going on between Anthropic, the DOD, and OpenAI, what the war means for Middle East data centers (Spoiler – I hope you have a good Disaster Recovery plan), and Transit Gateway pricing changes that are enough to make a grown man cry. And don’t bother waiting: Matt has completely forgotten almost two years of “bye everybody” and now claims full amnesia as to what his outtro is. Oh well. Let’s get into today’s show. 
Titles we almost went with this week

 Claude Learned to Use a Computer Better Than Your Dad **OpenAI
 Amazon and OpenAI’s $138 Billion AI Bromance
 When Two AZs Go Dark the Cloud Gets Crispy
 Fifty Billion Reasons AWS Loves OpenAI Now **Anthropic
 Azure Still Wins Even When AWS Thinks It Did
 Fire, Water, and a Multi-AZ Assumption Goes Up in Smoke
 Claude Refuses to Go Full Skynet for the Pentagon
 GPT-5.3 Instant Finally Stops Lecturing You
 No Killer Robots Without Human Approval Please
 Terraform Finally Sees Your Forgotten Cloud Resources
 Stage Before You Rage Deploy Azure Firewall
 CrowdStrike to Zscaler AWS Wants Your Security Tab
 One Hub to Rule Your API Sprawl
 Transit Gateway Attachments Just Got Surprisingly Expensive
 Azure Container Registry Finally Has Room for Your AI Hoarding
 Bedrock Gets a Roommate OpenAI Moves In
 Azure Firewall Gets a Safety on the Trigger
 Stop Writing Scripts, Just Import the Dang Infrastructure
 Audit Your APIs Before March 2026 Bites You
 Damn it… my excuse not to DR is gone
 I’m Epically Furious about DR

AI Is Going Great – Or How ML Makes Money 
03:34 Anthropic acquires Vercept to advance Claude’s computer use capabilities 

Anthropic acquired Vercept, a team specializing in AI perception and interaction, to strengthen Claude’s computer use capabilities. 
The Vercept founders, including Ross Girshick, bring deep expertise in how AI systems visually interpret and interact with software interfaces.
Claude Sonnet 4.6 shows substantial improvement in computer use benchmarks, jumping from under 15% on the OSWorld evaluation in late 2024 to 72.5% today. 
The model is now approaching human-level performance on tasks like navigating spreadsheets and completing multi-tab web forms.
Computer use enables Claude to operate inside live applications the way a human would, handling multi-step workflows across tools that cannot be automated through code alone. 
This is relevant for enterprise use cases involving document processing, browser-based workflows, and cross-application task management.
This is Anthropic’s second acquisition in a short period, following the purchase of Bun, which was tied to the Claude Code milestone. The pattern suggests Anthropic is actively acquiring specialized engineering teams rather than just technology assets.
For developers and businesses building agentic workflows on Claude, the improved computer use performance means more reliable automation of complex, real-world software tasks without requiring custom integrations or APIs for every application involved.

05:18  Justin – “It seems like every day I have to upda...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[345: Damn It… my excuse is now gone for Disaster Recovery]]>
                </itunes:title>
                                    <itunes:episode>345</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p>Welcome to episode 345 of The Cloud Pod, where the forecast is always cloudy! Justin, Ryan, and Matt are in the studio this week and are ready to bring you all the latest in cloud and AI news, including what’s going on between Anthropic, the DOD, and OpenAI, what the war means for Middle East data centers (Spoiler – I hope you have a good Disaster Recovery plan), and Transit Gateway pricing changes that are enough to make a grown man cry. And don’t bother waiting: Matt has completely forgotten almost two years of “bye everybody” and now claims full amnesia as to what his outtro is. Oh well. Let’s get into today’s show. </p>
<h3>Titles we almost went with this week</h3>
<ul>
<li> Claude Learned to Use a Computer Better Than Your Dad **OpenAI</li>
<li> Amazon and OpenAI’s $138 Billion AI Bromance</li>
<li> When Two AZs Go Dark the Cloud Gets Crispy</li>
<li> Fifty Billion Reasons AWS Loves OpenAI Now **Anthropic</li>
<li> Azure Still Wins Even When AWS Thinks It Did</li>
<li> Fire, Water, and a Multi-AZ Assumption Goes Up in Smoke</li>
<li> Claude Refuses to Go Full Skynet for the Pentagon</li>
<li> GPT-5.3 Instant Finally Stops Lecturing You</li>
<li> No Killer Robots Without Human Approval Please</li>
<li> Terraform Finally Sees Your Forgotten Cloud Resources</li>
<li> Stage Before You Rage Deploy Azure Firewall</li>
<li> CrowdStrike to Zscaler AWS Wants Your Security Tab</li>
<li> One Hub to Rule Your API Sprawl</li>
<li> Transit Gateway Attachments Just Got Surprisingly Expensive</li>
<li> Azure Container Registry Finally Has Room for Your AI Hoarding</li>
<li> Bedrock Gets a Roommate OpenAI Moves In</li>
<li> Azure Firewall Gets a Safety on the Trigger</li>
<li> Stop Writing Scripts, Just Import the Dang Infrastructure</li>
<li> Audit Your APIs Before March 2026 Bites You</li>
<li> Damn it… my excuse not to DR is gone</li>
<li> I’m Epically Furious about DR</li>
</ul>
<h2>AI Is Going Great – Or How ML Makes Money </h2>
<p>03:34 <a href="https://www.anthropic.com/news/acquires-vercept">Anthropic acquires Vercept to advance Claude’s computer use capabilities </a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.anthropic.com/">Anthropic</a> acquired <a href="https://vercept.com/">Vercept</a>, a team specializing in AI perception and interaction, to strengthen Claude’s computer use capabilities. </li>
<li style="font-weight:400;">The Vercept founders, including Ross Girshick, bring deep expertise in how AI systems visually interpret and interact with software interfaces.</li>
<li style="font-weight:400;"><a href="https://www.anthropic.com/claude/sonnet">Claude Sonnet 4.6</a> shows substantial improvement in computer use benchmarks, jumping from under 15% on the <a href="https://os-world.github.io/">OSWorld evaluation</a> in late 2024 to 72.5% today. </li>
<li style="font-weight:400;">The model is now approaching human-level performance on tasks like navigating spreadsheets and completing multi-tab web forms.</li>
<li style="font-weight:400;">Computer use enables Claude to operate inside live applications the way a human would, handling multi-step workflows across tools that cannot be automated through code alone. </li>
<li style="font-weight:400;">This is relevant for enterprise use cases involving document processing, browser-based workflows, and cross-application task management.</li>
<li style="font-weight:400;">This is Anthropic’s second acquisition in a short period, following the purchase of Bun, which was tied to the Claude Code milestone. The pattern suggests Anthropic is actively acquiring specialized engineering teams rather than just technology assets.</li>
<li style="font-weight:400;">For developers and businesses building agentic workflows on Claude, the improved computer use performance means more reliable automation of complex, real-world software tasks without requiring custom integrations or APIs for every application involved.</li>
</ul>
<p>05:18  Justin – “It seems like every day I have to update Claude Code because they released a new feature or a new capability.” </p>
<p>12:34 <a href="https://claude.com/blog/improving-skill-creator-test-measure-and-refine-agent-skills">Improving skill-creator: Test, measure, and refine Agent Skills </a></p>
<ul>
<li style="font-weight:400;">Anthropic has updated its skill-creator tool for Claude Agent Skills, now available on <a href="http://claude.ai">Claude.ai</a>, <a href="https://claude.com/product/cowork">Cowork</a>, and as a <a href="https://github.com/anthropics/claude-plugins-official/tree/main/plugins/skill-creator">plugin for Claude Code</a>. </li>
<li style="font-weight:400;">The update brings software development practices like testing, benchmarking, and iterative refinement to skill authoring without requiring users to write code.</li>
<li style="font-weight:400;">The core addition is an eval framework that lets skill authors define test prompts, describe expected outputs, and verify skill behavior across model updates. </li>
<li style="font-weight:400;">A practical example given is the PDF skill fix, where evals isolated a positioning failure on non-fillable forms and guided a targeted fix.</li>
<li style="font-weight:400;">A new benchmark mode tracks eval pass rate, elapsed time, and token usage, and can be integrated into CI systems or local dashboards. Multi-agent parallel eval execution is also included to reduce test time and prevent context bleed between runs.</li>
<li style="font-weight:400;">Comparator agents enable A/B testing between two skill versions or between a skill and no skill, with blind judging to reduce bias in assessing whether a change improves output quality.</li>
<li style="font-weight:400;">Anthropic notes that as base-model capabilities improve, some capability-uptake skills may become unnecessary, and the eval framework is positioned as a step toward skills being defined by natural-language descriptions of desired outcomes rather than detailed implementation instructions.</li>
</ul>
<p>13:54  Justin – “For things that are actually in pipelines or agentic capabilities where you want things to be specific, this is great.” </p>
<p>14:35 <a href="https://www.anthropic.com/news/statement-comments-secretary-war">Statement on the comments from Secretary of War Pete Hegseth</a></p>
<ul>
<li style="font-weight:400;">Anthropic has publicly refused to allow Claude to be used for mass domestic surveillance of Americans or fully autonomous weapons, citing concerns about current AI reliability and civil liberties. </li>
<li style="font-weight:400;">These <a href="https://www.anthropic.com/news/statement-department-of-war">two exceptions</a> led to a breakdown in negotiations with the Department of War after months of discussions.</li>
<li style="font-weight:400;">The Department of War is moving to designate Anthropic as a supply chain risk under <a href="https://uscode.house.gov/view.xhtml?req=granuleid:USC-prelim-title10-section3252&amp;num=0&amp;edition=prelim">10 USC 3252</a>, a designation Anthropic states would be the first time applied to a US adversary. Anthropic has indicated it will challenge any such designation in court.</li>
<li style="font-weight:400;">From a practical standpoint, the legal scope of a supply chain risk designation is narrow. It would only affect the use of Claude on Department of War contract work, leaving commercial API customers, Claude.ai users, and non-DoW contractor use cases completely unaffected.</li>
<li style="font-weight:400;">This situation raises a broader question for cloud and AI vendors about the terms under which they can negotiate acceptable use policies with government customers. </li>
<li style="font-weight:400;">The outcome could set a precedent for how American companies handle government contracts that conflict with their own usage restrictions.</li>
<li style="font-weight:400;">Anthropic notes it has been deployed in US government classified networks since June 2024, making this dispute notable for the AI industry as more frontier model providers pursue federal contracts through programs like FedRAMP and classified cloud environments.</li>
</ul>
<p><a href="https://www.anthropic.com/news/statement-department-of-war">Statement from Dario Amodei on our discussions with the Department of </a><a href="https://www.anthropic.com/news/statement-department-of-war">War </a></p>
<ul>
<li style="font-weight:400;">Anthropic has publicly refused the Department of War’s requests to remove two specific safeguards from Claude: restrictions on mass domestic surveillance use cases and on fully autonomous weapons systems. </li>
<li style="font-weight:400;">This is notable because Anthropic was already the <a href="https://www.anthropic.com/news/expanding-access-to-claude-for-government">first frontier AI company</a> to deploy <a href="https://www.anthropic.com/news/claude-gov-models-for-u-s-national-security-customers">models</a> in US classified networks, <a href="https://www.axios.com/2024/11/14/anthropic-claude-nuclear-information-safety">National Laboratories</a>, and custom national security configurations.</li>
<li style="font-weight:400;">The Department of War has threatened to label Anthropic a “supply chain risk,” a designation previously reserved for US adversaries, and to invoke the Defense Production Act to force removal of these safeguards. Anthropic notes that these two threats are <a href="https://www.politico.com/news/2026/02/26/incoherent-hegseths-anthropic-ultimatum-confounds-ai-policymakers-00800135?utm_content=topic/politics&amp;utm_source=flipboard">contradictory</a> since one frames Claude as a security risk while the other frames it as essential to national security.</li>
<li style="font-weight:400;">The autonomous weapons position has a specific technical basis: Anthropic states current frontier AI systems <a href="https://www.darioamodei.com/essay/the-adolescence-of-technology">lack sufficient reliability</a> for fully autonomous target selection and engagement, and they offered to collaborate with the Department on R&amp;D to improve reliability, an offer that was not accepted.</li>
<li style="font-weight:400;">For cloud and enterprise listeners, this situation establishes a precedent in which an AI provider publicly declines government contract terms on safety grounds rather than on commercial grounds, with direct implications for how AI vendors structure acceptable use policies in high-stakes government and defense cloud deployments.</li>
<li style="font-weight:400;">Anthropic has indicated it will support a smooth transition to another provider if offboarded, signaling that continuity planning for AI-dependent military operations is now a real operational consideration for defense cloud infrastructure teams.</li>
</ul>
<p><a href="https://openai.com/index/our-agreement-with-the-department-of-war">Our agreement with the Department of War </a></p>
<ul>
<li style="font-weight:400;">OpenAI signed a classified AI deployment agreement with the Pentagon using a cloud-only architecture, meaning models run on OpenAI infrastructure rather than on edge devices or government-controlled hardware, which is central to how they enforce their safety constraints.</li>
<li style="font-weight:400;">The agreement includes three stated red lines: no mass domestic surveillance, no directing autonomous weapons systems, and no automated high-stakes decisions without human approval. </li>
<li style="font-weight:400;">OpenAI retains full control of the safety stack and has cleared personnel embedded with the deployment.</li>
<li style="font-weight:400;">The cloud-only deployment model is the key technical differentiator here. By keeping models off edge devices, OpenAI argues it can run and update classifiers independently to verify red lines are not crossed, which would not be possible with on-premise or edge deployments.</li>
<li style="font-weight:400;">The contract language locks in current surveillance and autonomous weapons laws as the standard, meaning even if those laws or DoD policies change in the future, usage must still comply with the standards in place at signing. This is a notable contractual mechanism for maintaining guardrails over time.</li>
<li style="font-weight:400;">OpenAI requested that the same contract terms be made available to all AI labs, including Anthropic, framing this as an attempt to establish a consistent baseline for how the government engages with frontier AI providers on classified work.</li>
</ul>
<p>21:04  Justin – “The precedent that could be set, potentially, that the government can declare any vendor they want to a supply chain risk feels like it’s gonna violate several amendments to the Constitution…” </p>
<p>New Model Section</p>
<p>21:38 <a href="https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-3-1-flash-lite/">Gemini 3.1 Flash Lite: Our most cost-effective AI model yet</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.google.com/">Google</a> launched <a href="https://ai.google.dev/gemini-api/docs/models/gemini-3.1-flash-lite-preview">Gemini 3.1 Flash-Lite</a> in preview, available through the <a href="https://ai.google.dev/gemini-api/docs">Gemini API</a> in <a href="https://aistudio.google.com/prompts/new_chat?model=gemini-3.1-flash-lite-preview">Google AI Studio</a> and <a href="https://console.cloud.google.com/vertex-ai/studio/multimodal?mode=prompt&amp;model=gemini-3.1-flash-lite-preview">Vertex AI</a>, priced at $0.25 per million input tokens and $1.50 per million output tokens, positioning it as a cost-focused option for high-volume workloads.</li>
<li style="font-weight:400;">Compared to 2.5 Flash, the new model delivers 2.5x faster Time to First Answer Token and 45% higher output speed according to <a href="https://artificialanalysis.ai/">Artificial Analysis benchmarks</a>, while scoring 86.9% on GPQA Diamond and 76.8% on MMMU Pro.</li>
<li style="font-weight:400;">The model includes configurable thinking levels, letting developers dial reasoning depth up or down depending on task complexity, which is useful for balancing cost and quality across different workload types.</li>
<li style="font-weight:400;">Practical use cases highlighted include high-volume content moderation, translation, UI generation, and real-time dashboard creation, with early adopters like Latitude, Cartwheel, and Whering already using it in production.</li>
<li style="font-weight:400;">For GCP customers running inference at scale, the combination of low per-token pricing and higher throughput speed makes this a practical option to evaluate against existing model choices in Vertex AI pipelines.</li>
</ul>
<p>22:09 <a href="https://arstechnica.com/ai/2026/02/google-releases-nano-banana-2-ai-image-generator-promises-pro-results-with-flash-speed/">Google reveals Nano Banana 2 AI image model, coming to Gemini today</a></p>
<ul>
<li style="font-weight:400;">Google has released <a href="https://blog.google/innovation-and-ai/technology/ai/nano-banana-2/">Nano Banana 2</a>, technically named <a href="https://ai.google.dev/gemini-api/docs/models/gemini-3.1-flash-image-preview">Gemini 3.1 Flash Image</a>, which replaces both the standard and Pro variants of the previous Nano Banana model across Gemini, AI Studio, Vertex AI, and Flow simultaneously.</li>
<li style="font-weight:400;">The model draws on <a href="https://ai.google.dev/gemini-api/docs/models">Gemini 3.1 LLM</a> web knowledge to improve object fidelity and infographic accuracy, and Google claims it delivers text rendering quality comparable to the previous Pro tier at Flash-tier speeds.</li>
<li style="font-weight:400;">For developers building multi-character or complex scene workflows, the model supports consistent rendering of up to five characters and up to 14 distinct objects per workflow, with expanded output options ranging from 512px square to 4K widescreen.</li>
<li style="font-weight:400;">The full replacement of prior Nano Banana variants means GCP customers on Vertex AI have no migration choice here, so teams relying on the previous Pro model for production workloads should validate outputs against the new model promptly.</li>
<li style="font-weight:400;">Pricing details were not disclosed in the announcement, so Vertex AI customers should check the Vertex AI pricing page directly for updated image generation costs tied to the Gemini 3.1 Flash Image model.</li>
</ul>
<p>22:32  Justin – “I’m excited to plug this one into our show cover generator; I’ve been using Nano Banana 1, and if you’ve checked out our show covers lately, you’ve noticed they’ve become fun cartoons based on our show titles.”  </p>
<p>22:54 <a href="https://openai.com/index/gpt-5-3-instant">GPT-5.3 Instant: Smoother, more useful everyday conversations </a></p>
<ul>
<li style="font-weight:400;"><a href="https://openai.com/">OpenAI</a> released <a href="https://openai.com/index/gpt-5-3-instant/">GPT-5.3 Instant</a> as the new default model in <a href="https://chatgpt.com/">ChatGPT</a>, available to all users today and to developers via the API as gpt-5.3-chat-latest, with GPT-5.2 Instant remaining available for paid users until June 3, 2026.</li>
<li style="font-weight:400;">The update targets conversational quality issues that benchmarks typically miss, specifically reducing unnecessary refusals, moralizing preambles, and overly cautious responses that users flagged as frustrating in GPT-5.2 Instant.</li>
<li style="font-weight:400;">Hallucination rates show measurable improvement: 26.8% reduction in high-stakes domains like medicine, law, and finance when using web search, and 19.7% reduction using internal knowledge only, based on OpenAI’s internal evaluations.</li>
<li style="font-weight:400;">Web search integration is notably improved, with the model now balancing retrieved results against its own reasoning rather than defaulting to link lists, producing more synthesized and immediately usable answers.</li>
<li style="font-weight:400;">Developers should note this is a drop-in update to the existing model endpoint, meaning applications using gpt-5.3-chat-latest will automatically get the improved behavior, which could affect any downstream applications that relied on the previous refusal or response patterns.</li>
</ul>
<p>25:07  Matt – “Testing the models before you roll them out into production. One of the things… how do you actually test these models and prove they’re working? And a lot of customers and questionnaires all require measurable statistics.” </p>
<h2>AWS </h2>
<p>27:58 <a href="https://health.aws.amazon.com/health/status">Amazon DC Impacted in Operation Epic Fury </a></p>
<ul>
<li style="font-weight:400;">Two simultaneous outages hit <a href="https://aws.amazon.com/about-aws/global-infrastructure/regions_az/">AWS Middle East regions</a> on March 1-2, with ME-CENTRAL-1 (UAE) suffering physical fire damage to a data center that knocked out two of three availability zones, and ME-SOUTH-1 (Bahrain) experiencing a localized single-AZ power failure.</li>
<li style="font-weight:400;">The UAE incident demonstrated a critical edge case where S3, normally resilient to single-AZ loss, began failing for ingest and egress once a second AZ went down, highlighting that multi-AZ redundancy assumptions break down when two zones are simultaneously unavailable.</li>
<li style="font-weight:400;">Recovery timelines extended beyond 24 hours in both regions due to the need for physical facility repairs, cooling system restoration, and coordination with local authorities, underscoring that some failure modes fall outside software-level remediation.</li>
<li style="font-weight:400;">AWS recommended customers failover to EU regions for ME-CENTRAL-1 workloads, restore from EBS snapshots in unaffected regions, and use the allow-reassociation flag to migrate Elastic IPs to healthy AZs, which are standard DR playbook steps that many customers may not have pre-tested.</li>
<li style="font-weight:400;">This incident is a practical reminder that multi-AZ deployments alone are insufficient for high-availability requirements in smaller regions with fewer AZs, and that cross-region DR plans with tested failover procedures are necessary for critical workloads.</li>
<li style="font-weight:400;">Directly from Status Page: Due to the ongoing conflict in the Middle East, both affected regions have experienced physical impacts to infrastructure as a result of drone strikes. In the UAE, two of our facilities were directly struck, while in Bahrain, a drone strike in close proximity to one of our facilities caused physical impacts to our infrastructure. Finally, even as we work to restore these facilities, the ongoing conflict in the region means that the broader operating environment in the Middle East remains unpredictable. We recommend that customers with workloads running in the Middle East consider taking action now to back up data and potentially migrate your workloads to alternate AWS Regions</li>
</ul>
<p>29:38  Justin – “This is a real big deal because as our show title said tonight… DR is going to become a real big deal now. If you’re in the business where you need to host data for other customers across the globe, your job just got a lot harder.” </p>
<p>37:26 <a href="https://www.geekwire.com/2026/amazon-invests-50b-in-openai-deepens-aws-partnership-with-expanded-100b-cloud-deal/">Amazon invests $50B in OpenAI, deepens AWS partnership with expanded </a><a href="https://www.geekwire.com/2026/amazon-invests-50b-in-openai-deepens-aws-partnership-with-expanded-100b-cloud-deal/">$100B cloud deal </a></p>
<ul>
<li style="font-weight:400;">Amazon is making a $50 billion investment in OpenAI as <a href="https://openai.com/index/scaling-ai-for-everyone/">part of a $110 billion funding round that also includes SoftBank and NVIDIA</a>, valuing OpenAI at $730 billion pre-money. </li>
<li style="font-weight:400;">Separately, OpenAI and AWS are expanding their existing cloud agreement by $100 billion over eight years, which analysts estimate could add roughly $17 billion annually to AWS revenue.</li>
<li style="font-weight:400;">A key technical component of the deal is OpenAI committing to consume 2 gigawatts of capacity on Amazon’s Trainium chips, giving AWS a high-profile validation of its in-house AI silicon at a scale that helps justify <a href="https://www.geekwire.com/2026/aws-growth-hits-3-year-high-custom-chips-top-10b-as-200b-capex-plan-rattles-investors/">Amazon’s $200 billion capital expenditure plan for 2026</a>.</li>
<li style="font-weight:400;">AWS and OpenAI will co-create a Stateful Runtime Environment delivered through <a href="https://aws.amazon.com/bedrock/">Amazon Bedrock</a>, allowing enterprise customers to build AI agents that retain context and handle complex multi-step tasks, with AWS serving as the exclusive third-party cloud distribution provider for <a href="https://openai.com/business/frontier/">OpenAI Frontier</a>.</li>
<li style="font-weight:400;">Microsoft retains exclusivity over stateless OpenAI API calls, meaning simple one-and-done AI requests still route through Azure, while Amazon is positioning AWS as the infrastructure layer for stateful, context-aware, and agent-based workloads where the compute intensity and revenue potential are substantially higher.</li>
<li style="font-weight:400;">Amazon also maintains its existing partnership with Anthropic, meaning AWS customers now have access to models from two of the leading AI labs, which broadens the options available through Bedrock without requiring customers to commit to a single model provider.</li>
</ul>
<p>41:29  Justin – “I am more and more convinced every day that we are in an AI bubble. I do not see how they’re going to generate the revenues required to cover the capital investments that all of these cloud providers are making.” </p>
<p>43:18 <a href="https://aws.amazon.com/blogs/aws/aws-security-hub-extended-o%EF%AC%80ers-full-stack-enterprise-security-with-curated-partner-solutions/">AWS Security Hub Extended oﬀers full-stack enterprise security with </a><a href="https://aws.amazon.com/blogs/aws/aws-security-hub-extended-o%EF%AC%80ers-full-stack-enterprise-security-with-curated-partner-solutions/">curated partner solutions</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/security-hub/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">AWS Security Hub</a> Extended is a new plan that bundles curated third-party security tools from partners like <a href="https://www.crowdstrike.com/en-us/">CrowdStrike</a>, <a href="http://www.okta.com">Okta</a>, <a href="https://www.splunk.com/">Splunk</a>, <a href="https://www.zscaler.com/">Zscaler</a>, and <a href="https://www.proofpoint.com/us">Proofpoint</a> directly into the Security Hub console, covering endpoint, identity, email, network, and cloud security in one place.</li>
<li style="font-weight:400;">AWS acts as the seller of record for all partner solutions, meaning customers get a single consolidated bill, pre-negotiated pay-as-you-go pricing, and no long-term commitments, which removes the overhead of managing separate vendor contracts.</li>
<li style="font-weight:400;">All security findings from both AWS native services and partner tools are normalized using the <a href="https://github.com/ocsf">Open Cybersecurity Schema Framework (OCSF)</a> and automatically aggregated in Security Hub, making cross-environment threat correlation more straightforward.</li>
<li style="font-weight:400;">Enterprise Support customers get unified Level 1 support across all participating solutions, which reduces the friction of figuring out which vendor to contact when issues span multiple tools.</li>
<li style="font-weight:400;">The Extended plan is generally available now across all commercial AWS regions where <a href="https://console.aws.amazon.com/securityhub/v2/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">Security Hub</a> is supported, with both consumption-based and flat-rate pricing options available at aws.amazon.com/security-hub/pricing.</li>
</ul>
<p>44:11  Justin – “Thank you, Amazon. It’s only taken you 10 years to get to this point – because this is cool. Build partnerships with your security vendors, standardize the inputs, and make connections for those things so they all connect together, and if I can do all that through my cloud vendor, who I already have commitments with? I think that’s fantastic.” </p>
<p>Quick Hits</p>
<p>45:41 <a href="https://aws.amazon.com/about-aws/whats-new/2026/03/vpc-encryption-controls-pricing/">AWS announces pricing for VPC Encryption Controls</a> </p>
<ul>
<li style="font-weight:400;">Just pricing BUT CRAZY</li>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/vpc/latest/userguide/vpc-encryption-controls.html?refid=d8ec3b19-0f37-4f8c-8c12-189f913e205c">VPC Encryption Controls</a> exits free preview on March 1, 2026, introducing a fixed hourly charge per non-empty VPC with the feature enabled in either monitor or enforce mode, with no charge for empty VPCs.</li>
<li style="font-weight:400;">The feature offers two operational modes: monitor mode audits for unencrypted traffic flows, while enforce mode actively blocks resources that would allow unencrypted traffic within or across VPCs in a region.</li>
<li style="font-weight:400;">A notable billing consideration is that enabling encryption support on a Transit Gateway triggers standard VPC Encryption Controls charges for all attached VPCs, regardless of their individual encryption mode setting, even if those VPCs are empty.</li>
<li style="font-weight:400;">For compliance-focused organizations, this feature provides a centralized mechanism to audit and enforce encryption-in-transit across VPC traffic flows, which is a common requirement in regulated industries like finance and healthcare.</li>
<li style="font-weight:400;">Customers should audit how many non-empty VPCs they plan to enable this on before March 1, 2026, and pay close attention to Transit Gateway attachment costs, as those charges can accumulate across a large number of attached VPCs. Detailed regional pricing is available on the<a href="https://aws.amazon.com/vpc/pricing/?refid=d8ec3b19-0f37-4f8c-8c12-189f913e205c"> VPC pricing page</a>.</li>
</ul>
<p>46:00  Matt – “Go cry a little bit.” </p>
<p>48:03 <a href="https://aws.amazon.com/about-aws/whats-new/2026/03/policy-amazon-bedrock-agentcore-generally-available/">Policy in Amazon Bedrock AgentCore is now generally available</a></p>
<ul>
<li style="font-weight:400;">Policy in <a href="https://aws.amazon.com/bedrock/agentcore/">Amazon Bedrock AgentCore</a> is now generally available, giving security and compliance teams a way to define and enforce tool access rules for AI agents without touching agent code, which is a meaningful separation of concerns for enterprise governance.</li>
<li style="font-weight:400;">The natural language to <a href="https://aws.amazon.com/about-aws/whats-new/2023/05/cedar-open-source-language-access-control/">Cedar</a> conversion is a practical feature, letting non-developers author policies that automatically translate to the AWS open-source policy language, lowering the barrier for ops and compliance teams to participate in agent governance.</li>
<li style="font-weight:400;">The <a href="https://docs.aws.amazon.com/bedrock-agentcore/latest/devguide/gateway.html">AgentCore Gateway</a> acts as an inline policy enforcement point, intercepting agent-tool traffic and evaluating each request before allowing or denying access, which mirrors familiar patterns from API gateway and service mesh architectures.</li>
<li style="font-weight:400;">The feature is available across <a href="https://docs.aws.amazon.com/bedrock-agentcore/latest/devguide/agentcore-regions.html?refid=d8ec3b19-0f37-4f8c-8c12-189f913e205c">13 AWS regions</a> at launch, including major US, European, and Asia Pacific regions, giving organizations with data residency requirements reasonable coverage from day one.</li>
<li style="font-weight:400;">Pricing details are not specified in the announcement, so teams evaluating this for production workloads should review the AgentCore pricing page and documentation at docs.aws.amazon.com/bedrock-agentcore/latest/devguide/policy.html before planning deployments.</li>
</ul>
<p>49:27  Ryan – “I like the Cedar natural language processing, but I wonder how practical it is to write policies that allow agent-to-agent and tool communication.” </p>
<h2>GCP</h2>
<p>57:07  <a href="https://cloud.google.com/blog/products/api-management/combat-api-sprawl-using-apigee-api-hub/">Combat API sprawl using Apigee API hub</a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.cloud.google.com/apigee/docs/apihub/what-is-api-hub">Apigee API hub</a> now integrates directly with <a href="https://docs.cloud.google.com/api-gateway/docs">API Gateway</a> to automatically synchronize API definitions, OpenAPI specs, and gateway configurations in near real-time, giving platform teams a single control plane for APIs spread across multiple gateways and platforms.</li>
<li style="font-weight:400;">The new specification <a href="https://docs.cloud.google.com/apigee/docs/apihub/spec-boost">boost add-on</a>, currently in public preview, uses AI to scan API specs for gaps like missing usage examples or undefined error codes, then generates an enhanced parallel version labeled specboost-draft without overwriting the original, so teams can compare before adopting.</li>
<li style="font-weight:400;">The core problem being addressed is that incomplete or undocumented APIs cause AI agents to fail at function calling or miss APIs entirely, so centralizing and enriching specs directly improves agent reliability in agentic workflows.</li>
<li style="font-weight:400;">Both features are available now, with API Gateway users seeing onboarding prompts directly in the console. </li>
<li style="font-weight:400;">Pricing details for the spec boost add-on are not specified in the announcement, so teams should check the Add-on management section of the API hub for current cost information.</li>
<li style="font-weight:400;">Organizations running legacy specless proxies with no documentation stand to benefit most immediately, as the spec boost add-on can generate documentation for APIs that currently have none, making them visible to both developers and automated tools.</li>
</ul>
<p>52:08  Matt – “Any undocumented API is always a problem, whether you’re using it or one team uses something they don’t know, or a client finds that should be a dark API that is public, and that always becomes a problem. So, a way to centralize that and kind of help address API sprawl in general is a great thing and will make people’s lives so much better.”</p>
<p>52:41 <a href="https://cloud.google.com/blog/topics/developers-practitioners/improve-chatbot-memory-using-google-cloud/">Improve chatbot memory using Google Cloud </a></p>
<ul>
<li style="font-weight:400;">Google Cloud’s polyglot storage approach for chatbot memory combines <a href="https://cloud.google.com/memorystore/docs/redis/redis-overview">Memorystore for Redis</a>, <a href="https://cloud.google.com/bigtable">Cloud Bigtable</a>, and <a href="https://cloud.google.com/bigquery">BigQuery</a> to handle short, mid, and long-term conversation history, respectively, addressing a common scaling challenge for conversational AI applications.</li>
<li style="font-weight:400;">Memorystore for Redis handles the hot layer with sub-millisecond latency using Redis Lists and RPUSH commands, while Bigtable serves as the durable mid-term store using a user_id#session_id#reverse_timestamp key pattern to enable efficient range scans across millions of simultaneous sessions.</li>
<li style="font-weight:400;">Bigtable’s garbage collection policies allow teams to retain only recent data, such as the last 60 days, in the high-performance tier, while older data flows asynchronously to BigQuery via Pub/Sub and Dataflow for archival and analytics without impacting live application performance.</li>
<li style="font-weight:400;">Cloud Storage handles unstructured multimedia artifacts using a URI pointer strategy with signed URLs, keeping the primary databases lean while maintaining secure, time-limited access to files generated or uploaded during conversations.</li>
<li style="font-weight:400;">This architecture is relevant to any team building production-scale agentic applications on <a href="https://cloud.google.com/products/agent-builder">Vertex AI Agent Builder</a>, particularly in industries like customer service, healthcare, and financial services, where maintaining accurate long-term conversation context is a compliance or user experience requirement. Pricing varies across each component based on storage volume and query usage.</li>
<li style="font-weight:400;">Ryan loves this almost as much as he loves The Eagles.</li>
</ul>
<p>Quick Hits</p>
<p>55:42 <a href="https://cloud.google.com/blog/products/databases/spanner-columnar-engine-in-preview/">Spanner columnar engine in preview </a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.cloud.google.com/spanner/docs/columnar-engine">Spanner columnar engine</a> is now in preview, adding columnar storage alongside traditional row-based storage to enable analytical query acceleration of up to 200x on live operational data without impacting transactional workloads. </li>
<li style="font-weight:400;">This addresses the longstanding trade-off between OLTP and analytical performance in a single horizontally scalable system.</li>
<li style="font-weight:400;">The engine uses vectorized execution to process data in batches rather than row-by-row, and Spanner automatically routes large-scan analytical queries to the columnar representation. </li>
<li style="font-weight:400;">A new major compaction API also lets users manually trigger the conversion of existing data into columnar format.</li>
<li style="font-weight:400;">A key use case is reverse ETL from <a href="https://docs.cloud.google.com/datastream/docs/destination-blmt">Iceberg lakehouses</a>, where processed analytical data from <a href="https://medium.com/google-cloud/spanner-better-with-bigquery-streaming-insights-faster-federated-queries-with-iceberg-and-04e1299dd831">BigQuery</a>, <a href="https://www.databricks.com/">Databricks</a>, <a href="https://www.snowflake.com/en/">Snowflake</a>, or <a href="https://www.oracle.com/autonomous-database/autonomous-ai-lakehouse/">Oracle Autonomous AI Lakehouse</a> gets loaded into Spanner for sub-second, high-concurrency serving. This targets scenarios like real-time dashboards, AI agent features, and user-facing applications that need low-latency access to precomputed insights.</li>
<li style="font-weight:400;">The BigQuery integration is notably bidirectional, supporting federated queries via external datasets, <a href="https://docs.cloud.google.com/bigquery/docs/export-to-spanner">reverse ETL</a> pushes from BigLake Iceberg tables into Spanner, and live CDC streaming from Spanner back into BigQuery and BigLake Iceberg via <a href="https://docs.cloud.google.com/datastream/docs/sources-spanner">Datastream</a>. <a href="https://docs.oracle.com/en/database/goldengate/core/26/release-notes/new-features.html#OGGRN-GUID-F48FEF44-A714-4216-8BA0-4A7B9A220CBA">Oracle GoldenGate 26ai</a> also now supports direct replication into Spanner.</li>
<li style="font-weight:400;">The feature is available in preview and can be enabled on existing Spanner tables via a DDL change, with benchmark queries available on GitHub. </li>
<li style="font-weight:400;">Pricing follows standard Spanner node pricing, with no separate cost structure announced for the columnar engine specifically.</li>
</ul>
<p>55:52  Justin – “If you don’t know anything about columnar databases, you don’t know how cool that is.” </p>
<h2>Azure</h2>
<p>57:31 <a href="https://techcommunity.microsoft.com/blog/azureobservabilityblog/announcing-new-public-preview-capabilities-in-azure-monitor-pipeline/4488904">Announcing new public preview capabilities in Azure Monitor pipeline</a></p>
<ul>
<li style="font-weight:400;">Azure Monitor pipeline now supports <a href="https://techcommunity.microsoft.com/blog/azureobservabilityblog/announcing-new-public-preview-capabilities-in-azure-monitor-pipeline/4488904#community-4488904-tls">TLS and mutual TLS</a> for TCP-based ingestion endpoints in public preview, allowing teams to encrypt data in transit and enforce mutual authentication without relying on external proxies or custom gateways. </li>
<li style="font-weight:400;">This is particularly relevant for regulated environments and edge deployments where plain TCP ingestion no longer meets security requirements.</li>
<li style="font-weight:400;">The new execution placement configuration gives Kubernetes users direct control over how pipeline instances are scheduled across nodes, addressing practical problems like port exhaustion, multi-tenant isolation, and availability zone distribution. </li>
<li style="font-weight:400;">Notably, if the cluster cannot satisfy placement rules, the pipeline simply will not deploy, making failures predictable rather than silent.</li>
<li style="font-weight:400;">Data transformations allow teams to filter, aggregate, and normalize telemetry before it reaches <a href="https://azure.microsoft.com/en-us/products/monitor/">Azure Monitor</a>, including converting raw syslog or CEF messages into standardized schemas using KQL templates. This addresses the cost and complexity of ingesting high-volume noisy data and cleaning it up after the fact.</li>
<li style="font-weight:400;">All three capabilities are in public preview today and target organizations running Azure Monitor pipeline on on-premises infrastructure, edge locations, and large Kubernetes clusters. </li>
<li style="font-weight:400;">Pricing is not separately detailed for these features, so costs would follow existing Azure Monitor ingestion and data processing rates, which vary by volume.</li>
</ul>
<p>58:38  Matt – “It’s their ETL pipeline service… that’s kind of why this is a big deal.” </p>
<p>59:43 <a href="http://aka.ms/MicrosoftSovereignCloudDisconnectedBlog">Microsoft Sovereign Cloud adds governance, productivity, and support for </a><a href="http://aka.ms/MicrosoftSovereignCloudDisconnectedBlog">large AI models securely running even when completely disconnected</a></p>
<ul>
<li style="font-weight:400;">Microsoft has expanded its <a href="http://www.microsoft.com/sovereignty">Sovereign Cloud</a> offering with three new capabilities targeting organizations that need to operate in fully disconnected environments: <a href="https://learn.microsoft.com/en-us/azure/azure-local/manage/disconnected-operations-overview?view=azloc-2602">Azure Local disconnected operations</a>, <a href="https://techcommunity.microsoft.com/blog/AzureArcBlog/microsoft-365-local-is-generally-available/4470170">Microsoft 365 Local disconnected</a>, and large model support in Foundry Local. </li>
<li style="font-weight:400;">These are aimed at government, defense, and regulated industries where external connectivity may be intentionally restricted or prohibited.</li>
<li style="font-weight:400;">Azure Local disconnected operations allow organizations to run infrastructure with Azure governance and policy controls without any cloud connectivity, meaning management and workload execution stay entirely within customer-operated environments. This is now generally available worldwide, though pricing is not publicly listed and would depend on hardware and licensing configurations.</li>
<li style="font-weight:400;">Microsoft 365 Local disconnected brings <a href="https://learn.microsoft.com/en-us/exchange/exchange-server">Exchange Server</a>, <a href="https://learn.microsoft.com/en-us/sharepoint/getting-started">SharePoint Server</a>, and <a href="https://www.microsoft.com/en-us/microsoft-365/previous-versions/skype-for-business-online">Skype for Business Server</a> into the sovereign private cloud boundary, with Microsoft committing support for these workloads through at least 2035. This extends productivity capabilities to teams operating in air-gapped or isolated environments without requiring a cloud connection.</li>
<li style="font-weight:400;">Foundry Local now supports large multimodal AI models running on-premises using NVIDIA GPU infrastructure, enabling local inferencing entirely within customer-controlled data boundaries. This moves beyond the small model support Foundry Local previously offered and is currently available to qualified customers rather than broadly.</li>
<li style="font-weight:400;">The overall architecture is designed to span connected, hybrid, and fully disconnected modes under a consistent governance model, which reduces the operational complexity of managing separate toolsets for different connectivity scenarios. </li>
<li style="font-weight:400;">Organizations considering this stack should evaluate hardware requirements carefully, given the GPU dependencies for AI inferencing workloads.</li>
</ul>
<p>57:25 <a href="https://techcommunity.microsoft.com/blog/appsonazureblog/best-practice-using-self-signed-certificates-with-java-on-azure-functions-linux/4496900">Best Practice: Using Self-Signed Certificates with Java on Azure Functions </a></p>
<p>Winner of the dumbest feature of the week: </p>
<ul>
<li style="font-weight:400;">Java developers on <a href="https://learn.microsoft.com/en-us/azure/azure-functions/functions-create-container-registry?tabs=acr%2Cbash&amp;pivots=programming-language-csharp">Azure Functions Linux</a> who connect to services secured by self-signed certificates frequently encounter SSL handshake errors because the JVM only trusts well-known Certificate Authorities by default. The recommended fix is creating a custom truststore in the persistent /home directory and pointing the JVM to it via JAVA_OPTS application settings.</li>
<li style="font-weight:400;">The core reason to use /home for the truststore rather than system JVM directories is that the Linux Functions file system is ephemeral, meaning any changes outside /home are wiped on restart, scaling, or platform updates. Storing the keystore at a path like /home/site/wwwroot/my-truststore.jks ensures it survives those events.</li>
<li style="font-weight:400;">One practical deployment gotcha worth noting is that ZipDeploy or Run From Package configurations can overwrite /home/site/wwwroot contents during code deployments, so storing the .jks file in a separate directory like /home/my-certs/ is a safer long-term choice.</li>
<li style="font-weight:400;">Azure Functions Linux behaves differently from Azure App Service Linux in a notable way: App Service startup scripts often auto-import platform-managed certificates into the JVM keystore, but Functions does not, meaning OS-level tools like curl may succeed while Java code still throws handshake errors.</li>
<li style="font-weight:400;">For teams that prefer not to manage server-side keystore files, two code-based alternatives exist: loading an Azure-managed certificate from /var/ssl/certs via custom SSLContext code, or bundling a locally built JKS file inside the application JAR. Both require application code changes, which adds maintenance overhead compared to the JAVA_OPTS approach.</li>
</ul>
<p>1:03:46  Justin – “This is just a way for you to troubleshoot certificates even worse than you were troubleshooting it before.” </p>
<p>Quick Hits</p>
<p>1:05:19 <a href="https://techcommunity.microsoft.com/blog/azureconfidentialcomputingblog/announcing-general-availability-of-azure-intel%C2%AE-tdx-confidential-vms/4495693">Announcing general availability of Azure Intel® TDX confidential VMs </a></p>
<ul>
<li style="font-weight:400;">Azure has moved its Intel TDX confidential VMs to general availability, using 5th Gen Intel Xeon processors to provide hardware-enforced isolation that protects data while in use, which addresses a longstanding barrier for organizations running sensitive workloads in the cloud. Notably, existing applications can be deployed without any code changes.</li>
<li style="font-weight:400;">The new VM series (DCesv6, DCedsv6, ECesv6, ECedsv6) introduces NVMe local SSD support as a first for Azure confidential VMs, delivering roughly 5x more throughput and about 16% lower latency compared to the previous SCSI generation, with IO latency reduced by approximately 27 microseconds.</li>
<li style="font-weight:400;">These VMs are the first in Azure confidential compute to use the open-source OpenHCL paravisor, which increases transparency and allows customers to cryptographically verify workload integrity rather than simply trusting the cloud operator. </li>
<li style="font-weight:400;">The open-source component is available at github.com/microsoft/openvmm.</li>
<li style="font-weight:400;">Intel AMX acceleration is built in, making these VMs suited for confidential AI workloads such as protecting model weights and running cross-organization AI pipelines without exposing underlying data. </li>
<li style="font-weight:400;">Azure Boost support adds up to 205k IOPS, 4 GB/s remote storage throughput, and 40 Gbps network bandwidth.</li>
<li style="font-weight:400;">General availability is currently limited to the West US and West US 3 regions, with support for Windows Server 2025 and Ubuntu 22.04 and 24.04. Pricing is not specified in the announcement, and customers can request preview access in additional regions at aka.ms/acc/v6preview.</li>
</ul>
<p>1:10:10 <a href="https://azure.microsoft.com/en-us/updates?id=558072">Generally Available: Draft &amp; Deploy on Azure Firewall</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/firewall-manager/policy-overview">Azure Firewall Polic</a>y now supports a two-phase Draft and Deploy workflow, meaning teams can stage policy changes before committing them, which reduces the risk of unintended disruptions during updates.</li>
<li style="font-weight:400;">Previously, any policy change triggered a full firewall deployment, which could cause delays and service interruptions. </li>
<li style="font-weight:400;">This feature separates the authoring phase from the deployment phase, giving teams more control over when changes go live.</li>
<li style="font-weight:400;">The feature is particularly useful for organizations with strict change management processes, as it allows multiple edits to be batched and reviewed before a single deployment is executed, rather than deploying each change individually.</li>
<li style="font-weight:400;">This is now generally available, so production workloads can rely on it. Azure Firewall Policy pricing remains consumption-based, and customers should check the Azure Firewall pricing page at azure.microsoft.com for current rates, as costs vary by policy tier and region.</li>
<li style="font-weight:400;">Teams managing complex or high-traffic environments will benefit most, since reducing the frequency of full deployments directly translates to fewer maintenance windows and more predictable firewall behavior.</li>
</ul>
<p>1:10:27 <a href="https://techcommunity.microsoft.com/blog/appsonazureblog/azure-container-registry-premium-sku-now-supports-100-tib-storage/4497651">Azure Container Registry Premium SKU Now Supports 100 TiB Storage</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aka.ms/acr/skus">Azure Container Registry</a> Premium SKU now supports up to 100 TiB of storage, a 2.5x increase from the previous 40 TiB cap, with no configuration changes required for existing registries to benefit automatically.</li>
<li style="font-weight:400;">The increase directly addresses a real operational pain point where enterprises were splitting workloads across multiple registries just to stay under limits, adding complexity to access control and networking that had nothing to do with actual business requirements.</li>
<li style="font-weight:400;">AI and ML workloads are a clear driver here, as teams storing large model artifacts, training outputs, and inference containers were consuming registry capacity faster than anticipated, alongside normal container workload growth.</li>
<li style="font-weight:400;">Microsoft also improved geo-replication data sync speeds for new replicas and added a storage consumption view in the Azure Portal Monitoring tab, two improvements that had been customer requests for some time.</li>
<li style="font-weight:400;">The 100 TiB limit is exclusive to Premium SKU, so teams on Basic or Standard tiers will need to upgrade to access it, though Premium also includes geo-replication, private endpoints, and enhanced throughput. </li>
<li style="font-weight:400;"><a href="https://azure.microsoft.com/en-us/pricing/details/container-registry/">Pricing</a> details for Premium SKU storage are available at the Azure Container Registry pricing page.</li>
</ul>
<p>1:10:47 Ryan – “So now instead of two windows container images you can store FOUR.” </p>
<p>1:13:37 <a href="https://techcommunity.microsoft.com/blog/integrationsonazureblog/new-azure-api-management-service-limits/4497574">New Azure API management service limits </a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/api-management/api-management-key-concepts">Azure API Management</a> is rolling out updated resource limits starting March 2026, aligning classic tier limits with v2 tier limits across entities like API operations, tags, products, and subscriptions. This affects all service tiers in a phased rollout over several months.</li>
<li style="font-weight:400;">Existing classic tier customers whose usage exceeds the new limits will be grandfathered in, with their limits set 10% above observed usage at the time the new limits take effect. </li>
<li style="font-weight:400;">New services and those under the new thresholds will be subject to the updated limits immediately.</li>
<li style="font-weight:400;">Limit increase requests will only be considered for Standard, Standard v2, Premium, and Premium v2 tiers, with Premium customers receiving priority. Requests are evaluated case by case and are not guaranteed, so teams relying on high resource counts should audit their usage now.</li>
<li style="font-weight:400;">Before requesting a limit increase, Microsoft recommends reviewing the Manage Resources Within Limits documentation at learn.microsoft.com, as some increases can introduce latency or affect service capacity. </li>
<li style="font-weight:400;">This is a practical reminder that limits exist to protect shared infrastructure performance, not just to restrict usage.</li>
<li style="font-weight:400;">Pricing for API Management tiers varies, with the Developer tier starting around $0 for testing and the Premium tier running substantially higher for production workloads. Customers on lower tiers, like Consumption or Developer, cannot request limit increases, so production workload planning should account for tier selection early.</li>
</ul>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback, or ask questions at theCloudPod.net or tweet at us with the hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2391099/c1e-9202f2oknqh421p0-25096n60cd5n-efnuoq.mp3" length="137146524"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 345 of The Cloud Pod, where the forecast is always cloudy! Justin, Ryan, and Matt are in the studio this week and are ready to bring you all the latest in cloud and AI news, including what’s going on between Anthropic, the DOD, and OpenAI, what the war means for Middle East data centers (Spoiler – I hope you have a good Disaster Recovery plan), and Transit Gateway pricing changes that are enough to make a grown man cry. And don’t bother waiting: Matt has completely forgotten almost two years of “bye everybody” and now claims full amnesia as to what his outtro is. Oh well. Let’s get into today’s show. 
Titles we almost went with this week

 Claude Learned to Use a Computer Better Than Your Dad **OpenAI
 Amazon and OpenAI’s $138 Billion AI Bromance
 When Two AZs Go Dark the Cloud Gets Crispy
 Fifty Billion Reasons AWS Loves OpenAI Now **Anthropic
 Azure Still Wins Even When AWS Thinks It Did
 Fire, Water, and a Multi-AZ Assumption Goes Up in Smoke
 Claude Refuses to Go Full Skynet for the Pentagon
 GPT-5.3 Instant Finally Stops Lecturing You
 No Killer Robots Without Human Approval Please
 Terraform Finally Sees Your Forgotten Cloud Resources
 Stage Before You Rage Deploy Azure Firewall
 CrowdStrike to Zscaler AWS Wants Your Security Tab
 One Hub to Rule Your API Sprawl
 Transit Gateway Attachments Just Got Surprisingly Expensive
 Azure Container Registry Finally Has Room for Your AI Hoarding
 Bedrock Gets a Roommate OpenAI Moves In
 Azure Firewall Gets a Safety on the Trigger
 Stop Writing Scripts, Just Import the Dang Infrastructure
 Audit Your APIs Before March 2026 Bites You
 Damn it… my excuse not to DR is gone
 I’m Epically Furious about DR

AI Is Going Great – Or How ML Makes Money 
03:34 Anthropic acquires Vercept to advance Claude’s computer use capabilities 

Anthropic acquired Vercept, a team specializing in AI perception and interaction, to strengthen Claude’s computer use capabilities. 
The Vercept founders, including Ross Girshick, bring deep expertise in how AI systems visually interpret and interact with software interfaces.
Claude Sonnet 4.6 shows substantial improvement in computer use benchmarks, jumping from under 15% on the OSWorld evaluation in late 2024 to 72.5% today. 
The model is now approaching human-level performance on tasks like navigating spreadsheets and completing multi-tab web forms.
Computer use enables Claude to operate inside live applications the way a human would, handling multi-step workflows across tools that cannot be automated through code alone. 
This is relevant for enterprise use cases involving document processing, browser-based workflows, and cross-application task management.
This is Anthropic’s second acquisition in a short period, following the purchase of Bun, which was tied to the Claude Code milestone. The pattern suggests Anthropic is actively acquiring specialized engineering teams rather than just technology assets.
For developers and businesses building agentic workflows on Claude, the improved computer use performance means more reliable automation of complex, real-world software tasks without requiring custom integrations or APIs for every application involved.

05:18  Justin – “It seems like every day I have to upda...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2391099/c1a-k5d5-6z9m6343h577-a6gkgf.jpg"></itunes:image>
                                                                            <itunes:duration>01:11:08</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2391099/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[344: Amazon’s Coding Bot Bites the Hand That Runs It]]>
                </title>
                <pubDate>Wed, 04 Mar 2026 20:27:09 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2382918</guid>
                                    <link>https://tcpfm.castos.com/episodes/344-amazons-coding-bot-bites-the-hand-that-runs-it</link>
                                <description>
                                            <![CDATA[<p>Welcome to episode 344 of The Cloud Pod, where the forecast is always cloudy! Justin is out of the office at a World of Warcraft Tournament (not really), and Ryan is pursuing his lifelong dream of becoming a roadie for The Eagles (maybe?), so it’s Jonathan and Matt holding down the fort this week, and they’ve got a ton of cloud news for you! From security to AI assistants, we’ve got all the news you need. Let’s get started! </p>
<h3>Titles we almost went with this week</h3>
<ul>
<li> Zero Bus, All Gas, No Kafka Brakes</li>
<li> AI Coding Bot Bites the Hand That Runs It</li>
<li> When Your Robot Developer Goes Rogue on AWS</li>
<li> Kubernetes VPA Finally Stops Evicting Your Database Pods</li>
<li> Google Trains 100 Million People, Still No One Reads the Docs </li>
<li> MCP Walks Into a Bar Not Enterprise Ready Yet</li>
<li> No More Pod Evictions Kubernetes 1.35 Scales In Place</li>
<li> No Keys No Drama Just IAM and Cloud SQL</li>
<li>One Agent to Rule Them All in Kubernetes</li>
<li> IAM Tired of Writing Policies Manually</li>
<li> When Your AI Coding Tool Has Delete Permissions</li>
<li> One Dashboard to Rule All Your GPU Clusters</li>
<li> Serverless Reservations Prove Nothing Is Truly Free Range</li>
<li> Kiro Takes the Wheel on AWS IAM Policies</li>
<li> Stop Blaming Backups for Your Bad Architecture</li>
<li> AI Agent Goes Rogue, Takes AWS Down With It</li>
<li> Everything is Bigger in Texas Except the Water Usage</li>
<li>OpenAI launches the college basketball of Inference. Pro service – low cost</li>
</ul>
<h2>General News </h2>
<p>1:05 <a href="https://blog.cloudflare.com/code-mode-mcp/">Code Mode: give agents an entire API in 1,000 tokens</a></p>
<ul>
<li style="font-weight:400;"><a href="https://blog.cloudflare.com/">Cloudflare</a>‘s Code Mode <a href="https://www.cloudflare.com/learning/ai/what-is-model-context-protocol-mcp/">MCP</a> server reduces token consumption by 99.9% compared to a traditional MCP implementation, exposing the entire <a href="https://developers.cloudflare.com/api/">Cloudflare API</a> (over 2,500 endpoints) through just two tools, search() and execute(), using roughly 1,000 tokens versus 1.17 million for a conventional approach.</li>
<li style="font-weight:400;">The architecture works by having the AI agent write JavaScript code against a typed OpenAPI spec representation, rather than loading tool definitions into context, with code executing inside a sandboxed V8 isolate (<a href="https://developers.cloudflare.com/workers/runtime-apis/bindings/worker-loader/">Dynamic Worker</a>) that restricts file system access, environment variables, and external fetches by default.</li>
<li style="font-weight:400;">This approach addresses a fundamental constraint in agentic AI systems: adding more tools to give agents broader capabilities directly competes with the available context space for the task at hand.</li>
</ul>
<p>01:41  Jonathan- “It’s good. I’m not sure I could imagine 2 ½ thousand MCP tool definitions in a context window and still actually use it for anything.”   </p>
<h2>AI Is Going Great – Or How ML Makes Money </h2>
<p>03:58 <a href="https://techcrunch.com/2026/02/15/openclaw-creator-peter-steinberger-joins-openai/">OpenClaw creator Peter Steinberger joins OpenAI</a></p>
<ul>
<li style="font-weight:400;">Peter Steinberger, creator of <a href="https://techcrunch.com/2026/01/27/everything-you-need-to-know-about-viral-personal-ai-assistant-clawdbot-now-moltbot/">viral AI assistant</a> <a href="https://openclaw.ai/">OpenClaw</a> (formerly Clawdbot/Moltbot), has joined <a href="https://techcrunch.com/tag/openai/">OpenAI</a> to lead development of next-generation personal agents. </li>
<li style="font-weight:400;">OpenClaw gained attention for its ability to perform real-world tasks like calendar management, flight booking, and autonomous social network participation.</li>
<li style="font-weight:400;">OpenAI will maintain OpenClaw as an open source project through a foundation structure, allo...</li></ul>
<h3>Chapters</h3>
<ul><li>(00:00:07) - The Cloud Pod</li><li>(00:01:00) - Cloudflare, OpenAI: The AI Assistant</li><li>(00:10:04) - Cobalt vs. COBOL</li><li>(00:17:16) - Databricks Connect: Single Sync vs. Kafka</li><li>(00:20:06) - ChatGPT Pro Lite at $100 a month</li><li>(00:21:53) - Packer Adds SBOM Vulnerability Scanning</li><li>(00:24:17) - Kubernetes 1.35: Auto-Scale Pod Storage</li><li>(00:29:22) - Amazon's AI Coding Bot Causes AWS Outage</li><li>(00:33:49) - Amazon IAM policy Autopilot</li><li>(00:34:28) - AWS IAM Policy Autopilot: Will It Increase Security</li><li>(00:38:57) - Amazon Expands Serverless to AI-generated 'TikTok</li><li>(00:39:44) - Google Cloud Expands MCP Server Coverage to Azure, Cloud,</li><li>(00:46:40) - Google's $15 Billion Investment in AI Infrastructure</li><li>(00:49:18) - Microsoft Azure Completing 100% Renewable Energy</li><li>(00:54:32) - More Flexible Quotations on AWS</li><li>(00:56:23) - Microsoft Sovereignty Cloud: When IT's Connected, Connected</li><li>(00:58:08) - Kouser Command Center: Unified Operations Platform for AI</li><li>(01:00:01) - AMD Instinct Mi 350X GPUs coming to DigitalOcean</li><li>(01:00:43) - Week in Cloud: Chatting With Just Us</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 344 of The Cloud Pod, where the forecast is always cloudy! Justin is out of the office at a World of Warcraft Tournament (not really), and Ryan is pursuing his lifelong dream of becoming a roadie for The Eagles (maybe?), so it’s Jonathan and Matt holding down the fort this week, and they’ve got a ton of cloud news for you! From security to AI assistants, we’ve got all the news you need. Let’s get started! 
Titles we almost went with this week

 Zero Bus, All Gas, No Kafka Brakes
 AI Coding Bot Bites the Hand That Runs It
 When Your Robot Developer Goes Rogue on AWS
 Kubernetes VPA Finally Stops Evicting Your Database Pods
 Google Trains 100 Million People, Still No One Reads the Docs 
 MCP Walks Into a Bar Not Enterprise Ready Yet
 No More Pod Evictions Kubernetes 1.35 Scales In Place
 No Keys No Drama Just IAM and Cloud SQL
One Agent to Rule Them All in Kubernetes
 IAM Tired of Writing Policies Manually
 When Your AI Coding Tool Has Delete Permissions
 One Dashboard to Rule All Your GPU Clusters
 Serverless Reservations Prove Nothing Is Truly Free Range
 Kiro Takes the Wheel on AWS IAM Policies
 Stop Blaming Backups for Your Bad Architecture
 AI Agent Goes Rogue, Takes AWS Down With It
 Everything is Bigger in Texas Except the Water Usage
OpenAI launches the college basketball of Inference. Pro service – low cost

General News 
1:05 Code Mode: give agents an entire API in 1,000 tokens

Cloudflare‘s Code Mode MCP server reduces token consumption by 99.9% compared to a traditional MCP implementation, exposing the entire Cloudflare API (over 2,500 endpoints) through just two tools, search() and execute(), using roughly 1,000 tokens versus 1.17 million for a conventional approach.
The architecture works by having the AI agent write JavaScript code against a typed OpenAPI spec representation, rather than loading tool definitions into context, with code executing inside a sandboxed V8 isolate (Dynamic Worker) that restricts file system access, environment variables, and external fetches by default.
This approach addresses a fundamental constraint in agentic AI systems: adding more tools to give agents broader capabilities directly competes with the available context space for the task at hand.

01:41  Jonathan- “It’s good. I’m not sure I could imagine 2 ½ thousand MCP tool definitions in a context window and still actually use it for anything.”   
AI Is Going Great – Or How ML Makes Money 
03:58 OpenClaw creator Peter Steinberger joins OpenAI

Peter Steinberger, creator of viral AI assistant OpenClaw (formerly Clawdbot/Moltbot), has joined OpenAI to lead development of next-generation personal agents. 
OpenClaw gained attention for its ability to perform real-world tasks like calendar management, flight booking, and autonomous social network participation.
OpenAI will maintain OpenClaw as an open source project through a foundation structure, allo...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[344: Amazon’s Coding Bot Bites the Hand That Runs It]]>
                </itunes:title>
                                    <itunes:episode>344</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p>Welcome to episode 344 of The Cloud Pod, where the forecast is always cloudy! Justin is out of the office at a World of Warcraft Tournament (not really), and Ryan is pursuing his lifelong dream of becoming a roadie for The Eagles (maybe?), so it’s Jonathan and Matt holding down the fort this week, and they’ve got a ton of cloud news for you! From security to AI assistants, we’ve got all the news you need. Let’s get started! </p>
<h3>Titles we almost went with this week</h3>
<ul>
<li> Zero Bus, All Gas, No Kafka Brakes</li>
<li> AI Coding Bot Bites the Hand That Runs It</li>
<li> When Your Robot Developer Goes Rogue on AWS</li>
<li> Kubernetes VPA Finally Stops Evicting Your Database Pods</li>
<li> Google Trains 100 Million People, Still No One Reads the Docs </li>
<li> MCP Walks Into a Bar Not Enterprise Ready Yet</li>
<li> No More Pod Evictions Kubernetes 1.35 Scales In Place</li>
<li> No Keys No Drama Just IAM and Cloud SQL</li>
<li>One Agent to Rule Them All in Kubernetes</li>
<li> IAM Tired of Writing Policies Manually</li>
<li> When Your AI Coding Tool Has Delete Permissions</li>
<li> One Dashboard to Rule All Your GPU Clusters</li>
<li> Serverless Reservations Prove Nothing Is Truly Free Range</li>
<li> Kiro Takes the Wheel on AWS IAM Policies</li>
<li> Stop Blaming Backups for Your Bad Architecture</li>
<li> AI Agent Goes Rogue, Takes AWS Down With It</li>
<li> Everything is Bigger in Texas Except the Water Usage</li>
<li>OpenAI launches the college basketball of Inference. Pro service – low cost</li>
</ul>
<h2>General News </h2>
<p>1:05 <a href="https://blog.cloudflare.com/code-mode-mcp/">Code Mode: give agents an entire API in 1,000 tokens</a></p>
<ul>
<li style="font-weight:400;"><a href="https://blog.cloudflare.com/">Cloudflare</a>‘s Code Mode <a href="https://www.cloudflare.com/learning/ai/what-is-model-context-protocol-mcp/">MCP</a> server reduces token consumption by 99.9% compared to a traditional MCP implementation, exposing the entire <a href="https://developers.cloudflare.com/api/">Cloudflare API</a> (over 2,500 endpoints) through just two tools, search() and execute(), using roughly 1,000 tokens versus 1.17 million for a conventional approach.</li>
<li style="font-weight:400;">The architecture works by having the AI agent write JavaScript code against a typed OpenAPI spec representation, rather than loading tool definitions into context, with code executing inside a sandboxed V8 isolate (<a href="https://developers.cloudflare.com/workers/runtime-apis/bindings/worker-loader/">Dynamic Worker</a>) that restricts file system access, environment variables, and external fetches by default.</li>
<li style="font-weight:400;">This approach addresses a fundamental constraint in agentic AI systems: adding more tools to give agents broader capabilities directly competes with the available context space for the task at hand.</li>
</ul>
<p>01:41  Jonathan- “It’s good. I’m not sure I could imagine 2 ½ thousand MCP tool definitions in a context window and still actually use it for anything.”   </p>
<h2>AI Is Going Great – Or How ML Makes Money </h2>
<p>03:58 <a href="https://techcrunch.com/2026/02/15/openclaw-creator-peter-steinberger-joins-openai/">OpenClaw creator Peter Steinberger joins OpenAI</a></p>
<ul>
<li style="font-weight:400;">Peter Steinberger, creator of <a href="https://techcrunch.com/2026/01/27/everything-you-need-to-know-about-viral-personal-ai-assistant-clawdbot-now-moltbot/">viral AI assistant</a> <a href="https://openclaw.ai/">OpenClaw</a> (formerly Clawdbot/Moltbot), has joined <a href="https://techcrunch.com/tag/openai/">OpenAI</a> to lead development of next-generation personal agents. </li>
<li style="font-weight:400;">OpenClaw gained attention for its ability to perform real-world tasks like calendar management, flight booking, and autonomous social network participation.</li>
<li style="font-weight:400;">OpenAI will maintain OpenClaw as an open source project through a foundation structure, allowing the community to continue development while Steinberger focuses on building similar capabilities into OpenAI’s product suite. </li>
<li style="font-weight:400;">This acquisition-to-open-source model differs from typical tech company acquisitions, where projects are absorbed or shut down.</li>
<li style="font-weight:400;"><a href="https://steipete.me/posts/2026/openclaw">The move</a> signals OpenAI’s strategic focus on agentic AI systems that can execute multi-step tasks autonomously rather than just responding to prompts. Steinberger’s experience building practical automation workflows could accelerate OpenAI’s development of agent capabilities that compete with offerings from Anthropic, Google, and Microsoft.</li>
<li style="font-weight:400;">For developers, this represents a shift in how <a href="https://techcrunch.com/2026/01/30/openclaws-ai-assistants-are-now-building-their-own-social-network/">personal AI assistants</a> may be deployed, moving from standalone applications to integrated agent frameworks within larger platforms. </li>
<li style="font-weight:400;">The open source continuation of OpenClaw provides a reference implementation for building task-oriented AI systems.</li>
</ul>
<p>04:19 Matt – “This is kind of where I see Anthriopic Cowork slowly going to, being your personal assistant, and having this be your ability to manage your real-world tasks. It’s great, and if they can build that into OpenAI, then it becomes a lot more of a personal assistant than just a general tool that you’re using.” </p>
<p>09:11 <a href="https://www.anthropic.com/news/claude-code-security">Making frontier cybersecurity capabilities available to defenders</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.anthropic.com/">Anthropic</a> launched <a href="https://claude.com/solutions/claude-code-security">Claude Code Security</a> in a limited research preview for Enterprise and Team customers, with free expedited access for open-source maintainers. </li>
<li style="font-weight:400;">Unlike traditional static analysis tools that match known vulnerability patterns, it reasons through code contextually, the way a human security researcher would, catching logic flaws and access control issues <a href="https://red.anthropic.com/2026/zero-days/">that rule-based tools miss</a>.</li>
<li style="font-weight:400;">The tool uses a multi-stage verification process where Claude re-examines its own findings to filter false positives, assigns severity ratings, and provides confidence scores. </li>
<li style="font-weight:400;">Critically, no patches are applied without human approval, keeping developers in the decision loop.</li>
<li style="font-weight:400;">For cloud and enterprise teams, this integrates directly into Claude Code on the web, meaning security review happens within existing developer workflows rather than requiring separate tooling. The dashboard surfaces validated findings alongside suggested patches for team review.</li>
<li style="font-weight:400;">Want to request access? You can do that <a href="https://claude.com/contact-sales/security">here</a>. </li>
</ul>
<p>09:35 <a href="https://claude.com/blog/preview-review-and-merge-with-claude-code">Preview, review, and merge with Claude Code</a></p>
<ul>
<li style="font-weight:400;"><a href="https://code.claude.com/docs/en/desktop">Claude Code on desktop</a> now closes the full development loop by adding live app preview, inline code review, and GitHub PR monitoring in a single interface, reducing the need to switch between tools during development.</li>
<li style="font-weight:400;">The new auto-fix and auto-merge features allow Claude to monitor PRs in the background, automatically attempt to fix CI failures, and merge PRs once all checks pass, letting developers move on to new tasks without manually tracking PR status.</li>
<li style="font-weight:400;">The inline code review feature via the Review Code button lets Claude examine local diffs and leave comments directly in the desktop diff view before any code leaves the machine, functioning as an automated pre-push review step.</li>
<li style="font-weight:400;">Session portability is now built in, allowing developers to start a session in the CLI using /desktop to bring context into the desktop app, or push local sessions to the web or <a href="https://claude.com/download">Claude mobile app</a> using the Continue with Claude Code on the web button.</li>
<li style="font-weight:400;">These updates are available now to all users and represent a shift toward agentic, background-running development workflows where the AI continues working on tasks like CI remediation while the developer focuses elsewhere.</li>
</ul>
<p>11:20 Jonathan – “It’s a very human way of going back and self-reflecting on the work that you’ve just done.”  </p>
<p>18:08 <a href="https://www.databricks.com/blog/announcing-general-availability-zerobus-ingest-part-lakeflow-connect">Announcing General Availability of Zerobus Ingest, part of Lakeflow </a><a href="https://www.databricks.com/blog/announcing-general-availability-zerobus-ingest-part-lakeflow-connect">Connect</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.databricks.com/">Databricks</a> has announced General Availability of <a href="https://www.databricks.com/product/data-engineering/lakeflow-connect/zerobus-ingest">Zerobus Ingest</a>, part of <a href="https://www.databricks.com/product/data-engineering/lakeflow-connect">Lakeflow Connect</a>, a serverless streaming service that pushes data directly into Delta tables without intermediate message buses like <a href="https://kafka.apache.org/">Kafka</a>. </li>
<li style="font-weight:400;">It supports thousands of concurrent connections and achieves over 10GB per second of aggregate throughput with data landing in under 5 seconds.</li>
<li style="font-weight:400;">The core architectural difference is a single-sink design versus Kafka’s multi-sink approach, reducing a traditional five-system streaming stack down to two components. </li>
<li style="font-weight:400;">This eliminates dedicated compute and storage for the message bus itself, along with the engineering overhead to manage it, at a fraction of the cost per gigabyte compared to self-managed Kafka.</li>
<li style="font-weight:400;">Developers can integrate via <a href="https://docs.databricks.com/aws/en/ingestion/zerobus-ingest">gRPC, REST APIs,</a> or language-specific SDKs, and every write is automatically governed through Unity Catalog for lineage tracking and access control. </li>
<li style="font-weight:400;">This means streaming data gets the same governance treatment as the rest of the lakehouse from the moment it arrives.</li>
<li style="font-weight:400;">Real-world deployments include Toyota using it to detect factory overheating conditions in minutes rather than hours, and Joby Aviation reducing aircraft telemetry resolution latency from days to minutes. </li>
<li style="font-weight:400;">Both cases highlight manufacturing and IoT as strong use cases where low-latency ingestion has a direct operational impact.</li>
<li style="font-weight:400;">Zerobus Ingest is now GA on AWS and Azure, with Google Cloud support coming soon, priced under the <a href="https://docs.databricks.com/aws/en/jobs/run-serverless-jobs">Lakeflow Jobs Serverless SKU</a> with a 6-month promotional pricing period currently active.</li>
</ul>
<p>20:05 Jonathan – “I’m not a fan of Kafka in general, but I am a fan of doing things at massive scale, so it’s kind of cool.” </p>
<p>07:27 <a href="https://www.testingcatalog.com/openai-prepares-new-chatgpt-pro-lite-tier-priced-at-100-monthly/">OpenAI prepares new ChatGPT Pro Lite tier at $100 monthly</a></p>
<ul>
<li style="font-weight:400;"><a href="https://openai.com/">OpenAI</a> appears to be preparing a<a href="https://www.testingcatalog.com/tag/chatgpt/"> ChatGPT</a> Pro Lite tier at $100 per month, slotting between the existing Plus plan at $20 and the full Pro plan at $200, based on findings from engineer <a href="https://x.com/btibor91?ref=testingcatalog.com">Tibor Blaho</a>, who has a consistent track record of uncovering unreleased features.</li>
<li style="font-weight:400;">The new tier would address a notable pricing gap for users who regularly hit Plus rate limits but cannot justify the full Pro cost, with freelancers, researchers, and developers as the likely target audience.</li>
<li style="font-weight:400;">The plan may be structured around compute-heavy use cases, including Codex and persistent agentic workloads, where background-running agents carry substantially higher infrastructure costs than standard chat interactions.</li>
<li style="font-weight:400;">OpenAI recently hired Peter Steinberger, creator of the open-source agent framework <a href="https://openclaw.ai/?ref=testingcatalog.com">OpenClaw</a>, and has signaled a multi-agent direction for ChatGPT, suggesting the Pro Lite tier could serve as an entry point for always-on agentic capabilities rather than just increased chat limits.</li>
<li style="font-weight:400;">No release date or confirmed feature set exists yet, but the addition of a mid-tier option would create competitive pressure on Google, which currently lacks an equivalent individual plan at this price point.</li>
</ul>
<p>21:56  Matt – “I just think they needed a different naming convention.” </p>
<h2>Cloud Tools </h2>
<p>23:11 <a href="https://www.hashicorp.com/blog/hcp-packer-adds-sbom-vulnerability-scanning">HCP Packer adds SBOM vulnerability scanning</a></p>
<ul>
<li style="font-weight:400;">HCP Packer now includes SBOM vulnerability scanning in public beta, allowing platform teams to scan software bills of materials against <a href="https://www.cve.org/">MITRE’s CVE database</a> and classify findings by severity directly within the artifact registry.</li>
<li style="font-weight:400;">The feature builds on last year’s <a href="https://www.hashicorp.com/en/blog/hcp-packer-provides-further-artifact-visibility-with-sbom-storage">SBOM storage capabilities</a>, which are now generally available, meaning teams can generate, store, and now actively scan SBOMs for known vulnerabilities in a single workflow.</li>
<li style="font-weight:400;">This addresses a supply chain security gap by surfacing vulnerability data at the image level, covering <a href="https://en.wikipedia.org/wiki/Amazon_Machine_Image">AMIs</a>, Docker containers, and virtual machines before they reach production environments.</li>
<li style="font-weight:400;">Teams can see which specific package versions are affected and when vulnerabilities were detected, giving them the information needed to prioritize remediation without leaving the HCP Packer interface.</li>
<li style="font-weight:400;">The feature is available in public beta at no cost through the <a href="https://www.hashicorp.com/en/products/packer?utm_source=hashicorp&amp;utm_medium=internal-blog&amp;utm_campaign=26Q1_WW_PRAC_RISK_sbom-scanning-beta-blog_PRODUCT-PAGE&amp;utm_content=end-blog-cta&amp;utm_offer=product-page">free HCP Packer tier</a>, making it accessible for teams looking to add CVE scanning to their image management process without additional tooling.</li>
</ul>
<p>24:15 Jonathan – “It’s only as current as the time you built it though…” </p>
<p>25:43 <a href="https://thenewstack.io/kubernetes-vpa-inplace-resize/">Why Kubernetes 1.35 is a game-changer for stateful workload scaling</a> </p>
<ul>
<li style="font-weight:400;"><a href="https://kubernetes.io/blog/2025/12/17/kubernetes-v1-35-release/">Kubernetes 1.35</a> brings two notable autoscaling milestones: <a href="https://kubernetes.io/blog/2025/05/16/kubernetes-v1-33-in-place-pod-resize-beta/">In-Place Pod Resize</a> graduating to GA and <a href="https://kubernetes.io/docs/concepts/workloads/autoscaling/vertical-pod-autoscale/">Vertical Pod Autoscaler’s</a> InPlaceOrRecreate update mode reaching beta, allowing VPA to adjust CPU and memory on running pods without evicting them.</li>
<li style="font-weight:400;">The practical benefit for stateful workloads is substantial. </li>
<li style="font-weight:400;">Previously, VPA had to evict and recreate pods to apply new resource requests, which caused disruption for databases, caches, and other restart-sensitive applications. In-place resizing preserves the pod UID, container ID, and restart count throughout the adjustment.</li>
<li style="font-weight:400;">VPA operates in three stages worth understanding: a recommendation-only mode for passive observation, an InPlaceOrRecreate mode that attempts live resizing first and falls back to eviction only when node resources are insufficient, and configurable policies using minAllowed and maxAllowed to bound what VPA can actually set.</li>
<li style="font-weight:400;">VPA controllers are not bundled with Kubernetes itself. </li>
<li style="font-weight:400;">Engineers need to clone the kubernetes/autoscaler repository and run the vpa-up.sh script to deploy the Recommender, Updater, and Admission Controller components alongside the mutating </li>
</ul>
<p>26:09 Jonathan – “I think the practical benefit for stable workloads are fairly substantial, if you’re one of those crazy people who like to run databases or SQL server on Kubernetes (like Cody) because previously those pods would be evicted and new resources requested, which would obviously cause disruption, stale caches, and other issues.” </p>
<h2>AWS </h2>
<p>31:20 <a href="https://www.ft.com/content/00c282de-ed14-4acd-a948-bc8d6bdb339d">Amazon service was taken down by AI coding bot</a></p>
<ul>
<li style="font-weight:400;">Listener note: paywall article</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/publicsector/transform-devops-practice-with-kiro-ai-powered-agents/">Amazon’s Kiro AI</a> coding tool caused a 13-hour outage of an AWS cost exploration service in December after engineers granted it broad permissions, and it autonomously decided to delete and recreate the environment rather than patch it. </li>
<li style="font-weight:400;">A second outage involved <a href="https://aws.amazon.com/q/developer/">Amazon Q Developer</a>, though Amazon says neither event impacted core customer-facing AWS services.</li>
<li style="font-weight:400;">Amazon’s official position is that both incidents were user error stemming from improper access controls, not failures of the AI tools themselves. </li>
<li style="font-weight:400;"><a href="https://kiro.dev/">Kiro</a> is designed to request authorization before acting, but the engineer involved had been granted broader permissions than intended, bypassing that safeguard.</li>
<li style="font-weight:400;">The incidents highlight a practical risk with agentic AI tools in production environments: when an AI agent is given the same permissions as a human operator without requiring peer review, it can take destructive autonomous actions that a second set of eyes might have caught. AWS has since added mandatory peer review and staff training as corrective measures.</li>
<li style="font-weight:400;">AWS is pushing for 80 percent of its developers to use AI coding tools at least once weekly, which means these tools are being adopted at scale internally before the risk patterns are fully understood. </li>
<li style="font-weight:400;">Listeners running their own AI agents in production should treat permission scoping and human-in-the-loop approval gates as non-optional controls, not optional defaults.</li>
<li style="font-weight:400;">Kiro launched in July 2025 and is positioned as a specification-driven coding assistant meant to go beyond simple vibe coding. </li>
<li style="font-weight:400;">The December incident was limited to mainland China, and the second incident had no customer-facing impact, but the pattern of two production disruptions in a few months is worth tracking as agentic tools become more common in enterprise workflows.</li>
</ul>
<p>33:24  Matt – “…if you’re letting the AI tool start to do things inside of production environments, that’s where you need to watch it, and you need to probably have it be a little bit more specific, so the human needs to kind of be watching what’s going on and peer reviewing it.” </p>
<p>35:49 <a href="https://www.geekwire.com/2026/amazon-pushes-back-on-financial-times-report-blaming-ai-coding-tools-for-aws-outages/">Amazon pushes back on Financial Times report blaming AI coding tools for </a><a href="https://www.geekwire.com/2026/amazon-pushes-back-on-financial-times-report-blaming-ai-coding-tools-for-aws-outages/">AWS outages </a></p>
<ul>
<li style="font-weight:400;">Amazon issued a public rebuttal to a<a href="https://www.ft.com/content/00c282de-ed14-4acd-a948-bc8d6bdb339d"> Financial Times report</a> claiming its Kiro AI coding tool caused multiple AWS outages, acknowledging one limited incident in December but attributing it to a misconfigured access control role rather than a flaw in the AI tool itself.</li>
<li style="font-weight:400;">The confirmed disruption affected only <a href="https://aws.amazon.com/aws-cost-management/aws-cost-explorer/">AWS Cost Explorer</a> in a single China region for roughly 13 hours, with no customer inquiries received, and did not touch core services like compute, storage, or databases.</li>
<li style="font-weight:400;">Amazon’s core defense is that the issue was user error, not AI error, noting that a misconfigured role could result from any developer tool or manual action, AI-powered or not.</li>
<li style="font-weight:400;">In response to the incident, AWS has added safeguards, including mandatory peer review for production access, which is a practical governance consideration for any organization deploying agentic AI tools in production environments.</li>
<li style="font-weight:400;">The broader takeaway for AWS customers is that agentic AI tools capable of autonomous actions, like deleting and recreating environments, require clear human oversight policies and access control guardrails before being used in production systems.</li>
</ul>
<p>37:00 <a href="https://aws.amazon.com/about-aws/whats-new/2026/02/aws-iam-policy-autopilot-kiro-power/">AWS IAM Policy Autopilot is now available as a Kiro Power</a></p>
<ul>
<li style="font-weight:400;"><a href="https://github.com/awslabs/iam-policy-autopilot">AWS IAM Policy Autopilot</a>, an open source static code analysis tool launched at <a href="https://www.aboutamazon.com/news/aws/aws-re-invent-2025-ai-news-updates">re:Invent 2025</a>, is now available as a <a href="https://kiro.dev/powers/">Kiro Power</a>, allowing developers to generate baseline IAM policies directly within the <a href="https://kiro-ide.net/download">Kiro IDE</a> without manual policy writing.</li>
<li style="font-weight:400;">The integration uses a one-click installation model that removes the need for manual MCP server configuration, streamlining how developers access policy generation tools during AI-assisted development workflows.</li>
<li style="font-weight:400;">Key use cases include rapid prototyping of AWS applications, baseline policy creation for new projects, and keeping developers in their coding environment rather than switching to the IAM console or documentation.</li>
<li style="font-weight:400;">This fits into the broader trend of embedding security and permissions tooling earlier in the development cycle, helping teams start with least-privilege policies that can be refined over time rather than retrofitting permissions after the fact.</li>
<li style="font-weight:400;">The tool is open source and available on GitHub at github.com/awslabs/iam-policy-autopilot, with no additional cost mentioned beyond standard Kiro and AWS service usage, making it accessible for teams already using the Kiro IDE.</li>
</ul>
<p>38:18  Jonathan – “I’m really on the fence about this. Because on one hand, I know the pain, especially with things like deployment policies…and just trying to figure out every permission that has to be added so that Terraform can just do a deployment – it becomes very complicated. At the same time, if you have a machine that looks at your code and says ‘this is the policy you need for it,’ I don’t think that’s any security at all unless there’s another check at the end.” </p>
<p>-Honorable Mentions- </p>
<p>41:52  <a href="https://aws.amazon.com/about-aws/whats-new/2026/02/amazon-redshift-serverless-three-year-reservations/">Amazon Redshift Serverless introduces 3-year Serverless Reservations</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/redshift/redshift-serverless/">Amazon Redshift Serverless</a> now offers 3-year Serverless Reservations, providing up to 45% cost savings compared to standard on-demand <a href="https://aws.amazon.com/redshift/pricing/">RPU pricing</a> while maintaining the serverless model’s flexibility.</li>
<li style="font-weight:400;">The reservations are managed at the AWS payer account level and can be shared across multiple AWS accounts, making this useful for organizations running Redshift Serverless workloads across linked accounts.</li>
<li style="font-weight:400;">-stop</li>
<li style="font-weight:400;">Billing runs 24/7 on an hourly basis, metered per second, meaning you pay for reserved RPUs continuously, regardless of actual usage, so this option makes most sense for consistently active workloads rather than sporadic ones.</li>
<li style="font-weight:400;">Any RPU consumption beyond the reserved amount falls back to standard on-demand rates, so customers need to size their reservations carefully to avoid negating the savings.</li>
<li style="font-weight:400;">Reservations can be purchased through the Redshift console or via the create-reservation API and are available in all regions where Redshift Serverless is currently supported.</li>
<li style="font-weight:400;">More information is available on the Amazon Redshift Management Guide, which you can find <a href="https://docs.aws.amazon.com/redshift/latest/mgmt/serverless-billing-reserved.html">here</a>. </li>
</ul>
<p>42:03  <a href="https://www.theinformation.com/briefings/amazon-says-will-spend-12-billion-louisiana-data-centers">Amazon Says It Will Spend $12 billion On Louisiana Data Centers </a></p>
<ul>
<li style="font-weight:400;">Amazon has announced a $12 billion investment in data center campuses in Louisiana, aimed at expanding infrastructure capacity for AI and cloud computing workloads.</li>
<li style="font-weight:400;">A notable aspect of the deal is Amazon’s commitment to covering its own power costs directly, working with regional utility <a href="https://www.swepco.com/">Southwestern Electric Power Company</a> to avoid passing energy expenses onto local consumers.</li>
<li style="font-weight:400;">Amazon is pairing the infrastructure investment with solar energy projects in Louisiana, which aligns with its broader sustainability commitments and addresses concerns about grid strain from large-scale data center operations.</li>
<li style="font-weight:400;">This announcement reflects a broader industry trend where cloud providers are proactively addressing public and political concerns about data center energy consumption, following a similar commitment from Microsoft last month regarding higher electricity rate payments.</li>
<li style="font-weight:400;">For AWS customers, this expansion signals continued investment in US-based infrastructure capacity, which could translate to improved regional availability and lower latency for workloads in the southern United States over time.</li>
</ul>
<p>42:18  <a href="https://aws.amazon.com/about-aws/whats-new/2026/02/aws-elemental-inference-generally-avail/">Announcing AWS Elemental Inference</a></p>
<ul>
<li style="font-weight:400;">AWS Elemental Inference is a fully managed AI service that automatically generates vertical video crops and highlight clips from live and on-demand broadcasts in parallel with encoding, targeting broadcasters who need to distribute content across TikTok, Instagram Reels, YouTube Shorts, and similar platforms without dedicated production staff.</li>
<li style="font-weight:400;">The service uses an agentic AI approach with no prompts or human-in-the-loop intervention required, handling both vertical video cropping and metadata-based highlight detection automatically, which reduces the manual workflow overhead typically associated with multi-platform content distribution.</li>
<li style="font-weight:400;">Beta testing with large media companies showed 34% or more cost savings on AI-powered live video workflows compared to using multiple point solutions, making this a notable consolidation option for media organizations already using AWS Elemental encoding services.</li>
<li style="font-weight:400;">A practical sports broadcasting use case is highlighted where highlight clips can be identified and distributed to social platforms during live games rather than hours after the fact, addressing a real operational gap in live content workflows.</li>
<li style="font-weight:400;">The service is available in <a href="https://aws.amazon.com/about-aws/global-infrastructure/regional-product-services/">four regions</a> at launch: US East N. Virginia, US West Oregon, Asia Pacific Mumbai, and Europe Ireland. </li>
<li style="font-weight:400;">Pricing details are not specified in the announcement, so listeners should check the AWS Elemental Inference documentation at docs.aws.amazon.com/elemental-inference for current pricing information.</li>
</ul>
<h2>GCP</h2>
<p>57:25  <a href="https://cloud.google.com/blog/products/databases/managed-mcp-servers-for-google-cloud-databases/">Managed MCP servers for Google Cloud databases</a></p>
<ul>
<li style="font-weight:400;">Google Cloud expanded its <a href="https://cloud.google.com/blog/products/ai-machine-learning/announcing-official-mcp-support-for-google-services">managed MCP server support</a> to cover <a href="https://cloud.google.com/products/alloydb">AlloyDB</a>, <a href="https://cloud.google.com/spanner">Spanner</a>, <a href="https://cloud.google.com/sql">Cloud SQL</a>, <a href="https://cloud.google.com/bigtable">Bigtable</a>, and <a href="https://cloud.google.com/products/firestore">Firestore</a>, allowing AI agents to interact with these databases through natural language without requiring infrastructure deployment or complex configuration.</li>
<li style="font-weight:400;">The security model relies entirely on IAM for authentication rather than shared keys, and all agent actions are logged in Cloud Audit Logs, which addresses a practical concern for teams worried about giving AI agents access to production databases.</li>
<li style="font-weight:400;">A new <a href="https://developers.googleblog.com/introducing-the-developer-knowledge-api-and-mcp-server/">Developer Knowledge MCP server</a> connects IDEs directly to Google’s official documentation, letting agents reference best practices in real time during tasks like database migrations or app development troubleshooting.</li>
<li style="font-weight:400;">Because these servers follow the open MCP standard, they work with third-party clients like Anthropic’s <a href="https://claude.ai/new">Claude</a> in addition to <a href="https://gemini.google.com/">Gemini</a>, which broadens the practical appeal beyond teams already committed to Google’s AI tooling.</li>
<li style="font-weight:400;">Google has signaled plans to extend managed MCP support to Looker, Memorystore, Pub/Sub, Kafka, and migration services in the coming months, suggesting this is an ongoing buildout rather than a one-time release. </li>
<li style="font-weight:400;">Pricing is not separately listed for MCP access and likely falls under existing database service costs.</li>
</ul>
<p>44:12 Matt – “Anything that makes databases easier, I’m all for.” </p>
<p>45:12 <a href="https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-3-1-pro/">Gemini 3.1 Pro: Announcing our latest Gemini AI model</a></p>
<ul>
<li style="font-weight:400;"><a href="https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-3-1-pro/">Gemini 3.1 Pro</a> is now available in preview for developers via <a href="https://aistudio.google.com/">Google AI Studio</a>, <a href="https://geminicli.com/">Gemini CLI</a>, <a href="https://cloud.google.com/vertex-ai">Vertex AI</a>, and <a href="https://developer.android.com/studio">Android Studio</a>, with enterprise access through Vertex AI and Gemini Enterprise. Pricing details have not been publicly announced for the preview period.</li>
<li style="font-weight:400;">The model scores 77.1% on the ARC-AGI-2 benchmark, which tests reasoning on novel logic patterns, representing more than double the score of the previous Gemini 3 Pro model. </li>
<li style="font-weight:400;">This positions it as a stronger option for complex problem-solving tasks compared to its predecessor.</li>
<li style="font-weight:400;">Practical use cases highlighted include generating animated SVGs from text prompts, building live data dashboards by connecting to public APIs, and prototyping interactive 3D interfaces with hand-tracking and generative audio. These examples suggest the model is particularly suited for developers working on data visualization and creative coding projects.</li>
<li style="font-weight:400;">Consumer access is rolling out through the Gemini app and <a href="https://notebooklm.google/">NotebookLM</a>, but the 3.1 Pro tier is restricted to <a href="https://gemini.google/us/subscriptions/?hl=en">Google AI Pro and Ultra plan</a> subscribers. This tiered access model means free-tier users will not have access during the preview phase.</li>
<li style="font-weight:400;">Google notes the model is still in preview while they validate performance for agentic workflows before a general availability release. GCP customers evaluating it for production use should factor in that capabilities and pricing may shift before the full release.</li>
</ul>
<p>46:23  Matt – “It’s just amazing to me how fast these models are improving. This one is saying it scored a 77%, where models a year ago where 40 and 50%. Seeing how fast everything is moving is insane.” </p>
<p>47:36  <a href="https://cloud.google.com/blog/products/networking/understanding-the-firefly-clock-synchronization-protocol/">Understanding the Firefly clock synchronization protocol</a></p>
<ul>
<li style="font-weight:400;">Google’s <a href="https://dl.acm.org/doi/10.1145/3718958.3750502">Firefly</a> is a software-based clock synchronization protocol that achieves sub-10-nanosecond NIC-to-NIC synchronization across data center hardware, without requiring specialized or expensive dedicated timing equipment.</li>
<li style="font-weight:400;">The protocol uses a distributed consensus algorithm built on random graphs rather than a traditional hierarchical time server model, which improves convergence speed, scalability, and resilience to network path asymmetries.</li>
<li style="font-weight:400;">Firefly decouples internal synchronization from external UTC synchronization, meaning external time server jitter does not degrade the precision of clock alignment within the data center fabric itself.</li>
<li style="font-weight:400;">Financial services workloads are a primary beneficiary, as regulatory requirements mandate sub-100 microsecond external UTC synchronization and sub-10 nanosecond internal synchronization, both of which Firefly meets on standard cloud infrastructure.</li>
<li style="font-weight:400;">Beyond finance, the protocol has practical implications for distributed database consistency, ML workload coordination, and fine-grained network telemetry, potentially enabling workloads that previously required on-premises dedicated hardware to run on cloud infrastructure instead. No specific pricing details were provided in the announcement.</li>
</ul>
<p>48:52 Jonathan – “The fact that you need to guarantee sub-hundred microsynchronization for financial systems is crazy.” </p>
<p>-Honorable Mentions- </p>
<p>50:32  <a href="https://cloud.google.com/blog/products/infrastructure/america-india-connect-infrastructure-connects-four-continents/">America-India Connect infrastructure connects four continents</a></p>
<ul>
<li style="font-weight:400;">Google is investing<a href="https://blog.google/intl/en-in/company-news/our-first-ai-hub-in-india-powered-by-a-15-billion-investment/"> $15 billion in AI infrastructure in India</a> and launching America-India Connect, a multi-continent subsea cable initiative that establishes new fiber-optic routes connecting the United States, India, Singapore, South Africa, and Australia. </li>
<li style="font-weight:400;">The project creates Visakhapatnam as a new international subsea gateway on India’s east coast, adding network diversity beyond existing Mumbai and Chennai landing points.</li>
<li style="font-weight:400;">The infrastructure combines multiple subsea cable systems, including <a href="https://cloud.google.com/blog/products/infrastructure/introducing-equiano-a-subsea-cable-from-portugal-to-south-africa?e=48754805">Equiano</a>, <a href="https://cloud.google.com/blog/products/infrastructure/introducing-the-nuvem-subsea-cable?e=48754805">Nuvem</a>, <a href="https://cloud.google.com/blog/products/infrastructure/bosun-australia-connect-initiative-for-indo-pacific-connectivity?e=48754805">Bosun</a>, <a href="https://cloud.google.com/blog/products/infrastructure/honomoana-and-tabua-subsea-cables-connect-south-pacific?e=48754805">Tabua</a>, <a href="https://cloud.google.com/blog/products/infrastructure/talaylink-subsea-cable-to-connect-australia-and-thailand">TalayLink</a>, and <a href="https://cloud.google.com/blog/products/infrastructure/honomoana-and-tabua-subsea-cables-connect-south-pacific?e=48754805">Honomoana</a>, to create redundant high-capacity routes between American coasts and India through both <a href="https://cloud.google.com/blog/products/infrastructure/investing-in-connectivity-and-growth-for-africa?e=48754805">African</a> and <a href="https://cloud.google.com/blog/products/infrastructure/honomoana-and-tabua-subsea-cables-connect-south-pacific?e=48754805">Pacific</a> paths. </li>
<li style="font-weight:400;">This approach provides network resilience for over 1 billion people in India while improving connectivity across the Southern Hemisphere.</li>
<li style="font-weight:400;">Google Cloud is serving as the primary cloud infrastructure provider for India’s <a href="https://igotkarmayogi.gov.in/">iGOT</a> Karmayogi platform, which delivers training to over 20 million public servants across 800+ districts. </li>
<li style="font-weight:400;">The platform will use AI to digitize legacy training content and enable access in 18+ Indian languages, supporting the government’s Mission Karmayogi initiative for civil service modernization.</li>
<li style="font-weight:400;">The announcement positions these subsea cables as critical infrastructure to prevent an AI divide, with documented evidence that subsea cable connectivity improves internet affordability and reliability while driving productivity and economic growth. </li>
<li style="font-weight:400;">The initiative builds on Google’s existing infrastructure investments in Africa, Australia, and the Pacific region.</li>
<li style="font-weight:400;">Added this one just for you, Justin. </li>
</ul>
<p>52:20  <a href="https://blog.google/innovation-and-ai/infrastructure-and-cloud/global-network/data-center-wilbarger-county/">Wilbarger County data center</a></p>
<ul>
<li style="font-weight:400;">Google is building a new data center in Wilbarger County, Texas, expanding its existing infrastructure footprint in the state. </li>
<li style="font-weight:400;">This is primarily an infrastructure capacity announcement rather than a new GCP service or feature.</li>
<li style="font-weight:400;">The facility will use air-cooling technology instead of traditional water cooling, limiting water consumption to only essential campus operations like kitchens. This is a notable operational choice given ongoing concerns about data center water usage in drought-prone regions.</li>
<li style="font-weight:400;">Google has contracted to add more than 7,800 MW of net-new energy generation and capacity to the Texas electricity grid, with the Wilbarger facility co-located alongside new clean power developed in partnership with <a href="https://www.aes.com/newsroom/aes-announces-landmark-agreements-google-texas">AES</a>.</li>
<li style="font-weight:400;">Google announced a <a href="https://blog.google/company-news/inside-google/company-announcements/google-american-innovation-texas/">$30 million Energy Impact Fund</a> in November to support energy affordability, school weatherization, and energy workforce development across Texas. Details on the fund are available <a href="http://blog.google/company-news/inside-google/company-announcements/google-american-innovation-texas.">here</a>. </li>
<li style="font-weight:400;">For GCP customers, additional Texas-based infrastructure generally signals potential improvements in latency and redundancy for workloads serving the south-central US region, though Google has not announced specific new GCP regions or zones tied to this facility.</li>
</ul>
<p>52:55  <a href="https://blog.google/innovation-and-ai/products/gemini-app/lyria-3/">Use Lyria 3 to create music tracks in the Gemini app</a></p>
<ul>
<li style="font-weight:400;"><a href="https://deepmind.google/">Google DeepMind’s</a> <a href="https://deepmind.google/models/lyria/">Lyria 3</a> model is now available in beta within the Gemini app, letting users <a href="https://gemini.google/overview/music-generation/?utm_source=gemini&amp;utm_medium=web&amp;utm_campaign=lyria_marketing_keyword">generate 30-second music tracks</a> with lyrics, custom cover art, and style controls from text prompts or uploaded photos and videos. </li>
<li style="font-weight:400;">This is available to users 18 and older in 8 languages, with higher usage limits for Google AI Plus, Pro, and Ultra subscribers.</li>
<li style="font-weight:400;">Lyria 3 improves on previous versions by auto-generating lyrics from prompts, offering more control over style, vocals, and tempo, and producing more musically complex outputs without requiring users to provide their own creative assets.</li>
<li style="font-weight:400;">All generated tracks are embedded with SynthID, Google DeepMind’s imperceptible watermark, and the Gemini app now extends its AI content verification to audio files, allowing users to upload audio and check whether it was generated by Google AI.</li>
<li style="font-weight:400;">The feature is also rolling out to YouTube creators via <a href="https://support.google.com/youtube/answer/14151606?hl=en">Dream Track</a> for Shorts soundtracks, connecting Lyria 3 to a broader content creation workflow beyond the Gemini app itself.</li>
<li style="font-weight:400;">On the responsible AI side, Google states Lyria 3 was trained with copyright and partner agreements in mind, artist-specific prompts are treated as stylistic inspiration rather than direct mimicry, and output filters check against existing content, though Google acknowledges this approach is not guaranteed to catch all issues.</li>
</ul>
<h2>Azure</h2>
<p>57:25 <a href="https://blogs.microsoft.com/blog/2026/02/18/a-milestone-achievement-in-our-journey-to-carbon-negative/">A milestone achievement in our journey to carbon negative</a></p>
<ul>
<li style="font-weight:400;">Microsoft has achieved its 2025 goal of matching 100 percent of global electricity consumption with renewable energy, contracting 40 gigawatts of new renewable capacity across 26 countries since 2020. </li>
<li style="font-weight:400;">This represents enough energy to power approximately <a href="https://www.epa.gov/green-power-markets/green-power-equivalency-calculator-calculations-and-references">10 million US homes</a>, with 19 GW currently online and the remainder coming online over the next five years.</li>
<li style="font-weight:400;">The renewable energy procurement has reduced Microsoft’s reported Scope 2 carbon emissions by an estimated 25 million tons and mobilized billions in private investment through over 400 contracts with 95 utilities and developers. This directly impacts Azure datacenter operations globally, supporting the infrastructure that runs customer workloads while advancing toward the company’s 2030 carbon negative commitment.</li>
<li style="font-weight:400;">Microsoft is expanding beyond renewable energy to include nuclear power and other carbon-free technologies, including a <a href="https://www.helionenergy.com/articles/helion-announces-worlds-first-fusion-ppa-with-microsoft/">50 MW fusion project with Helion in Washington state</a> and restarting the 835 MW <a href="https://www.constellationenergy.com/newsroom/2024/Constellation-to-Launch-Crane-Clean-Energy-Center-Restoring-Jobs-and-Carbon-Free-Power-to-The-Grid.html">Crane Clean Energy Center</a> in Pennsylvania with Constellation Energy. The Climate Innovation Fund has allocated $806 million to 67 investees, with 38 percent directed toward energy systems innovation.</li>
<li style="font-weight:400;">The company is deploying AI-driven tools to accelerate clean energy deployment, including collaborations with <a href="https://inl.gov/news-release/idaho-national-laboratory-collaborates-with-microsoft-to-streamline-nuclear-licensing/">Idaho National Laboratory</a> for nuclear licensing and the <a href="https://www.microsoft.com/en/customers/story/18851-midcontinent-independent-system-operator-microsoft-purview?msockid=106ba26386196a493e91b1ba82196cb9">Midcontinent Independent System Operator</a> for grid optimization. </li>
<li style="font-weight:400;">These tools aim to streamline the design, permitting, and deployment of new power technologies to expand grid capacity more efficiently.</li>
<li style="font-weight:400;">Azure customers benefit indirectly through more sustainable cloud infrastructure, though Microsoft notes the shift to an all-of-the-above decarbonization strategy recognizes that rising electricity demand from datacenters, AI workloads, and digital services requires diverse carbon-free energy sources beyond renewables alone.</li>
</ul>
<p>55:58 <a href="https://azure.microsoft.com/en-us/updates?id=556008">Generally Available: Quota and deployment troubleshooting tools for Azure </a><a href="https://azure.microsoft.com/en-us/updates?id=556008">Functions Flex Consumption </a></p>
<ul>
<li style="font-weight:400;">Azure Functions Flex Consumption now has generally available quota and deployment troubleshooting tools built directly into the platform, giving developers clearer visibility into quota limits and constraints without needing to dig through documentation or support tickets.</li>
<li style="font-weight:400;">The quota troubleshooting experience surfaces Flex Consumption-specific limits in context, which is useful for teams hitting scaling walls and trying to understand why deployments are behaving unexpectedly.</li>
<li style="font-weight:400;">This is a quality-of-life improvement aimed at developers and platform engineers who use Flex Consumption for its per-execution billing model and fast scaling, helping reduce time spent diagnosing deployment failures.</li>
<li style="font-weight:400;">Pricing for Flex Consumption remains consumption-based, so there is no additional cost for these troubleshooting tools themselves. More details are available at the Azure updates page <a href="https://github.com/Azure-Samples/azure-functions-flex-consumption-samples">here</a>. </li>
<li style="font-weight:400;">Teams already invested in Azure Functions should note this reduces reliance on external monitoring or support escalations for common quota-related issues, keeping troubleshooting within the Azure portal workflow.</li>
</ul>
<p>56:32  Matt – “This is a great quality of life improvement because you can see why things are breaking when you’re using flexible consumption.” </p>
<p>-Honorable Mentions-</p>
<p>1:01:07 <a href="https://techcommunity.microsoft.com/blog/microsoftsentinelblog/public-preview-announcement-empower-real-time-security-with-microsoft-sentinel%E2%80%99s/4483884">Public Preview Announcement: Empower Real-Time Security with </a><a href="https://techcommunity.microsoft.com/blog/microsoftsentinelblog/public-preview-announcement-empower-real-time-security-with-microsoft-sentinel%E2%80%99s/4483884">Microsoft Sentinel’s CCF Push Feature | Microsoft Community Hub</a></p>
<ul>
<li style="font-weight:400;"><a href="https://marketplace.microsoft.com/en-us/product/azure-applications/keepersecurity.keeper-security-integration?tab=Overview">Microsoft Sentinel’s</a> CCF Push feature, now in public preview, allows security data providers to send logs directly to a Sentinel workspace without the traditional setup overhead of manually configuring Data Collection Endpoints, Data Collection Rules, Entra app registrations, and RBAC assignments. Pressing Deploy handles all resource provisioning automatically.</li>
<li style="font-weight:400;">The feature is built on Sentinel’s Log Ingestion API, which supports high-throughput data ingestion, pre-ingestion data transformation, and direct targeting of system tables, making it more flexible than the older polling-based connector model.</li>
<li style="font-weight:400;">For partners and ISVs building Sentinel integrations, CCF Push reduces time to market by consolidating connector deployment through the Content Hub as a single interface, rather than requiring customers to configure multiple Azure resources independently.</li>
<li style="font-weight:400;">Early adopters include security vendors like <a href="https://marketplace.microsoft.com/en-us/product/azure-applications/391c3d87-edc8-4f72-a719-825c022b8eb4.azure-sentinel-solution-obsidian-activity-threat?tab=Overview">Obsidian Security</a> and <a href="https://marketplace.microsoft.com/en-us/product/azure-applications/varonis.azure-sentinel-solution-varonispurview?tab=Overview">Varonis</a>, suggesting the feature is already being validated in real-world security workflows. </li>
<li style="font-weight:400;">Developers can reference the MS Learn documentation <a href="http://learn.microsoft.com/azure/sentinel/create-push-codeless-connector">here</a> to get started.</li>
<li style="font-weight:400;">No specific pricing details were provided in the announcement, but since CCF Push feeds data into Sentinel workspaces, standard Sentinel and Log Analytics ingestion costs would apply. </li>
<li style="font-weight:400;">Organizations evaluating this feature should factor in their existing Sentinel pricing tier when estimating costs.</li>
</ul>
<p>1:01:24 <a href="https://blogs.microsoft.com/blog/2026/02/24/microsoft-sovereign-cloud-adds-governance-productivity-and-support-for-large-ai-models-securely-running-even-when-completely-disconnected/">Microsoft Sovereign Cloud adds governance, productivity and support for </a><a href="https://blogs.microsoft.com/blog/2026/02/24/microsoft-sovereign-cloud-adds-governance-productivity-and-support-for-large-ai-models-securely-running-even-when-completely-disconnected/">large AI models securely running even when completely disconnected</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/azure-local/manage/disconnected-operations-overview?view=azloc-2602">Azure Local disconnected operations</a> are now generally available, allowing organizations to run mission-critical infrastructure with full Azure governance and policy enforcement even when completely isolated from cloud connectivity. This targets government, defense, and regulated industries where external dependencies are either unacceptable or prohibited.</li>
<li style="font-weight:400;"><a href="https://techcommunity.microsoft.com/blog/AzureArcBlog/microsoft-365-local-is-generally-available/4470170">Microsoft 365 Local disconnected</a> brings <a href="https://learn.microsoft.com/en-us/exchange/exchange-server">Exchange Server</a>, <a href="https://learn.microsoft.com/en-us/sharepoint/getting-started">SharePoint Server</a>, and <a href="https://www.microsoft.com/en-us/microsoft-365/previous-versions/skype-for-business-online">Skype for Business Server</a> into fully air-gapped sovereign environments running on Azure Local, with Microsoft committing support for these workloads through at least 2035. </li>
<li style="font-weight:400;">This keeps productivity tools available under the same governance boundary as infrastructure workloads.</li>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/foundry-local/get-started">Foundry Local</a> now supports large multimodal AI models running on-premises hardware, including NVIDIA GPUs, within fully disconnected sovereign environments. This extends local AI inferencing capabilities beyond the smaller models Foundry Local previously supported, with Microsoft providing deployment, update, and operational health support.</li>
<li style="font-weight:400;">The three components together form a full-stack sovereign private cloud covering infrastructure, productivity, and AI inferencing, all manageable through consistent Azure governance tooling regardless of connectivity state. </li>
<li style="font-weight:400;">Pricing is not publicly listed and appears to vary based on deployment scale and customer qualification, so organizations should contact Microsoft directly for specifics.</li>
<li style="font-weight:400;">Target customers include public sector agencies, classified environments, and regulated industries in regions where data residency and operational autonomy are legal or contractual requirements. </li>
<li style="font-weight:400;">Azure Local is disconnected, and Microsoft 365 Local is available worldwide, while large model support on Foundry Local is currently limited to qualified customers.</li>
</ul>
<h2>Emerging Clouds </h2>
<p>1:03:04 <a href="https://www.crusoe.ai/resources/blog/introducing-crusoe-command-center">Introducing Command Center:  The unified operations platform  for AI </a><a href="https://www.crusoe.ai/resources/blog/introducing-crusoe-command-center">workloads</a></p>
<ul>
<li style="font-weight:400;"><a href="https://crusoe.ai/cloud/command-center">Crusoe Command Center</a> is a unified operations platform that consolidates GPU cluster monitoring, orchestration, and support into a single interface, addressing the common problem of engineers context-switching between fragmented dashboards during AI training runs.</li>
<li style="font-weight:400;">The platform integrates with <a href="https://docs.crusoecloud.com/orchestration/cmk/overview/index.html">Crusoe Managed Kubernetes</a> and supports Managed Slurm, allowing long-running multi-week training jobs to operate continuously across large GPU clusters without manual intervention.</li>
<li style="font-weight:400;">AutoClusters is a key component that automatically detects GPU performance degradation, evicts compromised nodes, and replaces them with healthy instances from a reserve pool, reducing the need for around-the-clock manual oversight.</li>
<li style="font-weight:400;">On the observability side, Command Center supports multiple access methods, including a UI, Grafana via PromQL API, and a Prometheus endpoint, while a Telemetry Relay feature streams infrastructure metrics directly to external tools to reduce data silos.</li>
<li style="font-weight:400;">The <a href="https://github.com/crusoecloud/crusoe-watch-agent/blob/main/README.md">Crusoe Watch Agent</a>, paired with Telemetry Relay, extends visibility to custom application-level metrics, allowing teams to correlate workload performance with underlying GPU health data for more precise troubleshooting.</li>
</ul>
<p>1:04:04  Matt – “The whole stack here is what I kind of find nice. The smaller clouds are trying to attack that whole vertical a lot more, where they’re giving you that depth all the way down, so if you are training your own model, you get the CPU, you get the GPU, you can see that whole stack of what’s going on, and really start to fine-tune.”     </p>
<p>1:05:09 <a href="https://www.digitalocean.com/blog/now-available-amd-instinct-mi350x-gpus">Expanding our Agentic Inference Cloud: Introducing GPU Droplets </a><a href="https://www.digitalocean.com/blog/now-available-amd-instinct-mi350x-gpus">Powered by AMD Instinct MI350X GPUs</a></p>
<ul>
<li style="font-weight:400;">DigitalOcean is adding <a href="https://www.amd.com/en/products/accelerators/instinct/mi350/mi350x.html">AMD Instinct MI350X</a> GPUs to its <a href="https://www.digitalocean.com/products/gradient/gpu-droplets?utm_campaign=pmax_nb_gpu_competitorkeywords_usa_en&amp;utm_adgroup=&amp;utm_keyword=&amp;utm_matchtype=&amp;utm_adposition=&amp;utm_creative=&amp;utm_placement=&amp;utm_device=c&amp;utm_locationi=&amp;utm_location=9031971&amp;utm_term=&amp;utm_source=google&amp;utm_medium=cpc&amp;gad_source=1&amp;gclid=CjwKCAjw7pO_BhAlEiwA4pMQvJQk7YIRizEN9HzSdpzBRw21cS6QGjohJo_pAUfBRADZ8vOldcU9WxoC56QQAvD_BwE">GPU Droplets</a> lineup, built on the CDNA 4 architecture and optimized for inference workloads, including prefill phase compute, low-latency token generation, and larger context windows.</li>
<li style="font-weight:400;">The platform has demonstrated measurable results with existing customers, including a 2x increase in production request throughput and 50% reduction in inference costs for <a href="http://character.ai">Character.AI</a>, giving potential adopters concrete performance benchmarks to evaluate.</li>
<li style="font-weight:400;">DigitalOcean is positioning these offerings toward AI-native companies and developers who need enterprise features like HIPAA eligibility and SOC 2 compliance without the complexity of larger cloud providers, with provisioning available in a few clicks.</li>
<li style="font-weight:400;">The GPUs are currently available in the Atlanta datacenter, with AMD Instinct MI355X GPUs planned for next quarter, which will introduce liquid-cooled rack infrastructure to support larger models and datasets.</li>
<li style="font-weight:400;">For smaller businesses and developers, the predictable usage-based pricing and simplified deployment model represent a meaningful alternative to the more complex pricing and configuration requirements typical of hyperscaler GPU offerings.</li>
</ul>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback, or ask questions at theCloudPod.net or tweet at us with the hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2382918/c1e-7nknsv3m59bwj41v-rk2qm0wkfwon-ykry8x.mp3" length="118647282"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 344 of The Cloud Pod, where the forecast is always cloudy! Justin is out of the office at a World of Warcraft Tournament (not really), and Ryan is pursuing his lifelong dream of becoming a roadie for The Eagles (maybe?), so it’s Jonathan and Matt holding down the fort this week, and they’ve got a ton of cloud news for you! From security to AI assistants, we’ve got all the news you need. Let’s get started! 
Titles we almost went with this week

 Zero Bus, All Gas, No Kafka Brakes
 AI Coding Bot Bites the Hand That Runs It
 When Your Robot Developer Goes Rogue on AWS
 Kubernetes VPA Finally Stops Evicting Your Database Pods
 Google Trains 100 Million People, Still No One Reads the Docs 
 MCP Walks Into a Bar Not Enterprise Ready Yet
 No More Pod Evictions Kubernetes 1.35 Scales In Place
 No Keys No Drama Just IAM and Cloud SQL
One Agent to Rule Them All in Kubernetes
 IAM Tired of Writing Policies Manually
 When Your AI Coding Tool Has Delete Permissions
 One Dashboard to Rule All Your GPU Clusters
 Serverless Reservations Prove Nothing Is Truly Free Range
 Kiro Takes the Wheel on AWS IAM Policies
 Stop Blaming Backups for Your Bad Architecture
 AI Agent Goes Rogue, Takes AWS Down With It
 Everything is Bigger in Texas Except the Water Usage
OpenAI launches the college basketball of Inference. Pro service – low cost

General News 
1:05 Code Mode: give agents an entire API in 1,000 tokens

Cloudflare‘s Code Mode MCP server reduces token consumption by 99.9% compared to a traditional MCP implementation, exposing the entire Cloudflare API (over 2,500 endpoints) through just two tools, search() and execute(), using roughly 1,000 tokens versus 1.17 million for a conventional approach.
The architecture works by having the AI agent write JavaScript code against a typed OpenAPI spec representation, rather than loading tool definitions into context, with code executing inside a sandboxed V8 isolate (Dynamic Worker) that restricts file system access, environment variables, and external fetches by default.
This approach addresses a fundamental constraint in agentic AI systems: adding more tools to give agents broader capabilities directly competes with the available context space for the task at hand.

01:41  Jonathan- “It’s good. I’m not sure I could imagine 2 ½ thousand MCP tool definitions in a context window and still actually use it for anything.”   
AI Is Going Great – Or How ML Makes Money 
03:58 OpenClaw creator Peter Steinberger joins OpenAI

Peter Steinberger, creator of viral AI assistant OpenClaw (formerly Clawdbot/Moltbot), has joined OpenAI to lead development of next-generation personal agents. 
OpenClaw gained attention for its ability to perform real-world tasks like calendar management, flight booking, and autonomous social network participation.
OpenAI will maintain OpenClaw as an open source project through a foundation structure, allo...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2382918/c1a-k5d5-0v9wm0n5i6vo-m1wdoh.jpg"></itunes:image>
                                                                            <itunes:duration>01:01:30</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2382918/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[343: AWS CloudWatch Finally Hits Snooze]]>
                </title>
                <pubDate>Wed, 25 Feb 2026 23:20:15 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2374104</guid>
                                    <link>https://tcpfm.castos.com/episodes/aws-cloudwatch-finally-hits-snooze</link>
                                <description>
                                            <![CDATA[<p>Welcome to episode 343 of The Cloud Pod, where the forecast is always cloudy! Justin, Ryan, and Matt are in the studio this week bringing you all the latest in Cloud and AI news, including some of the smaller clouds like Cloudflare and Crusoe Cloud, as well as announcements from the big guys like Google’s Gemini DeepThink, Anthropic’s big pay day, and Microsoft’s Notepad problem. We’ve got all this plus Matt screwing up his outro AGAIN, so let’s get started! </p>
<h3>Titles we almost went with this week</h3>
<ul>
<li> Chrome’s WebMCP Protocol: Teaching AI Agents to Stop Doom-Scrolling the DOM and Actually Get Work Done</li>
<li>Claude Enterprise Self-Service: Because Sometimes You Just Want to Buy AI Without Small Talk</li>
<li> AWS EC2 Goes Inception Mode: Now You Can Virtualize Your Virtualization Without Going Broke</li>
<li> Amazon EC2 Nested Virtualization: Because Your Virtual Machine Was Lonely and Needed Its Own Virtual Machine</li>
<li> CloudWatch Alarm Mute Rules: Because Your Deployment Doesn’t Need a Standing    Ovation at 3 AM</li>
<li> Anthropic’s $380 Billion Valuation Proves AI Funding Has Gone Claude Nine </li>
<li>AWS EC2 Nested Virtualization Finally Escapes the Expensive Hardware Jail</li>
<li> Cloudflare Teaches AI Agents the Magic Words: Accept text/markdown and Save 13,000 Tokens</li>
<li> Crusoe Cloud’s MCP Server: Teaching AI Assistants to Stop Asking for the Manager and Just Fix Your Infrastructure</li>
<li> Azure’s New Agentic Copilot: Because Manually Clicking Through Dashboards Was So 2023</li>
<li> Chrome’s WebMCP Gives AI Agents a GPS for Websites Because Apparently They’ve Been Lost in the HTML This Whole Time </li>
<li> Anthropic Cuts Out the Middleman: Claude Enterprise Now Available Without the Enterprise Sales Dance</li>
<li> AWS Gives CloudWatch the Silent Treatment: New Mute Rules Let Alarms Sleep Through Maintenance Windows</li>
<li> AWS CloudWatch Hits Snooze: Mute Rules End On-Call Nightmares</li>
<li> AWS Gives CloudWatch the Silent Treatment</li>
</ul>
<h2>General News </h2>
<p>00:45 <a href="https://share.google/Q7OyXxsFn3ixLjUqh">Bloat Risk? Microsoft’s Notepad Upgrade Also Introduced a Vulnerability | </a><a href="https://share.google/Q7OyXxsFn3ixLjUqh">PCMag</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.microsoft.com/en-us">Microsoft’s</a> recent <a href="https://apps.microsoft.com/detail/9msmlrh6lzf3?hl=en-US&amp;gl=US">Notepad</a> modernization introduced <a href="https://www.cve.org/CVERecord?id=CVE-2026-20841">CVE-2026-20841</a>, a vulnerability in the new <a href="https://www.pcmag.com/encyclopedia/term/markdown">Markdown support</a> feature that allows malicious links in files to execute remote code. </li>
<li style="font-weight:400;">The flaw has been patched in the February 2026 security updates, but it highlights the security trade-offs when adding features to historically simple applications.</li>
<li style="font-weight:400;">The vulnerability exploits Notepad’s Markdown rendering capability, which Microsoft <a href="https://blogs.windows.com/windows-insider/2025/05/30/text-formatting-in-notepad-begin-rolling-out-to-windows-insiders/">added in May</a> to support lightweight markup language formatting. When Notepad opens a specially crafted Markdown file, embedded malicious links can trigger unverified protocols that load and execute remote files on the system.</li>
<li style="font-weight:400;">This incident raises questions about feature bloat in core Windows utilities, particularly as Microsoft continues adding network-dependent capabilities like AI-powered text writing to Notepad. Security researchers are debating whether basic text editors should have network functionality at all, given the expanded attack surface.</li>
<li style="font-weight:400;">The vulnerability demonstrates how modernization efforts can introduce security risks in previously low-risk applications. </li>
<li style="font-weight:400;">Organizations using Windows need to ensure t...</li></ul>
<h3>Chapters</h3>
<ul><li>(00:00:00) - AWS Cloudwatch Finally Learns to Hit Snooze</li><li>(00:00:48) - Microsoft's Notepad Vulnerability</li><li>(00:03:09) - WebMCP: The Standardization of AI Agents</li><li>(00:07:17) - AI Completes 4% of GitHub Commits</li><li>(00:09:32) - Cloud for Enterprise: Anthropic's Dominance</li><li>(00:14:54) - Sonnet 4.6 Available for Cloud</li><li>(00:16:34) - Coding Productivity: The Shift</li><li>(00:25:37) - MacBook Pro: Should You Upgrade to the M5?</li><li>(00:29:13) - Mac Studio: The M3 Ultra vs. Nvidia DGX Spark</li><li>(00:31:00) - Alibaba Launches New Large Language Model</li><li>(00:32:03) - Sea Dance Studio Launches Sealed Dance 2.0</li><li>(00:35:27) - AMD EPYC, HPC 8A Instances Now Available on</li><li>(00:38:05) - Kafka: Native AWS API for Topic Management</li><li>(00:41:08) - Amazon Bedrock now supports six new Open Weight Models</li><li>(00:50:32) - AWS: Supports nested virtualization on bare metal EC2 instances</li><li>(00:52:24) - Amazon SageMaker Inference for Custom Nova Models now available</li><li>(00:54:45) - Google's DeepThink Update to Gemini 3</li><li>(00:57:32) - BigQuery: Cross-Region Queries in Preview</li><li>(01:00:52) - Microsoft's Azure Copilot: Automating Cloud Operations</li><li>(01:03:22) - Azure now offers instant access to incremental snapshots for premium SSD,</li><li>(01:04:42) - Crystal Cloud and the AI-first Hypercloud</li><li>(01:05:16) - Cloud Infrastructure Management: Bringing AI Agents to the Cloud</li><li>(01:07:21) - Cloudflare to automatically convert HTML to Markdown for AI Agents</li><li>(01:10:52) - Week in Cloud: The Cloud and AI</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 343 of The Cloud Pod, where the forecast is always cloudy! Justin, Ryan, and Matt are in the studio this week bringing you all the latest in Cloud and AI news, including some of the smaller clouds like Cloudflare and Crusoe Cloud, as well as announcements from the big guys like Google’s Gemini DeepThink, Anthropic’s big pay day, and Microsoft’s Notepad problem. We’ve got all this plus Matt screwing up his outro AGAIN, so let’s get started! 
Titles we almost went with this week

 Chrome’s WebMCP Protocol: Teaching AI Agents to Stop Doom-Scrolling the DOM and Actually Get Work Done
Claude Enterprise Self-Service: Because Sometimes You Just Want to Buy AI Without Small Talk
 AWS EC2 Goes Inception Mode: Now You Can Virtualize Your Virtualization Without Going Broke
 Amazon EC2 Nested Virtualization: Because Your Virtual Machine Was Lonely and Needed Its Own Virtual Machine
 CloudWatch Alarm Mute Rules: Because Your Deployment Doesn’t Need a Standing    Ovation at 3 AM
 Anthropic’s $380 Billion Valuation Proves AI Funding Has Gone Claude Nine 
AWS EC2 Nested Virtualization Finally Escapes the Expensive Hardware Jail
 Cloudflare Teaches AI Agents the Magic Words: Accept text/markdown and Save 13,000 Tokens
 Crusoe Cloud’s MCP Server: Teaching AI Assistants to Stop Asking for the Manager and Just Fix Your Infrastructure
 Azure’s New Agentic Copilot: Because Manually Clicking Through Dashboards Was So 2023
 Chrome’s WebMCP Gives AI Agents a GPS for Websites Because Apparently They’ve Been Lost in the HTML This Whole Time 
 Anthropic Cuts Out the Middleman: Claude Enterprise Now Available Without the Enterprise Sales Dance
 AWS Gives CloudWatch the Silent Treatment: New Mute Rules Let Alarms Sleep Through Maintenance Windows
 AWS CloudWatch Hits Snooze: Mute Rules End On-Call Nightmares
 AWS Gives CloudWatch the Silent Treatment

General News 
00:45 Bloat Risk? Microsoft’s Notepad Upgrade Also Introduced a Vulnerability | PCMag

Microsoft’s recent Notepad modernization introduced CVE-2026-20841, a vulnerability in the new Markdown support feature that allows malicious links in files to execute remote code. 
The flaw has been patched in the February 2026 security updates, but it highlights the security trade-offs when adding features to historically simple applications.
The vulnerability exploits Notepad’s Markdown rendering capability, which Microsoft added in May to support lightweight markup language formatting. When Notepad opens a specially crafted Markdown file, embedded malicious links can trigger unverified protocols that load and execute remote files on the system.
This incident raises questions about feature bloat in core Windows utilities, particularly as Microsoft continues adding network-dependent capabilities like AI-powered text writing to Notepad. Security researchers are debating whether basic text editors should have network functionality at all, given the expanded attack surface.
The vulnerability demonstrates how modernization efforts can introduce security risks in previously low-risk applications. 
Organizations using Windows need to ensure t...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[343: AWS CloudWatch Finally Hits Snooze]]>
                </itunes:title>
                                    <itunes:episode>343</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p>Welcome to episode 343 of The Cloud Pod, where the forecast is always cloudy! Justin, Ryan, and Matt are in the studio this week bringing you all the latest in Cloud and AI news, including some of the smaller clouds like Cloudflare and Crusoe Cloud, as well as announcements from the big guys like Google’s Gemini DeepThink, Anthropic’s big pay day, and Microsoft’s Notepad problem. We’ve got all this plus Matt screwing up his outro AGAIN, so let’s get started! </p>
<h3>Titles we almost went with this week</h3>
<ul>
<li> Chrome’s WebMCP Protocol: Teaching AI Agents to Stop Doom-Scrolling the DOM and Actually Get Work Done</li>
<li>Claude Enterprise Self-Service: Because Sometimes You Just Want to Buy AI Without Small Talk</li>
<li> AWS EC2 Goes Inception Mode: Now You Can Virtualize Your Virtualization Without Going Broke</li>
<li> Amazon EC2 Nested Virtualization: Because Your Virtual Machine Was Lonely and Needed Its Own Virtual Machine</li>
<li> CloudWatch Alarm Mute Rules: Because Your Deployment Doesn’t Need a Standing    Ovation at 3 AM</li>
<li> Anthropic’s $380 Billion Valuation Proves AI Funding Has Gone Claude Nine </li>
<li>AWS EC2 Nested Virtualization Finally Escapes the Expensive Hardware Jail</li>
<li> Cloudflare Teaches AI Agents the Magic Words: Accept text/markdown and Save 13,000 Tokens</li>
<li> Crusoe Cloud’s MCP Server: Teaching AI Assistants to Stop Asking for the Manager and Just Fix Your Infrastructure</li>
<li> Azure’s New Agentic Copilot: Because Manually Clicking Through Dashboards Was So 2023</li>
<li> Chrome’s WebMCP Gives AI Agents a GPS for Websites Because Apparently They’ve Been Lost in the HTML This Whole Time </li>
<li> Anthropic Cuts Out the Middleman: Claude Enterprise Now Available Without the Enterprise Sales Dance</li>
<li> AWS Gives CloudWatch the Silent Treatment: New Mute Rules Let Alarms Sleep Through Maintenance Windows</li>
<li> AWS CloudWatch Hits Snooze: Mute Rules End On-Call Nightmares</li>
<li> AWS Gives CloudWatch the Silent Treatment</li>
</ul>
<h2>General News </h2>
<p>00:45 <a href="https://share.google/Q7OyXxsFn3ixLjUqh">Bloat Risk? Microsoft’s Notepad Upgrade Also Introduced a Vulnerability | </a><a href="https://share.google/Q7OyXxsFn3ixLjUqh">PCMag</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.microsoft.com/en-us">Microsoft’s</a> recent <a href="https://apps.microsoft.com/detail/9msmlrh6lzf3?hl=en-US&amp;gl=US">Notepad</a> modernization introduced <a href="https://www.cve.org/CVERecord?id=CVE-2026-20841">CVE-2026-20841</a>, a vulnerability in the new <a href="https://www.pcmag.com/encyclopedia/term/markdown">Markdown support</a> feature that allows malicious links in files to execute remote code. </li>
<li style="font-weight:400;">The flaw has been patched in the February 2026 security updates, but it highlights the security trade-offs when adding features to historically simple applications.</li>
<li style="font-weight:400;">The vulnerability exploits Notepad’s Markdown rendering capability, which Microsoft <a href="https://blogs.windows.com/windows-insider/2025/05/30/text-formatting-in-notepad-begin-rolling-out-to-windows-insiders/">added in May</a> to support lightweight markup language formatting. When Notepad opens a specially crafted Markdown file, embedded malicious links can trigger unverified protocols that load and execute remote files on the system.</li>
<li style="font-weight:400;">This incident raises questions about feature bloat in core Windows utilities, particularly as Microsoft continues adding network-dependent capabilities like AI-powered text writing to Notepad. Security researchers are debating whether basic text editors should have network functionality at all, given the expanded attack surface.</li>
<li style="font-weight:400;">The vulnerability demonstrates how modernization efforts can introduce security risks in previously low-risk applications. </li>
<li style="font-weight:400;">Organizations using Windows need to ensure their systems receive the <a href="https://msrc.microsoft.com/update-guide/releaseNote/2026-Feb">February 2026 security updates</a> to address this specific flaw in Notepad’s Markdown implementation.</li>
</ul>
<p>02:04  Matt – “I’m just confused why they didn’t use Copilot on their pull request in order to identify this as a potential bug. I feel like it should have found it. Just sayin’…”  </p>
<p>03:13 <a href="https://developer.chrome.com/blog/webmcp-epp">WebMCP is available for early preview</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.google.com/chrome/">Chrome</a> is introducing <a href="https://webmcp.link/">WebMCP</a>, a standardized protocol that lets websites expose structured tools and actions directly to AI agents, eliminating the need for agents to parse raw HTML and DOM elements. </li>
<li style="font-weight:400;">This addresses a key reliability problem in agentic workflows where AI agents currently struggle with inconsistent web interactions.</li>
<li style="font-weight:400;">The protocol offers two interaction modes: a declarative API for simple HTML form-based actions and an imperative API for complex JavaScript-driven workflows. This dual approach lets websites define exactly how agents should interact with features like booking systems, support ticket forms, and checkout processes.</li>
<li style="font-weight:400;">Early use cases focus on high-value transactional workflows, including e-commerce product configuration, travel booking with complex filtering requirements, and automated customer support ticket creation with technical details. These scenarios benefit most from structured interactions versus unreliable DOM manipulation.</li>
<li style="font-weight:400;">The early preview program requires sign-up for access to documentation and demos, indicating this is still in experimental stages. </li>
<li style="font-weight:400;">Developers interested in making their sites agent-ready will need to implement these new APIs to participate in the agentic web ecosystem Chrome is building.</li>
<li style="font-weight:400;">This represents Chrome’s attempt to standardize how AI agents interact with websites before the market fragments with competing approaches. Sites that adopt WebMCP early may gain advantages as browser-based AI agents become more prevalent.</li>
<li style="font-weight:400;">Interested in signing up for the preview? You can do that <a href="https://developer.chrome.com/docs/ai/join-epp">here</a>. </li>
</ul>
<p>04:41  Ryan – “It makes a lot of sense why they want to standardize on a specific protocol, but I can’t help but feel like this is the beginning of the end of human interaction; where you’re going to have an AI agent-to-agent protocol.” </p>
<h2>AI Is Going Great – Or How ML Makes Money </h2>
<p>07:27 <a href="https://www.anthropic.com/news/anthropic-raises-30-billion-series-g-funding-380-billion-post-money-valuation">Anthropic raises $30 billion in Series G funding at $380 billion post-money </a><a href="https://www.anthropic.com/news/anthropic-raises-30-billion-series-g-funding-380-billion-post-money-valuation">valuation \ Anthropic</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.anthropic.com/">Anthropic</a> closed a $30 billion Series G at a $380 billion post-money valuation, reaching $14 billion in run-rate revenue with 10x annual growth for three consecutive years. </li>
<li style="font-weight:400;">The company now serves eight of the Fortune 10, with over 500 customers spending more than $1 million annually.</li>
<li style="font-weight:400;"><a href="https://claude.com/product/claude-code">Claude Code</a>, made generally available in May 2025, has grown to $2.5 billion in run-rate revenue and now accounts for 4% of all public GitHub commits worldwide. Business subscriptions quadrupled since early 2026, with enterprise customers representing over half of Claude Code’s revenue.</li>
<li style="font-weight:400;"><a href="https://www.anthropic.com/news/claude-opus-4-6">Opus 4.6</a> launched last week as the latest model release, leading the <a href="https://artificialanalysis.ai/evaluations/gdpval-aa">GDPval-AA</a> benchmark for economically valuable knowledge work in finance and legal domains. The model powers agents capable of generating professional documents, spreadsheets, and presentations autonomously.</li>
<li style="font-weight:400;">Anthropic expanded its product portfolio in January with over thirty launches, including <a href="https://claude.com/product/cowork">Cowork</a>, which extends Claude Code capabilities to broader knowledge work with eleven open-source plugins for specialized roles. </li>
<li style="font-weight:400;">Claude for Enterprise is now HIPAA-compliant and available for healthcare and life sciences organizations.</li>
<li style="font-weight:400;">Claude remains the only frontier AI model available across all three major cloud platforms through <a href="https://aws.amazon.com/bedrock/">AWS Bedrock</a>, <a href="https://cloud.google.com/vertex-ai">Google Cloud Vertex AI</a>, and <a href="https://azure.microsoft.com/en-us/products/ai-foundry/">Microsoft Azure Foundry</a>. </li>
<li style="font-weight:400;">The company trains on diversified hardware, including AWS Trainium, Google TPUs, and NVIDIA GPUs, to optimize workload performance and resilience.</li>
</ul>
<p>08:10 Matt – “Those numbers are insane. I just want to make sure we’re all clear about that.” </p>
<p>15:16 <a href="https://www.anthropic.com/news/claude-sonnet-4-6">Introducing Sonnet 4.6 \ Anthropic</a></p>
<ul>
<li style="font-weight:400;"><a href="https://claude.ai/redirect/website.v1.e8e96b2d-c64a-48ec-8bed-03d7c78fd037">Claude Sonnet 4.6</a> is now generally available across all <a href="https://claude.com/pricing">Claude plans</a>, API, and major cloud platforms at the same <a href="https://claude.com/pricing#api">pricing</a> as <a href="https://claude.com/resources/tutorials/getting-the-most-out-of-sonnet-4-5-in-claude-ai">Sonnet 4.5</a> ($3/$15 per million tokens), with a 1M token context window in beta. </li>
<li style="font-weight:400;">The model now serves as the default for Free and Pro plan users, bringing Opus-class performance to a mid-tier price point.</li>
<li style="font-weight:400;">Computer use capabilities have improved substantially, with Sonnet 4.6 scoring 94% on insurance benchmarks and showing human-level performance on tasks like navigating complex spreadsheets and multi-step web forms. </li>
<li style="font-weight:400;">The model demonstrates better resistance to prompt injection attacks compared to Sonnet 4.5 and performs similarly to <a href="https://www.anthropic.com/news/claude-opus-4-6">Opus 4.6</a> on <a href="https://anthropic.com/claude-sonnet-4-6-system-card">safety evaluations</a>.</li>
<li style="font-weight:400;">Coding performance has advanced significantly, with early users preferring Sonnet 4.6 over Sonnet 4.5 roughly 70% of the time and even choosing it over Opus 4.5 59% of the time. </li>
<li style="font-weight:400;">Users report better instruction following, less overengineering, fewer hallucinations, and more consistent follow-through on multi-step tasks, with one customer reporting an 80.2% score on SWE-bench Verified.</li>
<li style="font-weight:400;">Several features have reached general availability on the <a href="https://platform.claude.com/docs/en/test-and-evaluate/strengthen-guardrails/mitigate-jailbreaks">API</a>, including code execution, memory, programmatic tool calling, tool search, and tool use examples. </li>
<li style="font-weight:400;">Web search and fetch tools now automatically write and execute code to filter search results, improving response quality and token efficiency.</li>
<li style="font-weight:400;">The model supports both adaptive thinking and extended thinking modes, with context compaction in beta that automatically summarizes older context as conversations approach limits. </li>
<li style="font-weight:400;"><a href="https://claude.com/claude-in-excel">Claude in Excel</a> now supports MCP connectors, allowing users to pull data from external sources like S&amp;P Global, LSEG, and PitchBook directly within spreadsheets. </li>
</ul>
<p>17:42 Ryan – “I haven’t played with Sonnet because it’s just released, but playing around with Opus, you can see that it’s another major improvement in these steps, and it is pretty fantastic to use.”</p>
<p>19:44 <a href="https://writing.nikunjk.com/p/token-anxiety">Token Anxiety – by Nikunj Kothari – Balancing Act</a></p>
<ul>
<li style="font-weight:400;">This article describes a cultural shift in San Francisco’s tech scene where developers are prioritizing AI agent management over social activities, with people leaving parties early to check on overnight code generation and spending weekends running 12-hour build sessions with AI assistants like Claude and Codex.</li>
<li style="font-weight:400;">The piece highlights how AI coding tools have created a new productivity anxiety where developers feel compelled to keep agents running continuously, even during sleep, to maximize output and stay competitive as new model capabilities and context windows are released weekly.</li>
<li style="font-weight:400;">Developers are adopting new vocabulary around AI models, discussing them like sommeliers evaluate wine and using animal training metaphors like keeping Claude on a tight leash for code review while giving it more slack for creative work.</li>
<li style="font-weight:400;">The constant stream of benchmark improvements and new AI capabilities is creating pressure to continuously optimize workflows, as each advancement makes previous methods feel outdated and multiplies the sense that competitors are already leveraging these improvements.</li>
<li style="font-weight:400;">This represents a broader shift in developer culture where traditional leisure activities are being replaced by AI-assisted building, with the primary social metric changing from what you accomplished to how many agents you have running in parallel.</li>
</ul>
<p>24:25 Ryan – “I still don’t know how everyone has these overnight workloads; I guess I don’t trust AI at all; I’m not going to let it run unsupervised.”  </p>
<p>31:48 <a href="https://www.theinformation.com/briefings/alibaba-launches-new-llm-chinas-ai-battle-heats">Alibaba Launches New LLM as China’s AI Battle Heats Up </a></p>
<ul>
<li style="font-weight:400;"><a href="https://qwen.ai/blog?id=qwen3.5">Qwen 3.5</a> is out. No industry freakouts (like with DeepSeek) so far</li>
</ul>
<p>33:06 <a href="https://seed.bytedance.com/en/blog/official-launch-of-seedance-2-0">Seed News – ByteDance Seed Team</a></p>
<ul>
<li style="font-weight:400;"><a href="https://seed.bytedance.com/en/">ByteDance</a> officially launched <a href="https://seed.bytedance.com/en/seed2">Seedance 2.0</a>, a next-generation video creation model with a unified multimodal audio-video architecture supporting text, image, audio, and video inputs. </li>
<li style="font-weight:400;">The model can process up to 9 images, 3 video clips, 3 audio clips, and natural language instructions simultaneously for comprehensive content referencing and editing.</li>
<li style="font-weight:400;">The model delivers substantial improvements in complex motion rendering and physical accuracy, particularly excelling at multi-subject interactions like competitive figure skating with synchronized movements, mid-air spins, and precise landings that follow real-world physics. </li>
<li style="font-weight:400;">Industry evaluations show Seedance 2.0 achieves leading performance in motion stability, instruction following, and visual aesthetics compared to competing models.</li>
<li style="font-weight:400;">Seedance 2.0 introduces dual-channel stereo audio generation with multi-track parallel output for background music, ambient effects, and voiceovers synchronized to visual rhythm. </li>
<li style="font-weight:400;">The model supports 15-second high-quality multi-shot audio-video output suitable for commercial advertising, film VFX, game animations, and explainer videos.</li>
<li style="font-weight:400;">New video editing capabilities allow targeted modifications to specific clips, characters, actions, and storylines, plus video extension functionality for generating continuous shots based on user prompts. </li>
<li style="font-weight:400;">The model demonstrates improved instruction-following for complex scripts and open-ended prompts while maintaining subject consistency across extended sequences.</li>
<li style="font-weight:400;">The unified multimodal architecture enables professional-grade content creation workflows where users can reference composition, motion, camera movement, visual effects, and audio elements from input assets, significantly lowering barriers to industrial-level video production without requiring specialized technical expertise.</li>
<li style="font-weight:400;"><a href="https://www.instagram.com/reel/DUm4zSvEn76/">https://www.instagram.com/reel/DUm4zSvEn76/</a> – John Wick cat video as mentioned. </li>
</ul>
<p>34:53 Justin – “I’m surprised Hollywood stock didn’t crash today over this; very very impressive. Crazily so.” </p>
<h2>AWS </h2>
<p>36:47 <a href="https://aws.amazon.com/about-aws/whats-new/2026/02/aws-m8azn-instances-generally-available">Announcing new Amazon EC2 general purpose M8azn instances</a></p>
<ul>
<li style="font-weight:400;">AWS launches <a href="https://aws.amazon.com/about-aws/whats-new/2026/02/aws-m8azn-instances-generally-available/">M8azn instances</a> powered by fifth-generation <a href="https://www.amd.com/en/products/processors/server/epyc/9005-series.html">AMD EPYC Turin processors</a> running at 5GHz, the highest CPU frequency available in the cloud. These general-purpose instances deliver 2x compute performance over M5zn and 24% better performance than M8a instances, with 4.3x higher memory bandwidth and 10x larger L3 cache.</li>
<li style="font-weight:400;">The instances target latency-sensitive workloads like high-frequency trading, real-time financial analytics, and simulation modeling for automotive and aerospace industries. </li>
<li style="font-weight:400;">Built on sixth-generation Nitro Cards, they provide 2x networking throughput and 3x EBS throughput compared to M5zn instances.</li>
<li style="font-weight:400;">M8azn instances come in nine sizes from 2 to 96 vCPUs with up to 384 GiB memory at a 4:1 memory-to-vCPU ratio, including two bare metal variants. Available in US East Virginia, US West Oregon, Tokyo, and Frankfurt regions through On-Demand, Spot, and Savings Plans pricing models.</li>
<li style="font-weight:400;">The high-frequency positioning fills a specific niche for workloads requiring maximum single-threaded performance rather than just core count.</li>
<li style="font-weight:400;">This complements AWS’s broader M8a lineup by offering customers a choice between standard frequency instances and these premium high-frequency variants for specialized use cases.</li>
</ul>
<p>37:03 <a href="https://aws.amazon.com/about-aws/whats-new/2026/02/amazon-ec2-c8i-m8i-and-r8i-instances-on-aws-outposts/">Announcing Amazon EC2 C8i, M8i, and R8i instances on </a><a href="https://aws.amazon.com/about-aws/whats-new/2026/02/amazon-ec2-c8i-m8i-and-r8i-instances-on-aws-outposts/">second-generation AWS Outposts racks</a></p>
<ul>
<li style="font-weight:400;">AWS is bringing C8i, M8i, and R8i instances to second-generation <a href="https://aws.amazon.com/outposts/rack/">Outposts racks</a>, delivering 20% better performance and 2.5x more memory bandwidth compared to the previous C7i, M7i, and R7i generation. These instances also provide 20% more compute capacity within the same physical rack space and power consumption, improving density for on-premises deployments.</li>
<li style="font-weight:400;">The new instances run on custom <a href="https://www.intel.com/content/www/us/en/products/details/processors/xeon.html">Intel Xeon 6 processors</a> exclusive to AWS and target workloads that need enhanced on-premises performance, including large databases, memory-intensive applications, real-time analytics, high-performance video encoding, and CPU-based ML inference. </li>
<li style="font-weight:400;">This addresses the gap for customers who need cloud-class compute but must keep workloads on-premises due to latency, data residency, or regulatory requirements.</li>
<li style="font-weight:400;">Second-generation Outposts racks continue AWS’s hybrid cloud strategy by extending the latest EC2 instance types to customer data centers with the same APIs and tooling as the public cloud. </li>
<li style="font-weight:400;">The availability varies by region, so customers should check the Outposts rack FAQs page for current country and territory support before planning deployments.</li>
<li style="font-weight:400;">The performance improvements come primarily from the memory bandwidth increase and processor generation upgrade, which should benefit database operations, in-memory caching, and data-intensive applications that previously hit memory bottlenecks on Outposts. </li>
<li style="font-weight:400;">The power and space efficiency gains matter for customers with constrained data center capacity or energy budgets.</li>
</ul>
<p>37:08 <a href="https://aws.amazon.com/blogs/aws/amazon-ec2-hpc8a-instances-powered-by-5th-gen-amd-epyc-processors-are-now-available/">Amazon EC2 Hpc8a Instances powered by 5th Gen AMD EPYC processors </a><a href="https://aws.amazon.com/blogs/aws/amazon-ec2-hpc8a-instances-powered-by-5th-gen-amd-epyc-processors-are-now-available/">are now available</a></p>
<ul>
<li style="font-weight:400;">AWS launches Hpc8a instances powered by 5th Gen AMD EPYC processors, delivering 40% higher performance and 42% greater memory bandwidth than the previous <a href="https://aws.amazon.com/blogs/aws/new-amazon-ec2-hpc7a-instances-powered-by-4th-gen-amd-epyc-processors-optimized-for-high-performance-computing/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">Hpc7a</a> generation, while offering up to 25% better price-performance for tightly coupled HPC workloads like computational fluid dynamics and weather modeling.</li>
<li style="font-weight:400;">The instances come in a single 96xlarge size with 192 cores, 768 GiB memory, and 300 Gbps <a href="https://aws.amazon.com/hpc/efa/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">Elastic Fabric Adapter</a> networking, featuring customizable core counts at launch and sixth-generation AWS Nitro cards for offloaded virtualization functions. Simultaneous Multithreading is disabled by default to optimize HPC performance.</li>
<li style="font-weight:400;">Available now in US East Ohio and Europe Stockholm regions, with support for <a href="https://aws.amazon.com/hpc/parallelcluster/">AWS ParallelCluster</a>, <a href="https://aws.amazon.com/pcs/4">AWS Parallel Computing Service</a>, and <a href="https://docs.aws.amazon.com/fsx/latest/LustreGuide/what-is.html">Amazon FSx for Lustre</a> integration to simplify cluster management and provide sub-millisecond storage latencies. Customers can purchase as On-Demand Instances or through Savings Plans, with specific pricing available on the <a href="https://aws.amazon.com/ec2/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">EC2</a> pricing page.</li>
<li style="font-weight:400;">The 1:4 core-to-memory ratio and high core density target compute-intensive simulation workloads requiring rapid time-to-results, including crash simulations and high-resolution weather modeling within tight operational windows. The customizable core count feature allows right-sizing based on specific HPC workload requirements without paying for unused capacity.</li>
</ul>
<p>39:20  Ryan – “I’m sure they use a subcontractor for actual maintenance, things. But I’m sure that you have to give them access and manage them just like you would any other remote hands for your data center.”</p>
<p>39:37 <a href="https://aws.amazon.com/about-aws/whats-new/2026/02/amazon-msk-kafka-topics-public-apis/">MSK simplifies Kafka topic management with new APIs and console </a><a href="https://aws.amazon.com/about-aws/whats-new/2026/02/amazon-msk-kafka-topics-public-apis/">integration</a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/msk/latest/developerguide/getting-started.html">Amazon MSK</a> now provides native AWS APIs for <a href="https://kafka.apache.org/">Kafka</a> topic management, eliminating the need to set up and maintain separate Kafka admin clients. The three new APIs (CreateTopic, UpdateTopic, and DeleteTopic) work alongside existing ListTopics and DescribeTopic APIs through AWS CLI, SDKs, and CloudFormation, letting teams manage topics using standard AWS tooling and IAM permissions.</li>
<li style="font-weight:400;">The MSK console now consolidates all topic operations in one interface with guided defaults for creating and updating topics. Users can configure properties like replication factor, partition count, retention policies, and cleanup settings while viewing comprehensive partition-level metrics and configuration details directly in the console.</li>
<li style="font-weight:400;">These capabilities are available at no additional cost for MSK provisioned clusters running Kafka version 3.6 and above across all regions where MSK is offered. Organizations need to configure appropriate IAM permissions to use the new APIs, with setup instructions available in the MSK Developer Guide.</li>
<li style="font-weight:400;">The update addresses a common operational pain point where teams previously had to maintain separate Kafka admin tooling outside the AWS ecosystem. This integration brings Kafka topic management into standard AWS workflows, improving consistency with existing infrastructure-as-code practices and centralized access control through IAM.</li>
</ul>
<p>40:47  Ryan – “I suspect this has more to do with Kafka than AWS because Kafka is notoriously hard to administer, so in a lot of cases there’s just not the ability…so I’m really happy to see this.”</p>
<p>42:40 <a href="https://aws.amazon.com/about-aws/whats-new/2026/02/amazon-bedrock-adds-support-six-open-weights-models">Amazon Bedrock adds support for six fully-managed open weights models</a></p>
<ul>
<li style="font-weight:400;"><a href="https://console.aws.amazon.com/bedrock/">Amazon Bedrock</a> now supports six new open weights models, including <a href="https://www.deepseek.com/en/">DeepSeek V3.2</a>, <a href="https://www.minimax.io/news/minimax-m21">MiniMax M2.1</a>, <a href="https://z.ai/blog/glm-4.7">GLM 4.7</a>, <a href="https://docs.z.ai/guides/llm/glm-4.7">GLM 4.7 Flash</a>, <a href="https://www.kimi.com/ai-models/kimi-k2-5">Kimi K2.5</a>, and <a href="https://qwen.ai/blog?id=qwen3">Qwen3</a> Coder Next, providing frontier-class performance at lower inference costs than proprietary alternatives. </li>
<li style="font-weight:400;">These models cover different enterprise needs from advanced reasoning and agentic tasks to autonomous coding with large output windows and lightweight production deployments.</li>
<li style="font-weight:400;">The models run on Project Mantle, a new distributed inference engine that accelerates model onboarding to Bedrock while providing serverless inference with quality of service controls and automated capacity management. Project Mantle includes native OpenAI API compatibility, allowing customers to switch from OpenAI endpoints without code changes.</li>
<li style="font-weight:400;">The addition of these open weights models gives AWS customers more flexibility in model selection based on specific workload requirements and cost constraints. </li>
<li style="font-weight:400;">DeepSeek V3.2 and Kimi K2.5 handle complex reasoning tasks, while GLM 4.7 and MiniMax 2.1 support coding workflows with extended context windows, and Qwen3 Coder Next and GLM 4.7 Flash offer cost-efficient options for high-volume production use.</li>
<li style="font-weight:400;">Project Mantle’s unified capacity pools and higher default quotas address common scaling challenges customers face when deploying large language models. </li>
<li style="font-weight:400;">The serverless architecture eliminates infrastructure management overhead, while the automated capacity management helps prevent quota limitations during peak usage periods.</li>
</ul>
<p>44:05  Matt – “I like how they made it all compatible with OpenAI. It’s kind of like S3 compatibility; I feel like we’re slowly kind of coming to a standard, which means you can go play with it and see which model makes sense.”</p>
<p>46:02 <a href="https://aws.amazon.com/about-aws/whats-new/2026/02/amazon-eks-auto-mode-enhanced-logging">Amazon EKS Auto Mode Announces Enhanced Logging for its Managed </a><a href="https://aws.amazon.com/about-aws/whats-new/2026/02/amazon-eks-auto-mode-enhanced-logging">Kubernetes Capabilities</a></p>
<ul>
<li style="font-weight:400;">EKS Auto Mode now integrates with CloudWatch Vended Logs to automatically collect logs from its managed Kubernetes capabilities, including compute autoscaling, block storage, load balancing, and pod networking. </li>
<li style="font-weight:400;">This gives customers centralized visibility into Auto Mode’s infrastructure management operations without manual configuration.</li>
<li style="font-weight:400;">The integration uses CloudWatch Vended Logs, which provides lower pricing than standard CloudWatch Logs while maintaining built-in AWS authentication and authorization. </li>
<li style="font-weight:400;">Customers can route logs to CloudWatch Logs, S3, or <a href="https://docs.aws.amazon.com/firehose/latest/dev/what-is-this-service.html">Kinesis Data Firehose</a>, depending on their retention and analysis requirements, with standard destination charges applying.</li>
<li style="font-weight:400;">Each Auto Mode capability can be configured independently as a log delivery source through CloudWatch APIs or the AWS Console. </li>
<li style="font-weight:400;">This granular control allows teams to monitor specific components like the Karpenter-based autoscaler or VPC CNI networking without collecting unnecessary log data.</li>
<li style="font-weight:400;">The feature addresses a common operational challenge where Auto Mode’s automated infrastructure management previously operated as a black box. DevOps teams can now troubleshoot issues like pod scheduling failures, storage provisioning problems, or load balancer configuration errors by examining the actual logs from Auto Mode’s control plane operations.</li>
<li style="font-weight:400;">Available immediately in all regions where EKS Auto Mode operates, this logging capability helps bridge the observability gap between customer workloads and AWS-managed Kubernetes infrastructure components.</li>
</ul>
<p>47:05  Justin – “All I have to say is, some lovely CloudWatch PM just made their bonus this year by turning this one, as this is a lot of logging context that you now need to parse and pay for.” </p>
<p>49:26 <a href="https://aws.amazon.com/about-aws/whats-new/2026/02/amazon-cloudwatch-alarm-muting-rules">AWS CloudWatch Alarm Mute Rules eliminate alert fatigue</a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/AmazonCloudWatch/latest/monitoring/alarm-mute-rules.html">CloudWatch Alarm Mute Rules</a> let you temporarily silence alarm notifications during planned maintenance windows, deployments, or off-hours without disabling the underlying monitoring. </li>
<li style="font-weight:400;">The feature supports up to 100 alarms per rule with one-time or recurring schedules, and automatically triggers any suppressed actions once the mute period ends if the alarm state persists.</li>
<li style="font-weight:400;">This addresses a common operational pain point where teams either ignore alerts during maintenance windows or use risky script-based workarounds that can be forgotten and leave monitoring disabled. </li>
<li style="font-weight:400;">The native integration eliminates the need for custom automation to manage notification states during planned activities.</li>
<li style="font-weight:400;">The feature is available today across all AWS regions that support CloudWatch alarms at no additional cost beyond standard CloudWatch pricing. </li>
<li style="font-weight:400;">Configuration is done through the <a href="https://console.aws.amazon.com/cloudwatch/">CloudWatch console</a> or API, with support for all alarm states, including OK, ALARM, and INSUFFICIENT_DATA.</li>
<li style="font-weight:400;">Primary use cases include silencing non-critical alerts during scheduled deployments, muting development environment alarms outside business hours, and suppressing known issues during maintenance windows. </li>
<li style="font-weight:400;">This helps reduce alert fatigue while maintaining full visibility into system state and metrics collection.</li>
<li style="font-weight:400;">The automatic re-triggering of muted actions ensures teams don’t miss persistent issues that started during a mute window, providing a safety mechanism that manual notification management typically lacks.</li>
</ul>
<p>50:49  Ryan – “This is much nicer. Basically, set it for ignore for an hour and then have it kick back in. Glad to see this, but strange that it took this long.” </p>
<p>52:48 <a href="https://aws.amazon.com/about-aws/whats-new/2026/02/amazon-ec2-nested-virtualization-on-virtual">Amazon EC2 supports nested virtualization on virtual Amazon EC2 </a><a href="https://aws.amazon.com/about-aws/whats-new/2026/02/amazon-ec2-nested-virtualization-on-virtual">instances</a></p>
<ul>
<li style="font-weight:400;">AWS now supports nested virtualization on standard EC2 instances, not just bare metal, allowing customers to run KVM or Hyper-V hypervisors inside virtual machines. This expands flexibility for development and testing scenarios that previously required more expensive bare metal instances.</li>
<li style="font-weight:400;">The feature launches on the latest generation C8i, M8i, and R8i instance families across all commercial AWS regions. </li>
<li style="font-weight:400;">Customers can now run mobile app emulators, automotive hardware simulators, and Windows Subsystem for Linux on Windows workstations directly on virtual instances.</li>
<li style="font-weight:400;">This capability addresses a long-standing limitation where nested virtualization required bare metal instances, which carry higher costs and longer provisioning times compared to standard virtual instances. </li>
<li style="font-weight:400;">The change makes nested environments more accessible for development teams and testing workflows.</li>
<li style="font-weight:400;">Common use cases include software vendors who need to test their products across multiple operating systems, automotive companies simulating vehicle hardware environments, and mobile developers running Android or iOS emulators at scale. </li>
<li style="font-weight:400;">These workloads can now run on more cost-effective instance types with faster deployment.</li>
<li style="font-weight:400;">The feature requires enabling hardware virtualization extensions in the instance configuration, with full documentation available in the EC2 user guide. Pricing follows standard EC2 rates for the C8i, M8i, and R8i instance families without additional charges for the nested virtualization capability itself.</li>
</ul>
<p>54:13  Ryan – “These kinds of announcements are usually preceded or quickly followed with Nitro…and it’s neat. It’s neat how they isolate the hardware layer to match these workloads.” </p>
<p>54:50 <a href="https://aws.amazon.com/blogs/aws/announcing-amazon-sagemaker-inference-for-custom-amazon-nova-models/">Announcing Amazon SageMaker Inference for custom Amazon Nova </a><a href="https://aws.amazon.com/blogs/aws/announcing-amazon-sagemaker-inference-for-custom-amazon-nova-models/">models</a></p>
<ul>
<li style="font-weight:400;">AWS now lets customers deploy <a href="https://aws.amazon.com/blogs/aws/announcing-amazon-nova-customization-in-amazon-sagemaker-ai/">custom-trained Amazon Nova models on SageMaker Inference</a> with production-grade controls over instance types, auto-scaling, context length, and concurrency settings. </li>
<li style="font-weight:400;">This addresses customer requests for the same deployment flexibility they get with open-weight models, enabling full-rank customized <a href="https://aws.amazon.com/nova/">Nova</a> Micro, Nova Lite, and Nova 2 Lite models trained via <a href="https://aws.amazon.com/sagemaker/ai/deploy/">SageMaker</a> Training Jobs or HyperPod.</li>
<li style="font-weight:400;">The service reduces inference costs by supporting more cost-effective <a href="https://aws.amazon.com/ec2">EC2</a> <a href="https://aws.amazon.com/ec2/instance-types/g5/">G5</a> and <a href="https://aws.amazon.com/ec2/instance-types/g6/">G6</a> instances instead of requiring <a href="https://aws.amazon.com/ec2/instance-types/p5/">P5 instances</a>, with auto-scaling based on 5-minute usage patterns and configurable inference parameters. </li>
<li style="font-weight:400;">Customers pay only for compute instances used with per-hour billing and no minimum commitments, following standard SageMaker pricing.</li>
<li style="font-weight:400;">Deployment works through <a href="https://console.aws.amazon.com/sagemaker?trk=769a1a2b-8c19-4976-9c45-b6b1226c7d20&amp;sc_channel=el">SageMaker Studio</a> UI or <a href="https://docs.aws.amazon.com/sagemaker/latest/dg/api-and-sdk-reference-overview.html">SDK</a>, supporting both real-time streaming and asynchronous batch inference modes. 
The service includes advanced configuration options for context length up to 8000 tokens, max concurrency settings, and inference parameters like temperature and top-p for optimizing latency-cost-accuracy tradeoffs.</li>
<li style="font-weight:400;">Currently available in US East N. Virginia and the US West Oregon regions, with support for Nova models with reasoning capabilities. </li>
<li style="font-weight:400;">Instance type requirements vary by model size, with Nova Micro supporting g5.12xlarge and up, Nova Lite requiring g5.48xlarge minimum, and Nova 2 Lite needing p5.48xlarge instances.</li>
</ul>
<p>56:47  Ryan – “It’s not an open-source model, and so it is kind of crazy that Nova offers that customization.” </p>
<h2>GCP</h2>
<p>57:25  <a href="https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-3-deep-think/">Gemini 3 Deep Think: AI model update designed for science</a></p>
<ul>
<li style="font-weight:400;">Google has released a major update to <a href="https://blog.google/products-and-platforms/products/gemini/gemini-3/#gemini-3-deep-think">Gemini 3 Deep Think</a>, a specialized reasoning mode designed for complex scientific and engineering problems where data is messy or incomplete, and solutions aren’t straightforward. </li>
<li style="font-weight:400;">The model achieved notable benchmark results, including 48.4% on Humanity’s Last Exam, 84.6% on ARC-AGI-2, and gold medal performance on the 2025 International <a href="https://deepmind.google/blog/advanced-version-of-gemini-with-deep-think-officially-achieves-gold-medal-standard-at-the-international-mathematical-olympiad/">Math</a>, Physics, and Chemistry Olympiads.</li>
<li style="font-weight:400;">Early adopters are using Deep Think for practical applications like identifying logical flaws in peer-reviewed mathematics papers, optimizing semiconductor crystal growth fabrication methods, and converting sketches into 3D-printable files with generated code. </li>
<li style="font-weight:400;">The model combines deep scientific knowledge with engineering utility to move beyond theoretical work into applied research.</li>
<li style="font-weight:400;">The updated Deep Think is available now to Google AI Ultra subscribers through the Gemini app, with pricing following the existing Ultra subscription model. </li>
<li style="font-weight:400;">For the first time, Google is offering API access through an <a href="https://forms.gle/eEF5natXTQimPhYH9">early access program</a> for select researchers, engineers, and enterprises who can apply through a Google form.</li>
<li style="font-weight:400;">The release targets scientific research institutions and engineering teams working on complex problems in physics, chemistry, materials science, and advanced mathematics, where traditional AI models struggle with ambiguous requirements. </li>
<li style="font-weight:400;">Deep Think’s ability to work with incomplete data and generate executable code for physical modeling makes it particularly relevant for R&amp;D workflows.</li>
</ul>
<p>1:00:19 <a href="https://cloud.google.com/blog/products/data-analytics/new-global-queries-in-bigquery-span-data-from-multiple-regions/">New global queries in BigQuery span data from multiple regions</a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/bigquery">BigQuery</a> global queries now allow users to run a single SQL statement across datasets stored in multiple geographic regions without requiring <a href="https://en.wikipedia.org/wiki/Extract,_transform,_load">ETL</a> pipelines or data replication. </li>
<li style="font-weight:400;">The feature automatically handles cross-region data movement in the background while respecting existing security controls like <a href="https://docs.cloud.google.com/vpc-service-controls/docs/overview">VPC Service Controls</a> and requiring explicit opt-in at both the project and user level.</li>
<li style="font-weight:400;">The primary use case targets multinational organizations that need to analyze distributed data for compliance or performance reasons, such as joining US customer data with European transaction logs and Asian operational data in one query. </li>
<li style="font-weight:400;">EssilorLuxottica is using this to perform cross-region aggregated analysis while maintaining data residency requirements for security and compliance. (DOES IT THOUGH?) </li>
<li style="font-weight:400;">Users maintain control over where queries execute and can specify the processing location to meet data residency requirements, though cross-region data transfers will incur additional egress costs that organizations need to factor into their analytics budgets. </li>
<li style="font-weight:400;">The feature is currently in preview with documentation available <a href="http://cloud.google.com/bigquery/docs/global-queries">here</a>.</li>
<li style="font-weight:400;">This addresses a longstanding limitation in cloud data warehousing, where geographic data distribution required complex engineering solutions, now replaced by standard SQL queries that any authorized analyst can run directly from the BigQuery console. The feature respects governance controls by default and prevents accidental data movement through required permissions and explicit enablement.</li>
</ul>
<p>1:01:36  Matt – “I feel l ike it is compliant… if you’re running local and you’re not collecting anything that could be confidential. So it depends on how your lawyer at your company interprets it.” </p>
<h2>Azure</h2>
<p>1:03:47 <a href="https://azure.microsoft.com/en-us/blog/agentic-cloud-operations-a-new-way-to-run-the-cloud/">Agentic cloud operations and Azure Copilot for AI‑driven workloads</a></p>
<ul>
<li style="font-weight:400;">Microsoft introduces <a href="https://techcommunity.microsoft.com/blog/AzureInfrastructureBlog/ushering-in-the-era-of-agentic-cloud-operations-with-azure-copilot/4469664">agentic cloud operations through Azure Copilot</a>, which uses AI agents to automate and coordinate cloud management tasks across the full infrastructure lifecycle. Instead of adding another dashboard, Azure Copilot provides a unified interface accessible through natural language, chat, console, or CLI that connects directly to a customer’s actual Azure environment, including subscriptions, resources, and policies.</li>
<li style="font-weight:400;">Azure Copilot includes six specialized agents that handle migration discovery and dependency mapping, deployment with infrastructure-as-code generation, continuous observability across the full stack, cost and performance optimization with carbon impact analysis, resiliency management including ransomware protection, and troubleshooting with root cause diagnosis. </li>
<li style="font-weight:400;">These agents work as a connected system rather than isolated tools, correlating signals and taking action within existing RBAC and policy controls.</li>
<li style="font-weight:400;">The service maintains governance through built-in oversight features, including Bring Your Own Storage for conversation history, which keeps operational data within the customer’s Azure environment for compliance and sovereignty requirements. </li>
<li style="font-weight:400;">All agent-initiated actions are reviewable, traceable, and auditable while respecting existing security policies and role-based access controls.</li>
<li style="font-weight:400;">Target customers are organizations running modern applications and AI workloads at scale, where traditional manual operations cannot keep pace with rapid deployment cycles and infrastructure changes. </li>
<li style="font-weight:400;">The approach addresses environments where workloads move from experimentation to production in weeks and where telemetry streams continuously from every layer of the stack.</li>
<li style="font-weight:400;">Pricing details were not disclosed in the announcement, though the service builds on existing Azure Copilot capabilities introduced at Microsoft Ignite. Organizations can access resources and get started at azure.microsoft.com/products/copilot.</li>
</ul>
<p>1:05:39  Matt – “Also, a developer actually understanding what they want and telling you what they want and actually being useful? I would love to see too, because how many times have we built something, deployed it, day before the release – we actually need these 16 other things that we didn’t tell you about that we manually did in our dev environment, which is why it’s working… and the release is tomorrow. Good luck. Why is it not done yet?”</p>
<p>1:06:18 <a href="https://azure.microsoft.com/en-us/updates?id=545784">General Availability: Instant access support for incremental snapshots of </a><a href="https://azure.microsoft.com/en-us/updates?id=545784">Azure Premium SSD v2 and Ultra Disk</a></p>
<ul>
<li style="font-weight:400;">Azure now offers instant access to incremental snapshots for <a href="https://learn.microsoft.com/en-us/azure/virtual-machines/disks-deploy-premium-v2?tabs=azure-cli#regional-availability">Premium SSD v2</a> and <a href="https://learn.microsoft.com/en-us/azure/virtual-machines/disks-enable-ultra-ssd?tabs=azure-portal#ga-scope-and-limitations">Ultra Disk storage</a>, eliminating the previous wait time when restoring disks from snapshots. </li>
<li style="font-weight:400;">This addresses a significant operational pain point for customers running high-performance workloads that require rapid disaster recovery or quick environment provisioning.</li>
<li style="font-weight:400;">The feature specifically targets enterprise customers using Azure’s highest-tier storage options, Premium SSD v2 and Ultra Disk, which are typically deployed for mission-critical databases, SAP HANA, and other latency-sensitive applications. </li>
<li style="font-weight:400;">Previously, customers had to wait for snapshot data to fully hydrate before using restored disks, creating delays in recovery scenarios.</li>
<li style="font-weight:400;">Incremental snapshots only capture changes since the last snapshot, reducing storage costs and backup windows compared to full snapshots. </li>
<li style="font-weight:400;">With instant access now available, customers can immediately mount and use restored disks while background hydration completes, improving recovery time objectives for business continuity planning.</li>
<li style="font-weight:400;">This capability brings Premium SSD v2 and Ultra Disk snapshot functionality closer to parity with standard Azure managed disk snapshots. </li>
<li style="font-weight:400;">The feature is now generally available across Azure regions where Premium SSD v2 and Ultra Disk are supported, though specific pricing for snapshot storage follows existing Azure snapshot pricing models based on stored data volume.</li>
</ul>
<p>1:06:25  Justin – “Welcome to what Amazon and Google have been doing for quite a while, so thanks, Azure! </p>
<h2>Emerging Clouds </h2>
<p>1:08:16 <a href="https://www.crusoe.ai/resources/blog/introducing-the-crusoe-cloud-mcp-server">Introducing the Crusoe Cloud MCP server</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.crusoe.ai/cloud">Crusoe Cloud</a> released an MCP server that connects AI coding assistants like Claude Code and Cursor directly to cloud infrastructure, but unlike typical API wrappers, it returns filtered responses designed specifically for LLM consumption to avoid flooding context windows with unnecessary data. </li>
<li style="font-weight:400;">The server includes composite tools like get_resource_relationships that map entire infrastructure topologies in a single call by fetching 11 resource types in parallel and resolving cross-references, something that doesn’t exist in their CLI or any single API endpoint.</li>
<li style="font-weight:400;">The cluster_health_check tool provides pre-analyzed node-level health metrics organized by InfiniBand pod placement, returning structured summaries with problem nodes flagged rather than raw metric time series that would require additional processing. </li>
<li style="font-weight:400;">This approach addresses a key limitation of AI agents working with cloud infrastructure: most MCP implementations just wrap CLI commands and return the same JSON a human would see, forcing the AI to parse through irrelevant metadata and empty fields.</li>
<li style="font-weight:400;">The implementation reflects a broader trend of cloud providers releasing MCP servers, but Crusoe’s focus on response filtering and burst-heavy access patterns specific to AI agents suggests infrastructure management tools are being redesigned around LLM capabilities rather than human interaction patterns. For developers already using AI coding assistants, this enables natural language infrastructure queries and troubleshooting without manual scripting or console navigation.</li>
</ul>
<p>1:10:16  Ryan – “This is gonna be chaos.” </p>
<p>1:10:21 <a href="https://blog.cloudflare.com/markdown-for-agents/">Introducing Markdown for Agents</a></p>
<ul>
<li style="font-weight:400;">Cloudflare now automatically converts HTML to <a href="https://en.wikipedia.org/wiki/Markdown">markdown</a> for AI agents using content negotiation headers, reducing token usage by up to 80 percent. </li>
<li style="font-weight:400;">When agents request pages with Accept: text/markdown, Cloudflare’s network performs real-time conversion at the edge, eliminating the need for downstream processing and reducing costs for AI systems.</li>
<li style="font-weight:400;">The feature addresses a fundamental inefficiency where AI agents waste tokens parsing HTML markup, navigation elements, and styling that have no semantic value. </li>
<li style="font-weight:400;">A simple heading that costs 3 tokens in markdown can consume 12-15 tokens in HTML, and this blog post example shows 16,180 tokens in HTML versus 3,150 in markdown.</li>
<li style="font-weight:400;">Cloudflare includes an x-markdown-tokens header with converted responses to help developers calculate context window sizes and chunking strategies. The service also automatically adds <a href="https://contentsignals.org/">Content-Signal</a> headers indicating the content can be used for AI training, search results, and agentic use, integrating with their Content Signals framework from Birthday Week.</li>
<li style="font-weight:400;">The feature is available in beta at no cost for Pro, Business, and Enterprise plans, with Cloudflare already enabling it on their own blog and developer documentation. </li>
<li style="font-weight:400;">Popular coding agents like Claude Code and OpenCode already send the appropriate accept headers, positioning this as infrastructure for the shift from traditional SEO to AI-driven content discovery.</li>
<li style="font-weight:400;">Cloudflare Radar now tracks content type distribution for AI bot traffic, allowing analysis of how different agents consume web content over time. This data is accessible through public APIs and shows early adoption patterns like OAI-Searchbot requesting markdown content.</li>
</ul>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback, or ask questions at theCloudPod.net or tweet at us with the hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2374104/c1e-60w0c7w00gfj21d4-47o3ko8zc52r-r7ltqs.mp3" length="137900235"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 343 of The Cloud Pod, where the forecast is always cloudy! Justin, Ryan, and Matt are in the studio this week bringing you all the latest in Cloud and AI news, including some of the smaller clouds like Cloudflare and Crusoe Cloud, as well as announcements from the big guys like Google’s Gemini DeepThink, Anthropic’s big pay day, and Microsoft’s Notepad problem. We’ve got all this plus Matt screwing up his outro AGAIN, so let’s get started! 
Titles we almost went with this week

 Chrome’s WebMCP Protocol: Teaching AI Agents to Stop Doom-Scrolling the DOM and Actually Get Work Done
Claude Enterprise Self-Service: Because Sometimes You Just Want to Buy AI Without Small Talk
 AWS EC2 Goes Inception Mode: Now You Can Virtualize Your Virtualization Without Going Broke
 Amazon EC2 Nested Virtualization: Because Your Virtual Machine Was Lonely and Needed Its Own Virtual Machine
 CloudWatch Alarm Mute Rules: Because Your Deployment Doesn’t Need a Standing    Ovation at 3 AM
 Anthropic’s $380 Billion Valuation Proves AI Funding Has Gone Claude Nine 
AWS EC2 Nested Virtualization Finally Escapes the Expensive Hardware Jail
 Cloudflare Teaches AI Agents the Magic Words: Accept text/markdown and Save 13,000 Tokens
 Crusoe Cloud’s MCP Server: Teaching AI Assistants to Stop Asking for the Manager and Just Fix Your Infrastructure
 Azure’s New Agentic Copilot: Because Manually Clicking Through Dashboards Was So 2023
 Chrome’s WebMCP Gives AI Agents a GPS for Websites Because Apparently They’ve Been Lost in the HTML This Whole Time 
 Anthropic Cuts Out the Middleman: Claude Enterprise Now Available Without the Enterprise Sales Dance
 AWS Gives CloudWatch the Silent Treatment: New Mute Rules Let Alarms Sleep Through Maintenance Windows
 AWS CloudWatch Hits Snooze: Mute Rules End On-Call Nightmares
 AWS Gives CloudWatch the Silent Treatment

General News 
00:45 Bloat Risk? Microsoft’s Notepad Upgrade Also Introduced a Vulnerability | PCMag

Microsoft’s recent Notepad modernization introduced CVE-2026-20841, a vulnerability in the new Markdown support feature that allows malicious links in files to execute remote code. 
The flaw has been patched in the February 2026 security updates, but it highlights the security trade-offs when adding features to historically simple applications.
The vulnerability exploits Notepad’s Markdown rendering capability, which Microsoft added in May to support lightweight markup language formatting. When Notepad opens a specially crafted Markdown file, embedded malicious links can trigger unverified protocols that load and execute remote files on the system.
This incident raises questions about feature bloat in core Windows utilities, particularly as Microsoft continues adding network-dependent capabilities like AI-powered text writing to Notepad. Security researchers are debating whether basic text editors should have network functionality at all, given the expanded attack surface.
The vulnerability demonstrates how modernization efforts can introduce security risks in previously low-risk applications. 
Organizations using Windows need to ensure t...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2374104/c1a-k5d5-ww7rx4dvadpd-rwdbof.jpg"></itunes:image>
                                                                            <itunes:duration>01:11:34</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2374104/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[342: Eight Minutes to Midnight: When AI Helps Hackers Speed Run Your AWS Account]]>
                </title>
                <pubDate>Wed, 18 Feb 2026 19:37:13 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2367231</guid>
                                    <link>https://tcpfm.castos.com/episodes/342-eight-minutes-to-midnight-when-ai-helps-hackers-speed-run-your-aws-account</link>
                                <description>
                                            <![CDATA[<p>Welcome to episode 342 of The Cloud Pod, where the forecast is always cloudy! Justin, Ryan, and Matt are in the studio today to bring you all the latest in cloud and AI news this week. How do you feel about ads? How do you feel about ads while using AI? We’ve got options! We’ve got a round-up of tech Super Bowl ads, AI ads, Earnings reports (who frankly need the ad revenue), and a plethora of Opus 4.6 announcements, plus more. Let’s get started! </p>
<h3>Titles we almost went with this week</h3>
<ul>
<li> ChatGPT Goes Full Mad Men: Your AI Assistant Now Comes With Commercial Breaks</li>
<li> Heroku’s New Feature: No New Features</li>
<li> AWS Gives EC2 Instances a Storage Growth Spurt: 22.8TB of Local NVMe Now Available</li>
<li> Identity Crisis Averted: IAM Identity Center Learns to Replicate Itself</li>
<li> JSON Schema Enforcement: Because Your LLM Needs Structure in Its Life</li>
<li> From Zero to Admin in 480 Seconds: A Serbian Speedrun Story</li>
<li> From Proof of Concept to Proof of Claw: DigitalOcean Tames AI Agent Infrastructure</li>
<li> Azure’s Growth Hits the Clouds: Microsoft’s 39% Increase Still Not Enough for Wall Street</li>
<li> One Lake to Rule Them All: Microsoft and Snowflake Finally Stop Fighting Over Your Data</li>
<li> Free Lunch Officially Over: ChatGPT Learns That Servers Cost Money</li>
<li> Claude Won’t Sell You Anything (Except Maybe Peace of Mind)</li>
<li> IAM Identity Center Goes Multi-Regional: Because One Region to Rule Them All Wasn’t Enough</li>
<li> Databricks Takes the Base Out of Database with Lakebase GA</li>
<li> I’m a Chrome Tab hoarder</li>
</ul>
<h2>General News </h2>
<p>01:30 Superbowl Ads of Note</p>
<ul>
<li style="font-weight:400;">OpenAI: <a href="https://www.youtube.com/watch?v=aCN9iCXNJqQ">https://www.youtube.com/watch?v=aCN9iCXNJqQ</a></li>
<li style="font-weight:400;">Microsoft CoPilot: <a href="https://www.youtube.com/watch?v=Ndj9Jk-tGKo">https://www.youtube.com/watch?v=Ndj9Jk-tGKo</a></li>
<li style="font-weight:400;">Base44?: <a href="https://www.youtube.com/watch?v=iKEUWtqvsis">https://www.youtube.com/watch?v=iKEUWtqvsis</a> </li>
<li style="font-weight:400;">Gemini: <a href="https://www.youtube.com/watch?v=Z1yGy9fELtE">https://www.youtube.com/watch?v=Z1yGy9fELtE</a></li>
<li style="font-weight:400;">Anthropic: <a href="https://www.youtube.com/watch?v=gmnjDLwZckA">https://www.youtube.com/watch?v=gmnjDLwZckA</a> </li>
<li style="font-weight:400;"><a href="http://ai.com">ai.com</a>: <a href="https://www.youtube.com/watch?v=n7I-D4YXbzg&amp;t=3s">https://www.youtube.com/watch?v=n7I-D4YXbzg&amp;t=3s</a></li>
</ul>
<p>16:35 Justin -If you ever want to knowif there’s a bubble, spending dumb money on the Super Bowl on an ad that makes no sense is probably your number one clue.” </p>
<p>16:53 It’s Earnings Time!</p>
<p><a href="https://www.cnbc.com/2026/01/28/microsoft-msft-q2-earnings-report-2026.html">Microsoft (MSFT) Q2 earnings report 2026</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.cnbc.com/quotes/MSFT/">Microsoft</a> Q2 2026 earnings show <a href="https://portal.azure.com/">Azure</a> cloud growth slowing to 39% from 40% in the prior quarter, missing analyst expectations of 39.4% and causing shares to drop 7% in after-hours trading. </li>
<li style="font-weight:400;">The company’s gross margin hit a three-year low at 68% due to substantial AI infrastructure investments totaling $37.5 billion in capital expenditures, up 66% year over year.</li>
<li style="font-weight:400;"><a href="https://www.cnbc.com/2025/10/28/open-ai-for-profit-microsoft.html">OpenAI</a> now represents 45% of Microsoft’s $625 billion remaining commercial performance obligation after the company committed to a $250 billion cloud services deal during the quarter. </li>
<li style="font-weight:400;">This concentration raises questions about revenue dependence on a single customer, though Microsoft maintains that the remaining backlog is still larger and more diversified than most compet...</li></ul>
<h3>Chapters</h3>
<ul><li>(00:00:00) - Cloud Pod: Speed Run Your AWS Account</li><li>(00:01:27) - Super Bowl LI Commercials</li><li>(00:02:08) - The Super Bowl Commercials</li><li>(00:04:40) - 15 Dumb Apps Built With No Code</li><li>(00:06:30) - Top 10 Ads Using AI</li><li>(00:07:40) - OpenAI vs Anthropic: The Chat</li><li>(00:12:42) - A AI Startup Spends $70 Million On A Dumb Ad</li><li>(00:15:50) - Microsoft Earnings: Down 7%</li><li>(00:19:19) - Google Cloud Earnings Beat Estimates</li><li>(00:21:39) - Amazon's 200 Billion Investment Plan for Cloud Infrastructure</li><li>(00:28:04) - Heroku to Become a Sustaining Engineering Model</li><li>(00:31:32) -  AWS Security: The Last Minute Attack</li><li>(00:35:28) - Cloud Business Model: How ML Makes Money</li><li>(00:44:15) - OpenAI GPT5.3 Codex</li><li>(00:46:21) - Facebook Testing Adverts on Free and Go Tier Users</li><li>(00:47:18) - Claude Opus 4.6 on Cloud, More</li><li>(00:48:17) - Snowflake and Databricks: Supervisor Agent</li><li>(00:49:38) - HashiCorp Launches Agent Skills Pack</li><li>(00:53:22) - Amazon's New massively big C8ID and R8ID Inst</li><li>(00:55:49) - AWS IAM Identity Center: Multi Region Replication</li><li>(00:59:00) -  JSON Schema Compliance in Bedrock</li><li>(01:00:35) - Amazon Redshift: Automatic Optimization now in place</li><li>(01:02:05) - Google Cloud: Developer Knowledge API & MCP Server</li><li>(01:06:27) - Bolt 2.8 in Python vs. Google Docs</li><li>(01:08:49) - Google Cloud Expands Sovereign Cloud Portfolio</li><li>(01:09:44) - Google's Gemini Enterprise Agent Ready Program</li><li>(01:11:43) - Charlie Bell Retires as EVP of Security and Focus on Quality</li><li>(01:17:40) - Azure Database for PostgreSQL at Ignite 2019</li><li>(01:19:55) - Microsoft OneLake & Snowflake: Bi directional Iceberg Tables</li><li>(01:21:58) - Azure Container Storage 2.10: Native elastic SAN Integration with</li><li>(01:23:22) - SQLCon 2018</li><li>(01:24:51) - This Week in the Cloud: Earnings</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 342 of The Cloud Pod, where the forecast is always cloudy! Justin, Ryan, and Matt are in the studio today to bring you all the latest in cloud and AI news this week. How do you feel about ads? How do you feel about ads while using AI? We’ve got options! We’ve got a round-up of tech Super Bowl ads, AI ads, Earnings reports (who frankly need the ad revenue), and a plethora of Opus 4.6 announcements, plus more. Let’s get started! 
Titles we almost went with this week

 ChatGPT Goes Full Mad Men: Your AI Assistant Now Comes With Commercial Breaks
 Heroku’s New Feature: No New Features
 AWS Gives EC2 Instances a Storage Growth Spurt: 22.8TB of Local NVMe Now Available
 Identity Crisis Averted: IAM Identity Center Learns to Replicate Itself
 JSON Schema Enforcement: Because Your LLM Needs Structure in Its Life
 From Zero to Admin in 480 Seconds: A Serbian Speedrun Story
 From Proof of Concept to Proof of Claw: DigitalOcean Tames AI Agent Infrastructure
 Azure’s Growth Hits the Clouds: Microsoft’s 39% Increase Still Not Enough for Wall Street
 One Lake to Rule Them All: Microsoft and Snowflake Finally Stop Fighting Over Your Data
 Free Lunch Officially Over: ChatGPT Learns That Servers Cost Money
 Claude Won’t Sell You Anything (Except Maybe Peace of Mind)
 IAM Identity Center Goes Multi-Regional: Because One Region to Rule Them All Wasn’t Enough
 Databricks Takes the Base Out of Database with Lakebase GA
 I’m a Chrome Tab hoarder

General News 
01:30 Superbowl Ads of Note

OpenAI: https://www.youtube.com/watch?v=aCN9iCXNJqQ
Microsoft CoPilot: https://www.youtube.com/watch?v=Ndj9Jk-tGKo
Base44?: https://www.youtube.com/watch?v=iKEUWtqvsis 
Gemini: https://www.youtube.com/watch?v=Z1yGy9fELtE
Anthropic: https://www.youtube.com/watch?v=gmnjDLwZckA 
ai.com: https://www.youtube.com/watch?v=n7I-D4YXbzg&t=3s

16:35 Justin -If you ever want to knowif there’s a bubble, spending dumb money on the Super Bowl on an ad that makes no sense is probably your number one clue.” 
16:53 It’s Earnings Time!
Microsoft (MSFT) Q2 earnings report 2026

Microsoft Q2 2026 earnings show Azure cloud growth slowing to 39% from 40% in the prior quarter, missing analyst expectations of 39.4% and causing shares to drop 7% in after-hours trading. 
The company’s gross margin hit a three-year low at 68% due to substantial AI infrastructure investments totaling $37.5 billion in capital expenditures, up 66% year over year.
OpenAI now represents 45% of Microsoft’s $625 billion remaining commercial performance obligation after the company committed to a $250 billion cloud services deal during the quarter. 
This concentration raises questions about revenue dependence on a single customer, though Microsoft maintains that the remaining backlog is still larger and more diversified than most compet...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[342: Eight Minutes to Midnight: When AI Helps Hackers Speed Run Your AWS Account]]>
                </itunes:title>
                                    <itunes:episode>342</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p>Welcome to episode 342 of The Cloud Pod, where the forecast is always cloudy! Justin, Ryan, and Matt are in the studio today to bring you all the latest in cloud and AI news this week. How do you feel about ads? How do you feel about ads while using AI? We’ve got options! We’ve got a round-up of tech Super Bowl ads, AI ads, Earnings reports (who frankly need the ad revenue), and a plethora of Opus 4.6 announcements, plus more. Let’s get started! </p>
<h3>Titles we almost went with this week</h3>
<ul>
<li> ChatGPT Goes Full Mad Men: Your AI Assistant Now Comes With Commercial Breaks</li>
<li> Heroku’s New Feature: No New Features</li>
<li> AWS Gives EC2 Instances a Storage Growth Spurt: 22.8TB of Local NVMe Now Available</li>
<li> Identity Crisis Averted: IAM Identity Center Learns to Replicate Itself</li>
<li> JSON Schema Enforcement: Because Your LLM Needs Structure in Its Life</li>
<li> From Zero to Admin in 480 Seconds: A Serbian Speedrun Story</li>
<li> From Proof of Concept to Proof of Claw: DigitalOcean Tames AI Agent Infrastructure</li>
<li> Azure’s Growth Hits the Clouds: Microsoft’s 39% Increase Still Not Enough for Wall Street</li>
<li> One Lake to Rule Them All: Microsoft and Snowflake Finally Stop Fighting Over Your Data</li>
<li> Free Lunch Officially Over: ChatGPT Learns That Servers Cost Money</li>
<li> Claude Won’t Sell You Anything (Except Maybe Peace of Mind)</li>
<li> IAM Identity Center Goes Multi-Regional: Because One Region to Rule Them All Wasn’t Enough</li>
<li> Databricks Takes the Base Out of Database with Lakebase GA</li>
<li> I’m a Chrome Tab hoarder</li>
</ul>
<h2>General News </h2>
<p>01:30 Superbowl Ads of Note</p>
<ul>
<li style="font-weight:400;">OpenAI: <a href="https://www.youtube.com/watch?v=aCN9iCXNJqQ">https://www.youtube.com/watch?v=aCN9iCXNJqQ</a></li>
<li style="font-weight:400;">Microsoft CoPilot: <a href="https://www.youtube.com/watch?v=Ndj9Jk-tGKo">https://www.youtube.com/watch?v=Ndj9Jk-tGKo</a></li>
<li style="font-weight:400;">Base44?: <a href="https://www.youtube.com/watch?v=iKEUWtqvsis">https://www.youtube.com/watch?v=iKEUWtqvsis</a> </li>
<li style="font-weight:400;">Gemini: <a href="https://www.youtube.com/watch?v=Z1yGy9fELtE">https://www.youtube.com/watch?v=Z1yGy9fELtE</a></li>
<li style="font-weight:400;">Anthropic: <a href="https://www.youtube.com/watch?v=gmnjDLwZckA">https://www.youtube.com/watch?v=gmnjDLwZckA</a> </li>
<li style="font-weight:400;"><a href="http://ai.com">ai.com</a>: <a href="https://www.youtube.com/watch?v=n7I-D4YXbzg&amp;t=3s">https://www.youtube.com/watch?v=n7I-D4YXbzg&amp;t=3s</a></li>
</ul>
<p>16:35 Justin -If you ever want to knowif there’s a bubble, spending dumb money on the Super Bowl on an ad that makes no sense is probably your number one clue.” </p>
<p>16:53 It’s Earnings Time!</p>
<p><a href="https://www.cnbc.com/2026/01/28/microsoft-msft-q2-earnings-report-2026.html">Microsoft (MSFT) Q2 earnings report 2026</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.cnbc.com/quotes/MSFT/">Microsoft</a> Q2 2026 earnings show <a href="https://portal.azure.com/">Azure</a> cloud growth slowing to 39% from 40% in the prior quarter, missing analyst expectations of 39.4% and causing shares to drop 7% in after-hours trading. </li>
<li style="font-weight:400;">The company’s gross margin hit a three-year low at 68% due to substantial AI infrastructure investments totaling $37.5 billion in capital expenditures, up 66% year over year.</li>
<li style="font-weight:400;"><a href="https://www.cnbc.com/2025/10/28/open-ai-for-profit-microsoft.html">OpenAI</a> now represents 45% of Microsoft’s $625 billion remaining commercial performance obligation after the company committed to a $250 billion cloud services deal during the quarter. </li>
<li style="font-weight:400;">This concentration raises questions about revenue dependence on a single customer, though Microsoft maintains that the remaining backlog is still larger and more diversified than most competitors, with 28% growth.</li>
<li style="font-weight:400;"><a href="https://m365.cloud.microsoft/">Microsoft 365 Copilo</a>t adoption reached 15 million seats out of 450 million total paid commercial seats, representing only 3.3% penetration. </li>
<li style="font-weight:400;">The company plans to <a href="https://www.cnbc.com/2025/12/04/microsoft-will-raise-prices-of-commercial-office-bundles-in-july-.html">raise prices</a> on commercial Office subscriptions in July to help offset AI infrastructure costs and improve margins, while Q3 guidance projects Azure growth of 37-38% at constant currency.</li>
<li style="font-weight:400;">The <a href="https://www.microsoft.com/en-us/Investor/earnings/FY-2026-Q1/more-personal-computing-performance">More Personal Computing</a> segment declined 3%, with gaming revenue down 9.5% due to an unspecified impairment charge, reflecting ongoing challenges in the <a href="https://www.xbox.com/">Xbox</a> division. </li>
<li style="font-weight:400;">Microsoft added nearly one gigawatt of data center capacity in the quarter alone, but continues to face supply constraints that cannot keep pace with customer demand for AI services. </li>
</ul>
<p>20:27 <a href="https://www.cnbc.com/2026/02/04/alphabet-googl-q4-2025-earnings.html">Alphabet (GOOGL) Q4 2025 earnings</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.cnbc.com/quotes/GOOG/">Alphabet</a> plans to spend between $175 billion and $185 billion on capital expenditures in 2026, more than double its 2025 spending, primarily targeting AI compute capacity for <a href="https://deepmind.google/">DeepMind</a> and meeting cloud customer demand. </li>
<li style="font-weight:400;">This represents one of the largest infrastructure investments in tech history and signals the scale of resources required to compete in enterprise AI.</li>
<li style="font-weight:400;"><a href="https://cloud.google.com/">Google Cloud</a> revenue grew 48% year-over-year to $17.66 billion and beat analyst expectations, with backlog reaching $240 billion after increasing 55% sequentially. </li>
<li style="font-weight:400;">The cloud division’s performance demonstrates strong enterprise adoption of Google’s AI services and positions it as a more competitive alternative to AWS and Azure.</li>
<li style="font-weight:400;"><a href="https://gemini.google.com/">Gemini AI</a> now has 750 million monthly active users, up from 650 million last quarter, while Google reduced Gemini serving costs by 78% throughout 2025 through model optimizations and efficiency improvements. 
<ul>
<li style="font-weight:400;">This cost reduction is critical for maintaining profitability as AI services scale to hundreds of millions of users.</li>
</ul>
</li>
<li style="font-weight:400;"><a href="http://youtube.com">YouTube</a> advertising revenue of $11.38 billion missed analyst expectations of $11.84 billion, which Alphabet attributed to difficult year-over-year comparisons against strong US election spending in Q4 2024. 
<ul>
<li style="font-weight:400;">This shortfall highlights how political advertising cycles create volatility in digital ad revenue forecasting.</li>
</ul>
</li>
<li style="font-weight:400;"><a href="https://waymo.com/">Waymo</a> recorded a $2.1 billion stock-based compensation charge following its $16 billion valuation fundraising round, contributing to Other Bets losses exceeding $3.6 billion despite serving 15 million autonomous rides across six US markets. 
<ul>
<li style="font-weight:400;">The charge reflects the high cost of retaining talent in competitive autonomous vehicle development.</li>
</ul>
</li>
</ul>
<p>22:05  Justin – “Gemini adoption must be ramping up much faster than I realized, because the fact that Microsoft was missing on earnings, and they’re the OpenAI provider for the most part… makes me question how well OpenAI is actually doing.”   </p>
<p>22:50 <a href="https://www.cnbc.com/2026/02/05/aws-q4-earnings-report-2025.html">AWS Q4 earnings report 2025</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.cnbc.com/quotes/AMZN/">AWS</a> Q4 2025 revenue reached $35.58 billion with 24% year-over-year growth, maintaining its market leadership position, while operating margins improved to 35%. </li>
<li style="font-weight:400;">The cloud unit now represents 17% of <a href="https://www.cnbc.com/2026/02/05/amazon-amzn-q4-earnings-report-2025.html">Amazon’s total revenue</a> but generates the majority of the company’s profits at $12.47 billion in operating income.</li>
<li style="font-weight:400;">Amazon plans to invest $200 billion in capital expenditures for 2026, primarily for AWS infrastructure, which significantly exceeds analyst expectations of $148.86 billion. </li>
<li style="font-weight:400;">The company added 4 gigawatts of computing capacity in 2025 and plans to double that by the end of 2027, with most investment directed toward AI workloads rather than traditional cloud services.</li>
<li style="font-weight:400;">AWS growth rate of 24% trails competitors Google Cloud at 48% and Azure at 39%, suggesting potential market share shifts in AI-driven cloud services. Both competitors are reporting stronger growth attributed to artificial intelligence workloads, which may indicate AWS is losing ground in the AI infrastructure race despite its overall market leadership.</li>
<li style="font-weight:400;">The company secured a $38 billion spending commitment from OpenAI and launched <a href="https://www.cnbc.com/2025/12/02/amazon-nova-forge-lets-clients-customize-ai-models-for-100000-a-year.html">Nova Forge</a> for advanced AI model customization at $100,000 annually. </li>
<li style="font-weight:400;">These moves demonstrate AWS’s strategy to compete in the generative AI training market, though the pricing and approach differ from competitors’ offerings.</li>
<li style="font-weight:400;">Capital expenditure guidance reveals that non-AI workloads are growing faster than anticipated, requiring additional infrastructure investment beyond AI capacity. </li>
<li style="font-weight:400;">This indicates traditional cloud computing demand remains strong and may be underestimated in current market analysis focused primarily on AI growth.</li>
</ul>
<p>25:11 Capex Growth By Quarter </p>
<p>24:14  Justin – “They also took a major write-off on Amazon Fresh, because they’re shutting that down as well. So just bad, bad all the way around for Amazon.”  </p>
<p>29:23 <a href="https://www.heroku.com/blog/an-update-on-heroku/">An Update on Heroku</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.heroku.com/">Heroku</a> is moving to a sustaining engineering model, meaning no new features will be developed while the platform continues to receive security patches, stability updates, and operational support. </li>
<li style="font-weight:400;">This represents a shift from active development to maintenance mode for the 15-year-old platform-as-a-service.</li>
<li style="font-weight:400;">Existing customers can continue using Heroku with no changes to pricing, billing, or service levels, and all core functionality, including applications, pipelines, teams, and add-ons, remains fully operational. </li>
<li style="font-weight:400;">Credit card-based accounts remain available for both current and new customers through the dashboard.</li>
<li style="font-weight:400;">Salesforce is ending new Enterprise Account contracts while honoring existing enterprise subscriptions and support agreements through their renewal periods. This signals a strategic pivot away from enterprise sales expansion while maintaining commitments to current large customers.</li>
<li style="font-weight:400;">The parent company is redirecting engineering resources toward enterprise AI capabilities rather than continuing platform-as-a-service innovation. This follows a pattern of <a href="https://www.salesforce.com/">Salesforce</a> deprioritizing Heroku since the acquisition, including the 2022 elimination of free tiers and reduced feature velocity in recent years.</li>
<li style="font-weight:400;">Developers relying on Heroku for production workloads should evaluate long-term platform viability given the maintenance-only status, though no immediate migration is required. </li>
<li style="font-weight:400;">The announcement provides clarity for capacity planning but raises questions about the platform’s competitiveness as cloud-native alternatives continue advancing.</li>
</ul>
<p>31:32  Matt – “It’s a great platform as a service, and I’m sad to see it go, because there’s a lot of companies I’ve worked with in the past that have started there because it was just so easy. The problem for them, at least back in the day, was scaling and supporting and having a lot of other features, which meant I helped a lot of customers move from Heroku to AWS to gain other aspects of the platform that they needed. So it doesn’t really surprise me, but it was a good starting point for a lot of companies.”  </p>
<p>35:58 <a href="https://www.sysdig.com/blog/ai-assisted-cloud-intrusion-achieves-admin-access-in-8-minutes">AI-assisted cloud intrusion achieves admin access in 8 minutes | Sysdig</a></p>
<ul>
<li style="font-weight:400;">An attacker achieved full AWS administrative access in just 8 minutes by exploiting credentials found in public S3 buckets, then used Lambda code injection to escalate privileges. </li>
<li style="font-weight:400;">The attack shows strong evidence of LLM assistance, including Serbian-language code comments, hallucinated AWS account IDs, and references to non-existent GitHub repositories.</li>
<li style="font-weight:400;">The threat actor compromised 19 different AWS principals through role chaining and cross-account access attempts, making detection difficult by distributing operations across multiple identities. They specifically targeted AI infrastructure by invoking 9 different Bedrock models and attempting to launch expensive GPU instances (p5.48xlarge and p4d.24xlarge) for potential model training or compute resale.</li>
<li style="font-weight:400;">The attack demonstrates how AI tools are accelerating offensive operations, with the attacker completing reconnaissance, privilege escalation, and resource abuse in under two hours. </li>
<li style="font-weight:400;">Organizations should implement least-privilege IAM policies, restrict Lambda UpdateFunctionCode permissions, and enable <a href="https://aws.amazon.com/bedrock/">Bedrock</a> model invocation logging to detect similar attacks.</li>
<li style="font-weight:400;">Critical security gaps included overly permissive Lambda execution roles with administrative access and the ReadOnlyAccess policy on the compromised user, which enabled extensive reconnaissance across all AWS services. </li>
<li style="font-weight:400;">The attacker also attempted to deploy a Terraform-based backdoor that would create a publicly accessible Lambda function for generating persistent Bedrock credentials.</li>
<li style="font-weight:400;">The use of IP rotation, <a href="https://docs.aws.amazon.com/IAM/latest/UserGuide/id_roles.html#id_roles_terms-and-concepts">role chaining</a>, and distributed operations across multiple principals shows sophisticated evasion techniques. </li>
<li style="font-weight:400;">Detection requires behavioral analytics that can identify patterns like rapid enumeration across services, unusual Bedrock model invocations, and Lambda code modifications rather than relying on single-event alerts.</li>
</ul>
<p>34:24  Ryan – “These are the types of examples I use when trying to talk to people about least privileged development and how, even in your lower environments where you think you’re safe, and you’re trying to develop things it’s really not okay to start not using least privileged access because there’s very creative ways in which you can do privilege escalation – this lambda attack is a very good example. And now it’s going to be so easy because AI will just do it for you, and this really demonstrates it.”</p>
<h2>AI Is Going Great – Or How ML Makes Money </h2>
<p>37:09 <a href="https://www.anthropic.com/news/claude-is-a-space-to-think">Claude is a space to think | Anthropic \ Anthropic</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.anthropic.com/">Anthropic</a> commits to keeping <a href="https://claude.ai/new">Claude</a> ad-free, stating that advertising would be incompatible with Claude’s role as a trusted assistant for work and deep thinking. </li>
<li style="font-weight:400;">The company will continue its subscription and enterprise-based revenue model rather than introducing sponsored content or product placements in conversations.</li>
<li style="font-weight:400;">Analysis of Claude conversations shows a substantial portion involves sensitive personal topics or complex technical work where ads would be inappropriate. Anthropic argues that AI conversations differ from search or social media because users share more context, and the open-ended format makes them more susceptible to commercial influence.</li>
<li style="font-weight:400;">The company identifies specific risks with ad-supported AI models, including unpredictable behavior changes when advertising incentives are introduced. For example, a user asking about sleep problems might receive recommendations influenced by commercial motives rather than purely helpful advice, making it difficult to distinguish genuine assistance from monetization attempts.</li>
<li style="font-weight:400;">Anthropic will support commerce through user-initiated interactions like agentic commerce, where Claude handles purchases on behalf of users, and third-party tool integrations with services like <a href="https://www.figma.com/">Figma</a> and <a href="https://asana.com/?noredirect">Asana</a>. </li>
<li style="font-weight:400;">The key distinction is that these features are triggered by user requests rather than advertiser interests.</li>
<li style="font-weight:400;">The decision has clear tradeoffs for business model scalability compared to ad-supported competitors. </li>
<li style="font-weight:400;">Anthropic is addressing access through educational partnerships in 60+ countries, nonprofit discounts, and maintaining frontier-level intelligence in free tiers rather than monetizing user attention.</li>
</ul>
<p>37:22 <a href="https://www.anthropic.com/news/claude-opus-4-6">Claude Opus 4.6 \ Anthropic</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.anthropic.com/news/claude-opus-4-6">Claude Opus 4.6</a> is now generally available with a 1M token context window in beta, marking the first time an Opus-class model has offered this extended context capability. </li>
<li style="font-weight:400;">The model maintains $5/$25 per million token pricing, with premium pricing of $10/$37.50 for prompts exceeding 200k tokens.</li>
<li style="font-weight:400;">The model introduces adaptive thinking and four effort levels (low, medium, high, max) that let developers control how deeply Claude reasons through problems, balancing intelligence against speed and cost. Context compaction automatically summarizes older conversation history when approaching limits, enabling longer-running agentic tasks without hitting context windows.</li>
<li style="font-weight:400;">Opus 4.6 achieves state-of-the-art performance on <a href="https://www.tbench.ai/news/announcement-2-0">Terminal-Bench 2.0</a> for agentic coding and outperforms GPT-5.2 by 144 Elo points on <a href="https://artificialanalysis.ai/evaluations/gdpval-aa">GDPval-AA</a>, an evaluation of economically valuable knowledge work tasks. </li>
<li style="font-weight:400;">On the 8-needle 1M variant of <a href="https://huggingface.co/datasets/openai/mrcr">MRCR v2</a>, it scores 76% compared to Sonnet 4.5’s 18.5%, demonstrating substantially improved long-context retrieval without degradation.</li>
<li style="font-weight:400;">New product features include <a href="https://code.claude.com/docs/en/agent-teams">agent teams</a> in Claude Code that work in parallel and coordinate autonomously, plus <a href="https://claude.com/claude-in-powerpoint">Claude in PowerPoint</a> (research preview) and upgraded <a href="https://claude.com/claude-in-excel">Claude in Excel</a> for handling multi-step data processing and presentation tasks. The model also supports 128k output tokens and US-only inference at 1.1x pricing for compliance-sensitive workloads.</li>
<li style="font-weight:400;">Safety evaluations show Opus 4.6 maintains alignment comparable to its predecessor while exhibiting the lowest over-refusal rate of any recent Claude model. </li>
<li style="font-weight:400;">Anthropic developed six new cybersecurity probes to monitor potential misuse given the model’s enhanced security capabilities, and is using the model to find and patch vulnerabilities in open-source software.</li>
</ul>
<p>34:24  Ryan – “One of the things that I’m constantly dabbling with is the context windows, and so I’m not so sure the context compaction works the way it’s advertised, because every time I go through a process like that, you lose so much.” </p>
<p>43:18 <a href="https://openai.com/index/introducing-openai-frontier">Introducing OpenAI Frontier | OpenAI</a></p>
<ul>
<li style="font-weight:400;">OpenAI launches <a href="https://openai.com/business/frontier/">Frontier</a>, an enterprise platform for building, deploying, and managing AI agents across existing infrastructure without requiring replatforming. </li>
<li style="font-weight:400;">The platform provides agents with shared business context by connecting siloed data warehouses, CRM systems, and internal applications, plus includes identity management, permissions, and governance controls for regulated environments.</li>
<li style="font-weight:400;">Frontier includes an agent execution environment where AI coworkers can reason over data, work with files, run code, and use tools while building memory from past interactions to improve performance. </li>
<li style="font-weight:400;">The platform works across local environments, enterprise cloud infrastructure, and OpenAI-hosted runtimes, with built-in evaluation and optimization capabilities to help agents learn what good performance looks like over time.</li>
<li style="font-weight:400;">OpenAI pairs Forward Deployed Engineers with customer teams to help develop best practices for production agent deployments, creating a feedback loop between business problems, deployment, and OpenAI Research. Early adopters include HP, Intuit, Oracle, State Farm, Thermo Fisher, and Uber, with existing customers like BBVA, Cisco, and T-Mobile piloting the platform.</li>
<li style="font-weight:400;">The platform uses open standards to integrate with existing systems and applications, allowing third-party agent apps to access shared business context without lengthy custom integrations. OpenAI is working with Frontier Partners including Abridge, Clay, Ambience, Decagon, Harvey, and Sierra, to design and support enterprise AI solutions on the platform.</li>
<li style="font-weight:400;">Frontier is currently available to a limited set of customers with broader availability planned over the next few months. </li>
<li style="font-weight:400;">OpenAI cites customer results, including a manufacturer reducing production optimization from six weeks to one day and a hardware company cutting test failure debugging from four hours to minutes.</li>
</ul>
<p>44:35  Ryan – “I think they’re extremely late to the market with this. AWS was too early, and they botched it. Gemini seems to be in the sweet spot, and OpenAI – it’s still not ready yet.</p>
<p>46:28 <a href="https://openai.com/index/introducing-gpt-5-3-codex">Introducing GPT-5.3-Codex | OpenAI</a></p>
<ul>
<li style="font-weight:400;">OpenAI released GPT-5.3-Codex, their most capable agentic coding model that combines the frontier coding performance of GPT-5.2-Codex with the reasoning capabilities of GPT-5.2, while running 25% faster. </li>
<li style="font-weight:400;">The model achieves state-of-the-art results on SWE-Bench Pro and Terminal-Bench 2.0 benchmarks, using fewer tokens than previous models, and can autonomously iterate on complex projects over millions of tokens spanning days.</li>
<li style="font-weight:400;">GPT-5.3-Codex represents the first self-improving model at OpenAI, where the Codex team used early versions to debug its own training, manage deployment, and diagnose test results. </li>
<li style="font-weight:400;">Internal teams report their work has fundamentally changed in the past two months, with researchers using Codex to monitor training runs, engineers using it to optimize harnesses and scale GPU clusters, and data scientists building custom pipelines and visualizations in under three minutes.</li>
<li style="font-weight:400;">The model extends beyond code generation to full computer operation, showing strong performance on OSWorld (visual desktop environment tasks) and matching GPT-5.2 on <a href="https://openai.com/index/gdpval/">GDPval</a>, which measures knowledge work across 44 occupations, including presentations, spreadsheets, and other professional deliverables. </li>
<li style="font-weight:400;">The <a href="https://openai.com/index/introducing-the-codex-app/">Codex app</a> now provides real-time updates and interactive steering, allowing users to direct and supervise multiple agents working in parallel.</li>
<li style="font-weight:400;">OpenAI classifies GPT-5.3-Codex as having <a href="https://openai.com/index/gpt-5-3-codex-system-card/">high capability for cybersecurity</a> under their <a href="https://openai.com/index/updating-our-preparedness-framework/">Preparedness Framework</a>, marking the first model directly trained to identify software vulnerabilities. </li>
<li style="font-weight:400;">They are deploying <a href="https://openai.com/index/trusted-access-for-cyber/">Trusted Access for Cyber</a>, expanding the <a href="https://openai.com/index/introducing-aardvark/">Aardvark</a> security research agent beta, and committing 10 million dollars in API credits through their <a href="https://openai.com/index/openai-cybersecurity-grant-program/">Cybersecurity Grant Program</a> for open source and critical infrastructure defense.</li>
<li style="font-weight:400;">GPT-5.3-Codex is available now with paid ChatGPT plans across the Codex app, CLI, IDE extension, and web, with API access coming soon. </li>
<li style="font-weight:400;">The model was co-designed for and trained on NVIDIA GB200 NVL72 systems, with infrastructure improvements delivering the 25% speed increase for all Codex users.</li>
</ul>
<p>47:48  Ryan – “I’m surprised this is the first self-improving model.” </p>
<p>48:43 <a href="https://openai.com/index/testing-ads-in-chatgpt/">Testing ads in ChatGPT | OpenAI</a></p>
<ul>
<li style="font-weight:400;"><a href="https://openai.com/">OpenAI</a> is launching ads in <a href="https://chatgpt.com/">ChatGPT</a> for free and Go <a href="https://chatgpt.com/pricing">tier users</a> in the US, while Plus, Pro, Business, Enterprise, and Education subscribers remain ad-free. Users can opt out of ads on the free tier in exchange for reduced daily message limits.</li>
<li style="font-weight:400;">Ads are contextually matched to conversation topics and chat history but do not influence ChatGPT responses, which remain independent. Advertisers receive only aggregate performance metrics like views and clicks, with no access to individual chats, memories, or personal details.</li>
<li style="font-weight:400;">The ad program excludes users under 18 and blocks ads near sensitive topics, including health, mental health, and politics. Users can dismiss ads, provide feedback, delete ad data with one tap, and manage personalization settings at any time.</li>
<li style="font-weight:400;">OpenAI positions this as infrastructure funding to maintain free tier performance and quality while supporting development of more powerful features. </li>
<li style="font-weight:400;">The company plans to expand ad formats, objectives, and buying models over time based on test results and user feedback.</li>
</ul>
<p>49:45 <a href="https://www.snowflake.com/content/snowflake-site/global/en/blog/claude-opus-4-6-snowflake-cortex-ai">Announcing Claude Opus 4.6 on Snowflake Cortex AI</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.snowflake.com/en/product/features/cortex/">Snowflake Cortex AI</a> now offers <a href="https://www.anthropic.com/news/claude-opus-4-6">Claude Opus 4.6</a>, Anthropic’s most capable model, providing enhanced reasoning and complex task handling directly within Snowflake’s data platform. </li>
<li style="font-weight:400;">This integration allows enterprises to process sensitive data without moving it outside their Snowflake environment, maintaining data governance and security controls.</li>
<li style="font-weight:400;">Claude Opus 4.6 delivers improved performance on coding tasks, mathematical reasoning, and multilingual capabilities compared to previous versions. The model excels at nuanced instructions and can handle sophisticated analysis workflows while operating on structured and unstructured data within Snowflake.</li>
<li style="font-weight:400;">Cortex AI’s serverless architecture means customers pay only for actual model usage without managing infrastructure or dealing with capacity planning. </li>
<li style="font-weight:400;">The integration supports both SQL and Python interfaces, enabling data teams to build AI applications using familiar tools and existing Snowflake data pipelines.</li>
<li style="font-weight:400;">Organizations can now combine Claude Opus 4.6 with Snowflake’s data clean rooms and governance features for compliant AI deployments in regulated industries. </li>
<li style="font-weight:400;">This addresses enterprise concerns about data residency and privacy while enabling advanced AI capabilities on proprietary datasets.</li>
</ul>
<p>49:57  Justin – “And just because we’re already 50 minutes into this, I will tell you we’re also getting Claude Opus 4.6 on multiple other providers, including Bedrock, Kiro, Vertex AI, and we’re getting it on Azure, in the Moicrosift Foundry App, as well as some of the smaller cloud providers, like DataBricks and DigitalOcean.” </p>
<p>50:45 <a href="https://www.databricks.com/blog/agent-bricks-supervisor-agent-now-ga-orchestrate-enterprise-agents">Agent Bricks Supervisor Agent is Now GA: Orchestrate Enterprise Agents | </a><a href="https://www.databricks.com/blog/agent-bricks-supervisor-agent-now-ga-orchestrate-enterprise-agents">Databricks Blog</a></p>
<ul>
<li style="font-weight:400;">Databricks Agent Bricks Supervisor Agent is now Generally Available, providing a managed orchestration layer that coordinates multiple specialized agents through Unity Catalog governance. </li>
<li style="font-weight:400;">The supervisor uses dynamic routing to analyze user intent and delegate tasks between <a href="https://docs.databricks.com/aws/en/genie/">Genie Spaces</a> for structured data queries, Knowledge Assistant agents for unstructured data, and MCP servers for tool execution.</li>
<li style="font-weight:400;">The platform implements On-Behalf-Of authentication where the supervisor acts as a transparent proxy, validating every data fetch and tool execution against the end user’s existing Unity Catalog permissions. </li>
<li style="font-weight:400;">This eliminates the common security gap where agents access data through broad service accounts that users themselves aren’t authorized to see.</li>
<li style="font-weight:400;"><a href="https://www.databricks.com/blog/agent-learning-human-feedback-alhf-databricks-knowledge-assistant-case-study">Agent Learning on Human Feedback</a> is built directly into the Supervisor Agent, allowing teams to add questions and guidelines that improve routing decisions and response quality over time. </li>
<li style="font-weight:400;">Franklin Templeton reports reducing fund analysis tasks from days to seconds while maintaining compliance, and Zapier uses ALHF to refine orchestration between different Genie spaces without hard-coding routing logic.</li>
<li style="font-weight:400;">The system addresses enterprise agent sprawl, where teams toggle between dozens of specialized bots and duplicate work by creating agents that already exist. </li>
<li style="font-weight:400;">Supervisor Agent provides a single entry point that reasons about intent and coordinates specialized agents while maintaining full MLflow experiment tracking for measurable performance monitoring.</li>
</ul>
<p>51:40 Ryan – “It just goes to show you, depending on who your provider is, this is the type of platform you’re going to need, right? So if you already are using a whole bunch of AI execution on Snowflake, or if you’re only using it on OpenAI’s platform, you’re just going to need to sign on to the platform that’s already there.”</p>
<h2>Cloud Tools</h2>
<p>52:09 <a href="https://www.hashicorp.com/en/blog/introducing-hashicorp-agent-skills">Introducing HashiCorp Agent Skills</a></p>
<ul>
<li style="font-weight:400;">HashiCorp launches <a href="https://github.com/hashicorp/agent-skills">Agent Skills</a>, an <a href="https://agentskills.io/home">open-standard</a> repository that packages domain expertise into portable instructions for AI assistants working with Terraform and Packer. </li>
<li style="font-weight:400;">These skills provide AI tools like Claude with specialized HashiCorp product knowledge, schema definitions, and best practices to reduce hallucinations and ensure code follows proper conventions.</li>
<li style="font-weight:400;">The initial skills pack addresses common DevOps challenges, including building and maintaining Terraform providers, generating style-compliant Terraform code, refactoring monolithic configurations into modules, and creating machine images with Packer across AWS, Azure, and Windows. </li>
<li style="font-weight:400;">HashiCorp partnered with <a href="https://tessl.io/">Tessl</a> to evaluate skill effectiveness using review and task-based evaluations against <a href="https://platform.claude.com/docs/en/agents-and-tools/agent-skills/best-practices">Anthropic’s best practices</a>.</li>
<li style="font-weight:400;">Agent Skills differ from Model Context Protocol (MCP) as complementary technologies – MCP is the data pipe connecting information to AI, while Agent Skills are the knowledge textbooks. Installation takes seconds using npx, Tessl CLI, or Claude Code’s plugin marketplace with simple one-line commands.</li>
<li style="font-weight:400;">The skills solve a fundamental problem where AI assistants lack a specific technical context for complex infrastructure tasks, particularly around HashiCorp’s plugin framework architectures and coding conventions. </li>
<li style="font-weight:400;">This prevents AI from suggesting outdated practices or generating code that doesn’t follow established patterns from official documentation.</li>
<li style="font-weight:400;">HashiCorp plans to expand beyond Terraform and Packer to cover additional products and welcomes community contributions through its GitHub repository. </li>
<li style="font-weight:400;">The open-standard format means these skills are portable and reusable across different AI assistants that support the Agent Skills specification.</li>
</ul>
<p>53:17 Justin – “I love this, because how many times I pointed Claude or others to the documentation, and said ‘I’m pretty sure you’re wrong, this is how it’s supposed to be done, here’s the doc.’ And it comes back and goes, you’re right, Justin, because you’re a genius. That’s what it always tells me.”</p>
<h2>AWS </h2>
<p>56:10 <a href="https://aws.amazon.com/blogs/aws/amazon-ec2-c8id-m8id-and-r8id-instances-with-up-to-22-8-tb-local-nvme-storage-are-generally-available/">Amazon EC2 C8id, M8id, and R8id instances with up to 22.8 TB local NVMe </a><a href="https://aws.amazon.com/blogs/aws/amazon-ec2-c8id-m8id-and-r8id-instances-with-up-to-22-8-tb-local-nvme-storage-are-generally-available/">storage are generally available</a></p>
<ul>
<li style="font-weight:400;">In “instances so big we don’t know what to do with them,” may we present…</li>
<li style="font-weight:400;">AWS launches <a href="https://aws.amazon.com/blogs/aws/introducing-new-compute-optimized-amazon-ec2-c8i-and-c8i-flex-instances/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">C8id</a>, <a href="https://aws.amazon.com/blogs/aws/new-general-purpose-amazon-ec2-m8i-and-m8i-flex-instances-are-now-available/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">M8id</a>, and <a href="https://aws.amazon.com/blogs/aws/best-performance-and-fastest-memory-with-the-new-amazon-ec2-r8i-and-r8i-flex-instances/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">R8id</a> <a href="https://aws.amazon.com/ec2/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">EC2</a> instances with up to 22.8TB of local NVMe storage, triple the capacity of sixth-generation instances. </li>
<li style="font-weight:400;">These new instances scale up to 96xlarge with 384 vCPUs and 3TiB of memory, delivering up to 43% higher compute performance and 3.3x more memory bandwidth than previous generation instances.</li>
<li style="font-weight:400;">The instances use custom Intel Xeon 6 processors exclusive to AWS, running at a 3.9 GHz sustained all-core turbo frequency. Performance improvements include up to 46% better I/O intensive database workload performance and 30% faster query results for real-time data analytics compared to sixth-generation instances.</li>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/ebs/latest/userguide/instance-bandwidth-configuration.html?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">Instance Bandwidth Configuration</a> feature allows customers to dynamically allocate resources between network and <a href="https://aws.amazon.com/ebs/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">EBS</a> bandwidth by 25%, optimizing for specific workload requirements. </li>
<li style="font-weight:400;">The local <a href="https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/aws-nvme-drivers.html?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">NVMe</a> storage is hardware-encrypted with XTS-AES-256 and ephemeral, meaning data is lost when instances stop or terminate.</li>
<li style="font-weight:400;">Currently available in US East N. Virginia, US East, Ohio, US West, Oregon, and Europe, Frankfurt regions, with additional regions planned. </li>
<li style="font-weight:400;">Instances can be purchased as <a href="https://aws.amazon.com/ec2/pricing/on-demand/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">On-Demand</a>, <a href="https://aws.amazon.com/savingsplans/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">Savings Plans</a>, <a href="https://aws.amazon.com/ec2/spot/pricing/?trk=trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">Spot Instances</a>, <a href="https://aws.amazon.com/ec2/pricing/dedicated-instances/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">Dedicated Instances</a>, or <a href="https://aws.amazon.com/ec2/dedicated-hosts/pricing/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">Dedicated Hosts</a>, with <a href="https://aws.amazon.com/ec2/pricing/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">pricing</a> varying by region and purchase model.</li>
</ul>
<p>56:47 Matt – “If it’s all core turbo, is it really turbo at that point?” </p>
<p>58:45 <a href="https://aws.amazon.com/blogs/aws/aws-iam-identity-center-now-supports-multi-region-replication-for-aws-account-access-and-application-use/">AWS IAM Identity Center now supports multi-Region replication for AWS </a><a href="https://aws.amazon.com/blogs/aws/aws-iam-identity-center-now-supports-multi-region-replication-for-aws-account-access-and-application-use/">account access and application use</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/iam/identity-center/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">AWS IAM Identity Center</a> now supports <a href="https://docs.aws.amazon.com/singlesignon/latest/userguide/identity-center-customer-managed-keys.html#replicate-kms-key?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">multi-Region replication</a>, allowing organizations to replicate workforce identities, permission sets, and metadata from a primary Region to additional Regions for improved resiliency and disaster recovery. </li>
<li style="font-weight:400;">This means if the primary Region experiences a service disruption, users can still <a href="https://docs.aws.amazon.com/singlesignon/latest/userguide/manage-your-accounts.html?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">access AWS accounts</a> through an active access portal endpoint in a secondary Region using their existing permissions.</li>
<li style="font-weight:400;">The feature requires using an organization instance of <a href="https://console.aws.amazon.com/singlesignon/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">IAM Identity Center </a>connected to an external IdP like Microsoft Entra ID or Okta, and you must first configure multi-Region <a href="https://docs.aws.amazon.com/kms/latest/cryptographic-details/basic-concepts.html?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">customer-managed KMS keys</a> before replicating to additional Regions. </li>
<li style="font-weight:400;">The primary Region remains the central management point for all configurations, while additional Regions provide read-only console access except for application management and user session revocation.</li>
<li style="font-weight:400;">Organizations can now deploy AWS managed applications closer to users and datasets to meet data residency requirements or improve performance, with applications accessing replicated workforce identities locally in each Region. This addresses compliance scenarios where datasets must remain in specific Regions while still providing centralized identity management.</li>
<li style="font-weight:400;">The feature is available at no additional cost in 17 enabled-by-default commercial AWS Regions, with only standard <a href="https://aws.amazon.com/kms/pricing/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">AWS KMS charges</a> applying for customer-managed keys. </li>
<li style="font-weight:400;">All workforce actions are logged in <a href="https://aws.amazon.com/cloudtrail/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">CloudTrail</a> in the Region where they occur, maintaining audit trails across multiple Regions for security and compliance monitoring.</li>
</ul>
<p>59:32  Justin – “I recently set up IAM Identity Center for the first time, and I was surprised that it was US East 1 only, so I’m pleased to see this is now available.” </p>
<p>1:00:25 <a href="https://aws.amazon.com/about-aws/whats-new/2026/02/amazon-ecs-nlb-linear-canary-deployments/">Amazon ECS adds Network Load Balancer support for Linear and Canary </a><a href="https://aws.amazon.com/about-aws/whats-new/2026/02/amazon-ecs-nlb-linear-canary-deployments/">deployments</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/ecs/">ECS</a> now supports linear and canary deployment strategies natively with Network Load Balancers, bringing managed traffic shifting to TCP/UDP workloads that previously required custom solutions or third-party tools. </li>
<li style="font-weight:400;">This fills a deployment gap for applications needing NLB features like static IPs, long-lived connections, and low latency.</li>
<li style="font-weight:400;">The feature integrates with CloudWatch alarms for automatic rollback if deployment issues are detected, providing safety guardrails for production updates. </li>
<li style="font-weight:400;">Teams can shift traffic incrementally (linear) or start with a small percentage for validation (canary) before completing rollouts.</li>
<li style="font-weight:400;">Primary beneficiaries are latency-sensitive and connection-oriented workloads such as online gaming backends, financial transaction systems, and real-time messaging services that depend on NLB’s Layer 4 capabilities. </li>
<li style="font-weight:400;">These applications can now use the same deployment patterns ALB users have had access to for years.</li>
<li style="font-weight:400;">Available immediately in all AWS commercial and GovCloud US regions for both new and existing ECS services. </li>
<li style="font-weight:400;">Configuration is accessible through the AWS Console, CLI, and Infrastructure-as-Code tools with no additional cost beyond standard ECS and NLB pricing.</li>
<li style="font-weight:400;">This brings ECS deployment parity between ALB and NLB, eliminating a common pain point.</li>
</ul>
<p>1:01:19  Ryan – “This is one of those rough edges that you hit unexpectedly. You want to use a network load balancer, typically because you have to. It’s easier to set up an application load balancer. You’re only using a network load balancer when it’s not your choice, but then you can’t deploy this app safely without lots of interruption or risk, and it’s kind of a problem.” </p>
<p>1:02:12 <a href="https://aws.amazon.com/about-aws/whats-new/2026/02/structured-outputs-available-amazon-bedrock/">Structured outputs now available in Amazon Bedrock</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/bedrock/">Amazon Bedrock</a> now enforces <a href="https://json.org/">JSON</a> schema compliance at the model level, eliminating the need for custom validation logic and retry mechanisms when extracting structured data from foundation models. </li>
<li style="font-weight:400;">This addresses a common production pain point where formatting errors in LLM responses break downstream API integrations and automated workflows.</li>
<li style="font-weight:400;">The feature works in two modes: custom JSON schema definitions for response formatting, or strict tool definitions that ensure model tool calls match exact specifications. </li>
<li style="font-weight:400;">This reduces operational overhead by preventing malformed outputs before they reach application code, making AI integrations more reliable for production use cases like data extraction, form processing, and API orchestration.</li>
<li style="font-weight:400;">Available now for <a href="https://www.anthropic.com/news/claude-opus-4-5">Anthropic Claude 4.5 models</a> and select <a href="https://docs.aws.amazon.com/bedrock/latest/userguide/structured-output.html#structured-output-supported-models">open-weight models</a> across all commercial <a href="https://docs.aws.amazon.com/general/latest/gr/bedrock.html">AWS Regions</a> where Bedrock operates. </li>
<li style="font-weight:400;">The capability works with Converse, ConverseStream, InvokeModel, and InvokeModelWithResponseStream APIs, providing flexibility for both synchronous and streaming applications.</li>
<li style="font-weight:400;">The practical benefit is fewer failed requests and reduced engineering time spent on output parsing and error handling. </li>
<li style="font-weight:400;">Organizations building production AI applications that feed into existing systems or databases can now rely on consistent, machine-readable responses without building extensive validation layers.</li>
<li style="font-weight:400;">int where teams had to choose between advanced deployment strategies and NLB’s technical requirements. Documentation available at <a href="https://docs.aws.amazon.com/AmazonECS/latest/developerguide/deployment-type-linear.html">https://docs.aws.amazon.com/AmazonECS/latest/developerguide/deployment-type-linear.html</a> </li>
</ul>
<p>FYI <a href="https://aws.amazon.com/about-aws/whats-new/2026/2/claude-opus-4.6-available-amazon-bedrock/">Claude Opus 4.6 now available in Amazon Bedrock</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.anthropic.com/news/claude-opus-4-6">Claude Opus 4.6</a> is now available in <a href="https://aws.amazon.com/bedrock/">Amazon Bedrock</a>, positioning itself as Anthropic’s most capable model with particular strength in coding, agentic workflows, and enterprise applications. </li>
<li style="font-weight:400;">The model supports both 200K and 1M context windows in preview, enabling analysis of large codebases and extensive document sets without chunking.</li>
<li style="font-weight:400;">The model’s agentic capabilities allow it to manage complex multi-step tasks across dozens of tools with reduced oversight, including the ability to autonomously spin up subagents for task decomposition. </li>
<li style="font-weight:400;">This makes it suitable for enterprise workflows like financial analysis that would typically require days of manual work, cybersecurity threat detection, and cross-application data movement.</li>
<li style="font-weight:400;">For developers, Opus 4.6 handles full software lifecycle management from requirements gathering through implementation and maintenance, particularly for long-horizon projects and large-scale codebases. </li>
<li style="font-weight:400;">The model’s deep reasoning capabilities make it applicable to professional work requiring sophisticated multi-step orchestration.</li>
<li style="font-weight:400;">Regional availability varies by deployment, with specific regions listed in the AWS Bedrock documentation. Pricing follows Bedrock’s standard model-based pricing structure, though specific costs for Opus 4.6 are not detailed in the announcement and should be verified in the Bedrock console.</li>
</ul>
<p>FYI <a href="https://kiro.dev/blog/opus-4-6/">Opus 4.6 is now available in Kiro </a></p>
<ul>
<li style="font-weight:400;"><a href="https://kiro.dev/">Kiro</a> has released Claude Opus 4.6 integration in their IDE and CLI, marking Anthropic’s newest state-of-the-art model that claims to be the world’s best for coding. </li>
<li style="font-weight:400;">The model is available to Kiro Pro, Pro+, and Power customers in AWS US-East-1 region with a 2.2x credit multiplier, same as Opus 4.5.</li>
<li style="font-weight:400;">Opus 4.6 targets production code and sophisticated agents, with particular strength in large-scale codebases and long-horizon projects. </li>
<li style="font-weight:400;">Anthropic positions it as capable of helping senior engineers complete multi-day projects in hours through task delegation with reduced oversight requirements.</li>
<li style="font-weight:400;">The model integrates with Kiro’s spec-driven development workflows, enabling detailed but precise specifications on large existing projects and surgical precision updates with minimal user input. </li>
<li style="font-weight:400;">This represents a shift toward AI-assisted development at enterprise scale rather than simple code completion.</li>
<li style="font-weight:400;">Access requires authentication through Google, GitHub, AWS BuilderID, or AWS IAM Identity Center, with experimental support currently limited to the Northern Virginia region. Users can access the model immediately by downloading or restarting the Kiro app or CLI.</li>
</ul>
<p>1:03:55 <a href="https://aws.amazon.com/about-aws/whats-new/2026/02/amazon-redshift-allocate-extra-compute-for-automatic-optimizations">Amazon Redshift now supports allocating extra compute for automatic </a><a href="https://aws.amazon.com/about-aws/whats-new/2026/02/amazon-redshift-allocate-extra-compute-for-automatic-optimizations">optimizations</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/redshift/">Amazon Redshift</a> now allows database administrators to allocate dedicated compute resources specifically for automatic optimization tasks like table optimization, sorting, vacuuming, and analysis. </li>
<li style="font-weight:400;">This prevents maintenance operations from competing with user queries during peak usage periods, addressing a common pain point where DBAs had to manually schedule these tasks during off-hours.</li>
<li style="font-weight:400;">The feature includes cost controls for provisioned clusters, letting administrators cap the amount of extra compute resources that autonomics can consume. This prevents runaway costs while still enabling continuous optimization, and works alongside the new SYS_AUTOMATIC_OPTIMIZATION system table that provides visibility into what optimization operations are running and their resource consumption.</li>
<li style="font-weight:400;">This enhancement is available across all <a href="https://aws.amazon.com/about-aws/global-infrastructure/regional-product-services/">AWS Regions</a> where Redshift operates, supporting both provisioned clusters and serverless workgroups. </li>
<li style="font-weight:400;">The feature essentially decouples database maintenance from query performance, which is particularly valuable for organizations running 24/7 analytics workloads that previously had no maintenance windows.</li>
<li style="font-weight:400;">The practical benefit is that Redshift databases can now stay optimized continuously without manual intervention or performance degradation during business hours. </li>
<li style="font-weight:400;">Organizations with high-concurrency analytics workloads or those operating across multiple time zones will see the most immediate value from this capability.</li>
</ul>
<p>1:04:35  Justin – “This is why I wanted a managed service from you, Amazon, so I didn’t have to think about this. This is you failing me.”  </p>
<h2>GCP</h2>
<p>1:05:26  <a href="https://developers.googleblog.com/introducing-the-developer-knowledge-api-and-mcp-server/">Introducing the Developer Knowledge API and MCP Server</a></p>
<ul>
<li style="font-weight:400;">Google launches the <a href="https://console.cloud.google.com/start/api?id=developerknowledge.googleapis.com">Developer Knowledge API</a> and Model Context Protocol server to provide AI assistants with programmatic access to official Google developer documentation as machine-readable Markdown. </li>
<li style="font-weight:400;">This addresses the problem of LLMs relying on outdated training data or web scraping when helping developers build with Google technologies like Firebase, Android, and Google Cloud.</li>
<li style="font-weight:400;">The MCP server implements the open Model Context Protocol standard, allowing popular AI assistants and IDEs to directly query Google’s documentation for real-time answers about API changes, code examples, and best practices. Developers can enable it through gcloud CLI and configure it in their AI assistant settings, with support for tools like Claude Desktop and various IDE extensions.</li>
<li style="font-weight:400;">The service is currently in public preview with free access through standard Google Cloud API quotas. </li>
<li style="font-weight:400;">Future plans include adding structured content support for code samples and API reference entities, expanding the documentation corpus, and reducing re-indexing latency before general availability.</li>
<li style="font-weight:400;">This integration benefits developers using AI coding assistants by ensuring responses reference current Google documentation rather than potentially stale information from model training cutoffs. The approach provides a canonical source of truth that updates as Google’s documentation changes.</li>
<li style="font-weight:400;">The Developer Knowledge API requires a Google Cloud project with the API enabled through gcloud beta services, and detailed configuration instructions are available in the official documentation at developers.google.com/knowledge/api and developers.google.com/knowledge/mcp.</li>
</ul>
<p>1:04:35  Ryan – “This won’t fix the fact that Google documentation is awful, but this will make it at least better.” </p>
<p>1:12:17 <a href="https://cloud.google.com/blog/products/identity-security/delivering-a-secure-open-sovereign-digital-world/">Delivering a secure, open, and sovereign digital world</a></p>
<ul>
<li style="font-weight:400;">Google Cloud expands its <a href="https://cloud.google.com/sovereign-cloud">Sovereign Cloud portfolio</a> with three tiers – Data Boundary, <a href="https://cloud.google.com/sovereign-cloud">Dedicated</a>, and <a href="https://cloud.google.com/distributed-cloud-air-gapped">Air-Gapped</a> – designed to meet varying data sovereignty requirements. </li>
<li style="font-weight:400;">Air-Gapped operates completely disconnected from Google Cloud and the internet, with no remote access possible by Google, while Dedicated allows partners to monitor and block updates with up to 12 months of independent operation if disconnected.</li>
<li style="font-weight:400;">The company announces substantial infrastructure investments across all continents, including new cloud regions in <a href="https://www.googlecloudpresscorner.com/2026-01-21-Google-Cloud-Launches-New-Cloud-Region-in-Thailand,-Bolstering-its-Commitment-to-Advancing-the-Countrys-AI-Driven-Digital-Economy">Thailand</a>, <a href="https://www.googlecloudpresscorner.com/2024-10-1-Google-Breaks-Ground-on-US-2-Billion-Malaysia-Data-Center-and-Cloud-Region,-Announces-Support-for-New-Sustainability-and-Skilling-Initiatives">Malaysia</a>, and <a href="https://cloud.google.com/blog/products/infrastructure/google-cloud-launches-42nd-cloud-region-in-sweden">Sweden</a>, plus subsea cables like <a href="https://cloud.google.com/blog/products/infrastructure/talaylink-subsea-cable-to-connect-australia-and-thailand">TalayLink</a> and <a href="https://cloud.google.com/blog/products/networking/introducing-dhivaru-new-subsea-cable?e=48754805">Dhivaru</a> for Asia-Pacific connectivity. </li>
<li style="font-weight:400;">Google commits to legal resistance against government shutdown orders and will enable qualified third parties to operate Google Cloud using Google’s code if Google becomes unable to continue operations.</li>
<li style="font-weight:400;">External Key Management lets customers store encryption keys outside Google Cloud with detailed access justifications required, while client-side encryption for Workspace ensures Google cannot read customer collaboration data. </li>
<li style="font-weight:400;">Google eliminated data transfer fees for customers migrating off the platform and expanded local ML processing for select Gemini models to 11 countries, including Australia, Brazil, Canada, France, Germany, India, Japan, Singapore, South Korea, and the UK.</li>
<li style="font-weight:400;">Notable sovereign cloud deployments include <a href="https://www.googlecloudpresscorner.com/2025-11-24-NATO-and-Google-Cloud-Sign-Multi-Million-Dollar-Deal-for-AI-Enabled-Sovereign-Cloud">NATO Communication and Information Agency</a>, <a href="https://cloud.google.com/blog/de/topics/offentlicher-sektor/souveraenitaet-auswahl-sicherheit">German Armed Forces</a>, <a href="https://www.googlecloudpresscorner.com/2025-09-11-Google-Cloud-Awarded-Landmark-Sovereign-Cloud-Contract-with-UK-Ministry-of-Defence">UK Ministry of Defence</a>, and <a href="https://www.googlecloudpresscorner.com/2025-08-28-Google-Cloud-Makes-Gemini-Everywhere-Vision-a-Reality,-Doubles-Down-on-Enterprise-AI-Commitment-to-Singapore">Singapore government agencies</a> using Air-Gapped, while France’s S3NS offers Premi3NS built on Dedicated with <a href="https://www.thalesgroup.com/en/news-centre/press-releases/s3ns-announces-secnumcloud-qualification-premi3ns-its-trusted-cloud">SecNumCloud 3.2</a> qualification from ANSSI. </li>
<li style="font-weight:400;">The portfolio targets highly regulated sectors like defense, government, banking, and healthcare, requiring strict data residency and operational independence guarantees.</li>
</ul>
<p>FYI  <a href="https://cloud.google.com/blog/products/ai-machine-learning/expanding-vertex-ai-with-claude-opus-4-6/">Expanding Vertex AI with Claude Opus 4.6. </a></p>
<ul>
<li style="font-weight:400;">Google Cloud adds Anthropic’s <a href="https://console.cloud.google.com/vertex-ai/publishers/anthropic/model-garden/claude-opus-4-6">Claude Opus 4.6</a> to Vertex AI, positioning it as its most powerful model for enterprise workflows, including document generation, financial analysis, and complex coding tasks. </li>
<li style="font-weight:400;">The model excels at multi-step agentic workflows and can handle tasks like creating production-ready spreadsheets and presentations with fewer revision cycles, particularly valuable for finance and legal verticals requiring precision.</li>
<li style="font-weight:400;">Vertex AI provides a complete agentic stack beyond just model access, including Agent Development Kit for rapid prototyping, <a href="https://docs.cloud.google.com/agent-builder/agent-engine/overview">Agent Engine</a> for serverless deployment, and Memory Bank for persistent context across interactions. </li>
<li style="font-weight:400;">Cost optimization features include provisioned throughput for fixed pricing, prompt caching with flexible TTL, batch predictions, and a 1M token context window in preview for Claude Opus 4.6.</li>
<li style="font-weight:400;">The platform integrates with Google Cloud’s security infrastructure, including <a href="https://cloud.google.com/security/products/model-armor?e=48754805&amp;hl=en">Model Armor</a> for protection against prompt injection and tool poisoning, plus Security Command Center for AI threat detection. </li>
<li style="font-weight:400;">Customer implementations show practical results, with Palo Alto Networks reporting a 20-30% increase in code development velocity and companies like Shopify, TELUS, and Replit using Claude on Vertex AI for production workloads.</li>
<li style="font-weight:400;">Claude Opus 4.6 is generally available on Vertex AI with deployment options through Google Cloud Marketplace for streamlined procurement. Regional availability and specific pricing details are documented at cloud.google.com/vertex-ai/generative-ai/pricing#claude-models, with the model accessible through the Vertex AI console and sample notebooks available on GitHub.</li>
</ul>
<p>1:13:15 <a href="https://cloud.google.com/blog/products/ai-machine-learning/gear-program-now-available/">GEAR program now available</a></p>
<ul>
<li style="font-weight:400;">Google launches <a href="https://developers.google.com/program/gear/">GEAR (Gemini Enterprise Agent Ready)</a> as a specialized learning program within the <a href="http://developers.google.com/program">Google Developer Program</a> to help developers build production-ready AI agents. </li>
<li style="font-weight:400;">The program provides 35 monthly learning credits on the <a href="https://www.skills.google/subscriptions">Google Skills</a> platform for sandbox testing and lab access at no cost to participants.</li>
<li style="font-weight:400;">The program offers two main learning paths: <a href="https://www.skills.google/paths/3546">Introduction to Agents</a> for understanding agent architecture and integration with Gemini Enterprise, and <a href="https://www.skills.google/paths/3545">Develop Agents with Agent Development Kit (ADK)</a> for building agents with reasoning loops. </li>
<li style="font-weight:400;">Both paths focus on moving developers from experimentation to production-grade implementations using Google’s open-source ADK.</li>
<li style="font-weight:400;">GEAR includes a credential system with completion badges on Google Developer profiles and skill badges for intermediate and advanced expertise. </li>
<li style="font-weight:400;">For Google Cloud customers, a separate <a href="https://developers.google.com/program/gear/getcertified/">Get Certified</a> cohort-based program offers instructor-led training and technical mentorship to prepare for industry-recognized certifications.</li>
<li style="font-weight:400;">The program addresses the shift toward agentic AI, where software can reason, plan, and execute complex workflows autonomously. Access requires creating or signing into a Google Developer Program profile and claiming the GEAR badge at developers.google.com/program/gear.</li>
</ul>
<p>1:14:27  Ryan – “I still think there’s a very large amount of people who don’t really understand sort of putting an agentic workflow in place to do what they want, right? It’s still pretty much fire-and-forget chat operations. And so there’s a lot of power in the tool once you know how to use it, but it is sort of less than straightforward, so I think this is a great course.”</p>
<h2>Azure</h2>
<p>1:15:26 <a href="https://blogs.microsoft.com/blog/2026/02/04/updates-in-two-of-our-core-priorities/">Updates in two of our core priorities</a></p>
<ul>
<li style="font-weight:400;">Microsoft announces major security leadership change with Hayete Gallot returning as EVP of Security, reporting directly to CEO Satya Nadella, while Charlie Bell transitions from leading security to focus on engineering quality as an individual contributor. </li>
<li style="font-weight:400;">This organizational shift reflects Microsoft’s continued emphasis on security as a top priority following recent Security Copilot and Purview adoption momentum.</li>
<li style="font-weight:400;">Gallot brings 15-plus years of Microsoft experience building Windows and Office franchises, plus recent Google Cloud customer experience leadership, positioning her to connect product development with customer value realization across Microsoft’s security portfolio. </li>
<li style="font-weight:400;">Her appointment comes as Microsoft integrates security into its new commercial cohorts operating model announced during recent earnings.</li>
<li style="font-weight:400;">Charlie Bell’s move from organizational leadership to an individual contributor engineering role is notable for a senior executive, with his new focus on Quality Excellence Initiative to improve engineering standards and product durability across Microsoft’s global scale operations. He will partner with Azure leadership, including Scott Guthrie, on quality improvements.</li>
<li style="font-weight:400;">Ales Holecek takes on the Chief Architect for Security role to bring platform architecture expertise to security products and connect them with Microsoft’s existing scale businesses and the Agent Platform. This architectural focus suggests deeper integration between security services and Microsoft’s broader cloud infrastructure.</li>
<li style="font-weight:400;">The timing aligns with Microsoft’s recent earnings report, highlighting security business growth and the company’s broader reorganization around commercial cohorts, indicating security will have dedicated product development rhythms separate from other business units. No specific pricing or feature changes were announced as part of this leadership transition.</li>
</ul>
<p>1:17:19  Justin – “I think this is them recreating the engineering operations review at Amazon at Azure. I think he is basically building a weekly program team that is going to be running the wheel, if you’re familiar with Amazon’s wheel thing, where basically you – as a service owner – can be called on at any time and you have to deep dive into all your KPIs, how your system’s operating, service operations, recent incidents, and you have to answer that at Amazon. They do it every week.”</p>
<p>1:19:23 <a href="https://azure.microsoft.com/en-us/blog/enhanced-storage-resiliency-with-azure-netapp-files-elastic-zone-redundant-service/">Enhanced storage resiliency with Azure NetApp Files – </a><a href="https://azure.microsoft.com/en-us/blog/enhanced-storage-resiliency-with-azure-netapp-files-elastic-zone-redundant-service/">Elastic zone-redundant service</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/azure-netapp-files/elastic-zone-redundant-concept">Azure NetApp Files</a> <a href="https://learn.microsoft.com/en-us/azure/azure-netapp-files/elastic-account">Elastic ZRS</a> introduces synchronous replication across three or more availability zones within a <a href="https://learn.microsoft.com/en-us/azure/azure-netapp-files/elastic-zone-redundant-concept#supported-regions">region</a> with automatic service-managed failover, maintaining the same mount target and endpoint during zone failures. </li>
<li style="font-weight:400;">This eliminates the need for customers to manage HA clusters or VM-level failover while guaranteeing zero data loss for mission-critical workloads.</li>
<li style="font-weight:400;">The service costs less than running three separate ANF volumes with cross-zone replication while providing the same multi-AZ high availability in a single volume. Volumes can be created as small as 1 GiB, offering flexibility for workloads of any size with support for both NFS and SMB protocols independently.</li>
<li style="font-weight:400;">ANF Elastic ZRS delivers enterprise data management capabilities, including instant snapshots, clones, tiering, and backup integration powered by NetApp ONTAP, plus efficient metadata operations through a shared QoS architecture that dynamically allocates IOPS. </li>
<li style="font-weight:400;">The service is particularly suited for healthcare, financial services, and other regulated industries requiring continuous uptime and compliance.</li>
<li style="font-weight:400;">The service is currently available in select Azure regions with rapid expansion planned, and future capabilities will include simultaneous multi-protocol access (<a href="https://learn.microsoft.com/azure/azure-netapp-files/azure-netapp-files-create-volumes">NFS</a>, <a href="https://learn.microsoft.com/azure/azure-netapp-files/azure-netapp-files-create-volumes-smb">SMB</a>, and Object REST API), custom region pairs for cross-region replication, and a migration assistant for moving data from on-premises ONTAP systems. </li>
<li style="font-weight:400;">This represents a clear migration path for existing NetApp on-premises customers looking to modernize without re-architecting applications.</li>
</ul>
<p>1:21:21 <a href="https://azure.microsoft.com/en-us/blog/postgresql-on-azure-supercharged-for-ai/">PostgreSQL on Azure supercharged for AI </a></p>
<ul>
<li style="font-weight:400;">Microsoft has enhanced <a href="https://azure.microsoft.com/en-us/products/postgresql">Azure Database for PostgreSQL</a> with native AI capabilities, including direct integration with Microsoft Foundry for in-database LLM operations like embeddings and semantic search. </li>
<li style="font-weight:400;">The service now supports DiskANN vector indexing for high-performance similarity search and includes a new PostgreSQL extension for <a href="https://www.bing.com/ck/a?!&amp;&amp;p=5f41a19334b843b7fdbefede99ac623b802ed7a5e254c2133bcee07fda936cceJmltdHM9MTc3MTM3MjgwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=01d3685b-64b5-6f5b-0f8b-7ee865eb6e52&amp;psq=Visual+Studio+Code&amp;u=a1aHR0cHM6Ly9jb2RlLnZpc3VhbHN0dWRpby5jb20vZG93bmxvYWQ">Visual Studio Code</a> that enables database provisioning directly from the IDE with built-in Entra ID authentication.</li>
<li style="font-weight:400;">The platform introduces zero-ETL real-time analytics through Microsoft Fabric mirroring and native Parquet file support via the Azure Storage Extension, allowing direct read/write operations to Azure Storage using SQL commands. PostgreSQL 18 is now generally available on Azure with new V6 compute SKUs that deliver improved I/O performance and lower latency, while Elastic Clusters enable horizontal scaling for multi-tenant workloads.</li>
<li style="font-weight:400;"><a href="https://azure.microsoft.com/en-us/products/horizondb?msockid=01d3685b64b56f5b0f8b7ee865eb6e52">Azure HorizonDB</a> was announced at Ignite as a new PostgreSQL-compatible service in private preview, designed specifically for AI-native workloads with scale-out compute and sub-millisecond latency. </li>
<li style="font-weight:400;">This positions Azure to support both traditional PostgreSQL workloads and next-generation AI applications requiring ultra-low latency and horizontal scale.</li>
<li style="font-weight:400;">The GitHub Copilot integration provides schema-aware SQL assistance within Visual Studio Code, while the new Model Context Protocol server for PostgreSQL enables direct agent framework connections in <a href="https://ai.azure.com/">Microsoft Foundry</a>. </li>
<li style="font-weight:400;">Nasdaq demonstrated a production use case with their Boardvantage platform, using Azure Database for PostgreSQL and Microsoft Foundry to add AI-powered document analysis and summarization to their board governance system serving nearly half of the Fortune 500.</li>
</ul>
<p>1:22:49  Matt – “Nothing I like better than an LLM inside my database!” </p>
<p>FYI      <a href="https://azure.microsoft.com/en-us/blog/claude-opus-4-6-anthropics-powerful-model-for-coding-agents-and-enterprise-workflows-is-now-available-in-microsoft-foundry-on-azure/">Claude Opus 4.6: Anthropic’s powerful model for coding, agents, and </a><a href="https://azure.microsoft.com/en-us/blog/claude-opus-4-6-anthropics-powerful-model-for-coding-agents-and-enterprise-workflows-is-now-available-in-microsoft-foundry-on-azure/">enterprise workflows is now available in Microsoft Foundry</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.anthropic.com/news/claude-opus-4-6">Claude Opus 4.6</a> is now available in <a href="https://ai.azure.com/">Microsoft Foundry</a> on Azure, bringing Anthropic’s most advanced reasoning model to enterprise customers with a 1M token context window in beta and 128K max output tokens. </li>
<li style="font-weight:400;">The model targets complex coding tasks, agentic workflows, and knowledge work across finance, legal, and cybersecurity domains, with new API features including adaptive thinking that dynamically adjusts reasoning depth and context compaction for long-running conversations.</li>
<li style="font-weight:400;">The integration connects Claude Opus 4.6 with <a href="https://techcommunity.microsoft.com/blog/azure-ai-foundry-blog/foundry-iq-unlocking-ubiquitous-knowledge-for-agents/4470812">Foundry IQ</a>, enabling access to data across Microsoft 365, Fabric, and web sources within Azure’s governance and compliance framework. </li>
<li style="font-weight:400;">Customers like Adobe, Dentons, and Macroscope are using the model for code review, legal drafting, and document generation, with deployment available through both Microsoft Foundry and Copilot Studio for no-code agent building.</li>
<li style="font-weight:400;">Technical improvements include enhanced computer use capabilities for navigating interfaces and automating multi-application workflows, plus a new max effort control level that joins existing high, medium, and low settings for finer token allocation. The model handles large codebases effectively for refactoring and bug detection, with companies like Momentic AI processing millions of tokens per hour using the Azure infrastructure.</li>
<li style="font-weight:400;">Pricing follows a premium model beyond 200K tokens for the 1M context window beta, though specific per-token costs were not disclosed in the announcement. </li>
<li style="font-weight:400;">The focus is on production-grade deployments where Azure’s managed infrastructure and operational controls help compress development timelines from days to hours while maintaining enterprise security requirements.</li>
</ul>
<p>1:23:45 <a href="https://blog.fabric.microsoft.com/en-GB/blog/microsoft-onelake-and-snowflake-interoperability-is-now-generally-available/">Microsoft OneLake and Snowflake interoperability (Generally Available) | </a><a href="https://blog.fabric.microsoft.com/en-GB/blog/microsoft-onelake-and-snowflake-interoperability-is-now-generally-available/">Microsoft Fabric Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/fabric/onelake/onelake-overview">Microsoft OneLake</a> and <a href="https://www.snowflake.com/en/">Snowflake</a> now offer bidirectional <a href="https://community.databricks.com/t5/technical-blog/how-to-set-up-databricks-snowflake-interoperability-with-unity/ba-p/128406">Iceberg table interoperability</a> in general availability, allowing customers to store and access data across both platforms without duplication. </li>
<li style="font-weight:400;">Changes made in one platform automatically reflect in the other, eliminating the need for traditional copy-heavy data integration approaches.</li>
<li style="font-weight:400;">Snowflake-managed <a href="https://quickstarts.snowflake.com/guide/getting_started_with_iceberg_in_oneLake/index.html?_ga=2.4046832.1392156976.1757344207-1553077896.1747951511&amp;_gac=1.224858344.1755023652.Cj0KCQjwzOvEBhDVARIsADHfJJQuB-FU5ZF0u7wgFyvOR-ThCJcpHxV4FxDP_-bC0JitaSkK57-dB5caAquFEALw_wcB%22%20%5Cl%20%2204">Iceberg tables</a> can now be natively stored in Microsoft OneLake, while Fabric data automatically converts to Iceberg format for direct Snowflake access. </li>
<li style="font-weight:400;">This addresses the challenge of enterprise data living across fragmented systems by providing a single copy of data accessible through either platform’s analytical engines.</li>
<li style="font-weight:400;">New UI elements launching next week include a Snowflake item in OneLake for simplified access without complex configurations, plus Snowflake UI that pushes managed Iceberg tables directly into Fabric as discoverable OneLake items. The integration also supports OneLake table APIs working with Snowflake’s catalog-linked database feature.</li>
<li style="font-weight:400;">The target use case centers on data teams managing analytics and AI workloads across multiple platforms who want to avoid vendor lock-in and proprietary formats. Organizations can now choose the optimal storage location and analytical engine for each project while maintaining a unified data estate without operational overhead from data duplication.</li>
<li style="font-weight:400;">No specific pricing details were provided in the announcement, though the integration leverages existing OneLake and Snowflake licensing models. Customers can access quickstart guides and documentation through Microsoft Learn and Snowflake’s resources, with hands-on training available at FabCon and SQLCon 2026 in Atlanta from March 16-20.</li>
</ul>
<p>1:24:44  Ryan – “This is kind of neat. I mean, it’s unexpected because it is data, and the amount of data and what you’d have in a data lake is usually one of those elements that makes using a service very sticky, so providing sort of an easy way to get out of that is a surprise to me, but it’s also – from a customer perspective – if you’ve got data across both, like how fantastic is that? To be able to use it. I like it.”</p>
<p>1:25:52 <a href="https://azure.microsoft.com/en-us/updates?id=553917">Generally Available: Azure Container Storage v2.1.0 now with Elastic SAN </a><a href="https://azure.microsoft.com/en-us/updates?id=553917">integration and on-demand installation</a></p>
<ul>
<li style="font-weight:400;">Azure Container Storage v2.1.0 brings native <a href="https://learn.microsoft.com/en-us/azure/storage/elastic-san/elastic-san-introduction">Elastic SAN</a> integration, allowing Kubernetes workloads to leverage Azure’s shared block storage service for high-performance persistent volumes. </li>
<li style="font-weight:400;">This integration provides an alternative to existing Azure Disk and ephemeral disk options, particularly beneficial for workloads requiring shared storage across multiple pods.</li>
<li style="font-weight:400;">The release introduces an on-demand installation model that reduces the deployment footprint and operational overhead compared to previous versions. Instead of pre-installing all storage components, the system now deploys only the necessary drivers and resources when specific storage types are requested, streamlining cluster management.</li>
<li style="font-weight:400;">Elastic SAN support targets enterprise customers running stateful containerized applications that need consistent low-latency performance and the ability to scale storage independently from compute. </li>
<li style="font-weight:400;">Common use cases include database workloads, analytics platforms, and applications requiring shared persistent volumes across multiple container instances.</li>
<li style="font-weight:400;">The lightweight installation approach addresses a common pain point where organizations previously had to deploy full storage stacks even when using only a subset of available storage options. </li>
<li style="font-weight:400;">This change reduces resource consumption on AKS clusters and simplifies troubleshooting by limiting the number of active storage components</li>
</ul>
<p>1:26:26  Justin – “The amount of SAN investment they’ve done in the last year is crazy to me.” </p>
<p>1:27:20 <a href="https://blog.fabric.microsoft.com/en-us/blog/32624">Five Reasons to attend SQLCon | Microsoft Fabric Blog </a></p>
<ul>
<li style="font-weight:400;"><a href="https://sqlcon.us/">SQLCon</a> is a new SQL-focused conference co-located with FabCon in Atlanta, March 16-20, offering dual access with a single registration. </li>
<li style="font-weight:400;">The event features <a href="https://sqlcon.us/program/tracks">50 SQL sessions</a> covering SQL Server, Azure SQL, and SQL database in Fabric, with hands-on workshops Monday-Tuesday and conference sessions Wednesday-Friday.</li>
<li style="font-weight:400;">Microsoft is sending over 30 SQL product team members to deliver engineering insights, roadmap announcements, and live demos of upcoming capabilities, including SSMS and VS Code extensions, Copilot integrations, and Fabric SQL experiences. This provides direct access to product teams for technical questions and future planning.</li>
<li style="font-weight:400;">The combined conference format allows attendees to mix deep SQL technical sessions with broader Fabric, Power BI, data engineering, and AI content throughout the week. </li>
<li style="font-weight:400;">This structure benefits both specialists needing deep technical content and cross-functional teams building shared understanding across data platforms.</li>
<li style="font-weight:400;">Registration includes access to both conferences, hands-on workshops, Ask-the-Experts sessions with MVPs and engineers, and an attendee party at the Georgia Aquarium. </li>
<li style="font-weight:400;">Early-bird pricing and team discounts are available, with promo code SQLCMTY200 offering $200 off registration.</li>
<li style="font-weight:400;">The event targets DBAs, developers, data engineers, architects, and data team leaders working with SQL Server, Azure SQL, or SQL database in Fabric who need practical migration, modernization, performance tuning, and AI integration guidance.</li>
</ul>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback, or ask questions at theCloudPod.net or tweet at us with the hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2367231/c1e-rodobomj3vuxgp5x-ww797qrra1zq-ufzhzl.mp3" length="165445936"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 342 of The Cloud Pod, where the forecast is always cloudy! Justin, Ryan, and Matt are in the studio today to bring you all the latest in cloud and AI news this week. How do you feel about ads? How do you feel about ads while using AI? We’ve got options! We’ve got a round-up of tech Super Bowl ads, AI ads, Earnings reports (who frankly need the ad revenue), and a plethora of Opus 4.6 announcements, plus more. Let’s get started! 
Titles we almost went with this week

 ChatGPT Goes Full Mad Men: Your AI Assistant Now Comes With Commercial Breaks
 Heroku’s New Feature: No New Features
 AWS Gives EC2 Instances a Storage Growth Spurt: 22.8TB of Local NVMe Now Available
 Identity Crisis Averted: IAM Identity Center Learns to Replicate Itself
 JSON Schema Enforcement: Because Your LLM Needs Structure in Its Life
 From Zero to Admin in 480 Seconds: A Serbian Speedrun Story
 From Proof of Concept to Proof of Claw: DigitalOcean Tames AI Agent Infrastructure
 Azure’s Growth Hits the Clouds: Microsoft’s 39% Increase Still Not Enough for Wall Street
 One Lake to Rule Them All: Microsoft and Snowflake Finally Stop Fighting Over Your Data
 Free Lunch Officially Over: ChatGPT Learns That Servers Cost Money
 Claude Won’t Sell You Anything (Except Maybe Peace of Mind)
 IAM Identity Center Goes Multi-Regional: Because One Region to Rule Them All Wasn’t Enough
 Databricks Takes the Base Out of Database with Lakebase GA
 I’m a Chrome Tab hoarder

General News 
01:30 Superbowl Ads of Note

OpenAI: https://www.youtube.com/watch?v=aCN9iCXNJqQ
Microsoft CoPilot: https://www.youtube.com/watch?v=Ndj9Jk-tGKo
Base44?: https://www.youtube.com/watch?v=iKEUWtqvsis 
Gemini: https://www.youtube.com/watch?v=Z1yGy9fELtE
Anthropic: https://www.youtube.com/watch?v=gmnjDLwZckA 
ai.com: https://www.youtube.com/watch?v=n7I-D4YXbzg&t=3s

16:35 Justin -If you ever want to knowif there’s a bubble, spending dumb money on the Super Bowl on an ad that makes no sense is probably your number one clue.” 
16:53 It’s Earnings Time!
Microsoft (MSFT) Q2 earnings report 2026

Microsoft Q2 2026 earnings show Azure cloud growth slowing to 39% from 40% in the prior quarter, missing analyst expectations of 39.4% and causing shares to drop 7% in after-hours trading. 
The company’s gross margin hit a three-year low at 68% due to substantial AI infrastructure investments totaling $37.5 billion in capital expenditures, up 66% year over year.
OpenAI now represents 45% of Microsoft’s $625 billion remaining commercial performance obligation after the company committed to a $250 billion cloud services deal during the quarter. 
This concentration raises questions about revenue dependence on a single customer, though Microsoft maintains that the remaining backlog is still larger and more diversified than most compet...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2367231/c1a-k5d5-jpqoqz6nsk3j-sbol8f.jpg"></itunes:image>
                                                                            <itunes:duration>01:25:46</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2367231/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[341: AWS Layoffs: Scaling Down Instead of Scaling Out]]>
                </title>
                <pubDate>Fri, 13 Feb 2026 03:38:58 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2361457</guid>
                                    <link>https://tcpfm.castos.com/episodes/341-aws-layoffs-scaling-down-instead-of-scaling-out</link>
                                <description>
                                            <![CDATA[<p>Welcome to episode 341 of The Cloud Pod, where the forecast is always cloudy! Matt &amp; Ryan are picking up Justin’s slack this week while he’s traveling for work, but don’t worry, because they have plenty of news! We’re talking about those mass layoffs over at AWS, a major security breach over at Notepad++, and some new slight of hand over at Elon’s companies. There’s a lot to cover, so let’s get into it! </p>
<h3>Titles we almost went with this week</h3>
<ul>
<li> Finally, a Chatbot That Actually Knows Where Your Data Lives **Anthropic</li>
<li> Microsoft Adds Security Analyzer to MSSQL Extension: Because Bobby Tables Jokes Are Only Funny Until They Happen to You</li>
<li> From Sequential Sadness to Parallel Paradise: GKE Node Pools Get Concurrent</li>
<li> From Vibe Coding to Production: AWS MCP Server Gets SOPs</li>
<li> One Prompt to Deploy Them All: AWS MCP Server Automates Infrastructure</li>
<li> AWS Layoffs: Scaling Down Instead of Scaling Out</li>
<li> Mutual TLS: Because CloudFront and Your Origin Need Couples Therapy</li>
<li> Claude Team Plan: Now With More Seats and Less Bills</li>
<li> From Snowflake to Snowball: Rolling Data and Dev Into One Platform</li>
<li> From Notepad++ to Notepad Pwned: A Six-Month Hosting Horror Story</li>
<li> EventBridge Payload Capacity Gets a 4x Upgrade: No More Event Splitting Headaches</li>
<li> CloudFront Finally Learns to Check ID Before Knocking on Origin’s Door</li>
</ul>
<h2>General News </h2>
<p>01:30 <a href="https://arstechnica.com/ai/2026/02/spacex-acquires-xai-plans-1-million-satellite-constellation-to-power-it/">SpaceX acquires xAI, plans to launch a massive satellite constellation to </a><a href="https://arstechnica.com/ai/2026/02/spacex-acquires-xai-plans-1-million-satellite-constellation-to-power-it/">power it – Ars Technica</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.spacex.com/">SpaceX</a> <a href="https://www.spacex.com/updates#xai-joins-spacex">has acquired xAI</a> to create a vertically integrated AI and space infrastructure company, with plans to deploy up to 1 million satellites as orbital data centers. </li>
<li style="font-weight:400;">This represents a significant bet that space-based compute infrastructure can be cost-competitive with traditional ground-based data centers for AI workloads.</li>
<li style="font-weight:400;">The merger combines SpaceX’s launch capabilities and satellite manufacturing expertise with xAI’s <a href="https://grok.com/">Grok chatbot</a> and <a href="https://x.com/x">X social platform</a>. </li>
<li style="font-weight:400;">The strategy assumes AI demand will continue to grow and that compute capacity, rather than other factors, is the primary bottleneck to AI adoption.</li>
<li style="font-weight:400;">The orbital data center concept raises questions about latency, power requirements, thermal management, and maintenance compared to terrestrial facilities. </li>
<li style="font-weight:400;">Traditional cloud providers have invested heavily in ground-based infrastructure optimized for these factors.</li>
<li style="font-weight:400;">This consolidation of Musk’s companies creates potential conflicts between SpaceX’s established government and commercial contracts and xAI’s more controversial products. </li>
<li style="font-weight:400;">The integration of a proven aerospace company with a newer AI venture introduces execution risk to SpaceX’s core business.</li>
<li style="font-weight:400;">The plan depends on several unproven assumptions, including sustained AI market growth, viable economics for space-based computing, and the ability to manufacture and launch satellites at unprecedented scale. </li>
<li style="font-weight:400;">Cloud providers and enterprises will need to evaluate whether orbital compute offers advantages over existing multi-region terrestrial deployments.</li>
</ul>
<p>03:22 Ryan – “I feel like this is a shell game con; taxes are over here – no, now they’re over here!” </p>
<p>06:49 <a href="..."></a></p>
<h3>Chapters</h3>
<ul><li>(00:00:00) - The Cloud Podcast</li><li>(00:01:40) - SpaceX to Deploy 1 Million Satellites as Data Centers for</li><li>(00:06:50) - Notepad Hacked by State Sponsored Hackers</li><li>(00:14:52) - Amazon Layoffs: What They Mean for Product Development</li><li>(00:18:34) - Google's Genie 3 AI World Model Available for Ultra Users</li><li>(00:23:15) - OpenAI to Retire Older ChatGPT Models</li><li>(00:27:06) - OpenAI Launches Codex on a Mac OS X App</li><li>(00:33:46) - AWS: Automatically Promote Code to Production with AI Agents</li><li>(00:38:59) -  AWS STS: Validation of Provider Specific Claims (OID</li><li>(00:44:10) - Amazon Cloudfront Announces Mutual TLS Authentication with Origin</li><li>(00:50:30) - Amazon EventBridge: Increased 1 megabyte payload size for Machine Learning</li><li>(00:56:31) - Google Cloud BigQuery: Conversational Analytics in 2020</li><li>(00:57:59) - Google Cloud Launches Single Tenant Cloud HSM</li><li>(01:02:53) - How to manage 15,000 keys on a single HSM with</li><li>(01:05:47) - Microsoft Launches DLSV7, DSV7 and ESV</li><li>(01:11:31) - This Week in the Cloud: The Cloud: AI & More</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 341 of The Cloud Pod, where the forecast is always cloudy! Matt & Ryan are picking up Justin’s slack this week while he’s traveling for work, but don’t worry, because they have plenty of news! We’re talking about those mass layoffs over at AWS, a major security breach over at Notepad++, and some new slight of hand over at Elon’s companies. There’s a lot to cover, so let’s get into it! 
Titles we almost went with this week

 Finally, a Chatbot That Actually Knows Where Your Data Lives **Anthropic
 Microsoft Adds Security Analyzer to MSSQL Extension: Because Bobby Tables Jokes Are Only Funny Until They Happen to You
 From Sequential Sadness to Parallel Paradise: GKE Node Pools Get Concurrent
 From Vibe Coding to Production: AWS MCP Server Gets SOPs
 One Prompt to Deploy Them All: AWS MCP Server Automates Infrastructure
 AWS Layoffs: Scaling Down Instead of Scaling Out
 Mutual TLS: Because CloudFront and Your Origin Need Couples Therapy
 Claude Team Plan: Now With More Seats and Less Bills
 From Snowflake to Snowball: Rolling Data and Dev Into One Platform
 From Notepad++ to Notepad Pwned: A Six-Month Hosting Horror Story
 EventBridge Payload Capacity Gets a 4x Upgrade: No More Event Splitting Headaches
 CloudFront Finally Learns to Check ID Before Knocking on Origin’s Door

General News 
01:30 SpaceX acquires xAI, plans to launch a massive satellite constellation to power it – Ars Technica

SpaceX has acquired xAI to create a vertically integrated AI and space infrastructure company, with plans to deploy up to 1 million satellites as orbital data centers. 
This represents a significant bet that space-based compute infrastructure can be cost-competitive with traditional ground-based data centers for AI workloads.
The merger combines SpaceX’s launch capabilities and satellite manufacturing expertise with xAI’s Grok chatbot and X social platform. 
The strategy assumes AI demand will continue to grow and that compute capacity, rather than other factors, is the primary bottleneck to AI adoption.
The orbital data center concept raises questions about latency, power requirements, thermal management, and maintenance compared to terrestrial facilities. 
Traditional cloud providers have invested heavily in ground-based infrastructure optimized for these factors.
This consolidation of Musk’s companies creates potential conflicts between SpaceX’s established government and commercial contracts and xAI’s more controversial products. 
The integration of a proven aerospace company with a newer AI venture introduces execution risk to SpaceX’s core business.
The plan depends on several unproven assumptions, including sustained AI market growth, viable economics for space-based computing, and the ability to manufacture and launch satellites at unprecedented scale. 
Cloud providers and enterprises will need to evaluate whether orbital compute offers advantages over existing multi-region terrestrial deployments.

03:22 Ryan – “I feel like this is a shell game con; taxes are over here – no, now they’re over here!” 
06:49 ]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[341: AWS Layoffs: Scaling Down Instead of Scaling Out]]>
                </itunes:title>
                                    <itunes:episode>341</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p>Welcome to episode 341 of The Cloud Pod, where the forecast is always cloudy! Matt &amp; Ryan are picking up Justin’s slack this week while he’s traveling for work, but don’t worry, because they have plenty of news! We’re talking about those mass layoffs over at AWS, a major security breach over at Notepad++, and some new slight of hand over at Elon’s companies. There’s a lot to cover, so let’s get into it! </p>
<h3>Titles we almost went with this week</h3>
<ul>
<li> Finally, a Chatbot That Actually Knows Where Your Data Lives **Anthropic</li>
<li> Microsoft Adds Security Analyzer to MSSQL Extension: Because Bobby Tables Jokes Are Only Funny Until They Happen to You</li>
<li> From Sequential Sadness to Parallel Paradise: GKE Node Pools Get Concurrent</li>
<li> From Vibe Coding to Production: AWS MCP Server Gets SOPs</li>
<li> One Prompt to Deploy Them All: AWS MCP Server Automates Infrastructure</li>
<li> AWS Layoffs: Scaling Down Instead of Scaling Out</li>
<li> Mutual TLS: Because CloudFront and Your Origin Need Couples Therapy</li>
<li> Claude Team Plan: Now With More Seats and Less Bills</li>
<li> From Snowflake to Snowball: Rolling Data and Dev Into One Platform</li>
<li> From Notepad++ to Notepad Pwned: A Six-Month Hosting Horror Story</li>
<li> EventBridge Payload Capacity Gets a 4x Upgrade: No More Event Splitting Headaches</li>
<li> CloudFront Finally Learns to Check ID Before Knocking on Origin’s Door</li>
</ul>
<h2>General News </h2>
<p>01:30 <a href="https://arstechnica.com/ai/2026/02/spacex-acquires-xai-plans-1-million-satellite-constellation-to-power-it/">SpaceX acquires xAI, plans to launch a massive satellite constellation to </a><a href="https://arstechnica.com/ai/2026/02/spacex-acquires-xai-plans-1-million-satellite-constellation-to-power-it/">power it – Ars Technica</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.spacex.com/">SpaceX</a> <a href="https://www.spacex.com/updates#xai-joins-spacex">has acquired xAI</a> to create a vertically integrated AI and space infrastructure company, with plans to deploy up to 1 million satellites as orbital data centers. </li>
<li style="font-weight:400;">This represents a significant bet that space-based compute infrastructure can be cost-competitive with traditional ground-based data centers for AI workloads.</li>
<li style="font-weight:400;">The merger combines SpaceX’s launch capabilities and satellite manufacturing expertise with xAI’s <a href="https://grok.com/">Grok chatbot</a> and <a href="https://x.com/x">X social platform</a>. </li>
<li style="font-weight:400;">The strategy assumes AI demand will continue to grow and that compute capacity, rather than other factors, is the primary bottleneck to AI adoption.</li>
<li style="font-weight:400;">The orbital data center concept raises questions about latency, power requirements, thermal management, and maintenance compared to terrestrial facilities. </li>
<li style="font-weight:400;">Traditional cloud providers have invested heavily in ground-based infrastructure optimized for these factors.</li>
<li style="font-weight:400;">This consolidation of Musk’s companies creates potential conflicts between SpaceX’s established government and commercial contracts and xAI’s more controversial products. </li>
<li style="font-weight:400;">The integration of a proven aerospace company with a newer AI venture introduces execution risk to SpaceX’s core business.</li>
<li style="font-weight:400;">The plan depends on several unproven assumptions, including sustained AI market growth, viable economics for space-based computing, and the ability to manufacture and launch satellites at unprecedented scale. </li>
<li style="font-weight:400;">Cloud providers and enterprises will need to evaluate whether orbital compute offers advantages over existing multi-region terrestrial deployments.</li>
</ul>
<p>03:22 Ryan – “I feel like this is a shell game con; taxes are over here – no, now they’re over here!” </p>
<p>06:49 <a href="https://notepad-plus-plus.org/news/hijacked-incident-info-update/">Notepad++ Hijacked by State-Sponsored Hackers | Notepad++</a></p>
<ul>
<li style="font-weight:400;">Chinese state-sponsored hackers compromised <a href="https://notepad-plus-plus.org/downloads/">Notepad++</a> update infrastructure from June through December 2025 by exploiting vulnerabilities at the shared hosting provider level, not in Notepad++ code itself. </li>
<li style="font-weight:400;">The attackers maintained access to internal service credentials even after losing server access in September, allowing them to selectively redirect update traffic to malicious servers until December 2025.</li>
<li style="font-weight:400;">The attack exploited insufficient update verification controls in older Notepad++ versions, with <a href="https://notepad-plus-plus.org/news/clarification-security-incident">attackers specifically targeting</a> the update manifest endpoint to serve compromised installers to selected users. </li>
<li style="font-weight:400;">Version 8.8.9 added certificate and signature verification for downloaded installers, while the upcoming version 8.9.2 will enforce XMLDSig signature verification on update server responses.</li>
<li style="font-weight:400;">The hosting provider confirmed the compromise was limited to one shared hosting server and found no evidence of other clients being targeted, though the investigation of 400GB of logs yielded no concrete indicators of compromise like binary hashes or IP addresses. Rapid7 and Kaspersky later published a more detailed technical analysis with actual IoCs.</li>
<li style="font-weight:400;">This incident demonstrates supply chain attack risks even for open source software with millions of users, particularly when update infrastructure relies on shared hosting environments. </li>
<li style="font-weight:400;">The Notepad++ project has since migrated to a new hosting provider with stronger security practices and implemented multiple layers of cryptographic verification.</li>
</ul>
<p>09:24 Matt – “Getting in at this level – and that maintenance of control for 7 months – is crazy. It’s a pretty big attack.” </p>
<p>15:25 <a href="https://www.businessinsider.com/internal-messages-teams-jobs-affected-amazon-layoffs-2026-1?utm_source=copy-link&amp;utm_medium=referral&amp;utm_content=topbar">Internal Messages Reveal Teams, Jobs Affected in Amazon Layoffs – </a><a href="https://www.businessinsider.com/internal-messages-teams-jobs-affected-amazon-layoffs-2026-1?utm_source=copy-link&amp;utm_medium=referral&amp;utm_content=topbar">Business Insider</a></p>
<ul>
<li style="font-weight:400;">Amazon is <a href="https://www.businessinsider.com/amazon-new-layoffs-restructuring-continues-cultural-reset-andy-jassy-2026-1">cutting 16,000 corporate roles</a> in its second major layoff round within four months, affecting multiple AWS service teams, including Bedrock AI, Redshift data warehouse, and ProServe consulting divisions. 
<ul>
<li style="font-weight:400;">The cuts represent a significant restructuring of Amazon’s corporate workforce of approximately 350,000 employees.</li>
</ul>
</li>
<li style="font-weight:400;">AWS engineering teams appear heavily impacted based on internal Slack messages, with software engineers from core cloud services posting job searches. </li>
<li style="font-weight:400;">This raises questions about AWS’s product development velocity and customer support capacity during a period of intense AI competition with Microsoft Azure and Google Cloud.</li>
<li style="font-weight:400;">Affected US employees receive 90 days for internal job searches with severance and benefits for those unable to find new positions. </li>
<li style="font-weight:400;">The timing follows Amazon’s return-to-office mandate and broader tech industry cost-cutting trends.</li>
<li style="font-weight:400;">The layoffs touch customer-facing teams like Prime subscription services and last-mile delivery alongside cloud infrastructure groups. This dual impact on retail and AWS operations suggests company-wide efficiency initiatives rather than targeted underperformance in specific business units.</li>
</ul>
<p>17:24  Matt – “It really did affect a broad spectrum of the org.” </p>
<h2>AI Is Going Great – Or How ML Makes Money </h2>
<p>19:10 <a href="https://blog.google/innovation-and-ai/models-and-research/google-deepmind/project-genie/">Project Genie: AI world model now available for Ultra users in U.S.</a></p>
<ul>
<li style="font-weight:400;">Google DeepMind launches <a href="http://labs.google/projectgenie/">Project Genie</a>, an experimental web app now available to <a href="https://one.google.com/about/google-ai-plans/">Google AI Ultra subscribers</a> in the U.S. (18+), powered by the Genie 3 world model that generates interactive 3D environments in real-time based on text prompts and images. 
<ul>
<li style="font-weight:400;">Unlike static 3D snapshots, <a href="https://deepmind.google/blog/genie-3-a-new-frontier-for-world-models/">Genie 3</a> simulates physics and interactions dynamically as users navigate, creating expanding worlds on the fly.</li>
</ul>
</li>
<li style="font-weight:400;">The platform offers three core capabilities: World Sketching (using <a href="https://deepmind.google/models/gemini-image/pro/">Nano Banana Pro</a> for image preview and fine-tuning before entering), World Exploration (real-time path generation based on user actions with adjustable camera controls), and World Remixing (building on existing worlds from galleries). 
<ul>
<li style="font-weight:400;">Users can define character perspectives (first-person or third-person) and movement types (walking, flying, driving).</li>
</ul>
</li>
<li style="font-weight:400;">Current limitations include 60-second generation caps, occasional physics inconsistencies, character control issues with higher latency, and generated worlds that may not always match prompts precisely. </li>
<li style="font-weight:400;">Some Genie 3 capabilities announced in August, like promptable events that modify worlds during exploration, are not yet included in this prototype.</li>
<li style="font-weight:400;">This release represents Google’s approach to building general-purpose AI systems that can navigate diverse real-world scenarios, moving beyond domain-specific agents like AlphaGo. </li>
<li style="font-weight:400;">The technology has potential applications in robotics simulation, animation modeling, location exploration, and historical setting recreation, though it remains an early research prototype in Google Labs.</li>
</ul>
<p>24:07 <a href="https://openai.com/index/retiring-gpt-4o-and-older-models">Retiring GPT-4o, GPT-4.1, GPT-4.1 mini, and OpenAI o4-mini in ChatGPT | </a><a href="https://openai.com/index/retiring-gpt-4o-and-older-models">OpenAI</a></p>
<ul>
<li style="font-weight:400;">OpenAI will <a href="https://openai.com/index/gpt-5-1/">retire</a> GPT-4o, GPT-4.1, GPT-4.1 mini, and o4-mini from ChatGPT on February 13, 2026, though API access remains unchanged. </li>
<li style="font-weight:400;">Only 0.1% of users still select GPT-4o daily, with most usage shifted to GPT-5.2.</li>
<li style="font-weight:400;">GPT-4o was previously deprecated, then restored after user feedback about creative ideation needs and preference for its conversational warmth. </li>
<li style="font-weight:400;">This feedback directly influenced GPT-5.1 and GPT-5.2 development, which now includes <a href="https://help.openai.com/en/articles/11899719-customizing-your-chatgpt-personality">customizable personality controls</a> for warmth, enthusiasm, and conversational styles like Friendly.</li>
<li style="font-weight:400;">OpenAI is addressing user complaints about unnecessary refusals and overly cautious responses in newer models. The company is developing an adult-focused version of ChatGPT for users over 18 with expanded freedom within appropriate safeguards, supported by <a href="https://help.openai.com/en/articles/12652064-age-prediction-in-chatgpt">age prediction</a> rollout in most markets.</li>
<li style="font-weight:400;">The model retirement strategy allows OpenAI to concentrate resources on improving models with active user bases rather than maintaining legacy versions. </li>
<li style="font-weight:400;">This follows a pattern of deprecating older models as newer versions incorporate user-requested features and achieve broader adoption.</li>
</ul>
<p>25:43  Matt – “Deprecation of things is one of the hardest things; we joked a lot last year when AWS finally deprecated things, but it’s hard. People have it built in and hard-coded into their apps and workflows. They’re used to specific types of responses.” </p>
<p>28:15 <a href="https://openai.com/index/introducing-the-codex-app/">Introducing the Codex app | OpenAI</a> </p>
<ul>
<li style="font-weight:400;"><a href="https://openai.com/">OpenAI</a> launches the <a href="https://openai.com/codex">Codex</a> desktop app for macOS, a command center interface for managing multiple AI coding agents simultaneously across long-running development tasks. </li>
<li style="font-weight:400;">The app includes native support for parallel agent workflows using git worktrees, allowing multiple agents to work on isolated copies of the same repository without conflicts while maintaining separate thread contexts per project.</li>
<li style="font-weight:400;">Codex now extends beyond code generation through a Skills system that bundles instructions, resources, and scripts for tasks like <a href="https://www.figma.com/">Figma</a> design implementation, Linear project management, and cloud deployment to <a href="https://www.cloudflare.com/">Cloudflare</a>, <a href="https://www.netlify.com/">Netlify</a>, <a href="https://render.com/">Render</a>, and <a href="https://vercel.com/">Vercel</a>. </li>
<li style="font-weight:400;">OpenAI demonstrated this by having Codex autonomously build a complete racing game using 7 million tokens from a single prompt, with the agent taking on designer, developer, and QA tester roles.</li>
<li style="font-weight:400;">The app introduces Automations for scheduled background tasks like daily issue triage, CI failure analysis, and release briefs, with results landing in a review queue for developer oversight. All agents run in configurable system-level sandboxes by default, restricted to editing files in their working folder and requiring permission for elevated operations like network access.</li>
<li style="font-weight:400;">For a limited time, OpenAI is including Codex access with ChatGPT Free and Go tiers and doubling rate limits across all paid plans (Plus, Pro, Business, Enterprise, Edu). </li>
<li style="font-weight:400;">Usage has doubled since <a href="https://openai.com/index/introducing-gpt-5-2-codex/">GPT-5.2-Codex</a> launched in mid-December, with over one million developers now using the service, and Windows support is planned for future releases.</li>
</ul>
<p>29:52  Ryan – “They’ve got a lot of catching up to do. Claude Code is all I hear about…it’s everywhere. I do hear about Gemini Code, mostly because I live in that ecosystem. I haven’t had a chance to play with it and compare it to the other tools.” </p>
<h2>AWS </h2>
<p>35:20 <a href="https://aws.amazon.com/about-aws/whats-new/2025/01/aws-announces-deployment-agent-sops-in-aws-mcp-server-preview">AWS announces Deployment Agent SOPs in AWS MCP Server</a></p>
<ul>
<li style="font-weight:400;">AWS introduces Deployment Agent SOPs in the <a href="https://docs.aws.amazon.com/aws-mcp/latest/userguide/what-is-mcp-server.html">AWS MCP Server</a> in preview, enabling developers to deploy web applications to production using natural language prompts through MCP-compatible tools like <a href="https://claude.ai/login">Claude</a>, <a href="https://cursor.com/">Cursor</a>, and <a href="https://kiro.dev/">Kiro</a>. </li>
<li style="font-weight:400;">The system automatically generates CDK infrastructure, deploys CloudFormation stacks, and sets up CI/CD pipelines with AWS security best practices included.</li>
<li style="font-weight:400;">The feature addresses the gap between AI-assisted prototyping and production deployment by allowing developers to move from vibe-coded applications to production environments in a single prompt. This is fine. Just fine. </li>
<li style="font-weight:400;">Agent SOPs follow multi-step procedures to analyze project structure, create preview environments on S3 and CloudFront, and configure CodePipeline for automated deployments from source repositories.</li>
<li style="font-weight:400;">Support includes popular web frameworks like <a href="https://react.dev/">React</a>, <a href="http://vue.js">Vue.js</a>, <a href="https://angular.dev/">Angular</a>, and <a href="http://next.js">Next.js</a>, with automatic documentation generation that enables AI agents to handle future deployments and troubleshooting across sessions. The deployment process creates persistent documentation in the repository for continuity.</li>
<li style="font-weight:400;">Currently available in preview at no additional cost in US East N. Virginia region only, with customers paying standard rates for AWS resources created and applicable data transfer costs. </li>
<li style="font-weight:400;">This represents AWS’s integration of AI agents into the deployment workflow, competing with other infrastructure-as-code and deployment automation tools.</li>
</ul>
<p>36:58  Ryan – “I like and hate this all at the same time.” </p>
<p>40:54 <a href="https://aws.amazon.com/about-aws/whats-new/2026/01/aws-sts-supports-validation-identity-provider-claims">AWS STS now supports validation of select identity provider-specific </a><a href="https://aws.amazon.com/about-aws/whats-new/2026/01/aws-sts-supports-validation-identity-provider-claims">claims from Google, GitHub, CircleCI and OCI</a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/STS/latest/APIReference/welcome.html">AWS STS</a> now validates provider-specific claims from Google, GitHub, <a href="https://circleci.com/">CircleCI</a>, and Oracle Cloud Infrastructure when federating into AWS via <a href="https://docs.aws.amazon.com/IAM/latest/UserGuide/reference_policies_iam-condition-keys.html#condition-keys-wif">OIDC</a>. </li>
<li style="font-weight:400;">This allows customers to reference custom claims as condition keys in IAM role trust policies and resource control policies, enabling more granular access control for federated identities beyond the standard OIDC claims.</li>
<li style="font-weight:400;">The feature addresses a common security gap where organizations previously could only validate standard OIDC claims like subject and audience, but couldn’t enforce conditions based on provider-specific attributes like GitHub repository names or <a href="https://workspace.google.com/">Google Workspace</a> domains. 
<ul>
<li style="font-weight:400;">This enhancement helps establish data perimeters by allowing customers to restrict access based on the specific context of the federated identity.</li>
</ul>
</li>
<li style="font-weight:400;">Available now in all AWS Commercial Regions at no additional cost beyond standard STS API usage. </li>
<li style="font-weight:400;">Organizations using OIDC federation for CI/CD pipelines, developer access, or multi-cloud identity management can immediately implement more restrictive trust policies without changing their authentication flows.</li>
<li style="font-weight:400;">The supported claims vary by provider and include attributes like GitHub repository visibility, CircleCI project IDs, and OCI tenancy information. Full documentation of available condition keys is provided in the IAM User Guide under Available Keys for OIDC federation.</li>
</ul>
<p>17:00  Matt – “This is a fantastic feature that I was convinced was a brand new announcement, until Matt schooled me and said, ‘I’ve been doing this for months, ‘ because I didn’t know you could do this with STS.” </p>
<p>46:33 <a href="https://aws.amazon.com/about-aws/whats-new/2026/01/amazon-cloudfront-mutual-tls-for-origins/">Amazon CloudFront announces mutual TLS support for origins</a></p>
<ul>
<li style="font-weight:400;">CloudFront now supports mutual TLS authentication for origins, allowing customers to verify that requests to their backend servers come only from authorized CloudFront distributions using certificate-based authentication. </li>
<li style="font-weight:400;">This eliminates the operational overhead of managing custom solutions like shared secret headers or IP allow-lists that previously required constant rotation and maintenance.</li>
<li style="font-weight:400;">The feature works with <a href="https://docs.aws.amazon.com/privateca/latest/userguide/PcaWelcome.html">AWS Private Certificate Authority</a> or third-party private CAs imported through <a href="https://aws.amazon.com/certificate-manager/">AWS Certificate Manager</a>, providing cryptographic verification of CloudFront’s identity to any origin that supports mTLS, including Application Load Balancers, API Gateway, on-premises servers, and third-party cloud providers. There is no additional charge for using origin mTLS beyond standard CloudFront pricing.</li>
<li style="font-weight:400;">This addresses a common security gap for organizations serving proprietary content through CloudFront, particularly when origins are publicly accessible or hosted externally. </li>
<li style="font-weight:400;">Previously, customers had to build custom authentication layers to ensure only their CloudFront distributions could access backend infrastructure, creating an ongoing operational burden.</li>
<li style="font-weight:400;">Configuration is available through the AWS Management Console, CLI, SDK, CDK, or CloudFormation, making it straightforward to implement across existing CloudFront distributions. The feature is also included in CloudFront’s Business and Premium flat-rate pricing plans at no extra cost.</li>
</ul>
<p>49:33 <a href="https://aws.amazon.com/about-aws/whats-new/2026/02/console-displays-account-name-on-nav-bar/">AWS Management Console now displays Account Name on the Navigation </a><a href="https://aws.amazon.com/about-aws/whats-new/2026/02/console-displays-account-name-on-nav-bar/">bar for easier account identification</a></p>
<ul>
<li style="font-weight:400;">The <a href="https://console.aws.com/?refid=d975e5d3-958d-49cb-8a4c-ee7a970d7a3b">AWS Management Console</a> now displays account names in the navigation bar, replacing the previous reliance on account numbers for identification. 
<ul>
<li style="font-weight:400;">This addresses a common pain point for organizations managing multiple AWS accounts across development, production, and different business units.</li>
</ul>
</li>
<li style="font-weight:400;">The feature is available at no additional cost across all public AWS regions and requires administrator enablement through IAM managed policies. </li>
<li style="font-weight:400;">Once enabled, all authorized users in an account will see the account name displayed in the console navigation bar.</li>
<li style="font-weight:400;">This update provides immediate value for teams working across multiple accounts who previously had to memorize or reference 12-digit account numbers. </li>
<li style="font-weight:400;">The visual distinction helps reduce errors when switching between environments like dev and prod.</li>
<li style="font-weight:400;">The implementation follows AWS best practices for multi-account architectures, making it easier to maintain account separation while improving operational efficiency. Organizations using AWS Organizations or Control Tower will particularly benefit from clearer account identification.</li>
</ul>
<p>51:21  Matt – “Not the sexiest feature, but for the love of God the most USEFUL feature of this podcast.” </p>
<p>53:22 <a href="https://aws.amazon.com/about-aws/whats-new/2026/01/amazon-eventbridge-increases-event-payload-size-256-kb-1-mb/">Announcing increased 1 MB payload size support in Amazon EventBridge</a> </p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/eventbridge/?refid=d975e5d3-958d-49cb-8a4c-ee7a970d7a3b">EventBridge</a> now supports 1 MB event payloads, up from the previous 256 KB limit, eliminating the need for developers to split large events, compress data, or store payloads externally in S3. </li>
<li style="font-weight:400;">This simplifies architectures for applications handling LLM prompts, telemetry data, and complex JSON structures from machine learning models.</li>
<li style="font-weight:400;">The increased payload size reduces architectural complexity and operational overhead by allowing comprehensive contextual data to be included in a single event rather than requiring chunking logic or coordination with external storage systems. 
<ul>
<li style="font-weight:400;">This is particularly relevant for AI/ML workloads where model outputs and prompts can exceed the previous size constraints.</li>
</ul>
</li>
<li style="font-weight:400;">The feature is available now in most commercial AWS regions where EventBridge operates, with notable exceptions including Asia Pacific regions like New Zealand, Thailand, Malaysia, and Taipei, plus Mexico Central. No additional cost is mentioned for the larger payload support beyond standard EventBridge pricing.</li>
<li style="font-weight:400;">This change addresses a common pain point in event-driven architectures where developers previously had to implement workarounds for large payloads, adding code complexity and potential failure points. </li>
<li style="font-weight:400;">The 4x increase in payload size aligns EventBridge more closely with modern application needs around AI and real-time data processing.</li>
</ul>
<p>54:44  Ryan – “I think this is a good thing. I was lauhging at this because I remember event size in Kinesis being a big to-do and a project forever ago, and trying to think through all the limits…but now I was thinking through the AI workloads and how much of a pain it would be to have your prompts referencing and external source everytime…so glad to see this.” </p>
<p>56:55 <a href="https://aws.amazon.com/about-aws/whats-new/2026/01/aws-network-firewall-web-category-based-filtering/">AWS Network Firewall now supports GenAI traffic visibility and </a><a href="https://aws.amazon.com/about-aws/whats-new/2026/01/aws-network-firewall-web-category-based-filtering/">enforcement with Web category-based filtering</a></p>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;">AWS Network Firewall adds URL category-based filtering that lets you control access to GenAI applications, social media, streaming services, and other web categories using pre-defined categories instead of maintaining manual domain lists. </li>
<li style="font-weight:400;">This reduces operational overhead for security teams who need to enforce consistent policies across AWS environments while gaining visibility into emerging technology usage.</li>
<li style="font-weight:400;">The GenAI traffic visibility component addresses a growing compliance need as organizations struggle to track and govern employee access to ChatGPT, Claude, Gemini, and other AI services. </li>
<li style="font-weight:400;">Security teams (booo) can now restrict GenAI usage to approved corporate tools or block access entirely based on their risk tolerance and regulatory requirements.</li>
<li style="font-weight:400;">When combined with TLS inspection, the feature enables full URL path inspection for granular control beyond just domain-level blocking. </li>
<li style="font-weight:400;">This matters for scenarios where you need to allow access to a domain but block specific paths or query parameters that might expose sensitive data.</li>
<li style="font-weight:400;">The feature is available now in all AWS commercial regions where Network Firewall operates, with no additional base cost beyond standard Network Firewall pricing, which starts at 0.395 dollars per firewall endpoint hour plus 0.065 dollars per GB processed. </li>
<li style="font-weight:400;">You can implement this through stateful rule groups using the AWS Console, CLI, or SDKs without requiring new infrastructure deployment.</li>
</ul>
</li>
<li>Did we talk about this one last week? It feels like we talked about this one already. Guess it’s time to build another bot. </li>
</ul>
<h2>GCP</h2>
<p>59:49 <a href="https://cloud.google.com/blog/products/data-analytics/introducing-conversational-analytics-in-bigquery/">Conversational Analytics in BigQuery is in preview </a></p>
<ul>
<li style="font-weight:400;">Google launches Conversational Analytics in <a href="https://cloud.google.com/bigquery">BigQuery</a> as a preview feature that lets users query data using natural language instead of SQL. </li>
<li style="font-weight:400;">The AI agent uses Gemini models to generate queries, execute them, and create visualizations while maintaining security controls and audit logging within BigQuery’s existing governance framework.</li>
<li style="font-weight:400;">The system goes beyond basic chatbots by grounding responses in actual BigQuery schemas, metadata, and custom business logic, including verified queries and User Defined Functions. </li>
<li style="font-weight:400;">This ensures generated SQL aligns with production metrics and enterprise standards rather than making generic assumptions about data structure.</li>
<li style="font-weight:400;">Users can perform predictive analytics through natural language by leveraging BigQuery AI functions like AI.FORECAST and AI.DETECT_ANOMALIES without writing code. </li>
<li style="font-weight:400;">The agent also supports querying unstructured data such as images stored in BigQuery object tables, expanding analysis beyond traditional row-column datasets.</li>
<li style="font-weight:400;">The agents can be <a href="https://cloud.google.com/blog/products/business-intelligence/looker-conversational-analytics-now-ga/">deployed across multiple surfaces, including Looker Studio</a> Pro, the BigQuery UI, custom applications via API, and existing agentic ecosystems through ADK tools. </li>
<li style="font-weight:400;"><a href="https://docs.cloud.google.com/bigquery/docs/conversational-analytics">Documentation</a> and codelabs are available at cloud.google.com for implementation guidance, though specific pricing details were not disclosed in the announcement.</li>
<li style="font-weight:400;">This addresses a common enterprise bottleneck where business users wait in queues for data teams to write queries, potentially reducing time-to-insight from hours or days to seconds for authorized users.</li>
</ul>
<p>1:01:11 Matt – “Anything that makes BigQuery easier to use.” </p>
<p>1:01:36 <a href="https://cloud.google.com/blog/products/identity-security/introducing-single-tenant-cloud-hsm-for-more-data-encryption-control/">Introducing Single-tenant Cloud HSM for more data encryption control </a></p>
<ul>
<li style="font-weight:400;">Google Cloud has launched <a href="https://docs.cloud.google.com/kms/docs/single-tenant-hsm">Single-tenant Cloud HSM</a>, a dedicated hardware security module service that gives organizations exclusive control over cryptographic keys with FIPS 140-2 Level 3 validation. </li>
<li style="font-weight:400;">Unlike multi-tenant solutions, customers get sole access to physical HSM partitions with hardware-enforced isolation, meaning their keys are cryptographically separated from other customers and Google operators. The service is generally available now in the US and EU, with “<a href="https://cloud.google.com/kms/pricing#stch_pricing">competitive</a>” pricing <a href="https://cloud.google.com/kms/pricing#stch_pricing">https://cloud.google.com/kms/pricing#stch_pricing</a> ($3500/month). </li>
<li style="font-weight:400;">The service targets highly-regulated industries like financial services, defense, healthcare, and government that need strict compliance controls but want to avoid managing physical hardware. </li>
<li style="font-weight:400;">Key security features include full ownership of root keys, quorum-based administration requiring multiple authorized users for sensitive operations, and the ability to revoke Google’s access at any time, which immediately makes all keys and encrypted data inaccessible until authorization is restored.</li>
<li style="font-weight:400;">Single-tenant Cloud HSM integrates directly with existing Cloud KMS APIs and works with Customer-Managed Encryption Keys (CMEK) across Google Cloud services. Setup takes approximately 15 minutes using standard gcloud commands, and the service automatically scales to handle peak traffic loads while maintaining high availability across multiple zones. </li>
<li style="font-weight:400;">The service has already obtained compliance certifications, including <a href="https://cloud.google.com/security/compliance/fedramp?hl=en">FedRAMP</a>, <a href="https://cloud.google.com/security/compliance/disa?hl=en#services-in-scope">DISA IL5</a>, <a href="https://cloud.google.com/security/compliance/itar?hl=en">ITAR</a>, <a href="https://cloud.google.com/security/compliance/soc-1?hl=en">SOC</a> 1/2/3, <a href="https://cloud.google.com/security/compliance/hipaa-compliance?hl=en">HIPAA</a>, and <a href="https://cloud.google.com/security/compliance/pci-dss?hl=en">PCI DSS</a>.</li>
<li style="font-weight:400;">Google manages all hardware provisioning, configuration, monitoring, and compliance, removing the operational burden of physical HSM management while maintaining the same redundancy and availability standards as multi-tenant Cloud HSM. </li>
<li style="font-weight:400;">Administrators can use hardware tokens like YubiKey or other key management systems to generate and manage their administrative credentials, with quorum requirements preventing any single individual from making unauthorized changes.</li>
</ul>
<p>1:06:21  Ryan – “And that’s why Google is announcing this. Someone had this checkbox – someone with deep enough pockets had this checkbox.” </p>
<h2>Azure</h2>
<p>44:40 <a href="https://azure.microsoft.com/en-us/updates?id=529407">Public Preview: 7th generation Intel-based VMs – Dlsv7/Dsv7/Esv7 </a></p>
<ul>
<li style="font-weight:400;">Azure launches Dlsv7, Dsv7, and Esv7 virtual machines in public preview, powered by <a href="https://www.intel.com/content/www/us/en/products/details/processors/xeon.htmlhttps://www.intel.com/content/www/us/en/products/details/processors/xeon.html">Intel Xeon 6 processors</a> codenamed Granite Rapids. </li>
<li style="font-weight:400;">These 7th-generation Intel-based VMs represent the latest iteration in Azure’s general-purpose and memory-optimized VM families, bringing newer processor architecture to cloud workloads.</li>
<li style="font-weight:400;">The new VM series targets customers running compute-intensive and memory-intensive workloads that can benefit from the latest Intel processor improvements. </li>
<li style="font-weight:400;">General-purpose Dlsv7 and Dsv7 VMs suit balanced workloads like web servers and application hosting, while Esv7 VMs are optimized for memory-heavy applications such as databases and in-memory analytics.</li>
<li style="font-weight:400;">Intel Xeon 6 processors introduce architectural improvements over previous generations, though specific performance metrics and pricing details are not provided in the announcement. </li>
<li style="font-weight:400;">Customers interested in testing these VMs should evaluate them during preview to determine if the newer processor generation delivers meaningful improvements for their specific workloads.</li>
<li style="font-weight:400;">The preview status means these VMs are available for testing but may not yet be suitable for production workloads, depending on service level agreements and regional availability. </li>
<li style="font-weight:400;">Organizations should check Azure documentation for supported regions and any preview limitations before deploying workloads on these new VM series.</li>
</ul>
<p>1:11:15  Matt – “The other reason I wanted to keep it in was, I’m still struggling to get the V6 in some regions. And granted, these are less common regions, you know, but I have a different skews based on region availability because I just can’t get it, and in some places it’s like, ‘we can do it in two zones.’ And I’m like, cool, thank you. Way to make yourself more money.”</p>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback, or ask questions at theCloudPod.net or tweet at us with the hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2361457/c1e-0424u7wnxrfo49ok-0v9v6p3qbdwk-gblfks.mp3" length="141357028"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 341 of The Cloud Pod, where the forecast is always cloudy! Matt & Ryan are picking up Justin’s slack this week while he’s traveling for work, but don’t worry, because they have plenty of news! We’re talking about those mass layoffs over at AWS, a major security breach over at Notepad++, and some new slight of hand over at Elon’s companies. There’s a lot to cover, so let’s get into it! 
Titles we almost went with this week

 Finally, a Chatbot That Actually Knows Where Your Data Lives **Anthropic
 Microsoft Adds Security Analyzer to MSSQL Extension: Because Bobby Tables Jokes Are Only Funny Until They Happen to You
 From Sequential Sadness to Parallel Paradise: GKE Node Pools Get Concurrent
 From Vibe Coding to Production: AWS MCP Server Gets SOPs
 One Prompt to Deploy Them All: AWS MCP Server Automates Infrastructure
 AWS Layoffs: Scaling Down Instead of Scaling Out
 Mutual TLS: Because CloudFront and Your Origin Need Couples Therapy
 Claude Team Plan: Now With More Seats and Less Bills
 From Snowflake to Snowball: Rolling Data and Dev Into One Platform
 From Notepad++ to Notepad Pwned: A Six-Month Hosting Horror Story
 EventBridge Payload Capacity Gets a 4x Upgrade: No More Event Splitting Headaches
 CloudFront Finally Learns to Check ID Before Knocking on Origin’s Door

General News 
01:30 SpaceX acquires xAI, plans to launch a massive satellite constellation to power it – Ars Technica

SpaceX has acquired xAI to create a vertically integrated AI and space infrastructure company, with plans to deploy up to 1 million satellites as orbital data centers. 
This represents a significant bet that space-based compute infrastructure can be cost-competitive with traditional ground-based data centers for AI workloads.
The merger combines SpaceX’s launch capabilities and satellite manufacturing expertise with xAI’s Grok chatbot and X social platform. 
The strategy assumes AI demand will continue to grow and that compute capacity, rather than other factors, is the primary bottleneck to AI adoption.
The orbital data center concept raises questions about latency, power requirements, thermal management, and maintenance compared to terrestrial facilities. 
Traditional cloud providers have invested heavily in ground-based infrastructure optimized for these factors.
This consolidation of Musk’s companies creates potential conflicts between SpaceX’s established government and commercial contracts and xAI’s more controversial products. 
The integration of a proven aerospace company with a newer AI venture introduces execution risk to SpaceX’s core business.
The plan depends on several unproven assumptions, including sustained AI market growth, viable economics for space-based computing, and the ability to manufacture and launch satellites at unprecedented scale. 
Cloud providers and enterprises will need to evaluate whether orbital compute offers advantages over existing multi-region terrestrial deployments.

03:22 Ryan – “I feel like this is a shell game con; taxes are over here – no, now they’re over here!” 
06:49 ]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2361457/c1a-k5d5-mkgkzjzpi2nr-nzcxsn.jpg"></itunes:image>
                                                                            <itunes:duration>01:13:29</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2361457/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[340: Azure releases a new SQL AI Assistant… Jimmy Droptables]]>
                </title>
                <pubDate>Sat, 07 Feb 2026 01:29:38 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2351545</guid>
                                    <link>https://tcpfm.castos.com/episodes/340-azure-releases-a-new-sql-ai-assistant-jimmy-droptables</link>
                                <description>
                                            <![CDATA[<p>Welcome to episode 340 of The Cloud Pod, where the forecast is always cloudy! It’s a full house (eventually) with Justin, Jonathan, Ryan, and Matt all on board for today’s episode. We’ve got a lot of announcements, from Gemini for Gov (no more CamoGPT!) to Route 52 and Claude. Let’s get started! </p>
<h3>Titles we almost went with this week</h3>
<ul>
<li> Claude’s Pricing Tiers: Free, Pro, and Maximum Overdrive</li>
<li> GitHub Copilot Learns Database Schema: Finally an AI That Understands Your Joins</li>
<li> SSMS Gets a Copilot: Your T-SQL Now Writes Itself While You Grab Coffee</li>
<li> Too Many Cooks in the Cloud Kitchen: How 32 GPUs Outcooked the Big Tech Industrial Kitchens</li>
<li> Uncle Sam Gets a Gemini Twin: Google’s AI Goes Federal</li>
<li> Route 53 Gets Domain of Its Own: .ai Joins the Party</li>
<li> Thai One On: Google Cloud Plants Its Flag in Bangkok</li>
<li> NAT So Fast: Azure’s Gateway Gets a V2 Glow-Up</li>
<li> Beware Azure’s SQL Assistant doesn’t smoke your joints.</li>
</ul>
<h2>AI Is Going Great, Or How ML Makes Money  </h2>
<p>30:10 <a href="https://www.databricks.com/blog/announcing-blackice-containerized-red-teaming-toolkit-ai-security-testing">Announcing BlackIce: A Containerized Red Teaming Toolkit for AI Security </a><a href="https://www.databricks.com/blog/announcing-blackice-containerized-red-teaming-toolkit-ai-security-testing">Testing | Databricks Blog</a></p>
<ul>
<li style="font-weight:400;">Databricks released BlackIce, an open-source containerized toolkit that bundles 14 AI security testing tools into a single Docker image available on <a href="https://hub.docker.com/r/databricksruntime/blackice">Docker Hub</a> as databricksruntime/blackice:17.3-LTS. </li>
<li style="font-weight:400;">The toolkit addresses common red teaming challenges, including conflicting dependencies, complex setup requirements, and the fragmented landscape of AI security tools, by providing a unified command-line interface similar to how <a href="https://www.kali.org/">Kali Linux</a> works for traditional penetration testing.</li>
<li style="font-weight:400;">The toolkit includes tools covering three main categories: Responsible AI, Security testing, and classical adversarial ML, with capabilities mapped to MITRE ATLAS and the Databricks AI Security Framework. </li>
<li style="font-weight:400;">Tools are organized as either static (simple CLI-based with minimal programming needed) or dynamic (Python-based with customization options), with static tools isolated in separate virtual environments and dynamic tools in a global environment with managed dependencies.</li>
<li style="font-weight:400;">BlackIce integrates directly with Databricks Model Serving endpoints through custom patches applied to several tools, allowing security teams to test for vulnerabilities like prompt injections, data leakage, hallucination detection, jailbreak attacks, and supply chain security issues. </li>
<li style="font-weight:400;">Users can deploy it via Databricks Container Services by specifying the Docker image URL when creating compute clusters.</li>
<li style="font-weight:400;">The release includes a demo notebook showing how to orchestrate multiple security tools in a single environment, with all build artifacts, tool documentation, and examples available in the GitHub repository. </li>
<li style="font-weight:400;">The <a href="https://arxiv.org/abs/2510.11823">CAMLIS Red Paper</a> provides additional technical details on tool selection criteria and the Docker image architecture.</li>
</ul>
<p>04:30 Ryan – “It’s very difficult to feel confident in your AI security practice or patterns. I feel like it’s just bleeding edge, and I’m learning so much all the time. And so I spend a lot of time reading papers and talking to others and seeing what they’re doing and meeting with vendors trying to figure out strategy, and it just feels like I’m drinking from a fire hose, and it’s really difficult to feel confident. So I like tools like t...</p>
<h3>Chapters</h3>
<ul><li>(00:00:07) - The Cloud Pod: Episode 340</li><li>(00:01:16) - Hello, How to Subscribe to our Podcast</li><li>(00:03:20) - Black Ice: A Single Toolkit for AI Security</li><li>(00:13:21) - OpenAI Launches Prism: a LaTeX workspace for scientific writing</li><li>(00:16:03) - Amazon EC2: New Graviton 4 Instances, and More</li><li>(00:21:54) - Amazon Workspaces: Advanced Printer Redirection</li><li>(00:25:50) - AWS Network Firewall Adds URL Category Based Filtering</li><li>(00:28:32) - The CEO's Executive Dinner</li><li>(00:29:21) - Gemini CLI Learning Course Launch</li><li>(00:32:43) - Google Cloud opens new Bangkok Region Asia Southeast 3</li><li>(00:36:08) - Apache Airflow 3.1 on Cloud Composer</li><li>(00:38:36) - Google's Gemini for Government Launches</li><li>(00:43:32) - BigQuery: Integrating AI into SQL queries</li><li>(00:45:46) - SQL Server Management Studio 2.22.1 New Features & Changes</li><li>(00:53:08) - Azure NAT Gateway: Standard V2 GAUNCH</li><li>(00:55:31) - Microsoft Announces Unified Socks & DORA Compliance Solutions in</li><li>(01:03:01) - IOM Deny Policies</li><li>(01:04:59) - Google's Gemini CLI for Outages & Compliance</li><li>(01:07:58) - Google's MCP for Docs</li><li>(01:10:49) - Super Bowl LII</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 340 of The Cloud Pod, where the forecast is always cloudy! It’s a full house (eventually) with Justin, Jonathan, Ryan, and Matt all on board for today’s episode. We’ve got a lot of announcements, from Gemini for Gov (no more CamoGPT!) to Route 52 and Claude. Let’s get started! 
Titles we almost went with this week

 Claude’s Pricing Tiers: Free, Pro, and Maximum Overdrive
 GitHub Copilot Learns Database Schema: Finally an AI That Understands Your Joins
 SSMS Gets a Copilot: Your T-SQL Now Writes Itself While You Grab Coffee
 Too Many Cooks in the Cloud Kitchen: How 32 GPUs Outcooked the Big Tech Industrial Kitchens
 Uncle Sam Gets a Gemini Twin: Google’s AI Goes Federal
 Route 53 Gets Domain of Its Own: .ai Joins the Party
 Thai One On: Google Cloud Plants Its Flag in Bangkok
 NAT So Fast: Azure’s Gateway Gets a V2 Glow-Up
 Beware Azure’s SQL Assistant doesn’t smoke your joints.

AI Is Going Great, Or How ML Makes Money  
30:10 Announcing BlackIce: A Containerized Red Teaming Toolkit for AI Security Testing | Databricks Blog

Databricks released BlackIce, an open-source containerized toolkit that bundles 14 AI security testing tools into a single Docker image available on Docker Hub as databricksruntime/blackice:17.3-LTS. 
The toolkit addresses common red teaming challenges, including conflicting dependencies, complex setup requirements, and the fragmented landscape of AI security tools, by providing a unified command-line interface similar to how Kali Linux works for traditional penetration testing.
The toolkit includes tools covering three main categories: Responsible AI, Security testing, and classical adversarial ML, with capabilities mapped to MITRE ATLAS and the Databricks AI Security Framework. 
Tools are organized as either static (simple CLI-based with minimal programming needed) or dynamic (Python-based with customization options), with static tools isolated in separate virtual environments and dynamic tools in a global environment with managed dependencies.
BlackIce integrates directly with Databricks Model Serving endpoints through custom patches applied to several tools, allowing security teams to test for vulnerabilities like prompt injections, data leakage, hallucination detection, jailbreak attacks, and supply chain security issues. 
Users can deploy it via Databricks Container Services by specifying the Docker image URL when creating compute clusters.
The release includes a demo notebook showing how to orchestrate multiple security tools in a single environment, with all build artifacts, tool documentation, and examples available in the GitHub repository. 
The CAMLIS Red Paper provides additional technical details on tool selection criteria and the Docker image architecture.

04:30 Ryan – “It’s very difficult to feel confident in your AI security practice or patterns. I feel like it’s just bleeding edge, and I’m learning so much all the time. And so I spend a lot of time reading papers and talking to others and seeing what they’re doing and meeting with vendors trying to figure out strategy, and it just feels like I’m drinking from a fire hose, and it’s really difficult to feel confident. So I like tools like t...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[340: Azure releases a new SQL AI Assistant… Jimmy Droptables]]>
                </itunes:title>
                                    <itunes:episode>340</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p>Welcome to episode 340 of The Cloud Pod, where the forecast is always cloudy! It’s a full house (eventually) with Justin, Jonathan, Ryan, and Matt all on board for today’s episode. We’ve got a lot of announcements, from Gemini for Gov (no more CamoGPT!) to Route 52 and Claude. Let’s get started! </p>
<h3>Titles we almost went with this week</h3>
<ul>
<li> Claude’s Pricing Tiers: Free, Pro, and Maximum Overdrive</li>
<li> GitHub Copilot Learns Database Schema: Finally an AI That Understands Your Joins</li>
<li> SSMS Gets a Copilot: Your T-SQL Now Writes Itself While You Grab Coffee</li>
<li> Too Many Cooks in the Cloud Kitchen: How 32 GPUs Outcooked the Big Tech Industrial Kitchens</li>
<li> Uncle Sam Gets a Gemini Twin: Google’s AI Goes Federal</li>
<li> Route 53 Gets Domain of Its Own: .ai Joins the Party</li>
<li> Thai One On: Google Cloud Plants Its Flag in Bangkok</li>
<li> NAT So Fast: Azure’s Gateway Gets a V2 Glow-Up</li>
<li> Beware Azure’s SQL Assistant doesn’t smoke your joints.</li>
</ul>
<h2>AI Is Going Great, Or How ML Makes Money  </h2>
<p>30:10 <a href="https://www.databricks.com/blog/announcing-blackice-containerized-red-teaming-toolkit-ai-security-testing">Announcing BlackIce: A Containerized Red Teaming Toolkit for AI Security </a><a href="https://www.databricks.com/blog/announcing-blackice-containerized-red-teaming-toolkit-ai-security-testing">Testing | Databricks Blog</a></p>
<ul>
<li style="font-weight:400;">Databricks released BlackIce, an open-source containerized toolkit that bundles 14 AI security testing tools into a single Docker image available on <a href="https://hub.docker.com/r/databricksruntime/blackice">Docker Hub</a> as databricksruntime/blackice:17.3-LTS. </li>
<li style="font-weight:400;">The toolkit addresses common red teaming challenges, including conflicting dependencies, complex setup requirements, and the fragmented landscape of AI security tools, by providing a unified command-line interface similar to how <a href="https://www.kali.org/">Kali Linux</a> works for traditional penetration testing.</li>
<li style="font-weight:400;">The toolkit includes tools covering three main categories: Responsible AI, Security testing, and classical adversarial ML, with capabilities mapped to MITRE ATLAS and the Databricks AI Security Framework. </li>
<li style="font-weight:400;">Tools are organized as either static (simple CLI-based with minimal programming needed) or dynamic (Python-based with customization options), with static tools isolated in separate virtual environments and dynamic tools in a global environment with managed dependencies.</li>
<li style="font-weight:400;">BlackIce integrates directly with Databricks Model Serving endpoints through custom patches applied to several tools, allowing security teams to test for vulnerabilities like prompt injections, data leakage, hallucination detection, jailbreak attacks, and supply chain security issues. </li>
<li style="font-weight:400;">Users can deploy it via Databricks Container Services by specifying the Docker image URL when creating compute clusters.</li>
<li style="font-weight:400;">The release includes a demo notebook showing how to orchestrate multiple security tools in a single environment, with all build artifacts, tool documentation, and examples available in the GitHub repository. </li>
<li style="font-weight:400;">The <a href="https://arxiv.org/abs/2510.11823">CAMLIS Red Paper</a> provides additional technical details on tool selection criteria and the Docker image architecture.</li>
</ul>
<p>04:30 Ryan – “It’s very difficult to feel confident in your AI security practice or patterns. I feel like it’s just bleeding edge, and I’m learning so much all the time. And so I spend a lot of time reading papers and talking to others and seeing what they’re doing and meeting with vendors trying to figure out strategy, and it just feels like I’m drinking from a fire hose, and it’s really difficult to feel confident. So I like tools like this, where not only is it adding a whole bunch of value, but you can use it as a rubric against what you’ve been doing and where your gaps are.”    </p>
<p>07:28 <a href="https://www.geekwire.com/2026/ai2-cooks-up-open-source-coding-agents-with-tech-equivalent-of-a-hot-plate-and-a-frying-pan/?utm_source=GeekWire+Newsletters&amp;utm_campaign=5475565d4c-daily-digest-email&amp;utm_medium=email&amp;utm_term=0_4e93fc7dfd-5475565d4c-233353605&amp;mc_cid=5475565d4c&amp;mc_eid=04fad859c0">Ai2 cooks up open-source coding agents with a tech equivalent of ‘hot plate </a><a href="https://www.geekwire.com/2026/ai2-cooks-up-open-source-coding-agents-with-tech-equivalent-of-a-hot-plate-and-a-frying-pan/?utm_source=GeekWire+Newsletters&amp;utm_campaign=5475565d4c-daily-digest-email&amp;utm_medium=email&amp;utm_term=0_4e93fc7dfd-5475565d4c-233353605&amp;mc_cid=5475565d4c&amp;mc_eid=04fad859c0">and frying pan’ – GeekWire</a></p>
<ul>
<li style="font-weight:400;"><a href="https://allenai.org/">Allen Institute for AI</a> releases SERA (Soft-Verified Efficient Repository Agents), the first in their <a href="https://allenai.org/blog/open-coding-agents">Open Coding Agents</a> series, as a <a href="https://allenai.org/blog/open-coding-agents">fully open-source coding agent</a> that organizations can fine-tune on their own codebases for approximately $1,300 using commodity GPUs. </li>
<li style="font-weight:400;">The model handles GitHub issues, generates line-by-line patches, and submits pull requests while learning internal APIs and development conventions.</li>
<li style="font-weight:400;">SERA-32B achieves over 50% success rate on SWE-Bench, matching the performance of proprietary models like GitHub Copilot Workspace and Claude Code, but was built with just 32 GPUs and a five-person team. </li>
<li style="font-weight:400;">This demonstrates that competitive coding agents can be developed without the massive infrastructure typically required by tech giants.</li>
<li style="font-weight:400;">The model runs on organization-owned infrastructure without ongoing licensing fees and integrates with existing tools like <a href="https://claude.com/product/claude-code">Claude Code</a> out of the box. </li>
<li style="font-weight:400;">Teams can deploy it with a few lines of code and customize it for private codebases, offering an alternative to expensive closed systems from Microsoft and Anthropic.</li>
<li style="font-weight:400;">By open-sourcing both the model and training code, Ai2 enables companies to maintain control over their proprietary code while still leveraging advanced AI coding assistance. </li>
<li style="font-weight:400;">This addresses a key concern for enterprises hesitant to send sensitive code to third-party services.</li>
</ul>
<p>05:30 Justin – “I was playing with Olamma, actually, this week, plugging it into Claude, and I definitely needed to get a new M5 MacBook with much more GPU capacity – or go buy a GPU for my house to make that really perform well. But even on my Mac with the 20B open model, it was serviceable. It just wasn’t as fast as using Anthropix APIs directly.”</p>
<p>09:51 <a href="https://blog.google/innovation-and-ai/technology/developers-tools/agentic-vision-gemini-3-flash/">Introducing Agentic Vision in Gemini 3 Flash</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.google.com/">Google</a> launches <a href="https://blog.google/innovation-and-ai/technology/developers-tools/agentic-vision-gemini-3-flash/">Agentic Vision</a> in <a href="https://deepmind.google/models/gemini/flash/">Gemini 3 Flash</a>, introducing a Think-Act-Observe loop that enables the model to actively manipulate images through Python code execution rather than processing them in a single static pass. </li>
<li style="font-weight:400;">This approach delivers a 5-10% quality improvement across most vision benchmarks by allowing the model to zoom, crop, rotate, and annotate images iteratively to ground its reasoning in visual evidence.</li>
<li style="font-weight:400;">The capability enables three primary use cases: implicit zooming for fine-grained detail inspection (<a href="http://planchecksolver.com">PlanCheckSolver.com</a> improved building plan validation accuracy by 5%), image annotation with bounding boxes and labels to prevent counting errors, and visual math with deterministic Python execution to parse tables and generate charts without hallucination.</li>
<li style="font-weight:400;">Agentic Vision is available now via the Gemini API in Google AI Studio and <a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/multimodal/code-execution">Vertex AI</a>, with rollout beginning in the Gemini app under the Thinking model option. </li>
<li style="font-weight:400;">Developers can enable the feature by turning on Code Execution under Tools in the <a href="https://aistudio.google.com/prompts/new_chat?model=gemini-3-flash-preview">AI Studio Playground</a>.</li>
<li style="font-weight:400;">Google plans to expand the capability by making behaviors like image rotation and visual math fully implicit without requiring prompt nudges, adding more tools, including web and reverse image search, and extending support beyond Flash to other model sizes. </li>
<li style="font-weight:400;">Currently, some capabilities require explicit prompting to trigger code execution.</li>
<li style="font-weight:400;">The feature addresses a fundamental limitation in frontier AI models that previously had to guess when missing fine-grained details like serial numbers or distant street signs, now replacing probabilistic guessing with verifiable code execution in a deterministic Python environment.</li>
</ul>
<p>11:08 Justin – “Enhance!” </p>
<p>13:44 <a href="https://openai.com/index/introducing-prism">Introducing Prism | OpenAI</a></p>
<ul>
<li style="font-weight:400;"><a href="https://openai.com/">OpenAI</a> launches <a href="https://openai.com/prism/">Prism</a>, a free cloud-based <a href="https://www.latex-project.org/">LaTeX</a> workspace for scientific writing that integrates <a href="https://openai.com/index/gpt-5-2-for-science-and-math/">GPT-5.2</a> directly into the research workflow. </li>
<li style="font-weight:400;">The platform offers unlimited projects and collaborators for anyone with a ChatGPT personal account, with enterprise plans coming soon for Business, Enterprise, and Education customers.</li>
<li style="font-weight:400;">Prism builds on OpenAI’s acquisition of <a href="https://crixet.com/">Crixet</a>, a LaTeX platform, and adds native AI capabilities, including real-time collaboration, literature search from sources like <a href="https://arxiv.org/">arXiv</a>, equation conversion from whiteboard photos to LaTeX, and voice-based editing. GPT-5.2 Thinking mode operates within the document context, understanding the full paper structure, including equations, citations, and figures.</li>
<li style="font-weight:400;">The platform eliminates the fragmented workflow researchers typically face by consolidating drafting, revision, collaboration, and publication preparation into a single workspace. </li>
<li style="font-weight:400;">This removes the need for local LaTeX installations and reduces context switching between separate editors, PDF viewers, reference managers, and chat interfaces.</li>
<li style="font-weight:400;">OpenAI positions this as part of a broader shift where AI accelerates scientific discovery, following examples of GPT-5 advancing mathematical research, immune-cell analysis, and molecular biology experiments. </li>
<li style="font-weight:400;">The free tier provides immediate access to core features, while more advanced AI capabilities will be available through paid ChatGPT plans over time.</li>
</ul>
<p>14:49 Justin – “I don’t care for LaTex, but I’m not in science either, so maybe this is for those people.” </p>
<h2>AWS</h2>
<p>16:41 <a href="https://aws.amazon.com/about-aws/whats-new/2026/01/graviton4-ebs-optimized-larger-sizes">Now available: 48xlarge and metal-48xl sizes for EBS optimized Amazon </a><a href="https://aws.amazon.com/about-aws/whats-new/2026/01/graviton4-ebs-optimized-larger-sizes">EC2 instances</a></p>
<ul>
<li style="font-weight:400;">AWS launches 48xlarge and metal-48xl instance sizes for Graviton4-powered <a href="https://aws.amazon.com/ec2/instance-types/c8g/">C8gb</a>, <a href="https://aws.amazon.com/ec2/instance-types/m8g/">M8gb</a>, and <a href="https://aws.amazon.com/ec2/instance-types/r8g">R8gb</a> instances, delivering up to 30% better compute performance than <a href="https://celerdata.com/glossary/aws-graviton3">Graviton3</a> and the highest EBS bandwidth (300 Gbps) among non-accelerated <a href="https://aws.amazon.com/ec2/">EC2</a> instances. </li>
<li style="font-weight:400;">These instances support up to 1440K IOPS, making them the highest EBS IOPS performers in EC2.</li>
<li style="font-weight:400;">The new instances scale up to 48xlarge with three memory-to-vCPU ratio options (compute, general purpose, and memory optimized), plus metal sizes for C8gb and R8gb that provide direct hardware access. </li>
<li style="font-weight:400;">They include up to 400 Gbps networking bandwidth and support <a href="https://aws.amazon.com/hpc/efa/">Elastic Fabric Adapter</a> for low-latency cluster workloads.</li>
<li style="font-weight:400;">Primary use cases include high-throughput database workloads, data analytics pipelines, and tightly coupled HPC applications that require sustained high block storage performance. </li>
<li style="font-weight:400;">The EFA support makes these particularly suitable for distributed computing tasks that need consistent low-latency inter-node communication.</li>
<li style="font-weight:400;">Currently available in US East (N. Virginia) and US West (Oregon), with metal sizes limited to US East (N. Virginia) only. </li>
<li style="font-weight:400;">This follows AWS’s typical pattern of launching new instance types in primary US regions before broader global expansion.</li>
<li style="font-weight:400;">The instances represent AWS’s continued investment in Graviton ARM-based processors, offering customers an alternative to x86 instances with improved price-performance for workloads that can run on ARM architecture. </li>
</ul>
<p>18:04 Justin – They’re the only thing I used to like to buy on the spot market until AI came around and then ruined it for me.”</p>
<p>18:47 <a href="https://aws.amazon.com/about-aws/whats-new/2026/1/amazon-route-53-domains-adds-support-for-.ai-and-other-top-level-domains/">Amazon Route 53 Domains adds support for .ai, and other top-level </a><a href="https://aws.amazon.com/about-aws/whats-new/2026/1/amazon-route-53-domains-adds-support-for-.ai-and-other-top-level-domains/">domains</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/route53/">Route 53</a> now supports ten new top-level domains, including .ai, .nz, .shop, .bot, .moi, .spot, .free, .deal, .now, and .hot, expanding domain registration options directly within AWS. </li>
<li style="font-weight:400;">The .ai domain has become particularly relevant for AI companies despite originally being Anguilla’s country code, while other TLDs target specific use cases like e-commerce (.shop) and chatbot services (.bot).</li>
<li style="font-weight:400;">The new domains integrate with existing Route 53 features, including DNS management, automatic renewal, and hosted zones, allowing customers to manage domain registration and DNS records through the <a href="https://console.aws.amazon.com/">console</a>, <a href="https://aws.amazon.com/cli/">CLI</a>, or <a href="https://docs.aws.amazon.com/AWSJavaScriptSDK/latest/AWS/EC2.html">SDKs</a>. </li>
<li style="font-weight:400;">This consolidation eliminates the need for third-party domain registrars when building AWS-hosted applications.</li>
<li style="font-weight:400;">Domain registration pricing varies by TLD, with no standard rate mentioned in the announcement. </li>
<li style="font-weight:400;">Customers should check the Route 53 pricing page for specific costs per domain type, as premium TLDs like .ai typically command higher annual registration fees than traditional domains.</li>
<li style="font-weight:400;">The timing aligns with increased demand for AI-related branding, though Route 53 has historically added TLD support incrementally rather than in response to specific market trends. </li>
<li style="font-weight:400;">The service now competes more directly with dedicated domain registrars by offering industry-specific and regional domain options.</li>
<li style="font-weight:400;">follows standard EC2 on-demand and reserved instance models, with Graviton instances typically offering 20-40% better price-performance than comparable x86 instances.</li>
</ul>
<p>20:03  Ryan – “It is frustrating, and it’s not like these are new. Like, AI’s been around for a while, and so it is strange that it takes that long.”</p>
<p>22:50 <a href="https://aws.amazon.com/about-aws/whats-new/2026/01/amazon-workspaces-advanced-printer-redirection/">Amazon WorkSpaces announces advanced printer redirection</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/workspaces-family/workspaces/">Amazon WorkSpaces</a> now supports advanced printer redirection for Windows users, enabling access to printer-specific features like duplex printing, paper tray selection, and finishing options such as stapling and hole-punching directly from virtual desktops. </li>
<li style="font-weight:400;">This addresses a longstanding limitation where WorkSpaces users were restricted to basic printing capabilities through generic drivers.</li>
<li style="font-weight:400;">The feature includes configurable driver validation modes that let administrators balance compatibility with feature support, automatically falling back to basic printing when matching drivers are not available. </li>
<li style="font-weight:400;">Organizations with users who need professional document printing, specialized labels, or advanced output formatting will benefit most from this capability.</li>
<li style="font-weight:400;">Advanced printer redirection requires WorkSpaces Agent version 2.2.0.2116 or later and Windows client version 5.31 or later, with matching printer drivers installed on both the WorkSpace and client device. </li>
<li style="font-weight:400;">The feature is available in all AWS Regions where Amazon WorkSpaces Personal is offered, though it is limited to Windows WorkSpaces with Windows clients only.</li>
<li style="font-weight:400;">This enhancement brings WorkSpaces closer to feature parity with traditional desktop environments for printing workflows, which is particularly important for industries like legal, healthcare, and finance, where document formatting and specialized printing are common requirements. </li>
<li style="font-weight:400;">The addition fills a notable gap in virtual desktop infrastructure capabilities that has been a barrier for some organizations considering cloud-based desktop solutions.</li>
</ul>
<p>26:55 <a href="https://aws.amazon.com/about-aws/whats-new/2026/01/aws-network-firewall-web-category-based-filtering">AWS Network Firewall now supports GenAI traffic visibility and </a><a href="https://aws.amazon.com/about-aws/whats-new/2026/01/aws-network-firewall-web-category-based-filtering">enforcement with Web category-based filtering</a></p>
<ul>
<li style="font-weight:400;">AWS Network Firewall adds URL category-based filtering that specifically identifies and controls GenAI application traffic alongside traditional web categories like social media and streaming services. </li>
<li style="font-weight:400;">This allows security teams to enforce policies like blocking unauthorized AI tools or restricting access to approved GenAI services only, addressing a growing compliance concern as organizations struggle to govern employee use of ChatGPT, Claude, and similar platforms.</li>
<li style="font-weight:400;">The feature works by inspecting traffic against pre-defined URL categories and can be combined with AWS Network Firewall’s existing TLS inspection capability for full URL path analysis. </li>
<li style="font-weight:400;">This provides more granular control than simple domain blocking, enabling organizations to differentiate between different services from the same provider or allow specific features while blocking others.</li>
<li style="font-weight:400;">The capability is available now in all AWS commercial regions where Network Firewall operates, with no separate pricing beyond existing Network Firewall costs, which start at $0.395 per firewall endpoint hour plus $0.065 per GB processed. </li>
<li style="font-weight:400;">Organizations can implement this through stateful rule groups using the AWS Console, CLI, or SDKs without requiring additional infrastructure changes.</li>
<li style="font-weight:400;">This addresses a practical security gap where traditional firewall rules struggle to keep pace with rapidly emerging GenAI services, reducing the operational burden of manually maintaining blocklists and allowlists. The pre-defined categories are maintained by AWS, meaning customers automatically get coverage for new GenAI services as they launch without manual rule updates.</li>
</ul>
<p>28:32  Ryan – “I’m happy to see this being added to the AWS network firewall. Hoping this gets added to the Google NextGen firewall as well, because it is sort of difficult when you’re forced to do domain-based filtering on these things.” </p>
<h2>GCP</h2>
<p>30:33 <a href="https://cloud.google.com/blog/topics/developers-practitioners/mastering-gemini-cli-your-complete-guide-from-installation-to-advanced-use-cases/">Mastering Gemini CLI: Your Complete Guide from Installation to Advanced </a><a href="https://cloud.google.com/blog/topics/developers-practitioners/mastering-gemini-cli-your-complete-guide-from-installation-to-advanced-use-cases/">Use-Cases </a></p>
<ul>
<li style="font-weight:400;">Google has partnered with <a href="http://deeplearning.ai">DeepLearning.ai</a> to launch a free, comprehensive course on <a href="https://goo.gle/gemini-cli-learning-course">Gemini CLI</a>, an open-source command-line agent that integrates AI capabilities into daily workflows. </li>
<li style="font-weight:400;">The course covers installation and context management through GEMINI.md files, extensibility via Model Context Protocol servers, and practical applications across software development, data analysis, content creation, and personalized learning.</li>
<li style="font-weight:400;">The course is structured as a sub-2-hour curriculum with nine lessons that progress from foundational setup to specialized workflows. </li>
<li style="font-weight:400;">Key technical features include memory management for maintaining context across sessions, integration with external tools through MCP servers, and custom extensions that allow users tailor the CLI to specific needs.</li>
<li style="font-weight:400;">Gemini CLI targets a broad user base beyond traditional developers, with dedicated modules for data visualization from local CSVs and Google Sheets, automated blog and social media content generation, and study plan creation. </li>
<li style="font-weight:400;">The tool is available as an open-source project on GitHub with full documentation at geminicli.com.</li>
<li style="font-weight:400;">The course is completely free and available now at <a href="https://www.deeplearning.ai/short-courses/gemini-cli-code-and-create-with-an-open-source-agent/">goo.gle/gemini-cli-learning-course</a>, positioning it as an accessible entry point for users looking to incorporate AI agents into command-line workflows. </li>
<li style="font-weight:400;">This represents Google’s continued push to make Gemini models more accessible through developer-friendly tooling and educational resources.</li>
</ul>
<p>32:30  Jonathan – “It’s interesting they didn’t go for a command line coding tool. It’s not Gemini code; it’s Gemini that does a whole bunch of stuff. So they’ve seen the broader implications of what those tools can do.”</p>
<p>34:08 <a href="https://cloud.google.com/blog/products/infrastructure/google-cloud-launches-new-region-in-bangkok-thailand/">Google Cloud Launches New Region in Bangkok, Thailand</a></p>
<ul>
<li style="font-weight:400;">Google Cloud has opened its Bangkok region (asia-southeast3), backed by a <a href="https://www.googlecloudpresscorner.com/2024-09-30-Google-Announces-Plans-to-Invest-US-1-Billion-to-Build-Data-Center-and-Cloud-Region-in-Thailand,-Support-Initiatives-to-Expand-AI-Opportunities-for-Thais">$1 billion infrastructure investment</a> that’s projected to<a href="https://aiopportunity.publicfirst.co/thailand/"> contribute $41 billion to Thailand’s economy</a> and support <a href="https://aiopportunity.publicfirst.co/handouts/Thailand%E2%80%99s%20AI%20Opportunity_%20How%20AI%20will%20turbocharge%20economic%20advancement%20in%20Thailand%20%E2%80%93%20Appendix.pdf">130,000 jobs</a> annually over five years. </li>
<li style="font-weight:400;">The region addresses data residency requirements under Thailand’s Personal Data Protection Act (PDPA) while providing low-latency access to local customers and connectivity to Google’s global network via the <a href="https://cloud.google.com/blog/products/infrastructure/talaylink-subsea-cable-to-connect-australia-and-thailand?e=48754805">TalayLink</a> subsea cable.</li>
<li style="font-weight:400;">The region launches with key compliance certifications, including ISO 27001/27017/27018, PCI DSS, and SOC 1/2/3, making it suitable for regulated industries like banking and insurance. KASIKORN Business-Technology Group and True Digital Group are among the first customers leveraging the local infrastructure to meet Bank of Thailand regulatory standards while maintaining data sovereignty.</li>
<li style="font-weight:400;">The Bangkok region provides local compute and storage with millisecond-level latency for Thai users, while AI workloads can access globally-hosted services like Vertex AI, Gemini 3, and generative models through the region as a secure on-ramp. </li>
<li style="font-weight:400;">This hybrid approach lets customers run general-purpose workloads locally without investing in specialized AI hardware while still accessing Google’s AI ecosystem when needed.</li>
<li style="font-weight:400;">Launch partners, including Accenture, Deloitte, MFEC, and NTT Data, are providing local engineering and consulting support to help customers migrate to the new region. ZZZZGoogle is also running the PanyaThAI customer success program and Google Skills initiatives to build local cloud and AI talent in Thailand.</li>
<li style="font-weight:400;">The region is now available in the Google Cloud console under asia-southeast3, joining Google’s network of 43 cloud regions connected by over <a href="https://cloud.google.com/about/locations">7.75 million kilometers of fiber infrastructure</a>. </li>
<li style="font-weight:400;">Pricing follows standard Google Cloud regional pricing models with no specific Thailand-region premiums mentioned in the announcement.</li>
</ul>
<p>35:12  Justin – “It’s really Google’s way of not having to buy a billion GPUs and distribute them globally, but you can argue it as a secure onramp all you want.”  </p>
<p>37:36 <a href="https://cloud.google.com/blog/products/data-analytics/cloud-composer-supports-apache-airflow-31/">Cloud Composer supports Apache Airflow 3.1</a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/composer">Cloud Composer</a> now supports Apache <a href="https://airflow.apache.org/blog/airflow-3.1.0/">Airflow 3.1</a> in preview, making Google the first hyperscaler to offer this version. </li>
<li style="font-weight:400;">The update builds on Airflow 3.0’s decoupled architecture with new features including Human-in-the-Loop workflows that pause execution for manual approvals via UI or API, Deadline Alerts that replace legacy SLAs with proactive time-based notifications, and native support for 17 languages in the React-based interface.</li>
<li style="font-weight:400;">The Human-in-the-Loop functionality integrates with Airflow Notifiers to send approval requests through Slack, email, or PagerDuty with direct links to decision points. This addresses the growing need for human oversight in AI agent workflows and complex automated pipelines, particularly for deployment approvals or reviewing generative AI outputs.</li>
<li style="font-weight:400;">Google positions Cloud Composer as an open orchestration alternative to proprietary walled garden platforms, emphasizing that Airflow-based workflows remain portable Python code rather than vendor-locked logic. The company contributes directly to the Airflow codebase and highlights access to thousands of community-built providers and custom operator development for legacy system integration.</li>
<li style="font-weight:400;">Additional developer-focused improvements include a React plugin system for embedding custom dashboards in the UI and a new streaming API endpoint for watching synchronous <a href="https://airflow.apache.org/docs/apache-airflow/stable/core-concepts/dags.html">DAG</a> execution until completion. The preview is available now for new Cloud Composer 3 environments, though specific pricing details for Airflow 3.1 support were not disclosed in the announcement.</li>
</ul>
<p>38:58  Ryan – “This is rich, because after dealing with Cloud Composer and its kind of terribleness… now with Cloud Composer 3, they’re just rebranding and saying that, no, all that stuff that you were complaining about is a feature, not a bug! We’re not going to build a complicated workflow engine where you don’t get exposed to the innards; we’re going to just let you run your own managed airflow. And it’s basically a deployment template. But it’s a feature, because they’re allowing direct access, not wall cards.”</p>
<p>40:23 <a href="https://cloud.google.com/blog/topics/public-sector/gemini-for-government-unlocking-the-next-wave-of-public-sector-innovation/">Gemini for Government: Unlocking Public Sector Innovation</a></p>
<ul>
<li style="font-weight:400;">Google launches <a href="https://cloud.google.com/blog/topics/public-sector/introducing-gemini-for-government-supporting-the-us-governments-transformation-with-ai/?e=48754805">Gemini for Government</a>, a <a href="https://www.fedramp.gov/">FedRAMP</a> High-authorized AI platform specifically designed for public sector agencies. </li>
<li style="font-weight:400;">The platform provides secure access to Gemini models and agentic AI capabilities, with the Department of Defense already deploying it to 3 million personnel through GenAI.mil and the <a href="https://www.linkedin.com/posts/karen-dahut-24135811_absolutely-thrilled-to-see-the-fda-taking-activity-7404545436725956608-0sfM/">FDA</a> implementing agentic AI across their operations.</li>
<li style="font-weight:400;">The platform emphasizes AI agents as productivity multipliers for government employees, automating administrative tasks while allowing workers to focus on strategic decision-making. </li>
<li style="font-weight:400;">At Google’s <a href="https://cloudonair.withgoogle.com/events/public-sector-summit-2025">Public Sector Summit</a>, agencies built over <a href="https://cloud.google.com/blog/topics/public-sector/the-agentic-era-is-here-300-ai-agents-built-in-one-day-at-google-public-sector-summit-to-accelerate-impact-and-advance-missions/?e=48754805">300 AI agents</a> in a single day to demonstrate potential use cases across different government functions.</li>
<li style="font-weight:400;"><a href="https://www.gartner.com/en/newsroom/press-releases/2025-12-17-gartner-identifies-the-companies-to-beat-in-the-ai-vendor-race?sbrc=19qhodCXASWliq87R27Bn5Q%3D%3D%24k4TO71FfFXXpQLfiplf_9g%3D%3D">Gartner</a> named Google a Company to Beat for Enterprise Agentic AI Platforms in their December 2025 report, citing Google’s integrated tech stack and enterprise-wide adoption capabilities. </li>
<li style="font-weight:400;">This recognition positions Google’s government AI offering against competitors in the federal marketplace, where security accreditation and compliance are critical requirements.</li>
<li style="font-weight:400;">The <a href="https://cloud.google.com/blog/topics/public-sector/driving-the-future-of-government-us-department-of-transportation-selects-google-workspace-as-new-agency-wide-collaboration-suite/?e=48754805">Department of Transportation</a> selected Google Workspace as its agency-wide collaboration suite, showing broader adoption of Google’s cloud services beyond just AI capabilities. </li>
<li style="font-weight:400;">This indicates government agencies are consolidating on Google’s platform for both productivity and AI workloads rather than using point solutions.</li>
<li style="font-weight:400;">No pricing information was disclosed in the announcement, though agencies can register for a <a href="https://cloudonair.withgoogle.com/events/gemini-for-government-your-front-door-for-mission-ai">February 5 webinar</a> and download AI agent toolkits to explore implementation options. </li>
<li style="font-weight:400;">The focus appears to be on enterprise agreements rather than public pricing, given the government procurement process.</li>
</ul>
<p>45:18 <a href="https://cloud.google.com/blog/products/data-analytics/new-bigquery-gen-ai-functions-for-better-data-analysis/">New BigQuery gen AI functions for better data analysis </a></p>
<ul>
<li style="font-weight:400;">BigQuery now integrates Gemini 3.0 and Vertex AI models directly into SQL queries through new AI functions, including <a href="https://docs.cloud.google.com/bigquery/docs/reference/standard-sql/bigqueryml-syntax-ai-generate">AI.GENERATE</a> for text and structured output, AI.EMBED for embeddings, and AI.SIMILARITY for semantic search. The setup process has been simplified by allowing End User Credentials authentication, eliminating the need for separate service account connections if users have the Vertex AI User role.</li>
<li style="font-weight:400;">The AI.GENERATE function handles multimodal inputs, including text, images, video, audio, and documents, and can perform multiple AI tasks simultaneously, like sentiment analysis, translation, and summarization in a single SQL call. Users can specify an output schema to convert unstructured data directly into structured table columns, making results immediately usable in downstream applications.</li>
<li style="font-weight:400;">The new AI.SIMILARITY function provides a streamlined approach to semantic search by computing embeddings and similarity scores in one step, ideal for interactive analysis on small to medium datasets. For larger-scale operations across millions of rows, users can transition to the VECTOR_SEARCH function with precomputed embeddings and vector indexing.</li>
<li style="font-weight:400;">These functions are fully composable within standard SQL, meaning they can be used in SELECT statements, WHERE clauses, and ORDER BY clauses alongside traditional SQL operations. The AI.GENERATE and <a href="https://docs.cloud.google.com/bigquery/docs/reference/standard-sql/bigqueryml-syntax-generate-table">AI.GENERATE_TABLE</a> functions are now generally available, while AI.EMBED and AI.SIMILARITY is currently in preview.</li>
</ul>
<p>46:28  Ryan – “I can have AI generate the query to call AI to analyze the results of the AI-generated query! I don’t see what could go wrong?” </p>
<p> Azure</p>
<p>47:35 <a href="https://techcommunity.microsoft.com/blog/sqlserver/announcing-github-copilot-code-completions-in-sql-server-management-22-2-1/4488252">SSMS 22.2.1 Release</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/ssms/release-notes-22">SQL Server Management Studio 22.2.1</a> adds GitHub Copilot <a href="https://learn.microsoft.com/ssms/github-copilot/code-completions">code completions</a> directly in the query editor, going beyond traditional IntelliSense by providing context-aware T-SQL suggestions that improve as more code is written in the editor. </li>
<li style="font-weight:400;">Microsoft customized the <a href="https://learn.microsoft.com/en-us/visualstudio/ide/visual-studio-github-copilot-get-started?view=visualstudio">Visual Studio Copilot</a> implementation to include database context, ensuring suggestions are both relevant and performant for SQL workflows.</li>
<li style="font-weight:400;">The release focuses on fundamental improvements with bug fixes addressing user-reported issues from the feedback site, while engineering teams work on the backend pipeline and testing enhancements. </li>
<li style="font-weight:400;">Microsoft spent December and January prioritizing quality and reliability improvements that may not be immediately visible but strengthen the product foundation.</li>
<li style="font-weight:400;"><a href="https://code.visualstudio.com/blogs/2025/02/24/introducing-copilot-agent-mode">GitHub Copilot Agent mode</a> is coming to SSMS, according to the updated roadmap, along with improvements to instructions functionality, which ranks as a top user request. </li>
<li style="font-weight:400;">Users can vote on specific AI features through the feedback site, with Microsoft using vote counts as the primary metric for prioritizing development work.</li>
<li style="font-weight:400;">Code completions may compete with traditional IntelliSense, so users experiencing conflicts can disable IntelliSense to get the full benefit of Copilot suggestions. </li>
<li style="font-weight:400;">The feature requires a GitHub Copilot subscription, which is separate from SSMS itself and follows standard GitHub Copilot pricing for individuals or organizations.</li>
<li style="font-weight:400;">This positions SSMS as a more AI-native database management tool, particularly relevant for SQL developers already using Copilot in other Microsoft development environments like Visual Studio and VS Code. </li>
<li style="font-weight:400;">The database context integration represents technical work specific to SQL workloads rather than a simple port of existing Copilot functionality. </li>
</ul>
<p>49:57 <a href="https://devblogs.microsoft.com/devops/whats-new-with-azure-repos/">What’s New in Azure Repos: Recent Updates – Azure DevOps Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://azure.microsoft.com/en-us/products/devops/repos/">Azure Repos</a> has rolled out several quality-of-life improvements focused on pull request workflows and TFVC modernization. </li>
<li style="font-weight:400;">The most impactful change is a breaking update that disables obsolete TFVC check-in policies, requiring teams still using the old storage format to migrate to the new system or lose policy enforcement entirely.</li>
<li style="font-weight:400;">Pull request notifications have been streamlined to reduce noise by removing low-value alerts like draft state changes and auto-complete updates, while simplifying remaining notifications to show only relevant changes like affected files. 
<ul>
<li style="font-weight:400;">This addresses a common complaint about notification overload in code review workflows.</li>
</ul>
</li>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/devops/repos/git/pull-request-templates?view=azure-devops">Pull request templates</a> now support nested folder structures that map to multi-level branch names, automatically selecting the most specific template available when targeting branches like feature/foo/december. 
<ul>
<li style="font-weight:400;">This eliminates template duplication for teams using hierarchical branching strategies.</li>
</ul>
</li>
<li style="font-weight:400;">The <a href="https://github.com/microsoft/azure-devops-mcp">Azure DevOps MCP Server</a> continues expanding with new tools for programmatic interaction with repos, branches, commits, and pull requests directly from VS Code and GitHub Copilot. 
<ul>
<li style="font-weight:400;">This enables developers to query repository metadata and inspect code without opening the Azure DevOps web interface.</li>
</ul>
</li>
<li style="font-weight:400;">Upcoming improvements include a more efficient Git policy configuration API that reduces unnecessary calls when retrieving policies across repositories and branches, plus additional pull request features like highlighting PRs with outstanding comments and filtering by tags. </li>
<li style="font-weight:400;">These changes target teams managing policies at scale and aim to keep code reviews moving more efficiently.</li>
</ul>
<p>51:17  Justin – “Wow. TFVC modernization is your feature. You’re just going to turn it off and lose your enforcement when they migrate automatically. That’s brutal. Classic Microsoft.”       </p>
<p>55:12 <a href="https://azure.microsoft.com/en-us/updates?id=547772">Generally Available: StandardV2 NAT Gateway with zone-redundancy and </a><a href="https://azure.microsoft.com/en-us/updates?id=547772">StandardV2 public IPs  </a></p>
<ul>
<li style="font-weight:400;">Azure’s StandardV2 NAT Gateway reaches general availability with zone-redundancy and improved performance while maintaining the same pricing as the original Standard SKU. </li>
<li style="font-weight:400;">This upgrade provides automatic high availability across availability zones without requiring customers to manage multiple NAT Gateways or configure complex failover scenarios.</li>
<li style="font-weight:400;">The StandardV2 SKU introduces dual-stack connectivity supporting both IPv4 and IPv6 traffic through a single NAT Gateway instance. </li>
<li style="font-weight:400;">This simplifies network architecture for organizations transitioning to IPv6 or running hybrid IP environments, eliminating the need to deploy separate NAT solutions for each protocol.</li>
<li style="font-weight:400;">StandardV2 Public IP addresses and prefixes are now available alongside the NAT Gateway upgrade, providing consistent zone-redundant capabilities across the networking stack. </li>
<li style="font-weight:400;">These resources work together to ensure outbound connectivity remains available even during zone-level failures without manual intervention.</li>
<li style="font-weight:400;">The price-neutral upgrade path means existing Standard SKU customers can migrate to StandardV2 for enhanced resiliency without budget impact. </li>
<li style="font-weight:400;">Organizations running mission-critical workloads that require guaranteed outbound connectivity should evaluate this upgrade, particularly those currently managing multiple NAT Gateways for redundancy purposes.</li>
</ul>
<p>56:27  Jonathan – “I guess it’s not as easy as it sounds. I mean, to us it’s like, well, why don’t I just deploy two, right? But if they’re NATing to public IPs, then those public IPs need to be routable to the zones, and so there’s probably a whole bunch more complexity on the back end in implementing multi-zone support for NAT than perhaps people realize.”</p>
<p>57:46 <a href="https://techcommunity.microsoft.com/blog/microsoftsentinelblog/announcing-unified-sox--dora-compliance-solutions-in-microsoft-sentinel/4484802">Announcing Unified SOX &amp; DORA Compliance Solutions in Microsoft </a><a href="https://techcommunity.microsoft.com/blog/microsoftsentinelblog/announcing-unified-sox--dora-compliance-solutions-in-microsoft-sentinel/4484802">Sentinel</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.microsoft.com/en-us/security/business/siem-and-xdr/microsoft-sentinel/?ef_id=_k_97ab281e199f18153281419ce4a4177b_k_&amp;OCID=AIDcmm5edswduu_SEM__k_97ab281e199f18153281419ce4a4177b_k_&amp;msclkid=97ab281e199f18153281419ce4a4177b">Microsoft Sentinel</a> now includes dedicated compliance solutions for SOX IT General Controls, and <a href="https://www.eiopa.europa.eu/digital-operational-resilience-act-dora_en">DORA regulations</a>, providing financial institutions with continuous monitoring and audit-ready evidence through workbook-driven dashboards. </li>
<li style="font-weight:400;">Both solutions are currently in public preview and consolidate telemetry from <a href="https://entra.microsoft.us/">Microsoft Entra ID</a>, <a href="https://learn.microsoft.com/en-us/azure/azure-monitor/platform/activity-log?tabs=log-analytics">Azure Activity Logs</a>, Defender signals, Microsoft 365 audit logs, and third-party sources into structured compliance views.</li>
<li style="font-weight:400;">The SOX IT Compliance solution maps directly to three core control domains: Access Management, monitoring unauthorized access to financial systems, Change Management tracking configuration modifications across Azure and on-premises environments, and Data Integrity controls detecting audit log tampering or gaps in critical system logging. </li>
<li style="font-weight:400;">Organizations deploy the solution by enabling data connectors, defining a SOX watchlist of authorized users and systems, and customizing queries to match internal policies.</li>
<li style="font-weight:400;">The DORA Compliance solution addresses the EU Digital Operational Resilience Act requirements through four specialized tabs covering Incident Management with MTTR tracking and SLA breach detection, Threat Intelligence correlating IOCs with MITRE ATT&amp;CK techniques, Business Continuity monitoring inactive servers and failover events, and Compliance Mapping that links security alerts directly to specific DORA Articles for audit evidence.</li>
<li style="font-weight:400;">Both solutions target financial services organizations, ICT providers, and any entity handling financial reporting systems that need to demonstrate regulatory compliance. </li>
<li style="font-weight:400;">The workbooks are fully customizable with editable KQL queries, allowing organizations to extend mappings, integrate custom logs, and adapt controls to different financial systems and regulatory frameworks over time.</li>
<li style="font-weight:400;">Deployment requires existing Microsoft Sentinel infrastructure with appropriate data connectors enabled, and organizations can define scope using watchlists to filter regulated assets. </li>
<li style="font-weight:400;">Pricing follows the standard Microsoft Sentinel consumption-based model for data ingestion and retention, with costs varying based on log volume from connected sources.</li>
</ul>
<p>1:00:55 <a href="https://blogs.microsoft.com/blog/2026/01/26/maia-200-the-ai-accelerator-built-for-inference/">Maia 200: The AI accelerator built for inference</a></p>
<ul>
<li style="font-weight:400;">Microsoft launches <a href="https://news.microsoft.com/january-2026-news">Maia 200</a>, a custom AI inference accelerator built on TSMC’s 3nm process that delivers over 10 petaFLOPS in FP4 precision and 5 petaFLOPS in FP8 within a 750W envelope. </li>
<li style="font-weight:400;">The chip offers 30% better performance per dollar than current Azure hardware and outperforms Amazon Trainium third generation and Google’s TPU seventh generation in key metrics.</li>
<li style="font-weight:400;">The accelerator features 216GB HBM3e memory at 7 TB/s bandwidth and 272MB on-chip SRAM, designed specifically for running large language models like GPT-5.2 and synthetic data generation workloads. </li>
<li style="font-weight:400;">Microsoft’s Superintelligence team will use Maia 200 for reinforcement learning and creating training data for next-generation models.</li>
<li style="font-weight:400;">Maia 200 uses a two-tier scale-up network built on standard Ethernet rather than proprietary fabrics, with each accelerator providing 2.8 TB/s bidirectional bandwidth and supporting clusters up to 6,144 accelerators. </li>
<li style="font-weight:400;">This approach reduces power consumption and total cost of ownership while maintaining predictable performance for dense inference workloads.</li>
<li style="font-weight:400;">Initial deployment is in US Central datacenter region near Des Moines, with US West 3 near Phoenix coming next, integrated with Microsoft Foundry and Microsoft 365 Copilot services. </li>
<li style="font-weight:400;">Microsoft is offering a Maia SDK preview with PyTorch integration, Triton compiler, and low-level programming tools for developers to optimize models for the new hardware.</li>
<li style="font-weight:400;">Microsoft achieved rapid deployment by validating the end-to-end system in pre-silicon environments, getting AI models running within days of receiving packaged parts and reducing time from first silicon to datacenter deployment by more than 50% compared to similar programs. </li>
<li style="font-weight:400;">The company positions this as the first in a multi-generational accelerator program with future iterations already in design.</li>
</ul>
<p>1:02:50  Ryan – “It’s a blessing and a curse that I don’t have these types of workloads. I’m not using these types of things for building models…but I’m sort of jealous because it would be kind of cool to have a use case where I could use this.”    </p>
<p>1:03:34 <a href="https://azure.microsoft.com/en-us/blog/beyond-boundaries-the-future-of-azure-storage-in-2026/">Azure Storage 2026: Built for Agentic Scale and Cloud‑Native Apps</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/storage/common/storage-introduction">Azure Storage</a> is positioning itself as the foundational platform for AI workloads across the entire lifecycle, from frontier model training to large-scale inference and agentic applications. </li>
<li style="font-weight:400;">Key capabilities include <a href="https://www.microsoft.com/en/customers/story/23427-openai-lp-azure-blob-storage/?msockid=3ed723774adc6b5418ce315e4b1b6a83">Blob scaled accounts</a> that handle millions of objects across hundreds of scale units, and <a href="https://learn.microsoft.com/en-us/azure/azure-managed-lustre/amlfs-overview">Azure Managed Lustre</a> delivering up to 512 GBps throughput with 25 PiB namespaces for keeping GPU fleets continuously fed during training and inference operations.</li>
<li style="font-weight:400;">The platform is adapting to handle agentic workloads that generate an order of magnitude more queries than traditional user-driven systems. </li>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/storage/elastic-san/elastic-san-introduction">Elastic SAN</a> is becoming the core building block for cloud-native applications, offering fully managed block storage pools with multi-tenant capabilities, while <a href="https://learn.microsoft.com/en-us/azure/storage/container-storage/container-storage-introduction">Azure Container Storage</a> has been open-sourced and now delivers 7x faster performance for Kubernetes-based stateful applications.</li>
<li style="font-weight:400;">Mission-critical workload performance has reached new levels with M-series VMs pushing disk storage to 780,000 IOPS and 16 GB/s throughput for SAP HANA deployments. </li>
<li style="font-weight:400;">Ultra Disks paired with Ebsv6 VMs can achieve 800,000 IOPS and 14 GB/s throughput with sub-500 microsecond latency, while Azure NetApp Files is introducing Elastic ZRS for zone-redundant high availability without operational complexity.</li>
<li style="font-weight:400;">Microsoft is addressing power and supply chain constraints through Azure Boost Data Processing Units that offload storage operations to dedicated hardware, reducing per-unit energy consumption while improving performance. </li>
<li style="font-weight:400;">The company is also expanding integrations with external datasets and AI frameworks, including <a href="https://ai.azure.com/">Microsoft Foundry</a>, <a href="https://www.langchain.com/">LangChain</a>, Ray, and Anyscale, to simplify data pipeline operations across hybrid environments.</li>
<li style="font-weight:400;">The partner ecosystem is expanding with co-engineered solutions from Commvault, Dell PowerScale, <a href="https://www.purestorage.com/">Pure Storage</a>, <a href="https://qumulo.com/">Qumulo</a>, and others that integrate deeply with Azure Storage services. </li>
<li style="font-weight:400;">These partnerships focus on hybrid data movement and backup solutions that enable customers to leverage Azure AI services while maintaining data across on-premises and cloud environments.</li>
</ul>
<p>1:05:07  Matt – “I just got concerned when Elastic SAN became the core building blocks of cloud native apps.” </p>
<h2>Oracle </h2>
<p>1:05:55 <a href="https://blogs.oracle.com/cloud-infrastructure/announcing-support-for-iam-deny-policies-in-the-oci-iam">Announcing Support for IAM Deny Policies in the OCI IAM</a></p>
<ul>
<li style="font-weight:400;">Oracle Cloud Infrastructure now supports IAM Deny Policies, allowing administrators to explicitly block specific actions even when allow policies would otherwise grant access. </li>
<li style="font-weight:400;">This addresses a common security gap where overly permissive policies could inadvertently grant unwanted access, particularly useful for enforcing compliance requirements and preventing accidental resource deletion in production environments.</li>
<li style="font-weight:400;">The deny policies work alongside existing allow policies using a deny-by-default model where explicit denies always override allows, following standard IAM best practices seen in AWS and other cloud providers. Organizations can now create guardrails that prevent even highly privileged users from performing certain actions, like deleting critical resources or accessing sensitive compartments.</li>
<li style="font-weight:400;">This feature integrates with Oracle’s existing IAM infrastructure, including compartments, groups, and dynamic groups, without requiring architectural changes. </li>
<li style="font-weight:400;">Customers can implement deny policies immediately through the OCI console, CLI, or API using the same policy language syntax they already know, though they’ll need to carefully plan policy hierarchies to avoid unintended lockouts.</li>
<li style="font-weight:400;">Primary use cases include preventing production resource deletion, enforcing regulatory compliance by blocking data exports to certain regions, and implementing separation of duties where even administrators cannot bypass certain security controls. </li>
<li style="font-weight:400;">The feature is available across all OCI regions at no additional cost beyond standard IAM usage.</li>
</ul>
<p>1:06:48  Justin – “Welcome to the party! How long has that been missing?”     </p>
<h2>Cloud Journey</h2>
<p>44:40 <a href="https://cloud.google.com/blog/topics/developers-practitioners/how-google-sres-use-gemini-cli-to-solve-real-world-outages/">How Google SREs Use Gemini CLI to Solve Real-World Outages</a></p>
<ul>
<li style="font-weight:400;">Google SREs are using Gemini CLI with their latest foundation model to reduce Mean Time to Mitigation during production outages, targeting a 5-minute SLO just to acknowledge incidents. </li>
<li style="font-weight:400;">The system uses function calling to fetch incident details, analyze logs, correlate time series data, and recommend specific mitigation playbooks like task restarts rather than generating arbitrary bash scripts.</li>
<li style="font-weight:400;">The implementation maintains human-in-the-loop control through multi-layer safety, including strictly typed tools via Model Context Protocol, risk assessment metadata, policy enforcement, and required confirmation steps before executing any production changes. 
<ul>
<li style="font-weight:400;">This copilot approach allows AI-speed analysis while preserving human accountability and creating automatic audit trails for compliance.</li>
</ul>
</li>
<li style="font-weight:400;">Gemini CLI integrates directly with Google’s <a href="https://dl.acm.org/doi/10.1145/2854146">monorepo</a> to analyze code changes, generate patches as Changelists, and automate the entire incident lifecycle from initial triage through postmortem generation. 
<ul>
<li style="font-weight:400;">The system can populate timelines, create action items, file bugs in issue trackers, and export documentation automatically.</li>
</ul>
</li>
<li style="font-weight:400;">The workflow creates a feedback loop where generated postmortems become training data for future incident responses, and the pattern is reproducible outside Google using open-source Gemini CLI with MCP servers connecting to tools like Grafana, Prometheus, PagerDuty, and Kubernetes. </li>
<li style="font-weight:400;">Custom commands allow teams to automate their specific operational workflows, similar to Google’s internal postmortem generator.</li>
</ul>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback, or ask questions at theCloudPod.net or tweet at us with the hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2351545/c1e-0424u7ww1rfj29pr-7zrnx3rxbn70-0ybzfu.mp3" length="140816232"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 340 of The Cloud Pod, where the forecast is always cloudy! It’s a full house (eventually) with Justin, Jonathan, Ryan, and Matt all on board for today’s episode. We’ve got a lot of announcements, from Gemini for Gov (no more CamoGPT!) to Route 52 and Claude. Let’s get started! 
Titles we almost went with this week

 Claude’s Pricing Tiers: Free, Pro, and Maximum Overdrive
 GitHub Copilot Learns Database Schema: Finally an AI That Understands Your Joins
 SSMS Gets a Copilot: Your T-SQL Now Writes Itself While You Grab Coffee
 Too Many Cooks in the Cloud Kitchen: How 32 GPUs Outcooked the Big Tech Industrial Kitchens
 Uncle Sam Gets a Gemini Twin: Google’s AI Goes Federal
 Route 53 Gets Domain of Its Own: .ai Joins the Party
 Thai One On: Google Cloud Plants Its Flag in Bangkok
 NAT So Fast: Azure’s Gateway Gets a V2 Glow-Up
 Beware Azure’s SQL Assistant doesn’t smoke your joints.

AI Is Going Great, Or How ML Makes Money  
30:10 Announcing BlackIce: A Containerized Red Teaming Toolkit for AI Security Testing | Databricks Blog

Databricks released BlackIce, an open-source containerized toolkit that bundles 14 AI security testing tools into a single Docker image available on Docker Hub as databricksruntime/blackice:17.3-LTS. 
The toolkit addresses common red teaming challenges, including conflicting dependencies, complex setup requirements, and the fragmented landscape of AI security tools, by providing a unified command-line interface similar to how Kali Linux works for traditional penetration testing.
The toolkit includes tools covering three main categories: Responsible AI, Security testing, and classical adversarial ML, with capabilities mapped to MITRE ATLAS and the Databricks AI Security Framework. 
Tools are organized as either static (simple CLI-based with minimal programming needed) or dynamic (Python-based with customization options), with static tools isolated in separate virtual environments and dynamic tools in a global environment with managed dependencies.
BlackIce integrates directly with Databricks Model Serving endpoints through custom patches applied to several tools, allowing security teams to test for vulnerabilities like prompt injections, data leakage, hallucination detection, jailbreak attacks, and supply chain security issues. 
Users can deploy it via Databricks Container Services by specifying the Docker image URL when creating compute clusters.
The release includes a demo notebook showing how to orchestrate multiple security tools in a single environment, with all build artifacts, tool documentation, and examples available in the GitHub repository. 
The CAMLIS Red Paper provides additional technical details on tool selection criteria and the Docker image architecture.

04:30 Ryan – “It’s very difficult to feel confident in your AI security practice or patterns. I feel like it’s just bleeding edge, and I’m learning so much all the time. And so I spend a lot of time reading papers and talking to others and seeing what they’re doing and meeting with vendors trying to figure out strategy, and it just feels like I’m drinking from a fire hose, and it’s really difficult to feel confident. So I like tools like t...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2351545/c1a-k5d5-gp5d93dphx58-5bogbn.jpg"></itunes:image>
                                                                            <itunes:duration>01:13:07</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2351545/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[339: Just-in-Time Secrets: Because Your AI Agent Can't Keep Its Mouth Shut]]>
                </title>
                <pubDate>Thu, 29 Jan 2026 05:31:58 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2342252</guid>
                                    <link>https://tcpfm.castos.com/episodes/339-just-in-time-secrets-because-your-ai-agent-cant-keep-its-mouth-shut</link>
                                <description>
                                            <![CDATA[<p>Welcome to episode 339 of The Cloud Pod, where the forecast is always cloudy! Justin and Matt are in the studio today to bring you all the latest in cloud and AI announcements, including more personnel shifts (and it doesn’t seem like it was very friendly), a new way to get much needed copper, and Azure marketplace advertising 4,000 different models. What’s the real story? Let’s get into it and find out! </p>
<h3>Titles we almost went with this week</h3>
<ul>
<li> US-EAST-1: Still the Least Reliable Friend You Keep Inviting to Parties **OpenAI</li>
<li>0⃣ From Zero to Inference: BigQuery Makes Open Models a Two-SQL Problem</li>
<li> AWS Goes Full Brandenburg Gate: Sovereign Cloud Opens for Business</li>
<li> Seven Ate Nine: AWS Skips G7 and Goes Straight to G7e Instances</li>
<li> From Crawling to Calling: Cloudflare Buys Human Native to Fix AI’s Data Problem</li>
<li> Finally, an AI That Actually Listens to Your War Room Panic</li>
<li> Tag, You’re Governed: AWS Automation Takes the Wheel</li>
<li> Cloudflare Reaches for the Stars: Astro Framework Acquisition Lands</li>
<li> Gemini Gets Personal: Google AI Finally Reads Your Email (With Permission)</li>
<li> AWS Strikes Ore: Amazon Cuts Out the Middleman in Copper Supply Chain</li>
<li> When Your Region Goes Down More Often Than Your Kubernetes Cluster</li>
<li> ChatGPT Go: OpenAI’s New Middle Child Gets $8 Allowance</li>
<li> Cloudflare’s Space-Age Acquisition: Astro Gets Jetsons-Level Upgrade</li>
<li> Rosie the Robot Fired: Cloudflare Brings Astro Framework Into the Family</li>
<li> It took 5 years, and now we have ads in our AI. </li>
<li> AI now with Ads</li>
<li> EU says hands off my data</li>
</ul>
<p> </p>
<h2>General News </h2>
<p>00:50 Heather’s data is not unreliable </p>
<ul>
<li>Maybe it’s unreliable.</li>
<li>I blame Matt for having screwed up his outtro (as he did today), in which case I no longer recognize his participation. </li>
</ul>
<p>01:11 <a href="https://blog.cloudflare.com/astro-joins-cloudflare/">Astro is joining Cloudflare</a></p>
<ul>
<li style="font-weight:400;"><a href="https://blog.cloudflare.com/">Cloudflare</a> acquires <a href="https://astro.build/blog/joining-cloudflare/">The Astro Technology Company</a>, bringing the popular open-source web framework in-house while maintaining its MIT license and multi-cloud deployment capabilities. </li>
<li style="font-weight:400;">Major platforms like <a href="https://webflow.com/feature/cloud">Webflow Cloud</a>, <a href="http://v">Wix Vibe</a>, and <a href="https://www.stainless.com/">Stainless</a> already use Astro on Cloudflare infrastructure to power customer websites.</li>
<li style="font-weight:400;"><a href="https://github.com/withastro/astro/milestone/37">Astro 6</a> introduces a redesigned development server built on <a href="https://vite.dev/guide/api-environment">Vite Environments API</a> that runs code locally using the same runtime as production deployment. When using the Cloudflare Vite plugin, developers can test against workerd runtime with access to <a href="https://developers.cloudflare.com/durable-objects/">Durable Objects</a>, <a href="https://developers.cloudflare.com/d1/">D1</a>, <a href="https://developers.cloudflare.com/kv/">KV</a>, and other Cloudflare services during local development.</li>
<li style="font-weight:400;">The framework focuses on content-driven websites through its <a href="https://docs.astro.build/en/concepts/islands/">Islands Architecture</a>, which renders most pages as static HTML while allowing selective client-side interactivity using any UI framework. </li>
<li style="font-weight:400;">This approach addresses the complexity that made building performant websites difficult before 2021, providing a simpler foundation for both human developers and AI coding agents.</li>
<li style="font-weight:400;">Astro 6 adds stable <a href="https://docs.astro.build/en/reference/experimental-flags/live-content-collections/">Live Content Collections</a> for real-time data...</li></ul>
<h3>Chapters</h3>
<ul><li>(00:00:00) - The Cloud Pod</li><li>(00:02:52) - Vite 6 and Cloudflare: Everything You Need to Know</li><li>(00:04:53) - Cloudflare to Acquire Human Data, Boost AI Data</li><li>(00:06:34) - Anthropic Launches a Lab for AI Product Development</li><li>(00:10:44) - Thinking Machine's Co-Founders Return to OpenAI</li><li>(00:13:29) - OpenAI to Add 750 Megawatts of Inference Capacity to Chat</li><li>(00:16:35) - Chat: More Adverts Coming to AI</li><li>(00:18:41) - 1Password for AI-Powered Development</li><li>(00:25:21) - EC2 X8i and G7E: The Bigger</li><li>(00:28:02) - Amazon Launches the AWS European Sovereign Cloud</li><li>(00:32:07) - Amazon to Become First Customer of Rio Tinto's Bio-Le</li><li>(00:34:34) - Curo CLI Update to 1.24</li><li>(00:37:21) - BigQuery adds SQL Native Inference for Open Models</li><li>(00:39:28) - Google Translate Gemma, a New Translation Model</li><li>(00:43:04) - Microsoft's AI Marketplace: Central Hub for AI Adoption</li><li>(00:47:10) - It's All In The Cloud For Azure...</li><li>(00:49:16) - Amazon's Outages for the Year 2025</li><li>(00:51:30) - US East 1 vs. Oregon: Is it Worse?</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 339 of The Cloud Pod, where the forecast is always cloudy! Justin and Matt are in the studio today to bring you all the latest in cloud and AI announcements, including more personnel shifts (and it doesn’t seem like it was very friendly), a new way to get much needed copper, and Azure marketplace advertising 4,000 different models. What’s the real story? Let’s get into it and find out! 
Titles we almost went with this week

 US-EAST-1: Still the Least Reliable Friend You Keep Inviting to Parties **OpenAI
0⃣ From Zero to Inference: BigQuery Makes Open Models a Two-SQL Problem
 AWS Goes Full Brandenburg Gate: Sovereign Cloud Opens for Business
 Seven Ate Nine: AWS Skips G7 and Goes Straight to G7e Instances
 From Crawling to Calling: Cloudflare Buys Human Native to Fix AI’s Data Problem
 Finally, an AI That Actually Listens to Your War Room Panic
 Tag, You’re Governed: AWS Automation Takes the Wheel
 Cloudflare Reaches for the Stars: Astro Framework Acquisition Lands
 Gemini Gets Personal: Google AI Finally Reads Your Email (With Permission)
 AWS Strikes Ore: Amazon Cuts Out the Middleman in Copper Supply Chain
 When Your Region Goes Down More Often Than Your Kubernetes Cluster
 ChatGPT Go: OpenAI’s New Middle Child Gets $8 Allowance
 Cloudflare’s Space-Age Acquisition: Astro Gets Jetsons-Level Upgrade
 Rosie the Robot Fired: Cloudflare Brings Astro Framework Into the Family
 It took 5 years, and now we have ads in our AI. 
 AI now with Ads
 EU says hands off my data

 
General News 
00:50 Heather’s data is not unreliable 

Maybe it’s unreliable.
I blame Matt for having screwed up his outtro (as he did today), in which case I no longer recognize his participation. 

01:11 Astro is joining Cloudflare

Cloudflare acquires The Astro Technology Company, bringing the popular open-source web framework in-house while maintaining its MIT license and multi-cloud deployment capabilities. 
Major platforms like Webflow Cloud, Wix Vibe, and Stainless already use Astro on Cloudflare infrastructure to power customer websites.
Astro 6 introduces a redesigned development server built on Vite Environments API that runs code locally using the same runtime as production deployment. When using the Cloudflare Vite plugin, developers can test against workerd runtime with access to Durable Objects, D1, KV, and other Cloudflare services during local development.
The framework focuses on content-driven websites through its Islands Architecture, which renders most pages as static HTML while allowing selective client-side interactivity using any UI framework. 
This approach addresses the complexity that made building performant websites difficult before 2021, providing a simpler foundation for both human developers and AI coding agents.
Astro 6 adds stable Live Content Collections for real-time data...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[339: Just-in-Time Secrets: Because Your AI Agent Can't Keep Its Mouth Shut]]>
                </itunes:title>
                                    <itunes:episode>339</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p>Welcome to episode 339 of The Cloud Pod, where the forecast is always cloudy! Justin and Matt are in the studio today to bring you all the latest in cloud and AI announcements, including more personnel shifts (and it doesn’t seem like it was very friendly), a new way to get much needed copper, and Azure marketplace advertising 4,000 different models. What’s the real story? Let’s get into it and find out! </p>
<h3>Titles we almost went with this week</h3>
<ul>
<li> US-EAST-1: Still the Least Reliable Friend You Keep Inviting to Parties **OpenAI</li>
<li>0⃣ From Zero to Inference: BigQuery Makes Open Models a Two-SQL Problem</li>
<li> AWS Goes Full Brandenburg Gate: Sovereign Cloud Opens for Business</li>
<li> Seven Ate Nine: AWS Skips G7 and Goes Straight to G7e Instances</li>
<li> From Crawling to Calling: Cloudflare Buys Human Native to Fix AI’s Data Problem</li>
<li> Finally, an AI That Actually Listens to Your War Room Panic</li>
<li> Tag, You’re Governed: AWS Automation Takes the Wheel</li>
<li> Cloudflare Reaches for the Stars: Astro Framework Acquisition Lands</li>
<li> Gemini Gets Personal: Google AI Finally Reads Your Email (With Permission)</li>
<li> AWS Strikes Ore: Amazon Cuts Out the Middleman in Copper Supply Chain</li>
<li> When Your Region Goes Down More Often Than Your Kubernetes Cluster</li>
<li> ChatGPT Go: OpenAI’s New Middle Child Gets $8 Allowance</li>
<li> Cloudflare’s Space-Age Acquisition: Astro Gets Jetsons-Level Upgrade</li>
<li> Rosie the Robot Fired: Cloudflare Brings Astro Framework Into the Family</li>
<li> It took 5 years, and now we have ads in our AI. </li>
<li> AI now with Ads</li>
<li> EU says hands off my data</li>
</ul>
<p> </p>
<h2>General News </h2>
<p>00:50 Heather’s data is not unreliable </p>
<ul>
<li>Maybe it’s unreliable.</li>
<li>I blame Matt for having screwed up his outtro (as he did today), in which case I no longer recognize his participation. </li>
</ul>
<p>01:11 <a href="https://blog.cloudflare.com/astro-joins-cloudflare/">Astro is joining Cloudflare</a></p>
<ul>
<li style="font-weight:400;"><a href="https://blog.cloudflare.com/">Cloudflare</a> acquires <a href="https://astro.build/blog/joining-cloudflare/">The Astro Technology Company</a>, bringing the popular open-source web framework in-house while maintaining its MIT license and multi-cloud deployment capabilities. </li>
<li style="font-weight:400;">Major platforms like <a href="https://webflow.com/feature/cloud">Webflow Cloud</a>, <a href="http://v">Wix Vibe</a>, and <a href="https://www.stainless.com/">Stainless</a> already use Astro on Cloudflare infrastructure to power customer websites.</li>
<li style="font-weight:400;"><a href="https://github.com/withastro/astro/milestone/37">Astro 6</a> introduces a redesigned development server built on <a href="https://vite.dev/guide/api-environment">Vite Environments API</a> that runs code locally using the same runtime as production deployment. When using the Cloudflare Vite plugin, developers can test against workerd runtime with access to <a href="https://developers.cloudflare.com/durable-objects/">Durable Objects</a>, <a href="https://developers.cloudflare.com/d1/">D1</a>, <a href="https://developers.cloudflare.com/kv/">KV</a>, and other Cloudflare services during local development.</li>
<li style="font-weight:400;">The framework focuses on content-driven websites through its <a href="https://docs.astro.build/en/concepts/islands/">Islands Architecture</a>, which renders most pages as static HTML while allowing selective client-side interactivity using any UI framework. </li>
<li style="font-weight:400;">This approach addresses the complexity that made building performant websites difficult before 2021, providing a simpler foundation for both human developers and AI coding agents.</li>
<li style="font-weight:400;">Astro 6 adds stable <a href="https://docs.astro.build/en/reference/experimental-flags/live-content-collections/">Live Content Collections</a> for real-time data updates without site rebuilds and includes first-class Content Security Policy support. </li>
<li style="font-weight:400;">The acquisition positions Cloudflare to serve better platform builders who extend Cloudflare services to their own customers through Cloudflare for Platforms.</li>
<li style="font-weight:400;">Tailwind recently laid off 80% of their staff, ostensibly due to AI, so this may have been an opportune moment for an exit. </li>
</ul>
<p>04:15 Matt – “I would assume that they heavily use it (AI) internally, so hopefully it’s something that they can leverage and continue to grow and they don’t have to redevelop their platform.” </p>
<p>04:53 <a href="https://blog.cloudflare.com/human-native-joins-cloudflare/">Human Native is joining Cloudflare</a></p>
<ul>
<li style="font-weight:400;">Cloudflare acquired <a href="https://www.humannative.ai/">Human Native</a>, a UK-based AI data marketplace that transforms multimedia content into structured, searchable data for AI training. </li>
<li style="font-weight:400;">The acquisition accelerates Cloudflare’s AI Index initiative, which uses a pub/sub model to let websites push structured content updates to AI developers in real time, rather than relying on traditional web crawling.</li>
<li style="font-weight:400;">Human Native’s platform focuses on licensed, high-quality training data rather than scraped content, with one UK video AI company reportedly discarding its existing training data after achieving better results with Human Native’s curated datasets. 
<ul>
<li style="font-weight:400;">This approach addresses the growing problem of <a href="https://blog.cloudflare.com/crawlers-click-ai-bots-training/">crawl-to-referral ratios</a> reaching tens of thousands of bot crawls per human visitor.</li>
</ul>
</li>
<li style="font-weight:400;">The acquisition builds on Cloudflare’s existing <a href="https://www.cloudflare.com/en-gb/ai-crawl-control/">AI Crawl Control</a> and <a href="https://developers.cloudflare.com/ai-crawl-control/features/pay-per-crawl/what-is-pay-per-crawl/">Pay Per Crawl</a> products, giving content owners more control over how AI systems access their content. </li>
<li style="font-weight:400;">Human Native’s technology will help customers structure their content for both AI consumption and traditional human audiences while enabling new monetization models.</li>
<li style="font-weight:400;">Cloudflare is positioning this work alongside the <a href="https://blog.cloudflare.com/x402/">x402 Foundation</a> (partnered with Coinbase) to enable machine-to-machine transactions for digital resources. </li>
<li style="font-weight:400;">The combination aims to create new economic models where AI developers can subscribe to structured content feeds and content creators receive fair compensation for their data.</li>
</ul>
<p>05:30 Justin – “We block you from getting to people’s AI content, and now we offer you a way to buy better content. Well played.” </p>
<h2>AI Is Going Great – Or How ML Makes Money </h2>
<p>06:40 <a href="https://www.anthropic.com/news/introducing-anthropic-labs">Introducing Labs \ Anthropic</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.anthropic.com/">Anthropic</a> is launching <a href="https://www.anthropic.com/news/introducing-anthropic-labs">Labs</a> as a dedicated team focused on incubating experimental AI products at the frontier of Claude’s capabilities, led by Instagram co-founder Mike Krieger and Ben Mann. </li>
<li style="font-weight:400;">This organizational shift separates rapid experimentation from production scaling, with Ami Vora taking over as head of Product to focus on enterprise-grade Claude experiences.</li>
<li style="font-weight:400;">The Labs approach has already produced several products that moved from research to production, including Claude Code, which reached <a href="https://www.anthropic.com/news/anthropic-acquires-bun-as-claude-code-reaches-usd1b-milestone">$1 billion in revenue within six months of launch</a>, and the Model Context Protocol, which now has <a href="https://www.anthropic.com/news/donating-the-model-context-protocol-and-establishing-of-the-agentic-ai-foundation">100 million monthly downloads</a> and has become an industry standard for connecting AI systems to tools and data.</li>
<li style="font-weight:400;">Recent Labs outputs include <a href="https://claude.com/blog/skills">Skills</a>, <a href="https://www.claude.com/blog/claude-for-chrome">Claude in Chrome</a>, and <a href="https://claude.com/blog/cowork-research-preview">Cowork</a>, which launched as a research preview to bring Claude’s agentic capabilities to desktop environments. This demonstrates the team’s focus on exploring new interaction models and deployment patterns for large language models beyond traditional chat interfaces.</li>
<li style="font-weight:400;">The organizational structure creates two parallel tracks: Labs for frontier experimentation with unpolished versions and early user testing, and the core Product organization partnering with CTO Rahul Patil to scale proven experiences for millions of daily users and enterprise customers. 
<ul>
<li style="font-weight:400;">This separation aims to balance innovation velocity with reliability requirements.</li>
</ul>
</li>
<li style="font-weight:400;">Anthropic is <a href="https://www.anthropic.com/careers">actively hiring for Labs positions</a>, specifically targeting builders with experience creating consumer products and working with emerging technologies. </li>
<li style="font-weight:400;">The team structure reflects the company’s view that rapid AI advancement requires different organizational approaches than traditional product development cycles.</li>
</ul>
<p>08:04 Matt – “The fact that you can get a lab to a GA customer product…is a really hard thing. They seem to have done a pretty good job of that with all these different technologies.” </p>
<p>10:56 <a href="https://techcrunch.com/2026/01/14/mira-muratis-startup-thinking-machines-lab-is-losing-two-of-its-co-founders-to-openai/">Mira Murati’s startup, Thinking Machines Lab, is losing two of its </a><a href="https://techcrunch.com/2026/01/14/mira-muratis-startup-thinking-machines-lab-is-losing-two-of-its-co-founders-to-openai/">co-founders to OpenAI </a></p>
<ul>
<li style="font-weight:400;"><a href="https://thinkingmachines.ai/">Thinking Machines Lab</a>, Mira Murati’s AI startup valued at $12 billion after a<a href="https://techcrunch.com/2025/07/15/mira-muratis-thinking-machines-lab-is-worth-12b-in-seed-round/"> $2 billion seed round last July</a>, has lost two of its three co-founders back to OpenAI within a year of founding. </li>
<li style="font-weight:400;">Barret Zoph, who served as CTO, along with <a href="https://lsvp.com/company/thinking-machines/">co-founder Luke Metz</a> and researcher Sam Schoenholz, returned to OpenAI in what reports suggest was not an amicable departure.</li>
<li style="font-weight:400;">The startup has now lost four key personnel in under a year, including co-founder Andrew Tulloch, <a href="https://techcrunch.com/2025/10/11/thinking-machines-lab-co-founder-andrew-tulloch-heads-to-meta/">who left for Meta</a> in October. </li>
<li style="font-weight:400;">Soumith Chintala has been promoted to replace Zoph as CTO, bringing over a decade of AI field experience to the role.</li>
<li style="font-weight:400;">The rapid co-founder departures raise questions about Thinking Machines’ internal dynamics and strategic direction, particularly given the company secured backing from major investors, including Andreessen Horowitz, Accel, Nvidia, and AMD. The startup has not publicly disclosed what products or services it is developing despite the substantial funding.</li>
<li style="font-weight:400;">This talent movement highlights the ongoing competition for AI research talent among major players, with OpenAI CEO of applications Fidji Simo noting the returns had been in the works for several weeks. The pattern mirrors OpenAI’s own history of co-founder departures to competing ventures, including John Schulman, who left for Anthropic before joining Thinking Machines.</li>
</ul>
<p>12:35 Matt – “It’s interesting that they’re going back to OpenAI. I’m curious, with NDAs and all of that stuff in place, how that is going to work.”  </p>
<p>13:49 <a href="https://openai.com/index/cerebras-partnership/">OpenAI partners with Cerebras </a></p>
<ul>
<li style="font-weight:400;"><a href="https://openai.com/">OpenAI</a> is adding 750MW of dedicated low-latency inference capacity through a partnership with <a href="https://www.cerebras.ai/">Cerebras</a>, with deployment rolling out in phases through 2028. </li>
<li style="font-weight:400;">Cerebras uses a unique architecture with a single giant chip that combines compute, memory, and bandwidth to eliminate traditional bottlenecks in AI inference.</li>
<li style="font-weight:400;">The partnership focuses specifically on accelerating real-time AI responses for workloads like complex queries, code generation, image creation, and AI agents. </li>
<li style="font-weight:400;">OpenAI’s strategy is to match specialized hardware to specific workload types rather than using one-size-fits-all infrastructure.</li>
<li style="font-weight:400;">Cerebras systems are purpose-built for fast token generation during the output phase of inference, which is critical for interactive AI applications where users expect immediate responses. This addresses the request-think-respond loop that determines user experience quality.</li>
<li style="font-weight:400;">The integration represents OpenAI’s approach to building a diversified compute portfolio, adding specialized low-latency systems alongside their existing infrastructure. </li>
<li style="font-weight:400;">This allows them to optimize different types of AI workloads based on performance requirements rather than using general-purpose hardware for everything.</li>
</ul>
<p>14:29 Justin – “In general, anybody that can get you AI capacity is apparently a musto-do.”  </p>
<p>15:49 <a href="https://openai.com/index/introducing-chatgpt-go/">Introducing ChatGPT Go, now available worldwide</a></p>
<ul>
<li style="font-weight:400;">OpenAI launches <a href="https://chatgpt.com/overview?openaicom_referred=true">ChatGPT</a> Go globally at $8 per month, creating a <a href="https://chatgpt.com/pricing?openaicom-did=ad04b478-4d65-49f6-a679-e4f8c7cd9645&amp;openaicom_referred=true">three-tier subscription model</a> with Go, Plus ($20), and Pro ($200). </li>
<li style="font-weight:400;">The Go tier provides 10x more messages, file uploads, and image creation than the free tier, with access to GPT-5.2 Instant, plus longer memory and context windows for improved conversation continuity.</li>
<li style="font-weight:400;">The pricing strategy positions Go as an entry-level paid option for users who need more capacity than the free tier but don’t require the advanced reasoning capabilities of GPT-5.2 Thinking (Plus) or GPT-5.2 Pro. </li>
<li style="font-weight:400;">OpenAI reports that Go became their fastest-growing product after initial rollout to 170 countries, with strong adoption for writing, learning, image creation, and problem-solving tasks.</li>
<li style="font-weight:400;">OpenAI plans to <a href="https://openai.com/index/our-approach-to-advertising-and-expanding-access/">introduce advertising</a> in both the free tier and ChatGPT Go in the US, while Plus, Pro, Business, and Enterprise tiers remain ad-free. </li>
<li style="font-weight:400;">This ad-supported model aims to sustain free and low-cost access points, while generating revenue from users who don’t need premium features.</li>
<li style="font-weight:400;">The tiered approach reflects a shift toward market segmentation similar to traditional SaaS models, with clear differentiation between casual users (Go), professionals (Plus), and power users (Pro). The $8 price point is localized in some markets, suggesting OpenAI is optimizing for purchasing power parity to maximize global adoption.</li>
</ul>
<p>17:00  Matt – “Ads are coming to AI. We all knew it was coming; they have to find additional ways to monetize it.” </p>
<h2>Cloud Tools</h2>
<p>19:15 <a href="https://1password.com/blog/bringing-secure-just-in-time-secrets-to-cursor-with-1password?utm_source=tldrdevops">Bringing secure, just-in-time secrets to Cursor with 1Password</a></p>
<ul>
<li style="font-weight:400;"><a href="https://1password.com/">1Password</a> has integrated with <a href="https://cursor.com/">Cursor</a>, the AI-powered IDE, to provide just-in-time secrets management through <a href="https://cursor.com/docs/agent/hooks">Cursor Hooks</a> that validate and inject credentials at runtime without ever storing them on disk. </li>
<li style="font-weight:400;">This eliminates the common security risk of developers hard-coding API keys or committing secrets to source control while working with AI coding assistants.</li>
<li style="font-weight:400;">The integration works by running a Hook Script before Cursor’s AI agent executes shell commands, verifying that the required environment files from 1Password Environments are properly configured and prompting users to authorize access only when needed. </li>
<li style="font-weight:400;">Secrets remain in memory for the runtime session only, and never touch disk or Git history, maintaining zero-trust principles while keeping development velocity high.</li>
<li style="font-weight:400;">This addresses a critical gap in AI-assisted development where AI agents could potentially access unrestricted credentials, or developers might paste tokens directly into config files for convenience. </li>
<li style="font-weight:400;">The solution lets project owners configure secrets management centrally while individual developers maintain control over authorization through 1Password’s existing access policies and vault permissions.</li>
<li style="font-weight:400;">Plans include granular, task-specific access rules for AI agents, broader support for the Model Context Protocol in external API interactions, automated secret rotation for AI workflows, and enhanced audit visibility for security teams. 
<ul>
<li style="font-weight:400;">The goal is to make secure access a native part of AI-powered development rather than an afterthought bolted on later.</li>
</ul>
</li>
<li style="font-weight:400;">This matters because AI coding tools like Cursor are rapidly becoming standard in developer workflows, but most teams lack proper secrets management for these new AI-driven interactions. </li>
<li style="font-weight:400;">The <a href="https://marketplace.1password.com/integration/cursor-hooks">integration</a> provides a practical path to adopt AI assistance without compromising security posture or requiring developers to change existing 1Password policies.</li>
</ul>
<p>20:34 Justin – “The one thing they don’t mention, which I think is also a big threat, is you’re sending your context to their servers, and if you’re putting your password into the context, that password is now going to the inference systems, and that could potentially get exposed. So it would be nice if this also had the ability to prevent a secret from getting transmitted to the third party LLM.” </p>
<p>23:36 <a href="https://www.harness.io/blog/announcing-the-harness-human-aware-change-agent">Announcing the Harness Human-Aware Change Agent</a></p>
<ul>
<li style="font-weight:400;">Harness launched the Human-Aware Change Agent, an AI system that listens to incident response conversations in Slack, Teams, and Zoom to extract operational clues like “the checkout button froze after they updated their cart” and automatically correlates them with actual production changes, including deployments, feature flags, and config updates. </li>
<li style="font-weight:400;">This solves the problem where critical incident context lives in human conversations but never makes it into automated investigation tools.</li>
<li style="font-weight:400;">The agent is part of <a href="https://www.harness.io/products/ai-sre">Harness AI SRE</a>, which includes an <a href="https://developer.harness.io/docs/ai-sre/ai-agent/">AI Scribe</a> that filters incident-related conversation from noise and feeds it to the change investigation engine. </li>
<li style="font-weight:400;">Instead of just transcribing chat or generating generic RCA summaries, it produces evidence-backed hypotheses like “deployment to checkout-service 12 minutes before the incident introduced new retry config, followed by latency spike and downstream timeouts.”</li>
<li style="font-weight:400;">The system integrates with existing observability and incident management tools, including <a href="https://www.datadoghq.com/">Datadog</a>, <a href="https://www.pagerduty.com/">PagerDuty</a>, <a href="https://www.atlassian.com/software/jira">Jira</a>, <a href="https://www.servicenow.com/">ServiceNow</a>, <a href="https://slack.com/signin#/signin">Slack</a>, and Teams through native integrations and webhooks. </li>
<li style="font-weight:400;">It also includes Automation Runbooks for standardized response and On-Call management to route incidents to the right owners.</li>
<li style="font-weight:400;">The core innovation is treating human insight as operational data rather than assuming incidents can be solved purely through logs, metrics, and traces. This addresses the reality that on-call engineers often identify patterns through conversation before they show up in dashboards, especially as AI-assisted development increases code velocity and reduces clear ownership of changes.</li>
<li style="font-weight:400;">The tool aims to shorten the incident response cycle from “What are we seeing” to “What changed” to “What should we do” by connecting human observations with machine-driven change intelligence in real time during active incidents.</li>
</ul>
<p>25:22  Justin – “Human awareness of how the system works as a whole – because typically AI systems don’t have the context to handle the whole system view – is also very valuable to the AI as well, so I guess we’re going to serving the AI someday, instead of the otherway around.” </p>
<h2>AWS </h2>
<p>26:15 <a href="https://aws.amazon.com/blogs/aws/amazon-ec2-x8i-instances-powered-by-custom-intel-xeon-6-processors-are-generally-available-for-memory-intensive-workloads/">Amazon EC2 X8i instances powered by custom Intel Xeon 6 processors are </a><a href="https://aws.amazon.com/blogs/aws/amazon-ec2-x8i-instances-powered-by-custom-intel-xeon-6-processors-are-generally-available-for-memory-intensive-workloads/">generally available for memory-intensive workloads </a></p>
<ul>
<li style="font-weight:400;">Want to burn all your moneys? Good news! </li>
<li style="font-weight:400;">AWS <a href="https://aws.amazon.com/about-aws/whats-new/2025/12/amazon-ec2-x8i-instances-preview/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">launches</a> X8i instances with custom <a href="https://www.intel.com/content/www/us/en/products/docs/xeon-6-product-brief.html">Intel Xeon 6 processors</a> offering up to 6 TB of memory and 3.9 GHz sustained all-core turbo frequency, delivering 1.5x more memory capacity and 3.4x more memory bandwidth than previous <a href="https://aws.amazon.com/ko/ec2/instance-types/x2i/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">X2i</a> generation. </li>
<li style="font-weight:400;">These SAP-certified instances target memory-intensive workloads like in-memory databases, data analytics, and EDA applications.</li>
<li style="font-weight:400;">Performance improvements are substantial across multiple workloads: 50% higher SAP HANA performance, 47% faster PostgreSQL, 88% faster Memcached, and 46% faster AI inference compared to X2i instances. Real customer deployments show <a href="https://orion.com/">Orion</a> reduced SQL Server licensing costs by 50% while maintaining performance thresholds by using fewer active cores.</li>
<li style="font-weight:400;">The instances come in 14 sizes, including three new larger options (48xlarge, 64xlarge, 96xlarge) and two bare metal variants, with network bandwidth up to 100 Gbps supporting <a href="https://aws.amazon.com/hpc/efa/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">Elastic Fabric Adapter</a> and 80 Gbps <a href="https://aws.amazon.com/ebs/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">EBS</a> throughput. </li>
<li style="font-weight:400;">The instance bandwidth configuration feature allows flexible allocation between network and EBS bandwidth with up to 25% scaling capability.</li>
<li style="font-weight:400;">Currently available in US East N. Virginia, US East Ohio, US West Oregon, and Europe Frankfurt regions with standard purchasing options including <a href="https://aws.amazon.com/ec2/pricing/on-demand/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">On-Demand</a>, <a href="https://aws.amazon.com/savingsplans/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">Savings Plans</a>, and <a href="https://aws.amazon.com/ec2/spot/pricing/?trk=trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">Spot Instances</a>. </li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/ec2/pricing/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">Pricing</a> follows standard EC2 memory-optimized instance rates available on the EC2 pricing page.</li>
</ul>
<p>27:23 <a href="https://aws.amazon.com/blogs/aws/announcing-amazon-ec2-g7e-instances-accelerated-by-nvidia-rtx-pro-6000-blackwell-server-edition-gpus/">Announcing Amazon EC2 G7e instances accelerated by NVIDIA RTX PRO </a><a href="https://aws.amazon.com/blogs/aws/announcing-amazon-ec2-g7e-instances-accelerated-by-nvidia-rtx-pro-6000-blackwell-server-edition-gpus/">6000 Blackwell Server Edition GPUs</a></p>
<ul>
<li style="font-weight:400;">AWS launches <a href="https://aws.amazon.com/ec2/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">EC2</a> G7e instances powered by <a href="https://www.nvidia.com/en-us/data-center/rtx-pro-6000-blackwell-server-edition/">NVIDIA RTX PRO 6000 Blackwell Server Edition GPUs</a>, delivering 2.3x better inference performance compared to <a href="https://aws.amazon.com/ec2/instance-types/g6e/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">G6e instances</a> and doubling GPU memory to 96GB per GPU. </li>
<li style="font-weight:400;">These instances can handle models up to 70B parameters with FP8 precision on a single GPU, with configurations scaling up to 8 GPUs and 768GB total GPU memory per node.</li>
<li style="font-weight:400;">The instances feature substantial networking improvements with 4x the bandwidth of G6e instances (up to 1,600 Gbps) and support for NVIDIA GPUDirect RDMA via Elastic Fabric Adapter for multi-node workloads. </li>
<li style="font-weight:400;">GPUDirect P2P enables direct GPU-to-GPU communication over PCIe with 4x the inter-GPU bandwidth compared to previous generation L40s GPUs, reducing latency for distributed model inference.</li>
<li style="font-weight:400;">G7e instances target generative AI inference, spatial computing, and scientific computing workloads with support for GPUDirect Storage integration with <a href="https://aws.amazon.com/fsx/lustre/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">FSx for Lustre</a>, providing up to 1.2 Tbps throughput for rapid model loading. Configurations range from single GPU instances to 8-GPU systems with up to 192 vCPUs and 2TB of system memory.</li>
<li style="font-weight:400;">Currently available in US East N. Virginia and Ohio regions with support for <a href="https://aws.amazon.com/ec2/pricing/on-demand/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">On-Demand</a>, <a href="https://aws.amazon.com/ec2/spot/pricing/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">Spot</a>, <a href="https://aws.amazon.com/savingsplans/?trk=cc9e0036-98c5-4fa8-8df0-5281f75284ca&amp;sc_channel=el">Savings Plans</a>, <a href="https://aws.amazon.com/ec2/pricing/dedicated-instances/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">Dedicated Instances</a>, and <a href="https://aws.amazon.com/ec2/dedicated-hosts/pricing/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">Dedicated Hosts</a> purchasing options. </li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/sagemaker-ai/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">SageMaker AI</a> integration is planned for future release, while <a href="https://aws.amazon.com/ecs/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">ECS</a> and <a href="https://aws.amazon.com/eks/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;sc_channel=el">EKS </a>support is available now.</li>
</ul>
<p>27:46 Justin – “That’s a lot of power, and cooling, and that where all my RAM went to, which is why my RAM is expensive now.”  </p>
<p>29:00 <a href="https://aws.amazon.com/blogs/aws/opening-the-aws-european-sovereign-cloud/">Opening the AWS European Sovereign Cloud</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.eu/">AWS European Sovereign Cloud</a> is now generally available with its first <a href="https://docs.aws.amazon.com/glossary/latest/reference/glos-chap.html#region">region</a> in Brandenburg, Germany, operating as a physically and logically separate infrastructure partition (aws-eusc) entirely within the EU. </li>
<li style="font-weight:400;">The infrastructure will be <a href="https://www.aboutamazon.eu/news/aws/aws-european-sovereign-cloud-to-be-operated-by-eu-citizens">operated exclusively by EU residents located in the EU</a>, with dedicated IAM and billing systems, and technical controls that prevent access from outside the EU.</li>
<li style="font-weight:400;">The service launches with comprehensive AWS capabilities, including <a href="https://aws.amazon.com/sagemaker/">SageMaker</a>, <a href="https://aws.amazon.com/bedrock/">Bedrock</a>, <a href="https://aws.amazon.com/ec2/">EC2</a>, <a href="https://aws.amazon.com/lambda/">Lambda</a>, <a href="https://aws.amazon.com/eks/">EKS</a>, <a href="https://aws.amazon.com/rds/aurora/">Aurora</a>, <a href="https://aws.amazon.com/dynamodb/">DynamoDB</a>, <a href="https://aws.amazon.com/s3/">S3</a>, and other core services, backed by a 7.8 billion EUR investment expected to contribute 17.2 billion EUR to the European economy through 2040. </li>
<li style="font-weight:400;">Expansion plans include sovereign Local Zones in Belgium, the Netherlands, and Portugal, plus options for Dedicated Local Zones, AI Factories, and Outposts deployments.</li>
<li style="font-weight:400;">The operational model features EU-based management through German legal entities, with <a href="https://www.aboutamazon.eu/news/aws/stephane-israel-appointed-to-lead-the-aws-european-sovereign-cloud">Stephane Israel</a> appointed as managing director and an advisory board of EU citizens providing sovereignty oversight. </li>
<li style="font-weight:400;">The infrastructure maintains AWS security standards, including <a href="https://aws.amazon.com/ec2/nitro/">Nitro System isolation</a>, ISO/IEC 27001, SOC 1/2/3 reports, and BSI C5 attestation, with a <a href="https://aws.amazon.com/blogs/security/exploring-the-new-aws-european-sovereign-cloud-sovereign-reference-framework/">Sovereign Reference Framework</a> available in AWS Artifact.</li>
<li style="font-weight:400;">Data residency guarantees ensure all customer content and metadata, including roles, permissions, and configurations, remain within the EU, using dedicated European trust service providers for certificate authority operations and European TLDs for Route 53 name servers. 
<a href="https://aws.amazon.com/legal/aws-emea/">Pricing is in EUR with billing available in eight supported currencies</a> through Amazon Web Services EMEA SARL.</li>
<li style="font-weight:400;">Major <a href="https://aws.amazon.com/blogs/apn/range-of-aws-partner-solutions-set-to-launch-on-the-aws-european-sovereign-cloud/">AWS partners</a>, including Adobe, Cisco, SAP, Snowflake, and Wiz, are making their solutions available in the sovereign cloud, enabling public sector and highly regulated industry customers to meet strict compliance requirements while accessing modern cloud capabilities without being stuck in legacy on-premises environments.</li>
</ul>
<p>31:53  Justin – “Google’s got the same thing on a partnership with Thales in France. I think Azure is doing something similar as well… but the question is kind of, a European entity owned by a US corporation, does that actually fulfill the concerns the European Union has?” </p>
<p>33:16 <a href="https://www.riotinto.com/en/news/releases/2026/rio-tinto-and-amazon-web-services-collaborate-to-bring-low-carbon-nuton-copper-to-u-s--data-centres">Rio Tinto and Amazon Web Services collaborate to bring low-carbon Nuton </a><a href="https://www.riotinto.com/en/news/releases/2026/rio-tinto-and-amazon-web-services-collaborate-to-bring-low-carbon-nuton-copper-to-u-s--data-centres">copper to U.S. data centres</a></p>
<ul>
<li style="font-weight:400;">AWS becomes the first customer for Rio Tinto’s Nuton bioleaching technology, which uses microorganisms to extract copper from ore at the Johnson Camp mine in Arizona. </li>
<li style="font-weight:400;">The process produces 99.99% pure copper cathode directly at the mine without traditional smelters or refineries, achieving a carbon footprint of 2.82 kgCO2e/kg Cu compared to the global range of 1.5-8.0 kgCO2e/kg Cu.</li>
<li style="font-weight:400;">The two-year agreement supplies low-carbon copper for AWS data center components, including electrical cables, busbars, transformers, circuit boards, and processor heat sinks.</li>
<li style="font-weight:400;">Johnson Camp is now the lowest-carbon primary copper producer in the U.S., targeting approximately 30,000 tonnes of refined copper over four years with 71 liters of water per kilogram versus the industry average of 130 liters.</li>
<li style="font-weight:400;">AWS provides cloud-based data and analytics support to optimize Nuton’s bioleaching operations, including heap-leach performance simulation and advanced analytics for acid and water usage. </li>
<li style="font-weight:400;">The modular system enables rapid scaling and customization for different ore bodies while recovering value from previously classified waste material.</li>
<li style="font-weight:400;">This collaboration addresses supply chain resilience by producing critical materials domestically for U.S. data centers while supporting Amazon’s Climate Pledge goal of net-zero carbon by 2040. </li>
<li style="font-weight:400;">The partnership demonstrates how industrial mining operations can integrate cloud technology to reduce environmental impact and shorten mine-to-market supply chains.</li>
</ul>
<p>34:39 Justin – “It also tells me how much you desperately need it (copper) for all the AI investments you’re about to be making.”  </p>
<p>35:53 <a href="https://kiro.dev/changelog/cli/1-24/?sc_channel=sm&amp;sc_publisher=LINKEDIN&amp;sc_country=global&amp;sc_geo=GLOBAL&amp;sc_outcome=awareness">Skills, Custom Diff Tools, Improved Code Intelligence, and Conversation </a><a href="https://kiro.dev/changelog/cli/1-24/?sc_channel=sm&amp;sc_publisher=LINKEDIN&amp;sc_country=global&amp;sc_geo=GLOBAL&amp;sc_outcome=awareness">Compaction</a></p>
<ul>
<li style="font-weight:400;"><a href="https://kiro.dev/">Kiro CLI version 1.24.0</a> introduces Skills, a new resource type for <a href="https://kiro.dev/docs/cli/custom-agents/configuration-reference/#skill-resources">progressive context loading</a> that only loads metadata at startup and fetches full documentation content on demand when the AI agent needs it. </li>
<li style="font-weight:400;">This addresses memory constraints when working with large documentation sets by requiring YAML frontmatter with descriptive metadata to help agents determine when to load complete content.</li>
<li style="font-weight:400;">The release adds <a href="https://kiro.dev/docs/cli/code-intelligence/">built-in code intelligence</a> for 18 programming languages, including Python, JavaScript, Go, Rust, and others, without requiring LSP setup. Developers get immediate access to symbol search, definition navigation, and structural code searches, plus a new /code overview command for quick workspace analysis.</li>
<li style="font-weight:400;">New AST-based pattern-search and pattern-rewrite tools enable precise code refactoring by matching syntax tree patterns instead of text regex. This eliminates false matches in string literals and comments, providing more reliable code transformations for AI agents.</li>
<li style="font-weight:400;">Conversation Compaction addresses context window limitations with a /compact command that summarizes conversation history while preserving key information. The feature triggers automatically when context limits are reached and creates a new session while allowing users to resume the original conversation, with configurable retention settings for message pairs and context window percentage.</li>
<li style="font-weight:400;">The update includes granular URL permissions for the web_fetch tool using regex patterns to control which domains AI agents can access, plus remote authentication support for Google and GitHub when running Kiro CLI on remote machines via SSH, SSM, or containers.</li>
</ul>
<h2>GCP</h2>
<p>38:43 <a href="https://cloud.google.com/blog/products/data-analytics/introducing-bigquery-managed-and-sql-native-inference-for-open-models/">Introducing BigQuery managed and SQL-native inference for open models | </a><a href="https://cloud.google.com/blog/products/data-analytics/introducing-bigquery-managed-and-sql-native-inference-for-open-models/">Google</a></p>
<ul>
<li style="font-weight:400;">BigQuery now supports SQL-native inference for open models from Hugging Face and Vertex AI Model Garden through a two-step process: CREATE MODEL with a model ID string, then run inference using AI.GENERATE_TEXT or AI.GENERATE_EMBEDDING functions. </li>
<li style="font-weight:400;">This eliminates the need for separate infrastructure management or API integrations outside of BigQuery.</li>
<li style="font-weight:400;">The service includes automated resource management with configurable idle timeout settings that automatically undeploy endpoints when not in use, preventing runaway costs from idle GPU instances. </li>
<li style="font-weight:400;">Users can customize machine types, replica counts, and leverage <a href="https://docs.cloud.google.com/compute/docs/instances/reservations-overview">Compute Engine reservations</a> for consistent GPU availability on demanding workloads.</li>
<li style="font-weight:400;">This extends BigQuery’s existing managed inference capabilities beyond Google’s Gemini models and partner models like Anthropic and Mistral to any compatible open model. </li>
<li style="font-weight:400;">The entire lifecycle from deployment to cleanup happens through SQL statements, making LLM inference accessible to data analysts without requiring ML engineering expertise.</li>
<li style="font-weight:400;">The feature is currently in Preview and supports both text generation and embedding generation workloads directly on data stored in BigQuery tables. </li>
<li style="font-weight:400;">Cost control includes both automated endpoint recycling based on idle time and manual undeploy options via ALTER MODEL statements, with automatic cleanup of all Vertex AI resources when models are dropped.</li>
</ul>
<p>39:45 Matt – “This all seems crazy to me; this is where we’re at, where AI is writing, creating models, running all of these things for us.” </p>
<p>40:56 <a href="https://blog.google/innovation-and-ai/technology/developers-tools/translategemma/">TranslateGemma: A new family of open translation models</a></p>
<ul>
<li style="font-weight:400;">Google released TranslateGemma, a new family of open translation models based on <a href="https://deepmind.google/models/gemma/gemma-3/">Gemma 3</a>, available in 4B, 12B, and 27B parameter sizes supporting 55 languages. </li>
<li style="font-weight:400;">The models use a two-stage training process combining supervised fine-tuning on parallel data from human translations and Gemini-generated synthetic translations, followed by reinforcement learning using MetricX-QE and AutoMQM reward models.</li>
<li style="font-weight:400;">The 12B TranslateGemma model outperforms the baseline Gemma 3 27B model on WMT24++ benchmarks while using less than half the parameters, delivering higher throughput and lower latency. </li>
<li style="font-weight:400;">The 4B model matches the performance of the 12B baseline, making it suitable for mobile inference and edge deployment.</li>
<li style="font-weight:400;">TranslateGemma retains Gemma 3’s multimodal capabilities, showing improved performance on the Vistra image translation benchmark without specific multimodal fine-tuning. </li>
<li style="font-weight:400;">The models were trained on nearly 500 language pairs beyond the core 55, providing a foundation for researchers to fine-tune for specific language pairs or low-resource languages.</li>
<li style="font-weight:400;">The models are optimized for different deployment scenarios: 4B for mobile and edge devices, 12B for consumer laptops, and 27B for a single H100 GPU or TPU cloud deployment. </li>
<li style="font-weight:400;">All three sizes are available now for developers and researchers to download and use.</li>
</ul>
<p>41:50  Justin – “I am excited about the idea of models that specialize in supporting language translations; and so this is things that power future products inside of your Android phones someday, where Apple has a feature where it can slowly translate things through your Airpods… it’s a little delayed but it works relatively well. I’m sure this will bring similar type capabilities to you and your Android phone.”       </p>
<h2>Azure</h2>
<p>44:40 <a href="https://azure.microsoft.com/en-us/blog/design-your-ai-and-agent-strategy-with-microsoft-marketplace/">Design your AI strategy with Microsoft Marketplace Solutions</a></p>
<ul>
<li style="font-weight:400;">Microsoft positions its <a href="https://marketplace.microsoft.com/en-us/home">Marketplace</a> as a central hub for AI adoption with over 11,000 pre-packaged <a href="https://azure.microsoft.com/en-us/products/ai-foundry/models/?msockid=1c424d43ffa569650cfd5b7afee668a6">models</a> (“models”) and 4,000 AI apps and <a href="https://support.microsoft.com/en-us/topic/get-started-with-agents-in-microsoft-365-copilot-943e563d-602d-40fa-bdd1-dbc83f582466">agents</a>, offering organizations flexible build-buy-blend strategies for implementing AI solutions. </li>
<li style="font-weight:400;">The platform integrates directly into existing Microsoft tools like <a href="https://www.microsoft.com/en-us/microsoft-365-copilot/microsoft-copilot-studio/">Copilot Studio</a> and <a href="https://ai.azure.com/">Azure Foundry</a>, allowing teams to discover and deploy AI components within their normal workflows rather than switching between separate procurement systems.</li>
<li style="font-weight:400;">The Marketplace supports both pro-code development with full control over custom logic and IP ownership, and low-code approaches through Copilot Studio using models from providers like <a href="https://www.microsoft.com/en-us/microsoft-copilot/blog/copilot-studio/anthropic-joins-the-multi-model-lineup-in-microsoft-copilot-studio/?msockid=3a06df49bfce66741dbfcd48be526714">Anthropic, OpenAI</a>, Meta, and NVIDIA. </li>
<li style="font-weight:400;">Organizations with Azure <a href="https://go.microsoft.com/fwlink/?linkid=2313441&amp;clcid=0x409&amp;culture=en-us&amp;country=us">consumption commitments</a> can apply Marketplace purchases dollar-for-dollar against their contracts with no limit, potentially improving ROI on existing Microsoft agreements.</li>
<li style="font-weight:400;">Microsoft emphasizes a blended approach where companies can extend partner solutions with proprietary components, illustrated by financial services firms deploying pre-built fraud detection models while customizing them with internal data pipelines and compliance workflows. </li>
<li style="font-weight:400;">This strategy reduces the engineering effort and compliance review cycles compared to building detection systems from scratch while maintaining data security through Managed Identity within Azure tenants.</li>
<li style="font-weight:400;">The platform includes try-before-you-buy capabilities with trials and proofs-of-concept that run within customer Microsoft environments, allowing validation before full deployment. </li>
<li style="font-weight:400;">Solutions are filtered by product, category, and industry to match specific organizational needs, with agents available directly in Microsoft 365 Copilot and models accessible through the Azure portal.</li>
</ul>
<p> Cloud Journey </p>
<p>52:07 <a href="https://statusgator.com/blog/aws-least-reliable-region-in-2025/">Is Northern Virginia Still the Least Reliable AWS Region in 2025? We </a><a href="https://statusgator.com/blog/aws-least-reliable-region-in-2025/">Analyzed the Data</a></p>
<ul>
<li style="font-weight:400;">StatusGator published an analysis of AWS outages from January through December 2025, focusing on regional reliability and service-level incidents across all commercial AWS regions</li>
</ul>
<ul>
<li style="font-weight:400;">N. Virginia (us-east-1) is the least reliable AWS region: 10 outages, 34 hours of downtime, 126 components affected</li>
<li style="font-weight:400;">October 20, 2025, was one of AWS’s most significant outages ever: 76 components down for ~15 hours, cascading failures across thousands of SaaS platforms</li>
<li style="font-weight:400;">Compute and ML services hit hardest: EC2 (14 outages), SageMaker (11), Glue (10), EMR (10), ECS (10)</li>
<li style="font-weight:400;">Several services exceeded 24 hours cumulative downtime: OpenSearch, CloudWatch, EMR Serverless, STS</li>
<li style="font-weight:400;">Multi-region (“Regionless”) outages increased: 12 incidents, 32 hours of downtime</li>
<li style="font-weight:400;">Status Gator speculates reasons:</li>
</ul>
<ul>
<li style="font-weight:400;">Customer density: us-east-1 has 2x the users of Oregon and 3x other regions</li>
<li style="font-weight:400;">Higher service density creates more interconnected dependencies and potential failure points</li>
<li style="font-weight:400;">Heavier API traffic and more complex multi-AZ coordination</li>
<li style="font-weight:400;">No evidence that the age of the region or architectural differences are factors</li>
</ul>
<ul>
<li style="font-weight:400;">Best Practices from Status Gator
<ul>
<li style="font-weight:400;">Avoid over-reliance on a single region, especially us-east-1</li>
<li style="font-weight:400;">Design for multi-region resilience and failover</li>
<li style="font-weight:400;">Monitor authentication/identity services (STS) as critical dependencies</li>
<li style="font-weight:400;">Consider the blast radius when selecting primary regions</li>
</ul>
</li>
</ul>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback, or ask questions at theCloudPod.net or tweet at us with the hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2342252/c1e-wnmnsv4ok0f63gj2-0v97gmx5trr8-pfmjxg.mp3" length="107238929"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 339 of The Cloud Pod, where the forecast is always cloudy! Justin and Matt are in the studio today to bring you all the latest in cloud and AI announcements, including more personnel shifts (and it doesn’t seem like it was very friendly), a new way to get much needed copper, and Azure marketplace advertising 4,000 different models. What’s the real story? Let’s get into it and find out! 
Titles we almost went with this week

 US-EAST-1: Still the Least Reliable Friend You Keep Inviting to Parties **OpenAI
0⃣ From Zero to Inference: BigQuery Makes Open Models a Two-SQL Problem
 AWS Goes Full Brandenburg Gate: Sovereign Cloud Opens for Business
 Seven Ate Nine: AWS Skips G7 and Goes Straight to G7e Instances
 From Crawling to Calling: Cloudflare Buys Human Native to Fix AI’s Data Problem
 Finally, an AI That Actually Listens to Your War Room Panic
 Tag, You’re Governed: AWS Automation Takes the Wheel
 Cloudflare Reaches for the Stars: Astro Framework Acquisition Lands
 Gemini Gets Personal: Google AI Finally Reads Your Email (With Permission)
 AWS Strikes Ore: Amazon Cuts Out the Middleman in Copper Supply Chain
 When Your Region Goes Down More Often Than Your Kubernetes Cluster
 ChatGPT Go: OpenAI’s New Middle Child Gets $8 Allowance
 Cloudflare’s Space-Age Acquisition: Astro Gets Jetsons-Level Upgrade
 Rosie the Robot Fired: Cloudflare Brings Astro Framework Into the Family
 It took 5 years, and now we have ads in our AI. 
 AI now with Ads
 EU says hands off my data

 
General News 
00:50 Heather’s data is not unreliable 

Maybe it’s unreliable.
I blame Matt for having screwed up his outtro (as he did today), in which case I no longer recognize his participation. 

01:11 Astro is joining Cloudflare

Cloudflare acquires The Astro Technology Company, bringing the popular open-source web framework in-house while maintaining its MIT license and multi-cloud deployment capabilities. 
Major platforms like Webflow Cloud, Wix Vibe, and Stainless already use Astro on Cloudflare infrastructure to power customer websites.
Astro 6 introduces a redesigned development server built on Vite Environments API that runs code locally using the same runtime as production deployment. When using the Cloudflare Vite plugin, developers can test against workerd runtime with access to Durable Objects, D1, KV, and other Cloudflare services during local development.
The framework focuses on content-driven websites through its Islands Architecture, which renders most pages as static HTML while allowing selective client-side interactivity using any UI framework. 
This approach addresses the complexity that made building performant websites difficult before 2021, providing a simpler foundation for both human developers and AI coding agents.
Astro 6 adds stable Live Content Collections for real-time data...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2342252/c1a-k5d5-nd1vm9qzhnz-biprrf.jpg"></itunes:image>
                                                                            <itunes:duration>00:55:46</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2342252/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[338: T5Gemma Says "AI’ll be Back”]]>
                </title>
                <pubDate>Thu, 22 Jan 2026 00:20:14 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2333121</guid>
                                    <link>https://tcpfm.castos.com/episodes/338-t5gemma-says-aill-be-back</link>
                                <description>
                                            <![CDATA[<p>Welcome to episode 338 of The Cloud Pod, where the forecast is always cloudy! Justin, Ryan, Matt, and Jonathan are in the studio today to bring you all the latest in cloud and AI news, including a bit of a buying spree (inlcuding whole power companies) Veo 3.1, Cowork, and more – today in the cloud!  </p>
<h3>Titles we almost went with this week</h3>
<ul>
<li> Snowflake’s Ironic Timing: Buying Downtime Prevention Tool While Experiencing Downtime</li>
<li> Flexera Buys ProsperOps and Chaos Genius, Promises Less Chaos and More Prosperity</li>
<li> Flexera Goes Shopping: Two FinOps Acquisitions to Prosper and Reduce Chaos</li>
<li> Token of Appreciation: Gemini CLI Now Tracks Every Penny of Your AI Spend</li>
<li> Snowflake Buys Observe to Stop Its Own Services from Melting Down</li>
<li> Google’s Veo 3.1 Goes Vertical: Finally Understanding How People Actually Hold Their Phones</li>
<li> Alphabet’s New Power Move: Buying the Company That Literally Powers Data Centers</li>
<li> Dashboard Confessional: Gemini CLI Gets Transparent About Its Usage</li>
<li> Microsoft’s New Agent Works 24/7 and Never Asks for a Raise</li>
<li>From Robot Vacuums That Climb Stairs to TVs You Can’t Feel: CES Gets Weird</li>
<li> Agent Shopping: When Your AI Has Better Taste Than You Do</li>
<li> The cloudpod hosts do not like any stories this week</li>
<li> AWS took a nap on announcements this week</li>
<li> Claude is my new co-worker</li>
<li> Wake up, AWS, and give us some fun news</li>
<li> The $200 Assistant: Is Cowork the End of Workplace Admins?</li>
<li> Azure has more interesting announcements than AWS oh noooo</li>
<li> If you can’t beat them in AI, just acquire everyone</li>
<li> Notebook LM turns the Data Tables on you</li>
</ul>
<h2>AI Is Going Great – Or How ML Makes Money </h2>
<p>01:11 <a href="https://arstechnica.com/ai/2026/01/anthropic-launches-cowork-a-claude-code-like-for-general-computing/">Anthropic launches Cowork, a Claude Code-like for general computing – </a><a href="https://arstechnica.com/ai/2026/01/anthropic-launches-cowork-a-claude-code-like-for-general-computing/">Ars Technica</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.anthropic.com/">Anthropic</a> launches <a href="https://claude.com/blog/cowork-research-preview">Cowork</a>, a new feature in the macOS Claude desktop app that extends <a href="https://claude.com/product/claude-code">Claude Code</a>‘s agentic capabilities to general office work tasks. </li>
<li style="font-weight:400;">Users can grant Claude access to specific folders and use plain language instructions to automate tasks like filling expense reports from receipt photos, writing reports from notes, or reorganizing files.</li>
<li style="font-weight:400;">Cowork lowers the technical barrier compared to Claude Code by making AI-assisted file operations accessible to non-developer knowledge workers, including marketers and office staff. </li>
<li style="font-weight:400;">The feature was developed after Anthropic observed users already applying Claude Code to general knowledge work despite its developer-focused positioning.</li>
<li style="font-weight:400;">The tool provides similar functionality to what was possible through Model Context Protocol integrations, but offers a more streamlined interface with Claude Code-style usability improvements. </li>
<li style="font-weight:400;">Users can submit new requests or modifications to ongoing tasks without waiting for the initial assignment to complete.</li>
<li style="font-weight:400;">Cowork represents a strategic expansion of Anthropic’s agentic AI approach beyond software development into broader productivity workflows. The feature demonstrates how AI agents with file system access can automate routine knowledge work tasks that previously required manual processing of documents and data.</li>
</ul>
<p>02:15  Ryan – “This week is the first time I actually tried to use AI to generate a PowerPoint presentation. It did not go well. It did gener...</p>
<h3>Chapters</h3>
<ul><li>(00:00:00) - Azure Weekly</li><li>(00:00:43) - Cloud Code Launches Cowork for iOS</li><li>(00:06:53) - Google's Video Output (VO 3.1)</li><li>(00:10:10) - Snowflake to Integrate Observe into its Data Platform</li><li>(00:12:47) - Flexera Expands Cloud Commitment Management with Acquisitions</li><li>(00:17:44) - AWS: Sleeping in Seattle</li><li>(00:18:35) - GCP 10.2: Gemini CLI Monitoring with Google Cloud</li><li>(00:20:58) - Alphabet to Acquire Data Center Company</li><li>(00:23:14) - Google's Notebook LLM Adds Data Tables</li><li>(00:27:53) - Google's T5 Gemma 2: Multodal Vision Models</li><li>(00:31:30) - Google Launches Universal Commerce Protocol (UCP) for AI Agents</li><li>(00:37:36) - Microsoft's Dynamic Threat Detection Agent in Public Preview</li><li>(00:40:20) - Azure Service Bus Premium: Cross-Regional Replication</li><li>(00:44:41) - This Week in Cloud: Amazon Stories</li><li>(00:45:32) - CES 2017: The Best Tech Gadgets</li><li>(00:51:27) - How to Get Your Smoke Detector to Work</li><li>(00:53:25) - Fooled by Apple's Fold Phone</li><li>(00:55:42) - E Ink Poster and Raspberry PI</li><li>(00:59:44) - Lawyers Use the Remarkable Notebook</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 338 of The Cloud Pod, where the forecast is always cloudy! Justin, Ryan, Matt, and Jonathan are in the studio today to bring you all the latest in cloud and AI news, including a bit of a buying spree (inlcuding whole power companies) Veo 3.1, Cowork, and more – today in the cloud!  
Titles we almost went with this week

 Snowflake’s Ironic Timing: Buying Downtime Prevention Tool While Experiencing Downtime
 Flexera Buys ProsperOps and Chaos Genius, Promises Less Chaos and More Prosperity
 Flexera Goes Shopping: Two FinOps Acquisitions to Prosper and Reduce Chaos
 Token of Appreciation: Gemini CLI Now Tracks Every Penny of Your AI Spend
 Snowflake Buys Observe to Stop Its Own Services from Melting Down
 Google’s Veo 3.1 Goes Vertical: Finally Understanding How People Actually Hold Their Phones
 Alphabet’s New Power Move: Buying the Company That Literally Powers Data Centers
 Dashboard Confessional: Gemini CLI Gets Transparent About Its Usage
 Microsoft’s New Agent Works 24/7 and Never Asks for a Raise
From Robot Vacuums That Climb Stairs to TVs You Can’t Feel: CES Gets Weird
 Agent Shopping: When Your AI Has Better Taste Than You Do
 The cloudpod hosts do not like any stories this week
 AWS took a nap on announcements this week
 Claude is my new co-worker
 Wake up, AWS, and give us some fun news
 The $200 Assistant: Is Cowork the End of Workplace Admins?
 Azure has more interesting announcements than AWS oh noooo
 If you can’t beat them in AI, just acquire everyone
 Notebook LM turns the Data Tables on you

AI Is Going Great – Or How ML Makes Money 
01:11 Anthropic launches Cowork, a Claude Code-like for general computing – Ars Technica

Anthropic launches Cowork, a new feature in the macOS Claude desktop app that extends Claude Code‘s agentic capabilities to general office work tasks. 
Users can grant Claude access to specific folders and use plain language instructions to automate tasks like filling expense reports from receipt photos, writing reports from notes, or reorganizing files.
Cowork lowers the technical barrier compared to Claude Code by making AI-assisted file operations accessible to non-developer knowledge workers, including marketers and office staff. 
The feature was developed after Anthropic observed users already applying Claude Code to general knowledge work despite its developer-focused positioning.
The tool provides similar functionality to what was possible through Model Context Protocol integrations, but offers a more streamlined interface with Claude Code-style usability improvements. 
Users can submit new requests or modifications to ongoing tasks without waiting for the initial assignment to complete.
Cowork represents a strategic expansion of Anthropic’s agentic AI approach beyond software development into broader productivity workflows. The feature demonstrates how AI agents with file system access can automate routine knowledge work tasks that previously required manual processing of documents and data.

02:15  Ryan – “This week is the first time I actually tried to use AI to generate a PowerPoint presentation. It did not go well. It did gener...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[338: T5Gemma Says "AI’ll be Back”]]>
                </itunes:title>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p>Welcome to episode 338 of The Cloud Pod, where the forecast is always cloudy! Justin, Ryan, Matt, and Jonathan are in the studio today to bring you all the latest in cloud and AI news, including a bit of a buying spree (inlcuding whole power companies) Veo 3.1, Cowork, and more – today in the cloud!  </p>
<h3>Titles we almost went with this week</h3>
<ul>
<li> Snowflake’s Ironic Timing: Buying Downtime Prevention Tool While Experiencing Downtime</li>
<li> Flexera Buys ProsperOps and Chaos Genius, Promises Less Chaos and More Prosperity</li>
<li> Flexera Goes Shopping: Two FinOps Acquisitions to Prosper and Reduce Chaos</li>
<li> Token of Appreciation: Gemini CLI Now Tracks Every Penny of Your AI Spend</li>
<li> Snowflake Buys Observe to Stop Its Own Services from Melting Down</li>
<li> Google’s Veo 3.1 Goes Vertical: Finally Understanding How People Actually Hold Their Phones</li>
<li> Alphabet’s New Power Move: Buying the Company That Literally Powers Data Centers</li>
<li> Dashboard Confessional: Gemini CLI Gets Transparent About Its Usage</li>
<li> Microsoft’s New Agent Works 24/7 and Never Asks for a Raise</li>
<li>From Robot Vacuums That Climb Stairs to TVs You Can’t Feel: CES Gets Weird</li>
<li> Agent Shopping: When Your AI Has Better Taste Than You Do</li>
<li> The cloudpod hosts do not like any stories this week</li>
<li> AWS took a nap on announcements this week</li>
<li> Claude is my new co-worker</li>
<li> Wake up, AWS, and give us some fun news</li>
<li> The $200 Assistant: Is Cowork the End of Workplace Admins?</li>
<li> Azure has more interesting announcements than AWS oh noooo</li>
<li> If you can’t beat them in AI, just acquire everyone</li>
<li> Notebook LM turns the Data Tables on you</li>
</ul>
<h2>AI Is Going Great – Or How ML Makes Money </h2>
<p>01:11 <a href="https://arstechnica.com/ai/2026/01/anthropic-launches-cowork-a-claude-code-like-for-general-computing/">Anthropic launches Cowork, a Claude Code-like for general computing – </a><a href="https://arstechnica.com/ai/2026/01/anthropic-launches-cowork-a-claude-code-like-for-general-computing/">Ars Technica</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.anthropic.com/">Anthropic</a> launches <a href="https://claude.com/blog/cowork-research-preview">Cowork</a>, a new feature in the macOS Claude desktop app that extends <a href="https://claude.com/product/claude-code">Claude Code</a>‘s agentic capabilities to general office work tasks. </li>
<li style="font-weight:400;">Users can grant Claude access to specific folders and use plain language instructions to automate tasks like filling expense reports from receipt photos, writing reports from notes, or reorganizing files.</li>
<li style="font-weight:400;">Cowork lowers the technical barrier compared to Claude Code by making AI-assisted file operations accessible to non-developer knowledge workers, including marketers and office staff. </li>
<li style="font-weight:400;">The feature was developed after Anthropic observed users already applying Claude Code to general knowledge work despite its developer-focused positioning.</li>
<li style="font-weight:400;">The tool provides similar functionality to what was possible through Model Context Protocol integrations, but offers a more streamlined interface with Claude Code-style usability improvements. </li>
<li style="font-weight:400;">Users can submit new requests or modifications to ongoing tasks without waiting for the initial assignment to complete.</li>
<li style="font-weight:400;">Cowork represents a strategic expansion of Anthropic’s agentic AI approach beyond software development into broader productivity workflows. The feature demonstrates how AI agents with file system access can automate routine knowledge work tasks that previously required manual processing of documents and data.</li>
</ul>
<p>02:15  Ryan – “This week is the first time I actually tried to use AI to generate a PowerPoint presentation. It did not go well. It did generate some cool images, though.” </p>
<p>07:42 <a href="https://blog.google/innovation-and-ai/technology/developers-tools/veo-3-1-gemini-api/">Enhanced Veo 3.1 capabilities are now available in the Gemini API.</a></p>
<ul>
<li style="font-weight:400;">Google has released <a href="https://blog.google/innovation-and-ai/technology/ai/veo-3-1-ingredients-to-video/">Veo 3.1 updates</a> in the <a href="https://ai.google.dev/gemini-api/docs/video?example=dialogue">Gemini API</a> and <a href="https://aistudio.google.com/prompts/new_video?model=veo-3.1-generate-preview">Google AI Studio</a>, adding enhanced Ingredients to Video capabilities that maintain character identity and background consistency across generated videos. </li>
<li style="font-weight:400;">The model now supports native 9:16 vertical format generation optimized for mobile-first applications, eliminating the need to crop from landscape orientation.</li>
<li style="font-weight:400;">The updated model delivers professional-grade output with new 4K resolution support and improved 1080p quality using state-of-the-art enhancement techniques. All generated videos include SynthID digital watermarking for content provenance tracking.</li>
<li style="font-weight:400;">These capabilities are available today through the Gemini API for developers and <a href="http://v">Vertex AI</a> for enterprise customers. Google AI Studio provides a demo app for testing the new features at ai.studio/apps/bundled/veo_studio.</li>
<li style="font-weight:400;">The vertical video format addresses the growing demand for social media content creation, while the 4K output positions Veo 3.1 for professional video production workflows. The character consistency improvements reduce the need for manual editing and post-processing in multi-shot video projects.</li>
</ul>
<p>08:20  Justin – “Don’t make the same mistakes that I do, and go try this and then get a $35 bill, which I did the first time I tried Veo out. So, do be cautious with this one!”  </p>
<p>11:08 <a href="https://www.snowflake.com/content/snowflake-site/global/en/blog/observe-ai-powered-observability">Snowflake Announces Intent to Acquire Observe to Deliver AI-Powered </a><a href="https://www.snowflake.com/content/snowflake-site/global/en/blog/observe-ai-powered-observability">Observability</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.snowflake.com/en/">Snowflake</a> is acquiring <a href="https://www.observe.ai/">Observe</a> to integrate AI-powered observability directly into its data platform, allowing customers to analyze telemetry data like logs, metrics, and traces alongside their business data. </li>
<li style="font-weight:400;">This consolidation eliminates the need for separate observability tools and reduces data movement between systems.</li>
<li style="font-weight:400;">The acquisition addresses the growing challenge of managing observability data at scale, which has become increasingly expensive and complex as organizations generate massive volumes of telemetry information. </li>
<li style="font-weight:400;">Observe’s approach stores data in a structured format that enables more efficient querying and analysis compared to traditional observability platforms.</li>
<li style="font-weight:400;">By bringing observability into Snowflake’s platform, customers can correlate operational metrics with business outcomes using the same SQL-based tools they already use for analytics. </li>
<li style="font-weight:400;">This unified approach should help teams identify how application performance issues directly impact revenue, customer experience, and other business metrics.</li>
<li style="font-weight:400;">The deal positions Snowflake to compete more directly with observability vendors like <a href="https://www.datadoghq.com/">Datadog</a>, <a href="https://www.splunk.com/">Splunk</a>, and <a href="https://newrelic.com/">New Relic</a> by offering native capabilities rather than requiring third-party integrations. </li>
<li style="font-weight:400;">Organizations already using Snowflake for data warehousing can now consolidate their observability spend and simplify their tool stack.</li>
</ul>
<p>12:08 Ryan – “I don’t know how to feel about this; I feel like Snowflake is a part of an application, but it’s not the entirety of an application. I definitely see a use for this for data warehousing and visualizing, but I don’t think it replaces your traditional observability tools because you have too many data sources that are outside of Snowflake.” </p>
<h2>Cloud Tools</h2>
<p>13:58 <a href="https://www.flexera.com/about-us/press-center/flexera-expands-its-finops-solution-with-agentic-and-ai-enabled-cost-optimization">Flexera acquires ProsperOps and Chaos Genius to expand its FinOps </a><a href="https://www.flexera.com/about-us/press-center/flexera-expands-its-finops-solution-with-agentic-and-ai-enabled-cost-optimization">solution with agentic and AI-enabled cost optimization</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.flexera.com/">Flexera</a> acquires two FinOps companies to add autonomous AI-driven cost optimization across major cloud platforms and data analytics services: <a href="https://www.prosperops.com/">ProsperOps</a> brings automated commitment management for AWS, Azure, and Google Cloud with over $6B in annual cloud usage under management, while <a href="https://www.chaosgenius.io/">Chaos Genius</a> focuses specifically on <a href="https://www.snowflake.com/en/">Snowflake</a> and <a href="https://www.databricks.com/">Databricks</a> optimization with reported cost reductions up to 30%.</li>
<li style="font-weight:400;">The acquisitions shift Flexera’s FinOps approach from passive recommendations to active autonomous execution through agentic AI. </li>
<li style="font-weight:400;">This means the platform can automatically purchase and manage cloud commitments and optimize data workloads without requiring manual human intervention, addressing the challenge of dynamic cloud usage patterns that don’t align well with static commitment purchases.</li>
<li style="font-weight:400;">ProsperOps will continue operating as a separate brand while integrating with Flexera’s existing FinOps capabilities. The company was growing at over 90% and has generated more than $3 billion in lifetime savings for customers, suggesting strong market demand for automated rate optimization solutions.</li>
<li style="font-weight:400;">The Chaos Genius acquisition specifically targets the emerging problem of runaway costs in data analytics platforms like Snowflake and Databricks as AI workloads scale. </li>
<li style="font-weight:400;">This addresses a gap in traditional FinOps tools that primarily focused on compute and storage optimization but lacked specialized capabilities for modern data cloud platforms.</li>
<li style="font-weight:400;">These moves position Flexera to cover the complete FinOps Framework defined by the FinOps Foundation, combining cost visibility, workload optimization, and rate optimization in a single platform. </li>
<li style="font-weight:400;">This matters for enterprises struggling to manage costs across an increasingly complex mix of traditional cloud services, AI infrastructure, and specialized data platforms.</li>
</ul>
<p>15:35  Matt – “It definitely needs some pretty strong guardrails of what your business objective is, like don’t go over 90% savings plan or look at the secondary market for short term if you see a random burst for a few months. But it’s not a terrible idea…”      </p>
<h2>AWS</h2>
<p>19:12 Weirdly enough, there are no AWS stories this week. </p>
<h2>GCP</h2>
<p>20:06 <a href="https://cloud.google.com/blog/topics/developers-practitioners/instant-insights-gemini-clis-new-pre-configured-monitoring-dashboards/">Instant insights: Gemini CLI’s New Pre-Configured Monitoring Dashboards </a><a href="https://cloud.google.com/blog/topics/developers-practitioners/instant-insights-gemini-clis-new-pre-configured-monitoring-dashboards/">| Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google has added pre-configured monitoring dashboards to Gemini CLI that provide immediate visibility into usage metrics like monthly active users, token consumption, and code changes without requiring custom query writing. </li>
<li style="font-weight:400;">The dashboards integrate with <a href="https://cloud.google.com/monitoring">Google Cloud Monitoring</a> and use <a href="https://github.com/google-gemini/gemini-cli/blob/main/docs/cli/telemetry.md">OpenTelemetry</a> for standardized data collection, allowing teams to track CLI adoption and performance across their organization.</li>
<li style="font-weight:400;">The implementation uses direct GCP exporters that bypass intermediate OTLP collector configurations, simplifying setup to three steps: setting the project ID, authenticating with proper IAM roles, and updating the settings.json file. This reduces infrastructure complexity compared to traditional OpenTelemetry deployments that require separate collector services.</li>
<li style="font-weight:400;">Organizations can analyze raw OpenTelemetry <a href="https://console.cloud.google.com/logs/">logs</a> and <a href="https://console.cloud.google.com/monitoring/metrics-explorer">metrics</a> to answer specific questions like identifying power users by token consumption, tracking budget allocation by command type, and monitoring tool reliability through status codes. The data follows <a href="https://opentelemetry.io/docs/specs/semconv/gen-ai/">GenAI OpenTelemetry conventions</a>, ensuring compatibility with other observability backends like Prometheus, Jaeger, or Datadog if teams want to switch platforms.</li>
<li style="font-weight:400;">The feature targets development teams using Gemini CLI who need to understand tool adoption patterns and justify AI tooling investments through concrete usage metrics. </li>
<li style="font-weight:400;">Engineering managers can track which developers benefit most from AI assistance and where token budgets are being allocated across different command types</li>
</ul>
<p>21:55  Ryan – “As long as there’s no metric for how stupid a question is, because that. That I don’t want.”   </p>
<p>22:40 <a href="https://blog.google/innovation-and-ai/infrastructure-and-cloud/global-network/energy-innovation-intersect/">We’re advancing U.S. energy innovation with Intersect.</a></p>
<ul>
<li style="font-weight:400;"><a href="https://abc.xyz/">Alphabet</a> announced a definitive agreement to acquire Intersect, a company specializing in data center and energy infrastructure solutions. </li>
<li style="font-weight:400;">This acquisition aims to accelerate the deployment of data center capacity and energy generation infrastructure in the United States.</li>
<li style="font-weight:400;">The deal addresses a critical bottleneck in AI and cloud infrastructure expansion by bringing expertise in energy development and data center deployment under Alphabet’s umbrella. Intersect’s capabilities will help Google bring more computing capacity online faster, which is essential given the substantial power requirements of AI workloads and hyperscale cloud operations.</li>
<li style="font-weight:400;">This acquisition reflects the growing importance of energy infrastructure as a limiting factor for cloud providers, particularly as AI training and inference workloads drive unprecedented power demands. By acquiring energy infrastructure expertise, Google positions itself to better control the full stack from power generation through data center operations.</li>
<li style="font-weight:400;">The announcement provides limited technical details about integration timelines or specific projects, but signals Google’s commitment to vertical integration in the infrastructure space. This move follows similar investments by other hyperscalers in power generation and energy partnerships to support their expanding data center footprints.</li>
</ul>
<p>22:50  Justin – “If you can’t get the capacity from the vendor, just buy them – and then force them to do it. Good move!”   </p>
<p>25:00 <a href="https://blog.google/innovation-and-ai/models-and-research/google-labs/notebooklm-data-tables/">Google’s NotebookLM introduces Data Tables feature</a></p>
<ul>
<li style="font-weight:400;"><a href="https://notebooklm.google/">NotebookLM</a> now includes Data Tables, a feature that automatically synthesizes information from multiple sources into structured tables that can be exported directly to <a href="https://workspace.google.com/products/sheets/">Google Sheets</a>. </li>
<li style="font-weight:400;">The feature is available today for Pro and Ultra users, with rollout to all users planned for the coming weeks.</li>
<li style="font-weight:400;">The feature addresses a common workflow challenge where valuable information is scattered across multiple documents, requiring manual compilation. Data Tables automates this process by extracting and organizing key facts into clean, structured formats without manual data entry.</li>
<li style="font-weight:400;">Use cases span professional and personal applications, including converting meeting transcripts into action item tables with owners and priorities, synthesizing research data like clinical trial outcomes across multiple papers, creating competitor analysis tables with pricing and strategy comparisons, and building study guides organized by relevant categories.</li>
<li style="font-weight:400;">The feature represents Google’s continued integration of AI capabilities into productivity tools, positioning NotebookLM as a research and synthesis tool rather than just a note-taking application. </li>
<li style="font-weight:400;">This builds on NotebookLM’s existing source analysis capabilities by adding structured data output.</li>
<li style="font-weight:400;">The tiered rollout strategy, with Pro and Ultra users receiving immediate access, suggests Google is testing the feature with power users before broader deployment, likely to gather usage patterns and refine the table generation algorithms.</li>
</ul>
<p>25:52  Justin – “I love creating spreadsheets; my budgets, all of my tracking of things, tasks I’m doing, vacation planning – it all lives in spreadsheets. And you’re going to take that away from me, Google? How dare you. AI is coming for my passion for spreadsheets.”   </p>
<p>29:53 <a href="https://blog.google/innovation-and-ai/technology/developers-tools/t5gemma-2/">T5Gemma 2: The next generation of encoder-decoder models</a></p>
<ul>
<li style="font-weight:400;">Google releases <a href="https://arxiv.org/abs/2512.14856">T5Gemma 2</a>,  a new generation of encoder-decoder models based on Gemma 3, available now in pre-trained checkpoints at three sizes: 270M-270M (370M total), 1B-1B (1.7B total), and 4B-4B (7B total) parameters. The models use tied word embeddings and merged decoder attention to reduce parameter count while maintaining capabilities, making them suitable for on-device applications and rapid experimentation.</li>
<li style="font-weight:400;">T5Gemma 2 adds multimodal vision capabilities using an efficient vision encoder for visual question answering and reasoning tasks, extends context windows to 128K tokens using Gemma 3’s alternating local and global attention mechanism, and supports over 140 languages out of the box. </li>
<li style="font-weight:400;">These represent the first multi-modal and long-context encoder-decoder models in the Gemma family.</li>
<li style="font-weight:400;">The architecture merges decoder self-attention and cross-attention into a single unified layer, reducing model complexity and improving parallelization for better inference performance. </li>
<li style="font-weight:400;">This structural change, combined with tied embeddings, allows more active capabilities within the same memory footprint compared to the original <a href="https://developers.googleblog.com/en/t5gemma/">T5Gemma</a>.</li>
<li style="font-weight:400;">Benchmarks show T5Gemma 2 outperforms Gemma 3 on several multimodal tasks, delivers substantial quality gains on long-context problems compared to both Gemma 3 and T5Gemma, and shows improved performance on coding, reasoning, and multilingual tasks. Post-training results indicate better performance than decoder-only counterparts, making these models suitable for both research and production applications.</li>
<li style="font-weight:400;">The models are designed for developers to post-train for specific tasks before deployment, continuing the approach from the original T5Gemma of adapting pre-trained decoder-only models into an encoder-decoder architecture without the computational cost of training from scratch.</li>
<li style="font-weight:400;"> Pre-trained checkpoints are available across multiple platforms for broad developer access.</li>
</ul>
<p>31:14  Jonathan – “I’m actually looking forward to playing with the T5Gemma model because the encoder part of it is what’s going to make it really special. Transformers have always had these two halves, encoder and decoder, and most LMs only use the decoder. And what that means is that as the attention is calculated for each token in the context window, it only ever attends to previous tokens in the message. So if you have a word, that word can only ever be related to something that you’ve already said in the conversation. But people aren’t like that. People go back and forth, and they refer back to things they said… people just suck at communication most of the time. And so what the encoder model does is it looks at the entire message holistically. It doesn’t only look at the last word by the time it gets to the last word, it looks at everything and encodes the meaning of the entire text. And then from there, it passes it to the decoder, and the decoder starts generating text based on the entire knowledge of the whole thing.”</p>
<p>33:39 <a href="https://blog.google/products/ads-commerce/agentic-commerce-ai-tools-protocol-retailers-platforms/">New tech and tools for retailers to succeed in an agentic shopping era</a></p>
<ul>
<li style="font-weight:400;">Google launches Universal Commerce Protocol (UCP), an open standard for agentic commerce co-developed with Shopify, Etsy, Wayfair, Target, and Walmart. </li>
<li style="font-weight:400;">UCP enables AI agents to interact across the entire shopping journey from discovery to post-purchase support, working alongside existing protocols like <a href="https://a2a-protocol.org/latest/">A2A</a>, <a href="https://ap2-protocol.org/">AP2</a>, and <a href="https://modelcontextprotocol.io/docs/getting-started/intro">MCP</a>. The protocol is endorsed by over 20 companies, including Adyen, American Express, Mastercard, Stripe, and Visa.</li>
<li style="font-weight:400;">New agentic checkout feature goes live in AI Mode in Search and Gemini app, allowing shoppers to purchase from eligible U.S. retailers directly within Google’s AI surfaces. </li>
<li style="font-weight:400;">The integration uses Google Pay and PayPal for payments, with retailers maintaining seller of record status and the ability to customize the implementation. Global expansion and additional capabilities like loyalty rewards and product discovery are planned for the coming months.</li>
<li style="font-weight:400;"><a href="https://support.google.com/brandprofile/answer/16410382">Business Agent</a> launches tomorrow as a branded AI assistant that appears directly in Search results for retailers like Lowe’s, Michaels, Poshmark, and Reebok. U.S. retailers can activate and customize this agent through Merchant Center, with future capabilities including training on retailer data, customer insights, product offers, and direct agentic checkout within the chat experience.</li>
<li style="font-weight:400;">Google introduces Direct Offers pilot in AI Mode, allowing advertisers to present exclusive discounts and deals to shoppers during AI-powered searches. The system uses AI to determine when offers are relevant to display, initially focusing on discounts with plans to expand to bundles and free shipping. Early partners include Petco, e.l.f. Cosmetics, Samsonite, Rugs USA, and Shopify merchants.</li>
<li style="font-weight:400;">Merchant Center adds dozens of new data attributes designed for conversational commerce discovery across AI Mode, Gemini, and Business Agent. These attributes extend beyond traditional keywords to include product Q&amp;A, compatible accessories, and substitutes, rolling out first to a small group of retailers before broader expansion.</li>
</ul>
<p>35:20 Ryan – “I think it’s important to standardize. In a web transaction where you’re doing shopping, there’s so many handoffs to different things, I can see, as more and more AI and agent-based or agent-assisted transactions happen, being able to talk a common language is super important.” </p>
<p>33:38 <a href="https://blog.google/company-news/inside-google/message-ceo/nrf-2026-remarks/">Read Sundar Pichai’s remarks at the 2026 National Retail Federation</a></p>
<ul>
<li style="font-weight:400;">Google announced <a href="https://blog.google/products/ads-commerce/agentic-commerce-ai-tools-protocol-retailers-platforms/">Universal Commerce Protocol (UCP)</a>, an open standard for agentic commerce built with Shopify, Etsy, Wayfair, Target, and Walmart. The protocol enables native checkout directly in Google Search AI Mode and Gemini, allowing retailers to maintain merchant of record status and own customer relationships while offering personalized pricing and loyalty enrollment at checkout.</li>
<li style="font-weight:400;"><a href="https://cloud.google.com/gemini-enterprise">Gemini Enterprise</a> for Customer Experience is now available in preview, providing retailers with integrated shopping assistants, support bots, and agentic search capabilities. </li>
<li style="font-weight:400;">The Home Depot and McDonald’s are already using these agents for customer service, while Kroger is testing a shopping agent that brings AI Mode functionality directly into retailer apps.</li>
<li style="font-weight:400;">Google processed over 90 trillion tokens through its API in December 2025, representing an 11x increase from 8.3 trillion tokens in December 2024. This growth demonstrates the rapid adoption of AI capabilities by retailers and the scale at which Google’s infrastructure is supporting commercial AI applications.</li>
<li style="font-weight:400;">Wing delivery service expanded to Houston, with Orlando, Tampa, and Charlotte coming soon, after doubling deliveries in existing markets during 2025 through its Walmart partnership. </li>
<li style="font-weight:400;">The expansion addresses the high cost and logistical challenges of last-mile delivery for retailers.</li>
</ul>
<p>38:35  Jonathan – “So is this how Google is going to make money in the future? Because obviously serving ads through AI is both controversial and a very lame customer experience. Are they going to start skimming off a percentage of sales for sales they direct to these retailers through their AI interface?”  </p>
<h2>Azure </h2>
<p>39:58 <a href="https://techcommunity.microsoft.com/blog/microsoftthreatprotectionblog/announcing-public-preview-uncovering-hidden-threats-with-the-dynamic-threat-dete/4475313">Announcing public preview: Uncovering hidden threats with the Dynamic </a><a href="https://techcommunity.microsoft.com/blog/microsoftthreatprotectionblog/announcing-public-preview-uncovering-hidden-threats-with-the-dynamic-threat-dete/4475313">Threat Detection Agent | Microsoft Community Hub</a></p>
<ul>
<li style="font-weight:400;">Microsoft <a href="https://techcommunity.microsoft.com/blog/MicrosoftThreatProtectionBlog/ignite-2025-whats-new-in-microsoft-defender/4469996?previewMessage=true">launches the Dynamic Threat Detection Agent</a> in public preview, an AI-powered backend service that runs continuously within Defender to identify hidden threats across Defender and Sentinel environments. </li>
<li style="font-weight:400;">The agent operates autonomously with no setup required, automatically generating alerts with natural language explanations, MITRE technique mappings, and remediation steps directly into existing XDR workflows.</li>
<li style="font-weight:400;">The agent achieves over 85% precision across thousands of alerts and 28 threat types by combining adaptive GenAI detection with hyperscale threat intelligence from <a href="https://arxiv.org/abs/2411.06239">TITAN</a> and <a href="https://learn.microsoft.com/en-us/azure/sentinel/identify-threats-with-entity-behavior-analytics">UEBA</a> behavioral analytics. </li>
<li style="font-weight:400;">It runs a five-step investigation loop at machine scale, starting from high-priority incidents, building unified activity timelines, testing hypotheses through automated Q&amp;A, and closing detection gaps with explainable alerts that include transparent reasoning traces.</li>
<li style="font-weight:400;">Public preview is free for <a href="https://www.microsoft.com/en-us/security/blog/2025/11/18/agents-built-into-your-workflow-get-security-copilot-with-microsoft-365-e5/?msockid=27bd8b1d324d6b4d28eb9e2e33dd6a4f">Security Copilot</a> customers and enabled by default for eligible organizations, with general availability planned for late 2026 when it transitions to Security Copilot’s SCU-based consumption model. </li>
<li style="font-weight:400;">Starting July 2026, the agent will be included with Microsoft 365 E5 licenses that have Security Copilot entitlement, and customers can disable it or monitor usage through detailed consumption reporting at any time.</li>
<li style="font-weight:400;">The service respects data residency by running region-local and integrates deeply with the Microsoft security ecosystem, using Sentinel to correlate third-party and native telemetry while surfacing Copilot-sourced detections in Defender. </li>
<li style="font-weight:400;">Built on Azure Synapse for massive scale, it can run thousands of parallel investigations and deliver near-real-time detections while continuously learning from analyst feedback to improve detection quality and reduce alert noise.</li>
</ul>
<p>43:54  Jonathan – “You don’t want to block a potential customer who’s about to press a button to spend tens of thousands of dollars either. guess false positives are almost as bad as false negatives.”</p>
<p>45:26 <a href="https://azure.microsoft.com/en-us/updates?id=490149">Generally Available: Geo-Replication for Azure Service Bus Premium</a></p>
<ul>
<li style="font-weight:400;">Azure Service Bus Premium now includes generally available Geo-Replication, allowing customers to replicate messaging infrastructure across regions for disaster recovery. </li>
<li style="font-weight:400;">This addresses a critical need for enterprises running mission-critical messaging workloads that require protection against regional outages.</li>
<li style="font-weight:400;">The feature provides active replication of Service Bus entities, including queues, topics, and subscriptions, between paired regions, maintaining message ordering and metadata consistency. </li>
<li style="font-weight:400;">Organizations can now implement cross-region failover strategies without building custom replication logic or managing multiple Service Bus namespaces manually.</li>
<li style="font-weight:400;">This capability is exclusive to the Premium tier of Service Bus, which starts at approximately $677 per month for the base messaging unit. Customers should factor in additional costs for cross-region data transfer and the secondary namespace when planning their disaster recovery architecture.</li>
<li style="font-weight:400;">The geo-replication option complements existing Service Bus disaster recovery features like Geo-Disaster Recovery (metadata-only failover), giving customers flexibility in choosing between cost-optimized metadata replication or full data replication based on their recovery time objectives. </li>
<li style="font-weight:400;">This is particularly relevant for financial services, healthcare, and retail sectors, where message loss during regional failures is unacceptable.</li>
</ul>
<p>46:23  Justin – “I’m surprised this wasn’t already part of premium, but I’m also sort of intrigued that they think people’s messaging strategies only involve two regions, because some of the cost architectures I’ve seen are like multiple regions with active replication across these things for geodistributed applications that need to have globally low latency for user populations everywhere – and I guess I just can’t run that on this service. So I guess, screw you? Or wait for Azure Service Bus Ultra?” </p>
<h2>After Show </h2>
<p>46:38 <a href="https://www.theverge.com/tech/854159/ces-2026-best-tech-gadgets-smartphones-appliances-robots-tvs-ai-smart-home">CES 2026: The best tech announced so far | The Verge</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.cnet.com/news-live/ces-2026-news-live-updates/">CES 2026</a> showcased significant infrastructure innovations, including Wi-Fi 8 routers from Asus and others, despite the standard not being finalized until 2028, plus solid-state battery breakthroughs from Donut Lab claiming 400 Wh/kg energy density that could give EVs 30 percent more range. These developments signal major shifts in networking and power infrastructure that cloud and edge computing deployments will eventually leverage.</li>
<li style="font-weight:400;">Smart home and IoT devices are getting serious upgrades with Matter compatibility becoming standard across Ikea and Philips Hue products, while spatial awareness features like Hue’s SpatialAware use AR to map rooms for better lighting distribution. For cloud professionals, this represents the maturation of IoT protocols and edge AI processing that will drive increased demand for home automation backend services.</li>
<li style="font-weight:400;">The display technology race is heating up with Samsung showing creaseless foldable OLED panels, Dell launching a 52-inch 6K Thunderbolt hub monitor, and LG reviving its Wallpaper TV with wireless video transmission. These advances in display tech and connectivity standards like Thunderbolt 5, delivering 120Gbps speeds, will impact how professionals design workspaces and remote work setups.</li>
<li style="font-weight:400;">AI wearables are moving beyond glasses with Razer’s Project Motoko headphones featuring 4K cameras, on-device AI processing via Qualcomm chips, and 36-hour battery life that eclipses current smart glasses. This shift toward headphone-based AI assistants could influence how voice interfaces and edge AI applications are developed for consumer devices.</li>
<li style="font-weight:400;">Robotics took center stage with practical home automation like Roborock’s stair-climbing Saros Rover vacuum and LG’s CLOiD dual-arm robot that can fold laundry and handle kitchen tasks. While still in development, these robots represent the convergence of computer vision, edge AI, and mechanical engineering that will require robust cloud backends for training and coordination.</li>
</ul>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback, or ask questions at theCloudPod.net or tweet at us with the hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2333121/c1e-qx4xb7om6kfkq90r-2504d6g6ijjx-t2kywf.mp3" length="119416699"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 338 of The Cloud Pod, where the forecast is always cloudy! Justin, Ryan, Matt, and Jonathan are in the studio today to bring you all the latest in cloud and AI news, including a bit of a buying spree (inlcuding whole power companies) Veo 3.1, Cowork, and more – today in the cloud!  
Titles we almost went with this week

 Snowflake’s Ironic Timing: Buying Downtime Prevention Tool While Experiencing Downtime
 Flexera Buys ProsperOps and Chaos Genius, Promises Less Chaos and More Prosperity
 Flexera Goes Shopping: Two FinOps Acquisitions to Prosper and Reduce Chaos
 Token of Appreciation: Gemini CLI Now Tracks Every Penny of Your AI Spend
 Snowflake Buys Observe to Stop Its Own Services from Melting Down
 Google’s Veo 3.1 Goes Vertical: Finally Understanding How People Actually Hold Their Phones
 Alphabet’s New Power Move: Buying the Company That Literally Powers Data Centers
 Dashboard Confessional: Gemini CLI Gets Transparent About Its Usage
 Microsoft’s New Agent Works 24/7 and Never Asks for a Raise
From Robot Vacuums That Climb Stairs to TVs You Can’t Feel: CES Gets Weird
 Agent Shopping: When Your AI Has Better Taste Than You Do
 The cloudpod hosts do not like any stories this week
 AWS took a nap on announcements this week
 Claude is my new co-worker
 Wake up, AWS, and give us some fun news
 The $200 Assistant: Is Cowork the End of Workplace Admins?
 Azure has more interesting announcements than AWS oh noooo
 If you can’t beat them in AI, just acquire everyone
 Notebook LM turns the Data Tables on you

AI Is Going Great – Or How ML Makes Money 
01:11 Anthropic launches Cowork, a Claude Code-like for general computing – Ars Technica

Anthropic launches Cowork, a new feature in the macOS Claude desktop app that extends Claude Code‘s agentic capabilities to general office work tasks. 
Users can grant Claude access to specific folders and use plain language instructions to automate tasks like filling expense reports from receipt photos, writing reports from notes, or reorganizing files.
Cowork lowers the technical barrier compared to Claude Code by making AI-assisted file operations accessible to non-developer knowledge workers, including marketers and office staff. 
The feature was developed after Anthropic observed users already applying Claude Code to general knowledge work despite its developer-focused positioning.
The tool provides similar functionality to what was possible through Model Context Protocol integrations, but offers a more streamlined interface with Claude Code-style usability improvements. 
Users can submit new requests or modifications to ongoing tasks without waiting for the initial assignment to complete.
Cowork represents a strategic expansion of Anthropic’s agentic AI approach beyond software development into broader productivity workflows. The feature demonstrates how AI agents with file system access can automate routine knowledge work tasks that previously required manual processing of documents and data.

02:15  Ryan – “This week is the first time I actually tried to use AI to generate a PowerPoint presentation. It did not go well. It did gener...]]>
                </itunes:summary>
                                                                            <itunes:duration>01:02:00</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2333121/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[337: AWS Discovers Prices Can Go Both Ways,  Raises GPU Costs 15 Percent]]>
                </title>
                <pubDate>Fri, 16 Jan 2026 01:02:43 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2326337</guid>
                                    <link>https://tcpfm.castos.com/episodes/337-aws-discovers-prices-can-go-both-ways-raises-gpu-costs-15-percent</link>
                                <description>
                                            <![CDATA[<p> Welcome to episode 337 of The Cloud Pod, where the forecast is always cloudy! Justin, Matt, and Ryan have hit the recording studio to bring you all the latest in cloud and AI news, from acquisitions and price hikes to new tools that Ryan somehow loves but also hates? We don’t understand either… but let’s get started! </p>
<h3>Titles we almost went with this week</h3>
<ul>
<li>Prompt Engineering Our Way Into Trouble</li>
<li>The Demo Worked Yesterday, We Swear</li>
<li>It Scales Horizontally, Trust Us</li>
<li>Responsible AI But Terrible Copy (Marketing Edition)</li>
</ul>
<h2>General News </h2>
<p>00:58 <a href="https://blog.google/technology/google-deepmind/the-thinking-game/">Watch ‘The Thinking Game’ documentary for free on YouTube</a></p>
<ul>
<li style="font-weight:400;"><a href="https://blog.google/innovation-and-ai/models-and-research/google-deepmind/">Google DeepMind</a> is releasing the “The Thinking Game” documentary for free on YouTube starting November 25, marking the fifth anniversary of <a href="https://www.youtube.com/watch?v=WXuK6gekU1Y">AlphaFold</a>. </li>
<li style="font-weight:400;">The feature-length film provides behind-the-scenes access to the AI lab and documents the team’s work toward artificial general intelligence over five years.</li>
<li style="font-weight:400;">The documentary captures the moment when the AlphaFold team learned they had solved the 50-year protein folding problem in biology, a scientific achievement that recently earned Demis Hassabis and John Jumper the <a href="https://deepmind.google/blog/demis-hassabis-john-jumper-awarded-nobel-prize-in-chemistry/">Nobel Prize in Chemistry</a>. </li>
<li style="font-weight:400;">This represents one of the most significant practical applications of deep learning to fundamental scientific research.</li>
<li style="font-weight:400;">The film was produced by the same award-winning team that created the AlphaGo documentary, which chronicled DeepMind’s earlier achievement in mastering the game of Go. For cloud and AI practitioners, this offers insight into how Google DeepMind approaches complex AI research problems and the development process behind their models.</li>
<li style="font-weight:400;">While this is primarily a documentary release rather than a technical product announcement, it provides context for understanding Google’s broader AI strategy and the research foundation underlying its cloud AI services. The AlphaFold model itself is available through Google Cloud for protein structure prediction workloads.</li>
</ul>
<p>01:54  Justin – “If you’re not into technology, don’t care about any of that, and don’t care about AI and how they built all the AI models that are now powering the world of LLMs we have, you will not like this documentary.” </p>
<p>04:22 <a href="https://www.theregister.com/2025/12/23/servicenow_to_buy_armis_in/">ServiceNow to buy Armis in $7.7 billion security deal • The Register</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.servicenow.com/">ServiceNow</a> is acquiring <a href="https://www.armis.com/">Armis</a> for $7.75 billion to integrate real-time security intelligence with its Configuration Management Database, allowing customers to identify vulnerabilities across IT, OT, and medical devices and remediate them through automated workflows. </li>
<li style="font-weight:400;"><a href="https://newsroom.servicenow.com/press-releases/details/2025/ServiceNow-to-acquire-Armis-to-expand-cyber-exposure-and-security-across-the-full-attack-surface-in-IT-OT-and-medical-devices-for-companies-governments-and-critical-infrastructure-worldwide/default.aspx">The deal</a> is expected to close in the second half of 2026 and aims to triple ServiceNow’s current $1 billion annual security revenue.</li>
<li style="font-weight:400;">The acquisition represents a strategic data play when combined with ServiceNow’s recent purchase of <a href="https://data.world/">Data.World</a>, giving the company both massive volumes of se...</li></ul>
<h3>Chapters</h3>
<ul><li>(00:00:00) - Azure: Raising GPU Prices 15%.</li><li>(00:00:51) - Homework for the Week</li><li>(00:01:05) - Google's The Thinking Game Documentary</li><li>(00:04:22) - ServiceNow Acquires Armis for $7.5 Billion</li><li>(00:06:39) - What is the Cognizant Threat Management Platform?</li><li>(00:08:29) - Google's 2025: The Year of TUNE (In Depth)</li><li>(00:11:36) - MetaAcquires AI Agent Firm Manus</li><li>(00:15:27) - Migration from AWS Security Hub to OCSF</li><li>(00:21:10) - EC2 Spot Capacity for Containerized Apps</li><li>(00:23:13) - Amazon EKS now supports DNS-based and Admin Network Policies</li><li>(00:26:58) - Amazon Raises EC2 Capacity Prices</li><li>(00:31:19) - Lookinger: Upload CSV and Excel Files Directly into the BI</li><li>(00:34:01) - AlloyDB's AI Natural Language API</li><li>(00:36:23) - Google's Vertex AI Agent Builder</li><li>(00:38:05) - Google Cloud SQL for MySQL Enterprise+ Edition: Optimized Writes</li><li>(00:42:26) - Microsoft Acquires OSMOS for Unified Data Platform</li><li>(00:44:19) - Microsoft Deploys Nvidia's Next-Gen Arubin Platform</li><li>(00:46:25) - Will Oracle Use Non-Evaporative Cooling at Their New</li><li>(00:51:01) - Week in the Box: Cloud: More News?</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[ Welcome to episode 337 of The Cloud Pod, where the forecast is always cloudy! Justin, Matt, and Ryan have hit the recording studio to bring you all the latest in cloud and AI news, from acquisitions and price hikes to new tools that Ryan somehow loves but also hates? We don’t understand either… but let’s get started! 
Titles we almost went with this week

Prompt Engineering Our Way Into Trouble
The Demo Worked Yesterday, We Swear
It Scales Horizontally, Trust Us
Responsible AI But Terrible Copy (Marketing Edition)

General News 
00:58 Watch ‘The Thinking Game’ documentary for free on YouTube

Google DeepMind is releasing the “The Thinking Game” documentary for free on YouTube starting November 25, marking the fifth anniversary of AlphaFold. 
The feature-length film provides behind-the-scenes access to the AI lab and documents the team’s work toward artificial general intelligence over five years.
The documentary captures the moment when the AlphaFold team learned they had solved the 50-year protein folding problem in biology, a scientific achievement that recently earned Demis Hassabis and John Jumper the Nobel Prize in Chemistry. 
This represents one of the most significant practical applications of deep learning to fundamental scientific research.
The film was produced by the same award-winning team that created the AlphaGo documentary, which chronicled DeepMind’s earlier achievement in mastering the game of Go. For cloud and AI practitioners, this offers insight into how Google DeepMind approaches complex AI research problems and the development process behind their models.
While this is primarily a documentary release rather than a technical product announcement, it provides context for understanding Google’s broader AI strategy and the research foundation underlying its cloud AI services. The AlphaFold model itself is available through Google Cloud for protein structure prediction workloads.

01:54  Justin – “If you’re not into technology, don’t care about any of that, and don’t care about AI and how they built all the AI models that are now powering the world of LLMs we have, you will not like this documentary.” 
04:22 ServiceNow to buy Armis in $7.7 billion security deal • The Register

ServiceNow is acquiring Armis for $7.75 billion to integrate real-time security intelligence with its Configuration Management Database, allowing customers to identify vulnerabilities across IT, OT, and medical devices and remediate them through automated workflows. 
The deal is expected to close in the second half of 2026 and aims to triple ServiceNow’s current $1 billion annual security revenue.
The acquisition represents a strategic data play when combined with ServiceNow’s recent purchase of Data.World, giving the company both massive volumes of se...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[337: AWS Discovers Prices Can Go Both Ways,  Raises GPU Costs 15 Percent]]>
                </itunes:title>
                                    <itunes:episode>337</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p> Welcome to episode 337 of The Cloud Pod, where the forecast is always cloudy! Justin, Matt, and Ryan have hit the recording studio to bring you all the latest in cloud and AI news, from acquisitions and price hikes to new tools that Ryan somehow loves but also hates? We don’t understand either… but let’s get started! </p>
<h3>Titles we almost went with this week</h3>
<ul>
<li>Prompt Engineering Our Way Into Trouble</li>
<li>The Demo Worked Yesterday, We Swear</li>
<li>It Scales Horizontally, Trust Us</li>
<li>Responsible AI But Terrible Copy (Marketing Edition)</li>
</ul>
<h2>General News </h2>
<p>00:58 <a href="https://blog.google/technology/google-deepmind/the-thinking-game/">Watch ‘The Thinking Game’ documentary for free on YouTube</a></p>
<ul>
<li style="font-weight:400;"><a href="https://blog.google/innovation-and-ai/models-and-research/google-deepmind/">Google DeepMind</a> is releasing the “The Thinking Game” documentary for free on YouTube starting November 25, marking the fifth anniversary of <a href="https://www.youtube.com/watch?v=WXuK6gekU1Y">AlphaFold</a>. </li>
<li style="font-weight:400;">The feature-length film provides behind-the-scenes access to the AI lab and documents the team’s work toward artificial general intelligence over five years.</li>
<li style="font-weight:400;">The documentary captures the moment when the AlphaFold team learned they had solved the 50-year protein folding problem in biology, a scientific achievement that recently earned Demis Hassabis and John Jumper the <a href="https://deepmind.google/blog/demis-hassabis-john-jumper-awarded-nobel-prize-in-chemistry/">Nobel Prize in Chemistry</a>. </li>
<li style="font-weight:400;">This represents one of the most significant practical applications of deep learning to fundamental scientific research.</li>
<li style="font-weight:400;">The film was produced by the same award-winning team that created the AlphaGo documentary, which chronicled DeepMind’s earlier achievement in mastering the game of Go. For cloud and AI practitioners, this offers insight into how Google DeepMind approaches complex AI research problems and the development process behind their models.</li>
<li style="font-weight:400;">While this is primarily a documentary release rather than a technical product announcement, it provides context for understanding Google’s broader AI strategy and the research foundation underlying its cloud AI services. The AlphaFold model itself is available through Google Cloud for protein structure prediction workloads.</li>
</ul>
<p>01:54  Justin – “If you’re not into technology, don’t care about any of that, and don’t care about AI and how they built all the AI models that are now powering the world of LLMs we have, you will not like this documentary.” </p>
<p>04:22 <a href="https://www.theregister.com/2025/12/23/servicenow_to_buy_armis_in/">ServiceNow to buy Armis in $7.7 billion security deal • The Register</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.servicenow.com/">ServiceNow</a> is acquiring <a href="https://www.armis.com/">Armis</a> for $7.75 billion to integrate real-time security intelligence with its Configuration Management Database, allowing customers to identify vulnerabilities across IT, OT, and medical devices and remediate them through automated workflows. </li>
<li style="font-weight:400;"><a href="https://newsroom.servicenow.com/press-releases/details/2025/ServiceNow-to-acquire-Armis-to-expand-cyber-exposure-and-security-across-the-full-attack-surface-in-IT-OT-and-medical-devices-for-companies-governments-and-critical-infrastructure-worldwide/default.aspx">The deal</a> is expected to close in the second half of 2026 and aims to triple ServiceNow’s current $1 billion annual security revenue.</li>
<li style="font-weight:400;">The acquisition represents a strategic data play when combined with ServiceNow’s recent purchase of <a href="https://data.world/">Data.World</a>, giving the company both massive volumes of security asset data from Armis and the governance tools to make that data searchable and usable with AI. </li>
<li style="font-weight:400;">This combination enhances ServiceNow’s CMDB capabilities by an order of magnitude, according to Forrester analysts.</li>
<li style="font-weight:400;">ServiceNow has completed six acquisitions this year, including Armis, <a href="https://veza.com/">Veza</a> for identity access management, and Data.World for data governance, signaling an aggressive expansion strategy focused on security and data management. </li>
<li style="font-weight:400;">The company’s integration approach will be critical as customers watch how well these separate platforms merge into ServiceNow’s unified platform.</li>
<li style="font-weight:400;">The deal positions ServiceNow to eliminate the patchwork of security tools organizations currently use by embedding security capabilities directly into its AI platform. </li>
<li style="font-weight:400;">Armis brings 950 employees, $340 million in annual recurring revenue, and recognition as a Gartner leader in cyber-physical systems protection.</li>
<li style="font-weight:400;">Despite Salesforce entering the ITSM market, analysts assess ServiceNow maintains a five-year development lead in the space, though successful integration of multiple acquisitions remains the key challenge for maintaining that advantage.</li>
</ul>
<p>05:49  Ryan – “Is this security tooling that you use for analysis or threat hunting? Or is this something that they’re adding to their existing tooling, so it’s more of an integration?” </p>
<p>Listener Note: If you have any idea what this company does, let us know! </p>
<h2>Cloud Tools</h2>
<p>08:38 <a href="https://www.digitalocean.com/community/tutorials/toon-vs-json">TOON vs. JSON | DigitalOcean</a></p>
<ul>
<li style="font-weight:400;">TOON (Token Oriented Object Notation) is a new data format designed to replace JSON in LLM prompts, claiming to reduce input token usage by approximately 40% while maintaining or improving accuracy. </li>
<li style="font-weight:400;">The format works by eliminating verbose JSON syntax and repeated tokens, converting structured data into a more compact representation that LLMs can still interpret effectively.</li>
<li style="font-weight:400;">DigitalOcean released a Python library (toon-python) that automatically converts JSON datasets to TOON format before sending them to LLM endpoints. In their testing example, a JSON dataset using 172 tokens was reduced to 71 tokens in TOON format (59% reduction) while producing identical query results across multiple model providers, including Mistral 3.</li>
<li style="font-weight:400;">TOON is specifically designed for input context containing structured data from databases or other sources, not for replacing plain text prompts or LLM outputs. Studies show that converting plain text instructions to structured formats like JSON doesn’t consistently improve accuracy, so TOON’s value proposition is primarily for applications already using JSON-formatted datasets in their prompts.</li>
<li style="font-weight:400;">The format has limitations, including a lack of proven effectiveness for model outputs, potential compatibility issues with models that haven’t been trained on TOON examples, and the need for application-specific testing to verify accuracy and token savings. Function calling, parsing, and other use cases requiring JSON outputs should continue using JSON rather than attempting TOON conversions.</li>
<li style="font-weight:400;">For cost-conscious LLM applications processing large structured datasets, TOON represents a practical optimization that could reduce token costs by 40% without requiring changes to model architecture or training. The token savings become more significant at scale, particularly for applications making frequent API calls with substantial context data.</li>
</ul>
<p>09:16  Justin – “I’d almost argue that TOON is more of what I would have wanted; very simple comma-separated values… so maybe LLMs will finally solve all my JSON complaints…but maybe not.” </p>
<p>10:40 <a href="https://blog.google/technology/ai/2025-research-breakthroughs/">Google 2025 recap: Research breakthroughs of the year</a></p>
<ul>
<li style="font-weight:400;">Google released <a href="https://blog.google/products/gemini/gemini-3/">Gemini 3 Pro</a> in November 2025 and <a href="https://blog.google/products/gemini/gemini-3-flash/">Gemini 3 Flash</a> in December 2025, with Gemini 3 Pro topping the LMArena Leaderboard and achieving 23.4% on MathArena Apex benchmark. </li>
<li style="font-weight:400;">Gemini 3 Flash delivers Pro-grade reasoning at Flash-level latency and cost, continuing Google’s trend where each generation’s Flash model surpasses the previous generation’s Pro model in quality while being substantially cheaper and faster.</li>
<li style="font-weight:400;">The company introduced several specialized AI models, including <a href="https://blog.google/technology/ai/nano-banana-pro/">Nano Banana Pro</a> for native image generation and editing,<a href="https://developers.googleblog.com/introducing-veo-3-1-and-new-creative-capabilities-in-the-gemini-api/"> Veo 3.1</a> for video generation, and <a href="https://blog.google/technology/ai/generative-media-models-io-2025/">Imagen 4</a> for image creation. </li>
<li style="font-weight:400;">Google also launched developer tools like Google Antigravity for AI-assisted software development and Jules, an asynchronous coding agent that acts as a collaborative partner for developers.</li>
<li style="font-weight:400;">Google’s <a href="https://deepmind.google/blog/alphafold-five-years-of-impact/">AlphaFold</a> celebrated its 5th anniversary with over 3 million researchers across 190+ countries using the Nobel Prize-winning protein folding system, including 1 million users in low and middle-income countries. </li>
<li style="font-weight:400;">New AI tools for genomics include <a href="https://deepmind.google/blog/alphagenome-ai-for-better-understanding-the-genome/">AlphaGenome</a> for genome understanding and <a href="https://research.google/blog/using-ai-to-identify-genetic-variants-in-tumors-with-deepsomatic/">DeepSomatic</a> for identifying genetic variants in tumors, moving beyond sequencing to the interpretation of complex genomic data.</li>
<li style="font-weight:400;">Google’s quantum computing work achieved recognition with <a href="https://blog.google/inside-google/company-announcements/googler-michel-devoret-awarded-the-nobel-prize-in-physics/">Googler Michel Devoret</a> receiving the 2025 Nobel Prize in Physics, while the <a href="https://blog.google/technology/research/quantum-echoes-willow-verifiable-quantum-advantage/">Quantum Echoes algorithm</a> demonstrated progress toward real-world quantum applications. </li>
<li style="font-weight:400;">The company also introduced <a href="https://blog.google/products/google-cloud/ironwood-google-tpu-things-to-know/">Ironwood</a>, a new TPU designed for inference workloads using the AlphaChip design method, and launched <a href="https://blog.google/technology/google-deepmind/weathernext-2/">WeatherNext 2</a>, which generates weather forecasts 8x faster with up to 1-hour resolution covering flood predictions for 2 billion people across 150 countries.</li>
<li style="font-weight:400;">Google formed the Agentic AI Foundation with other AI labs to establish open standards for agentic AI interoperability and announced Model Context Protocol support for Google services. </li>
<li style="font-weight:400;">The company also partnered with the US Department of Energy’s 17 national laboratories on the <a href="https://deepmind.google/blog/google-deepmind-supports-us-department-of-energy-on-genesis/">Genesis project</a> to transform scientific research and expand educational AI initiatives with school districts like Miami-Dade County.</li>
</ul>
<p>11:52 <a href="https://www.cnbc.com/amp/2025/12/30/meta-acquires-singapore-ai-agent-firm-manus-china-butterfly-effect-monicai.html">Meta acquires intelligent agent firm Manus, capping a year of aggressive AI </a><a href="https://www.cnbc.com/amp/2025/12/30/meta-acquires-singapore-ai-agent-firm-manus-china-butterfly-effect-monicai.html">moves</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.cnbc.com/quotes/META">Meta</a> acquired Singapore-based <a href="https://www.cnbc.com/2025/12/29/ai-agentic-shopping-price-discounts-cheap-sales-commerce-visa-mastercard-chatbots.html">AI agent</a> firm <a href="https://manus.im/">Manus</a> for over $2 billion, bringing on board a company that claims $125 million in revenue run rate just eight months after launching its general-purpose AI agent. </li>
<li style="font-weight:400;">Manus will continue operating its subscription service while its team joins Meta to enhance automation across consumer products like Meta AI assistant and business tools.</li>
<li style="font-weight:400;">Manus offers AI agents capable of executing complex tasks, including market research, coding, and data analysis, having processed over 147 trillion tokens and supported 80 million virtual computers to date. </li>
<li style="font-weight:400;">The platform provides both free and paid subscription tiers and has already been tested by Microsoft in Windows 11 PCs for tasks like creating websites from local files.</li>
<li style="font-weight:400;">The acquisition represents Meta’s continued strategy of acquiring specialized AI startups to accelerate its AI capabilities and Llama large language model development. </li>
<li style="font-weight:400;">This follows Meta’s $14.3 billion investment in Scale AI in June and its acquisition of AI-wearables startup Limitless earlier this month, demonstrating an aggressive talent and technology acquisition approach.</li>
<li style="font-weight:400;">Manus originated as a product of Chinese startup Butterfly Effect before relocating its headquarters from Beijing to Singapore in June, backed by investors including Tencent, HongShan Capital Group, and Benchmark, which led a $75 million Series B round. The company maintains strategic partnerships with Chinese tech firms, including Alibaba’s Qwen AI team, despite its geographic shift.</li>
</ul>
<p>13:04  Ryan – “You know, the upside, if they’ve just been around for 8 months, they don’t have the terrible tech debt that all these other firms have…they have 8 months of it.” </p>
<h2>AWS</h2>
<p>15:59 <a href="https://aws.amazon.com/blogs/security/security-hub-cspm-automation-rule-migration-to-security-hub/">Security Hub CSPM automation rule migration to Security Hub | AWS </a><a href="https://aws.amazon.com/blogs/security/security-hub-cspm-automation-rule-migration-to-security-hub/">Security Blog</a></p>
<ul>
<li style="font-weight:400;">AWS has split <a href="https://aws.amazon.com/security-hub/">Security Hub</a> into two services: the new Security Hub with enhanced capabilities using the Open Cybersecurity Schema Framework (OCSF), and <a href="https://aws.amazon.com/security-hub/cspm/features/">Security Hub CSPM</a>, which continues as a separate service focused on cloud security posture management. </li>
<li style="font-weight:400;">The schema change from <a href="https://docs.aws.amazon.com/securityhub/latest/userguide/securityhub-findings-format.html">AWS Security Finding Format (ASFF)</a> to OCSF means existing automation rules need migration to work with the new service.</li>
<li style="font-weight:400;">AWS released an open-source Python migration tool on GitHub that automatically discovers Security Hub CSPM automation rules, transforms them to OCSF schema, and generates CloudFormation templates for deployment. </li>
<li style="font-weight:400;">The tool handles Regional differences intelligently, supporting both home Region deployments where rules apply across linked Regions and Region-by-Region deployments for unlinked Regions.</li>
<li style="font-weight:400;">Not all automation rules can be fully migrated due to schema differences between ASFF and OCSF. The tool generates a migration report identifying rules that cannot be migrated or are only partially migrated, and creates all new rules in a disabled state by default so administrators can validate them before enabling.</li>
<li style="font-weight:400;">The migration tool preserves the original order of automation rules, which matters when multiple rules operate on the same findings or fields. </li>
<li style="font-weight:400;">For organizations using a delegated administrator account with AWS Organizations, rules must be created in that account’s home Region, and the tool is designed to work with this model while also supporting single-account deployments.</li>
<li style="font-weight:400;">This migration capability is included in the Security Hub essentials plan at no additional cost beyond standard <a href="https://aws.amazon.com/security-hub/pricing/">Security Hub pricing</a>. </li>
<li style="font-weight:400;">Organizations should review the ASFF to OCSF field mapping tables in the documentation before migration, as some criteria fields, like ComplianceAssociatedStandardsId and ProductName have no OCSF equivalents and require manual rule redesign.</li>
</ul>
<p>18:21  Matt – “The problem I always have with CPAMs – and this is a larger rant or conversation we can have – is there’s no interoperability. So if you have a CPAM and you want to then set up a GRC tool, or your other security tool can also run it, there’s no interoperability. So you then have to acknowledge things in three different spots, and there’s no single source of truth.</p>
<p>20:08 <a href="https://aws.amazon.com/blogs/containers/proactive-amazon-eks-monitoring-with-amazon-cloudwatch-operator-and-aws-control-plane-metrics/">Proactive Amazon EKS monitoring with Amazon CloudWatch Operator and </a><a href="https://aws.amazon.com/blogs/containers/proactive-amazon-eks-monitoring-with-amazon-cloudwatch-operator-and-aws-control-plane-metrics/">AWS Control Plane metrics | Containers</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/eks/">EKS</a> clusters running version <a href="https://docs.aws.amazon.com/eks/latest/userguide/observability-dashboard.html#observability-control-plane">1.28</a> and above now automatically send control plane metrics to <a href="https://aws.amazon.com/cloudwatch/">CloudWatch</a> at no extra cost, covering API server health, scheduler performance, and etcd database status. </li>
<li style="font-weight:400;">The new CloudWatch Observability Operator <a href="https://docs.aws.amazon.com/eks/latest/userguide/eks-add-ons.html">add-on</a> extends this with Container Insights and Application Signals for deeper visibility into workloads and applications without code changes.</li>
<li style="font-weight:400;">The enhanced monitoring addresses common operational challenges like detecting pod scheduling bottlenecks through metrics such as scheduler_pending_pods and scheduler_schedule_attempts_UNSCHEDULABLE, which help identify under-resourced worker nodes. API server throttling issues become visible through apiserver_request_total_429 metrics, showing when the default 600 in-flight request limit is approached.</li>
<li style="font-weight:400;">Critical infrastructure components like admission webhooks, which power AWS Load Balancer Controller and IRSA functionality, can now be monitored for failures and latency issues. The apiserver_admission_webhook_rejection_count metric helps catch silent webhook failures that could prevent deployments, with CloudWatch Log Insights providing correlated log data for troubleshooting.</li>
<li style="font-weight:400;">The etcd database monitoring is particularly important since EKS has an 8 GB recommended limit, and exceeding it makes clusters read-only. CloudWatch alarms can trigger at 80 percent capacity (6.4 GB) using the apiserver_storage_size_bytes metric, giving teams time to clean up unnecessary resources before hitting the limit.</li>
<li style="font-weight:400;">Application Signals provides automatic instrumentation for Java applications with pre-built dashboards tracking traffic, latency, and availability at a 5 percent sampling rate. </li>
<li style="font-weight:400;">The feature integrates with CloudWatch anomaly detection using machine learning to identify unusual patterns in metrics like node_cpu_utilization without manual threshold configuration.</li>
</ul>
<p>21:15  Ryan – “I like this, except for the fact that it’s an operator…I don’t understand why this isn’t just configuration options in your cluster.” </p>
<p>21:59 <a href="https://aws.amazon.com/about-aws/whats-new/2025/12/amazon-ecs-managed-instances-ec2-spot-instances/">Amazon ECS Managed Instances now supports Amazon EC2 Spot </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/12/amazon-ecs-managed-instances-ec2-spot-instances/">Instances</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/ecs/managed-instances/">ECS Managed Instances</a> now supports <a href="https://aws.amazon.com/ec2/spot/">EC2 Spot capacity</a>, allowing customers to run fault-tolerant containerized workloads at up to 90% discount compared to On-Demand pricing while AWS handles all infrastructure management. </li>
<li style="font-weight:400;">You configure a new capacityOptionType parameter as spot or on-demand in your capacity provider settings.</li>
<li style="font-weight:400;">This extends ECS Managed Instances beyond its existing capabilities of automatic provisioning, dynamic scaling, and cost-optimized task placement. AWS still handles the infrastructure operations through AWS-controlled access in your account, but now you can choose between spot and on-demand capacity types alongside existing options for GPU, network-optimized, and burstable instance families.</li>
<li style="font-weight:400;">The feature is available in all AWS Regions where ECS Managed Instances currently operate. Pricing includes both the spot EC2 instance costs and an additional management fee for the compute provisioning service, though specific management costs are not disclosed in the announcement.</li>
<li style="font-weight:400;">This targets customers running stateless or fault-tolerant containerized applications like batch processing, CI/CD pipelines, or web services that can handle interruptions. </li>
<li style="font-weight:400;">The combination of managed infrastructure and spot pricing addresses a common challenge where teams want cost savings from spot instances but lack resources to manage the complexity of spot interruptions and capacity management.</li>
</ul>
<p>24:07 <a href="https://aws.amazon.com/blogs/containers/enhance-amazon-eks-network-security-posture-with-dns-and-admin-network-policies/">Enhance Amazon EKS network security posture with DNS and admin </a><a href="https://aws.amazon.com/blogs/containers/enhance-amazon-eks-network-security-posture-with-dns-and-admin-network-policies/">network policies | Containers</a></p>
<ul>
<li style="font-weight:400;">Amazon EKS now supports <a href="https://aws.amazon.com/about-aws/whats-new/2025/12/amazon-eks-enhanced-network-security-policies/">DNS-based and Admin network policies</a>, allowing teams to control pod traffic using stable domain names instead of constantly changing IP addresses. </li>
<li style="font-weight:400;">This eliminates the operational overhead of maintaining IP allowlists for AWS services, on-premises systems, and third-party APIs while providing centralized policy management across multiple namespaces.</li>
<li style="font-weight:400;">Admin network policies operate in two tiers with hierarchical enforcement that cannot be overridden by namespace-level policies, enabling platform teams to enforce mandatory security controls like blocking access to EC2 Instance Metadata Service at 169.254.169.254. </li>
<li style="font-weight:400;">The policies use label-based segmentation to apply security standards across multiple namespaces simultaneously, reducing the need for per-namespace policy management.</li>
<li style="font-weight:400;">DNS-based policies are available in EKS Auto mode clusters version 1.29 and later, while Admin policies work in both EKS Auto mode and EC2-based clusters running VPC CNI version 1.21.1 or later. </li>
<li style="font-weight:400;">The feature removes the need for third-party network policy tools and integrates with existing Kubernetes NetworkPolicy resources for defense-in-depth security.</li>
<li style="font-weight:400;">The policy evaluation order follows a strict hierarchy: Admin tier Deny rules take precedence over everything, followed by Admin Allow rules, then namespace-scoped policies, and finally Baseline tier policies. </li>
<li style="font-weight:400;">This ensures security teams can enforce organization-wide controls while still allowing application teams flexibility for namespace-specific requirements.</li>
<li style="font-weight:400;">Real-world applications include multi-tenant environments where different applications need controlled access to specific AWS services like S3 or DynamoDB using patterns like asterisk.s3.amazonaws.com, and hybrid cloud scenarios where workloads access on-premises databases through stable DNS names that remain valid even as underlying infrastructure changes.</li>
</ul>
<p>24:17  Justin – “Thank you, Jesus.” </p>
<p>27:46  Ryan – “If you are a traditional engineer listening to our show, this is an example of something where you can take your skillset and add a ton of value.” </p>
<p>28:00 <a href="https://www.theregister.com/2026/01/05/aws_price_increase/">AWS raises GPU prices 15% on a Saturday • The Register</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.lastweekinaws.com/">AWS</a> <a href="https://aws.amazon.com/ec2/capacityblocks/pricing/">increased prices</a> for EC2 Capacity Blocks for ML by approximately 15 percent over the weekend, with p5e.48xlarge instances jumping from $34.61 to $39.80 per hour in most regions. </li>
<li style="font-weight:400;">This marks a departure from AWS’s two-decade pattern of price reductions and represents one of the first straight increases to a line item not tied to regulatory requirements.</li>
<li style="font-weight:400;">Capacity Blocks allow customers to reserve guaranteed GPU capacity for ML training jobs from one day to several weeks in advance with locked-in rates paid upfront. </li>
<li style="font-weight:400;">AWS attributes the increase to supply and demand patterns for this quarter, reflecting the global GPU shortage driven by increased AI workload demand across the industry.</li>
<li style="font-weight:400;">The price increase creates complications for customers with Enterprise Discount Programs, as their percentage discounts remain the same, but absolute costs rise by 15 percent. </li>
<li style="font-weight:400;">This gives competitors like Azure and GCP a direct talking point for enterprise sales conversations, though whether they can absorb the demand remains uncertain given industry-wide GPU constraints.</li>
<li style="font-weight:400;">The change establishes a precedent that could extend to other resource-constrained services, particularly RAM-intensive offerings that touch nearly every AWS service. </li>
<li style="font-weight:400;">The timing and execution on a Saturday with minimal announcement suggest AWS is testing customer response to price increases after conditioning the market to expect only decreases.</li>
<li style="font-weight:400;">This affects primarily enterprise customers running serious ML workloads with budgets in the millions, as Capacity Block pricing targets teams that cannot afford training run interruptions. </li>
<li style="font-weight:400;">The broader concern is whether this signals a shift in AWS’s pricing strategy across other services where supply constraints or cost increases exist.</li>
</ul>
<p>29:31  Matt – “I don’t think it’s a broader concern; but I think it’s the first real time you’re seeing a dramatic increase, and it’s been a fear for many companies for many years…what if they raise the prices and there’s nothing we can do because we’re already there? And they’re doing it, and there’s not much you CAN do.” </p>
<p>30:51 <a href="https://aws.amazon.com/about-aws/whats-new/2026/01/ec2-capacity-manager-spot-interruption-metrics/">EC2 Capacity Manager now includes Spot interruption metrics</a></p>
<ul>
<li style="font-weight:400;">EC2 Capacity Manager adds three new Spot interruption metrics at no additional cost across all commercial AWS regions. </li>
<li style="font-weight:400;">The metrics track total Spot instance count, interruption counts, and interruption rates across regions, availability zones, and accounts to help optimize Spot placement strategies.</li>
<li style="font-weight:400;">The new visibility helps customers make data-driven decisions about Spot instance diversification by identifying patterns in interruptions. </li>
<li style="font-weight:400;">Organizations can use this data to determine which availability zones or instance types experience fewer interruptions and adjust their Spot strategies accordingly.</li>
<li style="font-weight:400;">This enhancement integrates with existing <a href="https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/spot-placement-score.html">Spot placement score</a> functionality to provide a complete picture of Spot capacity management. </li>
<li style="font-weight:400;">Customers can now correlate predicted availability scores with actual interruption data to validate and refine their capacity planning decisions.</li>
<li style="font-weight:400;">The metrics are particularly valuable for organizations running large-scale Spot fleets where even small improvements in interruption rates translate to meaningful cost savings. </li>
<li style="font-weight:400;">By tracking interruption rates over time, teams can measure the effectiveness of their diversification strategies and identify opportunities to expand into more stable capacity pools.</li>
</ul>
<p>31:11  Justin – “Or…you could just make this a service I could subscribe to.”</p>
<h2>GCP</h2>
<p>32:50 <a href="https://cloud.google.com/blog/products/business-intelligence/looker-self-service-explores-tabbed-dashboards-custom-themes/">Looker self-service Explores, tabbed dashboards, custom themes | Google </a><a href="https://cloud.google.com/blog/products/business-intelligence/looker-self-service-explores-tabbed-dashboards-custom-themes/">Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Y’all can thank Ryan if you’re not into this particular story. Hit him up on Slack and let him know your thoughts. </li>
<li style="font-weight:400;"><a href="https://lookerstudio.google.com/u/0/navigation/reporting">Looker</a> <a href="https://docs.cloud.google.com/looker/docs/exploring-self-service">now allows users to upload CSV and spreadsheet files directly into the platform</a> through a drag-and-drop interface in the new self-service Explores feature, currently in Public Preview. </li>
<li style="font-weight:400;">This bridges the gap between governed data models and ad-hoc analysis by letting users combine local files with existing Looker data while maintaining administrator oversight on uploads and permissions.</li>
<li style="font-weight:400;">The new tabbed dashboard feature helps organize complex dashboards into logical sections with automatic filter propagation across tabs, reducing visual clutter by showing only relevant filters per view. </li>
<li style="font-weight:400;">Users can share specific tab URLs and export entire multi-tab dashboards as single PDF documents, making it easier to present cohesive data narratives.</li>
<li style="font-weight:400;">Internal dashboard theming is now available in Public Preview, enabling organizations to customize tile styles, colors, fonts, and formatting to match corporate branding within the Looker application. </li>
<li style="font-weight:400;">Administrators can create reusable theme templates and set default themes across entire instances to ensure consistency.</li>
<li style="font-weight:400;">A new <a href="https://docs.cloud.google.com/looker/docs/content-certification">content certification flow</a> helps distinguish between ad-hoc experiments and vetted data sources, addressing governance concerns when users upload their own datasets. </li>
<li style="font-weight:400;">This feature works alongside administrator controls to maintain data quality standards while enabling self-service capabilities.</li>
<li style="font-weight:400;">These features are available starting with <a href="https://docs.cloud.google.com/looker/docs/release-notes#December_03_2025">Looker version 25.20</a> and can be enabled through the Admin Labs page, with no specific pricing changes announced as they appear to be included in existing Looker subscriptions.</li>
</ul>
<p>34:06  Ryan – “For everyone that has to supply you with pretty graphs and pictures, this is very important. It is very difficult to sort of modify and work with existing data sets in any BI tool, and so this is another knob that you can put. And I could use something like this for just uploading a very easy CSV of like product names or usernames or something that’s just a list, versus having to parse that out of a very large data set, which may have a combination of structured and unstructured data or just bad schema adherence. And so this is sort of a nice tool for being able to create those types of things.” </p>
<p>35:28 <a href="https://cloud.google.com/blog/products/databases/optimizing-alloydb-ai-text-to-sql-accuracy/">Optimizing AlloyDB AI text-to-SQL accuracy | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.cloud.google.com/alloydb/docs/ai/natural-language-landing?_gl=1*1upfho8*_ga*NjY0MDA3MDYuMTc2MzUyNzU2Mw..*_ga_WH2QY8WWF5*czE3NjQxMTAwNTgkbzYkZzEkdDE3NjQxMTAyNTEkajI0JGwwJGgw">AlloyDB AI’s natural language API</a>, currently in preview, enables developers to build agentic applications that translate <a href="https://cloud.google.com/blog/products/databases/how-to-get-gemini-to-deeply-understand-your-database?e=48754805">natural language questions into SQL</a> queries with near-100% accuracy. </li>
<li style="font-weight:400;">The system uses descriptive context like table descriptions, prescriptive context including SQL templates and facets for complex conditions, and a <a href="https://cloud.google.com/alloydb/docs/ai/use-natural-language-generate-sql-queries#value-index">value index</a> to disambiguate database-specific terms that foundation models wouldn’t recognize.</li>
<li style="font-weight:400;">The API addresses a critical business need where 80-90% accuracy isn’t sufficient, particularly in industries like real estate search and retail, where poor query interpretation directly impacts conversions and revenue. </li>
<li style="font-weight:400;">Users can iteratively improve accuracy through a hill-climbing approach, starting with out-of-the-box capabilities and progressively adding context to handle nuanced questions like “homes near good schools” that require specific business logic for terms like “near” and “good.”</li>
<li style="font-weight:400;">The system provides explainability features that show users what the API understood their question to mean, allowing agents and end users to verify the interpretation even when accuracy isn’t perfect. </li>
<li style="font-weight:400;">This transparency helps mitigate the impact of occasional misinterpretations while the system approaches 100% accuracy for specific use cases.</li>
<li style="font-weight:400;">Integration options include <a href="https://cloud.google.com/blog/products/ai-machine-learning/mcp-toolbox-for-databases-now-supports-model-context-protocol">MCP Toolbox for Databases</a> for developers writing AI tools or <a href="https://cloud.google.com/gemini-enterprise">Gemini Enterprise</a> for no-code agentic programming, allowing conversational applications that combine web knowledge with database queries. The technology works across structured, unstructured, and multimodal data using AlloyDB’s <a href="https://cloud.google.com/blog/products/databases/alloydb-ai-auto-vector-embeddings-and-auto-vector-index">vector search</a>, text search, and AI operators like <a href="http://ai.if">AI.IF</a> for semantic conditions.</li>
<li style="font-weight:400;">Google plans to expand this natural language capability beyond AlloyDB to a broader set of Google Cloud databases, though specific timelines and pricing details for the preview or general availability weren’t disclosed in the: announcement.</li>
</ul>
<p>36:43  Justin – “Natural language query – I am here for it.” </p>
<p>37:56 <a href="https://cloud.google.com/blog/products/ai-machine-learning/new-enhanced-tool-governance-in-vertex-ai-agent-builder/">New Enhanced Tool Governance in Vertex AI Agent Builder | Google Cloud </a><a href="https://cloud.google.com/blog/products/ai-machine-learning/new-enhanced-tool-governance-in-vertex-ai-agent-builder/">Blog</a></p>
<ul>
<li style="font-weight:400;">Google introduces enhanced tool governance for Vertex AI <a href="https://cloud.google.com/products/agent-builder?e=48754805&amp;hl=en">Agent Builder</a> through <a href="https://docs.cloud.google.com/api-registry/docs/overview">Cloud API Registry</a> integration, allowing administrators to centrally manage and curate approved tools across their organization while developers access them via a new <a href="https://google.github.io/adk-docs/tools/google-cloud/api-registry/">ApiRegistry</a> object in the Agent Development Kit. </li>
<li style="font-weight:400;">This addresses the duplicate work problem where developers previously built tools separately for each agent and gives enterprises better control over what data and APIs their AI agents can access.</li>
<li style="font-weight:400;">The <a href="https://google.github.io/adk-docs/">Agent Development Kit</a> now supports <a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/models/gemini/3-pro">Gemini 3 Pro</a> and Flash models with full <a href="https://github.com/google/adk-js">TypeScript</a> compatibility, plus improved <a href="https://google.github.io/adk-docs/sessions/state/">state management</a> features including automatic recovery from failures, human-in-the-loop pause and resume capabilities, and conversation rewind functionality. </li>
<li style="font-weight:400;">The new <a href="https://developers.googleblog.com/building-agents-with-the-adk-and-the-new-interactions-api/">Interactions API</a> integration provides consistent multimodal input/output handling across agents, while <a href="https://a2ui.org/">A2UI</a> enables agents to pass UI components directly to applications without the security risks of executable code.</li>
<li style="font-weight:400;">Agent Engine sessions and memory bank reach general availability, powered by Google Cloud AI Research’s topic-based approach for managing both short-term and long-term agent memory across interactions. </li>
<li style="font-weight:400;">The service expands to seven additional regions globally, with runtime pricing reduced and billing for additional <a href="https://docs.cloud.google.com/agent-builder/agent-engine/overview">Agent Engine</a> services beginning January 28, 2026 (specific pricing details available in documentation).</li>
<li style="font-weight:400;">Customer implementations show practical benefits: Burns &amp; McDonnell uses <a href="https://cloud.google.com/products/agent-builder?e=48754805&amp;hl=en">Agent Builder</a> to transform project data into real-time intelligence, Payhawk reduced expense submission time by over 50 percent through Memory Bank’s context retention, and Gurunavi projects a 30 percent improvement in user experience for their restaurant discovery app by remembering user preferences and patterns.</li>
<li style="font-weight:400;">The platform now includes <a href="http://console.cloud.google.com/vertex-ai/agents/agent-garden">Vertex AI Agent Garden</a> with one-click deployment of curated agent samples and an <a href="https://googlecloudplatform.github.io/agent-starter-pack/">Agent Starter Pack</a> providing production-ready templates for building, testing, and deploying agents. </li>
<li style="font-weight:400;">Apigee integration allows organizations to transform existing managed APIs into custom MCP servers, bringing multi-cloud tools into a centralized catalog through <a href="https://docs.cloud.google.com/api-registry/docs/overview">Cloud API Registry</a>.</li>
</ul>
<p>38:47   Ryan – “This just goes to show how early we are in this ecosystem. Companies are just starting to sort of get wise that they’ve got a whole bunch of developers using these platforms, and they’re all kind of doing their own things and separate little silos and there’s very little ability to share or get any kind of optimization with those central resources… I do think that this is a good thing.” </p>
<p>39:55 <a href="https://cloud.google.com/blog/products/compute/introducing-vm-extensions-manager/">Introducing VM Extensions Manager | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google launches VM Extensions Manager in preview to centralize and automate the installation and lifecycle management of OS agents across Compute Engine fleets. </li>
<li style="font-weight:400;">The service eliminates manual scripting and startup script dependencies by providing policy-driven control that can reduce operational overhead from months to hours, according to Google.</li>
<li style="font-weight:400;">The preview supports three critical extensions at launch: <a href="https://docs.cloud.google.com/logging/docs/agent/ops-agent/agent-vmem-policies">Cloud Ops Agent</a> for telemetry collection, <a href="https://docs.cloud.google.com/workload-manager/docs/evaluate/set-up-agent-for-sap">Agent for SAP</a> for monitoring SAP workloads, and <a href="https://docs.cloud.google.com/compute/docs/instances/agent-for-compute-workloads">Agent for Compute Workloads</a> for workload evaluation. </li>
<li style="font-weight:400;">Administrators can pin specific extension versions or let the system automatically deploy the latest releases, with more extensions planned for future support.</li>
<li style="font-weight:400;">VM Extensions Manager offers two rollout speeds for global policies: SLOW mode executes zone-by-zone deployments over 5 days by default to minimize risk, while FAST mode enables immediate fleet-wide updates for urgent security patches. </li>
<li style="font-weight:400;">Zonal policies at the project level are available now, with global policies and organization or folder-level policies coming in the following months.</li>
<li style="font-weight:400;">The service integrates directly into the existing compute.googleapis.com API without requiring new API enablement or discovery, allowing administrators to start creating policies immediately through the Cloud Console or gcloud CLI. Documentation is available <a href="https://docs.cloud.google.com/compute/docs/vm-extensions/about-vm-extension-manager">here</a>. </li>
</ul>
<p>42:18   Matt – “I like that they released both of those day one – both slow and fast mode.” </p>
<p>43:23 <a href="https://cloud.google.com/blog/products/databases/cloud-sql-for-mysql-introduces-optimized-writes/">Cloud SQL for MySQL introduces optimized writes | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Cloud SQL for MySQL Enterprise Plus edition now includes <a href="https://cloud.google.com/sql/docs/mysql/flags#tips-optimized-write">optimized writes</a>, a feature that automatically tunes five different MySQL parameters and configurations based on real-time workload metrics to improve write performance. </li>
<li style="font-weight:400;">The feature is enabled by default on all <a href="https://cloud.google.com/sql/docs/editions-intro">Enterprise Plus</a> instances and requires no manual intervention or configuration changes.</li>
<li style="font-weight:400;">Google reports up to 3x better write throughput compared to the standard Enterprise edition, with reduced latency, particularly beneficial for write-intensive OLTP workloads. </li>
<li style="font-weight:400;">Performance gains vary based on machine configuration, and the feature complements the existing SSD-backed data cache that provides up to 3x higher read throughput.</li>
<li style="font-weight:400;">The optimized writes feature works by automatically adjusting MySQL flags, data handling, and parameters in response to instance and workload characteristics. </li>
<li style="font-weight:400;">Customers can benchmark the improvements using sysbench by comparingthe  Enterprise edition, the Enterprise Plus without optimized writes, and the Enterprise Plus with optimized writes enabled.</li>
<li style="font-weight:400;">Existing Cloud SQL instances can upgrade to the Enterprise Plus edition in-place to access optimized writes, though specific pricing details for the Enterprise Plus tier are not provided in the announcement. </li>
<li style="font-weight:400;">The feature targets organizations running write-heavy database workloads that previously required manual MySQL tuning and optimization efforts.</li>
</ul>
<h2>Azure </h2>
<p>43:23 <a href="https://blogs.microsoft.com/blog/2026/01/05/microsoft-announces-acquisition-of-osmos-to-accelerate-autonomous-data-engineering-in-fabric/">Microsoft announces acquisition of Osmos to accelerate autonomous data </a><a href="https://blogs.microsoft.com/blog/2026/01/05/microsoft-announces-acquisition-of-osmos-to-accelerate-autonomous-data-engineering-in-fabric/">engineering in Fabric – The Official Microsoft Blog</a></p>
<ul>
<li style="font-weight:400;">Microsoft acquires Osmos to bring agentic AI capabilities to <a href="https://www.microsoft.com/en-us/microsoft-fabric/blog">Fabric</a> for autonomous data engineering workflows. Osmos uses AI agents to automate data preparation tasks that typically consume most of data teams’ time, transforming raw data into analytics-ready assets in <a href="https://learn.microsoft.com/en-us/fabric/onelake/onelake-overview">OneLake</a> without manual intervention.</li>
<li style="font-weight:400;">The acquisition addresses a common enterprise challenge where organizations have abundant data but lack efficient ways to make it actionable. </li>
<li style="font-weight:400;">Osmos will integrate into Microsoft Fabric’s unified data platform, allowing AI agents to handle data connection, preparation, and transformation tasks that currently require significant manual effort and technical expertise.</li>
<li style="font-weight:400;">The Osmos team joins Microsoft’s Fabric engineering organization to advance autonomous data operations within the existing Fabric ecosystem. </li>
<li style="font-weight:400;">This builds on Fabric’s existing capabilities around OneLake, Power BI, and unified data analytics by adding intelligent automation for data engineering workflows.</li>
<li style="font-weight:400;">No pricing details or availability timeline were announced, though Microsoft indicates integration updates will be shared through the Microsoft Fabric Blog. The acquisition targets organizations spending excessive resources on data preparation rather than analysis, particularly those already invested in the Fabric ecosystem.</li>
</ul>
<p>45:32   Ryan – “As long as they deliver on the promise. There’s been solutions that make the same promise – not with AI… and it just never works the way it should. Drives me nuts that it’s so failure prone. As long as the AI and Agentic add to these things, that’s fantastic.” </p>
<p>46:38 <a href="https://azure.microsoft.com/en-us/blog/microsofts-strategic-ai-datacenter-planning-enables-seamless-large-scale-nvidia-rubin-deployments/">Microsoft’s strategic AI datacenter planning enables seamless, large-scale </a><a href="https://azure.microsoft.com/en-us/blog/microsofts-strategic-ai-datacenter-planning-enables-seamless-large-scale-nvidia-rubin-deployments/">NVIDIA Rubin deployments | Microsoft Azure Blog</a></p>
<ul>
<li style="font-weight:400;">Azure is deploying NVIDIA’s next-generation Rubin platform at scale, with infrastructure already designed to handle its power, cooling, and networking requirements. </li>
<li style="font-weight:400;">Microsoft’s Fairwater datacenters in Wisconsin and Atlanta can accommodate Rubin’s 50 petaflops per chip and 3.6 exaflops per rack without retrofitting, representing a five-times performance jump over <a href="https://www.nvidia.com/en-us/data-center/gb200-nvl72/">GB200</a> systems.</li>
<li style="font-weight:400;">The deployment leverages Azure’s systems approach where compute, networking, storage, and infrastructure work as an integrated platform. </li>
<li style="font-weight:400;">Key technical enablements include support for sixth-generation <a href="https://www.nvidia.com/en-us/data-center/nvlink/">NVLink</a> with 260 TB/s bandwidth, ConnectX-9 1,600 Gb/s networking, HBM4 memory thermal management, and pod exchange architecture for rapid hardware servicing without extensive rewiring.</li>
<li style="font-weight:400;">Azure’s track record includes operating the world’s largest commercial <a href="https://www.nvidia.com/en-eu/networking/quantum2/">InfiniBand</a> deployments and being first to deploy both GB200 and <a href="https://www.nvidia.com/en-us/data-center/gb300-nvl72/">GB300</a> NVL72 platforms at scale. </li>
<li style="font-weight:400;">The company’s multi-year collaboration with NVIDIA on co-design means Rubin integrates directly into existing infrastructure, enabling faster customer deployments compared to competitors who need infrastructure upgrades.</li>
<li style="font-weight:400;">Microsoft’s regional superfactory approach differs from other hyperscalers’ single megasite strategy, allowing more predictable global rollout of new AI capabilities. </li>
<li style="font-weight:400;">This modular design combined with <a href="https://azure.microsoft.com/en-us/products/virtual-machines/boost/?msockid=2d15e68042986f6815c7f05343506e7e">Azure Boost</a> offload engines, liquid cooling systems, and optimized orchestration through CycleCloud and AKS aims to maximize GPU utilization and deliver better performance per dollar at cluster scale.</li>
</ul>
<h2>Oracle </h2>
<p>46:38 <a href="https://www.oracle.com/news/announcement/blog/oracle-is-set-to-power-on-new-data-center-in-michigan-2025-1018/">Oracle is Set to Power on New Data Center in Michigan</a></p>
<ul>
<li style="font-weight:400;">Oracle is building a new data center in Saline Township, Michigan specifically to serve OpenAI’s infrastructure needs, marking another major cloud capacity expansion for AI workloads. </li>
<li style="font-weight:400;">The facility will use closed-loop non-evaporative cooling systems that consume water comparable to an average office building rather than millions of gallons daily like traditional evaporative systems.</li>
<li style="font-weight:400;">The project includes a 17-year power agreement with DTE Energy where Oracle pays 100% of energy costs including new transmission lines and an onsite substation, with Michigan law prohibiting utilities from passing data center costs to existing ratepayers. </li>
<li style="font-weight:400;">Oracle claims its large customer contribution to DTE’s fixed costs will reduce overall energy costs for other customers by approximately $300 million annually by 2029-2030.</li>
<li style="font-weight:400;">The facility will create 2,500 union construction jobs and 450 permanent on-site positions plus an estimated 1,500 jobs across Washtenaw County, with construction scheduled to begin in Q1 2026. The project includes $8 million annually for local schools, $1.6 million yearly in direct tax revenue for Saline Township, and over $14 million in community benefits.</li>
<li style="font-weight:400;">Oracle is developing only 250 of 575 acres with the remaining land protected as open space, farmland, wetlands and woodlands including 47.5 acres in conservation easement. </li>
<li style="font-weight:400;">This represents Oracle’s 148th data center with 64 more under construction globally, though the company provides no specific pricing or service details for customers beyond OpenAI.</li>
</ul>
<p>52:15  Ryan – “But are you trading the water concern for the high energy costs?”</p>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback, or ask questions at theCloudPod.net or tweet at us with the hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2326337/c1e-k5d5sdmg6kf5qmxw-mkgxr2kob7v5-en3fbh.mp3" length="100100206"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[ Welcome to episode 337 of The Cloud Pod, where the forecast is always cloudy! Justin, Matt, and Ryan have hit the recording studio to bring you all the latest in cloud and AI news, from acquisitions and price hikes to new tools that Ryan somehow loves but also hates? We don’t understand either… but let’s get started! 
Titles we almost went with this week

Prompt Engineering Our Way Into Trouble
The Demo Worked Yesterday, We Swear
It Scales Horizontally, Trust Us
Responsible AI But Terrible Copy (Marketing Edition)

General News 
00:58 Watch ‘The Thinking Game’ documentary for free on YouTube

Google DeepMind is releasing the “The Thinking Game” documentary for free on YouTube starting November 25, marking the fifth anniversary of AlphaFold. 
The feature-length film provides behind-the-scenes access to the AI lab and documents the team’s work toward artificial general intelligence over five years.
The documentary captures the moment when the AlphaFold team learned they had solved the 50-year protein folding problem in biology, a scientific achievement that recently earned Demis Hassabis and John Jumper the Nobel Prize in Chemistry. 
This represents one of the most significant practical applications of deep learning to fundamental scientific research.
The film was produced by the same award-winning team that created the AlphaGo documentary, which chronicled DeepMind’s earlier achievement in mastering the game of Go. For cloud and AI practitioners, this offers insight into how Google DeepMind approaches complex AI research problems and the development process behind their models.
While this is primarily a documentary release rather than a technical product announcement, it provides context for understanding Google’s broader AI strategy and the research foundation underlying its cloud AI services. The AlphaFold model itself is available through Google Cloud for protein structure prediction workloads.

01:54  Justin – “If you’re not into technology, don’t care about any of that, and don’t care about AI and how they built all the AI models that are now powering the world of LLMs we have, you will not like this documentary.” 
04:22 ServiceNow to buy Armis in $7.7 billion security deal • The Register

ServiceNow is acquiring Armis for $7.75 billion to integrate real-time security intelligence with its Configuration Management Database, allowing customers to identify vulnerabilities across IT, OT, and medical devices and remediate them through automated workflows. 
The deal is expected to close in the second half of 2026 and aims to triple ServiceNow’s current $1 billion annual security revenue.
The acquisition represents a strategic data play when combined with ServiceNow’s recent purchase of Data.World, giving the company both massive volumes of se...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2326337/c1a-k5d5-jpq2ogg0ammm-jmjbfj.jpg"></itunes:image>
                                                                            <itunes:duration>00:52:01</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2326337/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[336: We Were Right (Mostly), 2026: The New Prophecies]]>
                </title>
                <pubDate>Tue, 13 Jan 2026 04:49:10 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2320455</guid>
                                    <link>https://tcpfm.castos.com/episodes/336-we-were-right-mostly-2026-the-new-prophecies</link>
                                <description>
                                            <![CDATA[<p>Welcome to episode 335 of The Cloud Pod, where the forecast is always cloudy! Welcome to the first show of 2026, and it’s a full house, too! Justin, Jonathan, Ryan,  and Matt are all here to reflect on 2025, plus bring you their predictions for 2026.</p>
<p>Let’s get started! </p>
<h3>Titles we almost went with this week</h3>
<ul>
<li> SQL Me Maybe: AlloyDB Gets Chatty With Your Database **OpenAI</li>
<li>SELECT * FROM natural_language WHERE accuracy LIKE ‘100%’ **Anthropic</li>
<li> etcd You Were Worried About Database Limits: CloudWatch Has Your Back</li>
<li> CSV You Later: Looker Adds Drag-and-Drop Data Uploads</li>
<li> AWS Spots an Opportunity to Manage Your Container Costs</li>
<li> EKS Network Policies: No More IP Address Whack-a-Mole</li>
<li> AWS Security Hub Splits: It’s Not You, It’s CSPM</li>
<li> Spot On: ECS Finally Manages Your Cheapest Compute</li>
<li> TOON Squad: DigitalOcean’s New Format Makes JSON Look Bloated</li>
<li> The Price is Wrong: AWS Breaks Two Decades of Downward Pricing Tradition</li>
<li> Show Your Work: Why AI-Generated Code Without Tests is Just Expensive Spam</li>
<li> No More Agent Orange: Google Simplifies VM Extension Deployment</li>
<li> AWS Discovers Prices Can Go Both Ways, Raises GPU Costs 15 Percent</li>
<li> Sovereignty Washing: When Your European Cloud Still Answers to Uncle Sam</li>
<li> Agent Builder Gets a Memory Upgrade: Google’s AI Finally Remembers Where It Put Its Keys</li>
<li> Ctrl+F for the Future: A year-end Scorecard &amp; Next-Gen Bets</li>
<li> AI Agents, GPU Prices, and The best of the Cloud Pod 2025</li>
<li> Beyond the Hype: The Cloud Pods Definitive 2025 Year in Review</li>
<li> Apocalypse Now… What? Our 2026 Forecast</li>
</ul>
<p> </p>
<h2>Follow Up </h2>
<p>01:27 RYAN’S PREDICTIONS</p>



Prediction
Status
Notes


Quick LLM models for individuals
 ACCURATE
Meta-Llama-3.1-8B-Instruct, GLM-4-9B-0414, and Qwen2.5-VL-7B-Instruct—each chosen for an outstanding balance of performance and computational efficiency, making them ideal for edge AI deployment. A new AI inference application called Inferencer allows even modest Apple Mac computers to run the largest open-source LLMs.


AI at the edge natively (Lambda-esque)
 ACCURATE
Akamai launched a new Inference Cloud product for edge AI using Nvidia’s Blackwell 6000 GPUs in 17 cities. AWS IoT Greengrass with Lambda functions for edge logic. “Edge AI allows for instant decision-making where it matters most—close to the data source.”


Cloud native security mesh multi-cloud
 UNCLEAR
Service mesh technologies continue to evolve (Istio, Linkerd), but I didn’t find a breakthrough “app-to-app at the edge” security mesh product announcement in 2025. This one needs more specific evidence.



<p>Ryan Score: 2/3 </p>
<p>02:25 MATTHEW’S PREDICTIONS</p>



Prediction
Status
Notes


FOCUS adopted by Snowflake or Databricks
 ACCURATE
FOCUS version 1.2 was ratified on May 29, 2025. Three new providers announced support: Alibaba Cloud, Databricks, and Grafana. Databricks officially adopted FOCUS!


AI security/ethical standard (SOC or ISO)
 ACCURATE
ISO 42001 is the first international standard outlining requirements for AI governance. Major companies achieving certification in 2025: Automation Anywhere is among the first 100 companies worldwide to earn ISO/IEC 42001:2023 certification. Anthropic also achieved ISO 42001 certification.


Amazon deprecates 5+ services (WorkMail bonus)
 ACCURATE (no bonus)
19 services are mothballed, four are being sunset, and one is end of its supported life. Deprecated services include CodeCommit, Cloud9, S3 Select, CloudSearch, SimpleDB, Forecast, Data Pipeline, QLDB, Snowball Edge, and more. WorkMail NOT deprecated – WorkDocs was (April 2025), but WorkMail remains active.



<p>Matthew Score: 3/3 </p>
<p>03:22 JONATHAN’S PREDICTIONS</p>



Prediction
Status
Notes


Company claims AGI achieved
 ACCURATE
Integral AI, founded by ex-Google veteran Jad Tarifi, claims to have built a world-first AGI mo...
<h3>Chapters</h3>
<ul><li>(00:00:00) - 2019: The New Prophecies</li><li>(00:01:16) - 2018 Cloud Predictions: The Best Ever</li><li>(00:06:21) - Cloud Provider Coverage on The Show</li><li>(00:08:20) - Ryan on Host Participation</li><li>(00:09:22) - AI Spelled Out 596 Times in 2025</li><li>(00:10:51) - A Year in the Life of AWS</li><li>(00:12:03) - A Year in the Life of AI</li><li>(00:14:38) - How to Build an AI Chatbot</li><li>(00:21:43) - Cloud Hub: Update the Website, Build a CMS</li><li>(00:24:48) - Top 3 Stories From 2025</li><li>(00:27:12) - Agent to Agent: The Technology Standard</li><li>(00:29:26) - Amazon's Nova: Underused, but Solid</li><li>(00:31:51) - GitHub's migration to Azure</li><li>(00:35:29) - Cloud 2.8 & Cloud 4</li><li>(00:40:34) - ECS 12. Quality of Life</li><li>(00:46:09) - Top 10 Cloud Outages predicted for 2021</li><li>(00:47:05) - Top 10 Predictions for 2021</li><li>(00:47:54) - Predictions for the AI Industry in 2017</li><li>(00:50:23) - Quantum Computing: A Step Forward in 2026</li><li>(00:53:24) - I Predict the First AI Agent Security Breach</li><li>(00:55:02) - 2026: Infrastructure as a Human Language</li><li>(00:55:49) - Will AI End the SaaS Business?</li><li>(00:59:26) - I Predict One More AI-Specific Cloud Hitter</li><li>(01:02:00) - AI-First Design on Websites</li><li>(01:03:00) - Top 4 Predictions for the Future of Content</li><li>(01:05:56) - Last Year's Prediction: AI-Generated Podcast</li><li>(01:07:38) - Week in the Cloud: January 1</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 335 of The Cloud Pod, where the forecast is always cloudy! Welcome to the first show of 2026, and it’s a full house, too! Justin, Jonathan, Ryan,  and Matt are all here to reflect on 2025, plus bring you their predictions for 2026.
Let’s get started! 
Titles we almost went with this week

 SQL Me Maybe: AlloyDB Gets Chatty With Your Database **OpenAI
SELECT * FROM natural_language WHERE accuracy LIKE ‘100%’ **Anthropic
 etcd You Were Worried About Database Limits: CloudWatch Has Your Back
 CSV You Later: Looker Adds Drag-and-Drop Data Uploads
 AWS Spots an Opportunity to Manage Your Container Costs
 EKS Network Policies: No More IP Address Whack-a-Mole
 AWS Security Hub Splits: It’s Not You, It’s CSPM
 Spot On: ECS Finally Manages Your Cheapest Compute
 TOON Squad: DigitalOcean’s New Format Makes JSON Look Bloated
 The Price is Wrong: AWS Breaks Two Decades of Downward Pricing Tradition
 Show Your Work: Why AI-Generated Code Without Tests is Just Expensive Spam
 No More Agent Orange: Google Simplifies VM Extension Deployment
 AWS Discovers Prices Can Go Both Ways, Raises GPU Costs 15 Percent
 Sovereignty Washing: When Your European Cloud Still Answers to Uncle Sam
 Agent Builder Gets a Memory Upgrade: Google’s AI Finally Remembers Where It Put Its Keys
 Ctrl+F for the Future: A year-end Scorecard & Next-Gen Bets
 AI Agents, GPU Prices, and The best of the Cloud Pod 2025
 Beyond the Hype: The Cloud Pods Definitive 2025 Year in Review
 Apocalypse Now… What? Our 2026 Forecast

 
Follow Up 
01:27 RYAN’S PREDICTIONS



Prediction
Status
Notes


Quick LLM models for individuals
 ACCURATE
Meta-Llama-3.1-8B-Instruct, GLM-4-9B-0414, and Qwen2.5-VL-7B-Instruct—each chosen for an outstanding balance of performance and computational efficiency, making them ideal for edge AI deployment. A new AI inference application called Inferencer allows even modest Apple Mac computers to run the largest open-source LLMs.


AI at the edge natively (Lambda-esque)
 ACCURATE
Akamai launched a new Inference Cloud product for edge AI using Nvidia’s Blackwell 6000 GPUs in 17 cities. AWS IoT Greengrass with Lambda functions for edge logic. “Edge AI allows for instant decision-making where it matters most—close to the data source.”


Cloud native security mesh multi-cloud
 UNCLEAR
Service mesh technologies continue to evolve (Istio, Linkerd), but I didn’t find a breakthrough “app-to-app at the edge” security mesh product announcement in 2025. This one needs more specific evidence.



Ryan Score: 2/3 
02:25 MATTHEW’S PREDICTIONS



Prediction
Status
Notes


FOCUS adopted by Snowflake or Databricks
 ACCURATE
FOCUS version 1.2 was ratified on May 29, 2025. Three new providers announced support: Alibaba Cloud, Databricks, and Grafana. Databricks officially adopted FOCUS!


AI security/ethical standard (SOC or ISO)
 ACCURATE
ISO 42001 is the first international standard outlining requirements for AI governance. Major companies achieving certification in 2025: Automation Anywhere is among the first 100 companies worldwide to earn ISO/IEC 42001:2023 certification. Anthropic also achieved ISO 42001 certification.


Amazon deprecates 5+ services (WorkMail bonus)
 ACCURATE (no bonus)
19 services are mothballed, four are being sunset, and one is end of its supported life. Deprecated services include CodeCommit, Cloud9, S3 Select, CloudSearch, SimpleDB, Forecast, Data Pipeline, QLDB, Snowball Edge, and more. WorkMail NOT deprecated – WorkDocs was (April 2025), but WorkMail remains active.



Matthew Score: 3/3 
03:22 JONATHAN’S PREDICTIONS



Prediction
Status
Notes


Company claims AGI achieved
 ACCURATE
Integral AI, founded by ex-Google veteran Jad Tarifi, claims to have built a world-first AGI mo...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[336: We Were Right (Mostly), 2026: The New Prophecies]]>
                </itunes:title>
                                    <itunes:episode>336</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p>Welcome to episode 335 of The Cloud Pod, where the forecast is always cloudy! Welcome to the first show of 2026, and it’s a full house, too! Justin, Jonathan, Ryan,  and Matt are all here to reflect on 2025, plus bring you their predictions for 2026.</p>
<p>Let’s get started! </p>
<h3>Titles we almost went with this week</h3>
<ul>
<li> SQL Me Maybe: AlloyDB Gets Chatty With Your Database **OpenAI</li>
<li>SELECT * FROM natural_language WHERE accuracy LIKE ‘100%’ **Anthropic</li>
<li> etcd You Were Worried About Database Limits: CloudWatch Has Your Back</li>
<li> CSV You Later: Looker Adds Drag-and-Drop Data Uploads</li>
<li> AWS Spots an Opportunity to Manage Your Container Costs</li>
<li> EKS Network Policies: No More IP Address Whack-a-Mole</li>
<li> AWS Security Hub Splits: It’s Not You, It’s CSPM</li>
<li> Spot On: ECS Finally Manages Your Cheapest Compute</li>
<li> TOON Squad: DigitalOcean’s New Format Makes JSON Look Bloated</li>
<li> The Price is Wrong: AWS Breaks Two Decades of Downward Pricing Tradition</li>
<li> Show Your Work: Why AI-Generated Code Without Tests is Just Expensive Spam</li>
<li> No More Agent Orange: Google Simplifies VM Extension Deployment</li>
<li> AWS Discovers Prices Can Go Both Ways, Raises GPU Costs 15 Percent</li>
<li> Sovereignty Washing: When Your European Cloud Still Answers to Uncle Sam</li>
<li> Agent Builder Gets a Memory Upgrade: Google’s AI Finally Remembers Where It Put Its Keys</li>
<li> Ctrl+F for the Future: A year-end Scorecard &amp; Next-Gen Bets</li>
<li> AI Agents, GPU Prices, and The best of the Cloud Pod 2025</li>
<li> Beyond the Hype: The Cloud Pods Definitive 2025 Year in Review</li>
<li> Apocalypse Now… What? Our 2026 Forecast</li>
</ul>
<p> </p>
<h2>Follow Up </h2>
<p>01:27 RYAN’S PREDICTIONS</p>



Prediction
Status
Notes


Quick LLM models for individuals
 ACCURATE
Meta-Llama-3.1-8B-Instruct, GLM-4-9B-0414, and Qwen2.5-VL-7B-Instruct—each chosen for an outstanding balance of performance and computational efficiency, making them ideal for edge AI deployment. A new AI inference application called Inferencer allows even modest Apple Mac computers to run the largest open-source LLMs.


AI at the edge natively (Lambda-esque)
 ACCURATE
Akamai launched a new Inference Cloud product for edge AI using Nvidia’s Blackwell 6000 GPUs in 17 cities. AWS IoT Greengrass with Lambda functions for edge logic. “Edge AI allows for instant decision-making where it matters most—close to the data source.”


Cloud native security mesh multi-cloud
 UNCLEAR
Service mesh technologies continue to evolve (Istio, Linkerd), but I didn’t find a breakthrough “app-to-app at the edge” security mesh product announcement in 2025. This one needs more specific evidence.



<p>Ryan Score: 2/3 </p>
<p>02:25 MATTHEW’S PREDICTIONS</p>



Prediction
Status
Notes


FOCUS adopted by Snowflake or Databricks
 ACCURATE
FOCUS version 1.2 was ratified on May 29, 2025. Three new providers announced support: Alibaba Cloud, Databricks, and Grafana. Databricks officially adopted FOCUS!


AI security/ethical standard (SOC or ISO)
 ACCURATE
ISO 42001 is the first international standard outlining requirements for AI governance. Major companies achieving certification in 2025: Automation Anywhere is among the first 100 companies worldwide to earn ISO/IEC 42001:2023 certification. Anthropic also achieved ISO 42001 certification.


Amazon deprecates 5+ services (WorkMail bonus)
 ACCURATE (no bonus)
19 services are mothballed, four are being sunset, and one is end of its supported life. Deprecated services include CodeCommit, Cloud9, S3 Select, CloudSearch, SimpleDB, Forecast, Data Pipeline, QLDB, Snowball Edge, and more. WorkMail NOT deprecated – WorkDocs was (April 2025), but WorkMail remains active.



<p>Matthew Score: 3/3 </p>
<p>03:22 JONATHAN’S PREDICTIONS</p>



Prediction
Status
Notes


Company claims AGI achieved
 ACCURATE
Integral AI, founded by ex-Google veteran Jad Tarifi, claims to have built a world-first AGI model (December 2025). Also, Sam Altman called GPT-5 “a significant step along the path to AGI” at release.


AI agents booking reservations/real-world tasks
 FULLY ACCURATE
OpenAI’s Operator can execute tasks like filling out forms, managing online reservations, and even booking tickets to sporting events. Google AI Mode’s agentic capabilities help take the hassle out of booking restaurant reservations, event tickets, or beauty and wellness appointments.


Models that can learn in real-time
 PARTIALLY ACCURATE
Extended context windows and memory systems have improved dramatically. Claude 4 has “memory capabilities, extracting and saving key facts to maintain continuity.” However, true real-time learning/weight updates during conversations haven’t fully materialized yet.



<p>Jonathan Score: 2.5/3 </p>
<p>05:07  JUSTIN’S PREDICTIONS</p>



Prediction
Status
Notes


GPT-5, Claude 4, and Gemini 3.0
 FULLY ACCURATE
GPT-5 (August 7, 2025), Claude 4 (May 22, 2025), Gemini 3 (November 18, 2025). All three major models have been released! Plus, we’ve already seen GPT-5.1, GPT-5.2, and Claude Opus 4.5.


OpenAI is not seen as a leader
 ACCURATE
ChatGPT’s user growth is slowing, and Google’s Gemini is gaining ground. Anthropic now holds 32% of the enterprise LLM market share by usage, with OpenAI at 25%—a sharp reversal from 50% vs. 12% in 2023. Sam Altman issued a “code red” memo following the release of Gemini 3.


10+ companies RTO 5 days after Q2
 PARTIALLY ACCURATE
Major announcements after Q2: Novo Nordisk, Paramount Skydance, NBCUniversal, Instagram, Starbucks, Samsung, Freddie Mac. Many 5-day mandates took effect in 2025 (Amazon, AT&amp;T, JPMorgan, Dell), but several were announced pre-Q2. Close call.



<p>Justin Score: 2.5/3 </p>
<p>JONATHAN’S PREDICTIONS</p>



Prediction
Status
Notes


Company claims AGI achieved
 ACCURATE
Integral AI, founded by ex-Google veteran Jad Tarifi, claims to have built a world-first AGI model (December 2025). Also, Sam Altman called GPT-5 “a significant step along the path to AGI” at release.


AI agents booking reservations/real-world tasks
 FULLY ACCURATE
OpenAI’s Operator can execute tasks like filling out forms, managing online reservations, and even booking tickets to sporting events. Google AI Mode’s agentic capabilities help take the hassle out of booking restaurant reservations, event tickets, or beauty and wellness appointments.


Models that can learn in real-time
 PARTIALLY ACCURATE
Extended context windows and memory systems have improved dramatically. Claude 4 has “memory capabilities, extracting and saving key facts to maintain continuity.” However, true real-time learning/weight updates during conversations haven’t fully materialized yet.



<p>Jonathan Score: 2.5/3 </p>
<p> FINAL STANDINGS</p>



Host
Score
Grade


Matthew
3/3
 A+


Justin
2.5/3
 A


Jonathan
2.5/3
 A


Ryan
2/3
 B+



<p> Key Takeaways for the Pod</p>
<ol>
<li style="font-weight:400;">The AI model predictions were NAILED – All three major model releases happened exactly as predicted.</li>
<li style="font-weight:400;">OpenAI’s dominance really did slip – Anthropic now leads enterprise, Gemini is surging, Sam issued “code red.”</li>
<li style="font-weight:400;">AI agents are HERE – OpenAI Operator and Google AI Mode are booking real reservations.</li>
<li style="font-weight:400;">AWS deprecation wave was massive – Way more than 5 services axed (but WorkMail survived!)</li>
<li style="font-weight:400;">Edge AI exploded – Akamai, AWS, and others went all-in on inference at the edge.e</li>
</ol>
<p>Solid predictions all around – Matthew takes the crown! </p>
<p>06:08  Jonathan – “That’s good; it only took us 6 years to know what the hell we’re talking about!” </p>
<p>06:23 2025 Stats Review</p>
<ul>
<li style="font-weight:400;">We covered 1,308 stories from 15 different, unique sources.</li>
<li style="font-weight:400;">Amazon accounted for 39% of those stories.</li>
<li style="font-weight:400;">Ryan’s favorite, Azure, made up 22.9% of the stories (Thanks, Matt…) </li>
<li style="font-weight:400;">GCP was 38.1% of our news announcements. </li>
<li style="font-weight:400;">The official blogs from cloud providers, including <a href="https://aws.amazon.com/blogs/">AWS</a>, <a href="https://azure.microsoft.com/en-us/blog/">Azure</a>, and <a href="https://cloud.google.com/blog">GCP</a>, made up the bulk of the sources for the above stories. </li>
<li style="font-weight:400;">This is an interesting change from the first year we recorded, 2019, when AWS accounted for 73% of the announcements. </li>
<li style="font-weight:400;">When it comes to host participation, only 6 shows had all four hosts participating. Justin was present for 95%, Ryan for 85%, Matt recorded 78% (not bad with a new baby, honestly), and we had Jonathan for 12 episodes. </li>
<li style="font-weight:400;">We only had one guest, and increasing the number of guests is one of our 2026 resolutions, so thanks to Elise for joining us. </li>
<li style="font-weight:400;">AI was mentioned 526 times, averaging 12.2x per episode (which seems low to the show note editor), and has definitely been growing each year exponentially. </li>
<li style="font-weight:400;">Outages were discussed 19 times (boooo). </li>
<li style="font-weight:400;">And we got to talk about our favorite topic, deep-sea cables, 5 times. </li>
<li style="font-weight:400;">There were 58.9 hours of runtime over the course of 49 shows, with an average length of 72 minutes.</li>
<li style="font-weight:400;">The in memorium includes AWS Cloud Search, Glacier, Migration Hub, S3 Object Lambda, Azure Consumption API, dial-up internet, and RC4 encryption, among many others. RIP.  </li>
<li style="font-weight:400;">The most mentioned non-hyperscaler company was OpenAI, followed closely by Nvidia and Antropic. </li>
<li style="font-weight:400;">Lastly, Justin has updated our show LLM Bolt, building a brand new data pipeline for the podcast, which will include show notes, transcripts, etc., all with a new AI-based search. Want to check it out? Join our Slack channel! </li>
</ul>
<p>16:28  Ryan – “I’m having a similar experience mostly in my day job… trying to use AI for different workloads and then falling back into more traditional technologies or different ways, and at first I thought it was just like old dog, new tricks, just falling back in the comfort zone. But I find more and more I’m identifying things that, you know, the large language models just are not good at. And I think a lot of stats and the metrics, it feels like it should be able to do that, right? Because it’s conversational and you’re building a corpus of data for the model to query and do all that, but that it really can’t, right? And so, fortunately, we do have machine learning technologies and the ability to do notebooks and stuff. And agentic can absolutely help you make the notebook, but it can’t do the analysis for you, which I find funny.”</p>
<p>To be a good vibe coder, you need to be an experienced programmer, you need to have business experience, and I don’t think the people who are vibe coding right now are getting really good results if they don’t have that kind of background.” </p>
<p><a href="https://tcp-media.s3.us-west-2.amazonaws.com/2025_year_in_review.html">https://tcp-media.s3.us-west-2.amazonaws.com/2025_year_in_review.html</a> </p>
<p>25:54  Favorite Announcements</p>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;">Justin:
<ul>
<li style="font-weight:400;">Amazon saying F*** your security to Microsoft was great. 
<ul>
<li style="font-weight:400;">Episode 287: Recorded for the week of Jan 8th, 2025: The Cloud Pod rebrands to The Cloud AI so we can get a 1B valuation.</li>
<li style="font-weight:400;"><a href="https://www.csoonline.com/article/3625205/amazon-refuses-microsoft-365-deployment-because-of-lax-cybersecurity.html">https://www.csoonline.com/article/3625205/amazon-refuses-microsoft-365-deployment-because-of-lax-cybersecurity.html</a></li>
</ul>
</li>
<li style="font-weight:400;">Episode 303 – Someday You Will Find Me, Caught beneath the AI Landslide, in a Champagne Premier Nova in the Sky, from May 18th.  
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/amazon-nova-premier-our-most-capable-model-for-complex-tasks-and-teacher-for-model-distillation/">https://aws.amazon.com/blogs/aws/amazon-nova-premier-our-most-capable-model-for-complex-tasks-and-teacher-for-model-distillation/</a></li>
</ul>
</li>
<li style="font-weight:400;">Episode 288: Recorded for the week of Jan 14th, 2025: You might be able to retrain Notebook LM hosts to be less annoyed, but not your cloud pod hosts
<ul>
<li style="font-weight:400;"><a href="https://www.theverge.com/2025/1/6/24337530/nvidia-ces-digits-super-computer-ai">https://www.theverge.com/2025/1/6/24337530/nvidia-ces-digits-super-computer-ai</a> </li>
</ul>
</li>
<li style="font-weight:400;">Episode 322: Recorded for September 16th, 2025: Did OpenAI and Microsoft break up?… It’s complicated
<ul>
<li style="font-weight:400;"><a href="https://www.anthropic.com/news/claude-4">https://www.anthropic.com/news/claude-4</a> </li>
</ul>
</li>
</ul>
</li>
<li style="font-weight:400;">Matt: 
<ul>
<li style="font-weight:400;">Chime is dead: Update on Support for Amazon Chime 
<ul>
<li style="font-weight:400;">episode 294: “Ding: Chime is Dead”** (recorded for the week of February 25th, 2025).</li>
</ul>
</li>
<li style="font-weight:400;">GitHub Will Prioritize Migrating to Azure Over Feature Development – The New Stack
<ul>
<li style="font-weight:400;">Episode 317** (“I Got 99 Problems, But a Hallucination Ain’t One”).</li>
<li style="font-weight:400;"><a href="https://thenewstack.io/github-will-prioritize-migrating-to-azure-over-feature-development/">https://thenewstack.io/github-will-prioritize-migrating-to-azure-over-feature-development/</a></li>
</ul>
</li>
<li style="font-weight:400;">Claude on Azure
<ul>
<li style="font-weight:400;">**Episode 331** is where Claude’s big Azure announcement happened! </li>
<li style="font-weight:400;">The episode title says it all: “Claude Gets a $30 Billion Azure Wardrobe and Two New Best Friends” (published November 18, 2025).</li>
</ul>
</li>
</ul>
</li>
<li style="font-weight:400;">Ryan:
<ul>
<li style="font-weight:400;">A2A protocol</li>
</ul>
</li>
</ul>
</li>
</ul>
<ul>
<li>Jonathan:</li>
</ul>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;">DeepSeek is stirring things up</li>
<li style="font-weight:400;">AWS Frontier Agents</li>
</ul>
</li>
</ul>
<p>47:35 2026 Predictions</p>
<ul>
<li>Matt
<ul>
<li>A Major GCP Outage will occur</li>
<li>A step forward in quantum computing (A quantum leap into 2026)</li>
<li>A new MicroHyperscaler will go into the market at the same level as Digital Ocean</li>
</ul>
</li>
<li>Justin
<ul>
<li>AI Layoff Regret</li>
<li>AI Agent Security Breach (Agent that breaches an organization and exfiltrates data)</li>
<li>AI-designed web instead of Eyeballs/Humans</li>
</ul>
</li>
<li>Ryan
<ul>
<li>Multi-Agent Orchestration will blow up in a big way. Major providers of more A2A integrations of workflows between services/clouds</li>
<li>Infrastructure as Code will turn into Infrastructure as Intent. </li>
<li>Full Stack Media Creation company with AI? With CMS and Providence tracking and watermarking. Tooling/etc.</li>
</ul>
</li>
<li>Jonathan
<ul>
<li>Highly Visible company bankruptcy due to rising AI/GPU/Inference Costs.</li>
<li>Explosion of Competition against existing SaaS companies</li>
<li>An entirely AI-generated Podcast episode from the cloud pod</li>
</ul>
</li>
</ul>
<p>56:11  Ryan – “Trying to think through emerging threats on technology that I barely understand – because it’s coming out so fast – it’s changing the way we work. You’re already starting to see AI in attacks where groups of people are using AI to put together pretty sophisticated attacks on companies. It’s a lot easier for natural language speakers to generate content for spearfishing; it’s a lot easier for malicious actors to have an AI agent to do a bunch of research on a company real quick, and this is where I think it will be weak.” </p>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback, or ask questions at theCloudPod.net or tweet at us with the hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2320455/c1e-60w0com436uj0wrx-rk2d6gxvf241-kfvk5f.mp3" length="131428641"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 335 of The Cloud Pod, where the forecast is always cloudy! Welcome to the first show of 2026, and it’s a full house, too! Justin, Jonathan, Ryan,  and Matt are all here to reflect on 2025, plus bring you their predictions for 2026.
Let’s get started! 
Titles we almost went with this week

 SQL Me Maybe: AlloyDB Gets Chatty With Your Database **OpenAI
SELECT * FROM natural_language WHERE accuracy LIKE ‘100%’ **Anthropic
 etcd You Were Worried About Database Limits: CloudWatch Has Your Back
 CSV You Later: Looker Adds Drag-and-Drop Data Uploads
 AWS Spots an Opportunity to Manage Your Container Costs
 EKS Network Policies: No More IP Address Whack-a-Mole
 AWS Security Hub Splits: It’s Not You, It’s CSPM
 Spot On: ECS Finally Manages Your Cheapest Compute
 TOON Squad: DigitalOcean’s New Format Makes JSON Look Bloated
 The Price is Wrong: AWS Breaks Two Decades of Downward Pricing Tradition
 Show Your Work: Why AI-Generated Code Without Tests is Just Expensive Spam
 No More Agent Orange: Google Simplifies VM Extension Deployment
 AWS Discovers Prices Can Go Both Ways, Raises GPU Costs 15 Percent
 Sovereignty Washing: When Your European Cloud Still Answers to Uncle Sam
 Agent Builder Gets a Memory Upgrade: Google’s AI Finally Remembers Where It Put Its Keys
 Ctrl+F for the Future: A year-end Scorecard & Next-Gen Bets
 AI Agents, GPU Prices, and The best of the Cloud Pod 2025
 Beyond the Hype: The Cloud Pods Definitive 2025 Year in Review
 Apocalypse Now… What? Our 2026 Forecast

 
Follow Up 
01:27 RYAN’S PREDICTIONS



Prediction
Status
Notes


Quick LLM models for individuals
 ACCURATE
Meta-Llama-3.1-8B-Instruct, GLM-4-9B-0414, and Qwen2.5-VL-7B-Instruct—each chosen for an outstanding balance of performance and computational efficiency, making them ideal for edge AI deployment. A new AI inference application called Inferencer allows even modest Apple Mac computers to run the largest open-source LLMs.


AI at the edge natively (Lambda-esque)
 ACCURATE
Akamai launched a new Inference Cloud product for edge AI using Nvidia’s Blackwell 6000 GPUs in 17 cities. AWS IoT Greengrass with Lambda functions for edge logic. “Edge AI allows for instant decision-making where it matters most—close to the data source.”


Cloud native security mesh multi-cloud
 UNCLEAR
Service mesh technologies continue to evolve (Istio, Linkerd), but I didn’t find a breakthrough “app-to-app at the edge” security mesh product announcement in 2025. This one needs more specific evidence.



Ryan Score: 2/3 
02:25 MATTHEW’S PREDICTIONS



Prediction
Status
Notes


FOCUS adopted by Snowflake or Databricks
 ACCURATE
FOCUS version 1.2 was ratified on May 29, 2025. Three new providers announced support: Alibaba Cloud, Databricks, and Grafana. Databricks officially adopted FOCUS!


AI security/ethical standard (SOC or ISO)
 ACCURATE
ISO 42001 is the first international standard outlining requirements for AI governance. Major companies achieving certification in 2025: Automation Anywhere is among the first 100 companies worldwide to earn ISO/IEC 42001:2023 certification. Anthropic also achieved ISO 42001 certification.


Amazon deprecates 5+ services (WorkMail bonus)
 ACCURATE (no bonus)
19 services are mothballed, four are being sunset, and one is end of its supported life. Deprecated services include CodeCommit, Cloud9, S3 Select, CloudSearch, SimpleDB, Forecast, Data Pipeline, QLDB, Snowball Edge, and more. WorkMail NOT deprecated – WorkDocs was (April 2025), but WorkMail remains active.



Matthew Score: 3/3 
03:22 JONATHAN’S PREDICTIONS



Prediction
Status
Notes


Company claims AGI achieved
 ACCURATE
Integral AI, founded by ex-Google veteran Jad Tarifi, claims to have built a world-first AGI mo...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2320455/c1a-k5d5-6z9wj857fo18-37u0qj.jpg"></itunes:image>
                                                                            <itunes:duration>01:08:15</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2320455/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[335: EKS Network Policies:  Now With More Layers Than Your Security Team's Org Chart]]>
                </title>
                <pubDate>Wed, 24 Dec 2025 16:43:16 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2304352</guid>
                                    <link>https://tcpfm.castos.com/episodes/335-eks-network-policies-now-with-more-layers-than-your-security-teams-org-chart</link>
                                <description>
                                            <![CDATA[<p>Welcome to episode 335 of The Cloud Pod, where the forecast is always cloudy! This pre-Christmas week, Ryan and Justin have hit the studio to bring you the final show of 2025. We’ve got lots of AI images, EKS Network Policies, Gemini 3, and even some Disney drama. </p>
<p>Let’s get into it! </p>
<h3>Titles we almost went with this week</h3>
<ul>
<li> From Roomba to Tomb-ba: How the Robot Vacuum Pioneer Got Cleaned Out **OpenAI</li>
<li> From Napkin Sketch to Production: Google’s App Design Center Goes GA</li>
<li> Terraform Gets a Canvas: Google Paints Infrastructure Design with AI</li>
<li> Mickey Mouse Takes Off the Gloves: Disney vs Google AI Showdown</li>
<li> From Data Silos to Data Solos: Google Conducts the Integration Orchestra</li>
<li> No More Thread Dread: AWS Brings AI to JVM Performance Troubleshooting</li>
<li> MCP: More Corporate Plumbing Than You Think</li>
<li> GPT-5.2 Beats Humans at Work Tasks, Still Can’t Get You Out of Monday Meetings</li>
<li> Kerberos More Like Kerbero-Less: Microsoft Axes Ancient Encryption Standard</li>
<li> OpenAI Teaches GPT-5.2 to PowerPoint: Death by Bullet Points Now AI-Generated</li>
<li> MCP: Like USB-C, But Everyone’s Keeping Theirs in the Drawer</li>
<li> Flash Gordon: Google’s Gemini 3 Gets a Speed Boost Without the Sacrifice</li>
<li> Tag, You’re It: AWS Finally Knows Who to Bill</li>
<li> Snowflake Gets a GPT-5.2 Upgrade: Now With More Intelligence Per Query</li>
<li> OpenAI and Snowflake: Making Data Warehouses Smarter Than Your Average Analyst</li>
<li>GPT-5.2 Moves Into the Snowflake: No Melting Required</li>
</ul>
<h2>AI Is Going Great, or How ML Makes Money </h2>
<p>01:06 <a href="https://www.cnbc.com/2025/12/09/meta-avocado-ai-strategy-issues.html?utm_source=tldrnewsletter">Meta’s multibillion-dollar AI strategy overhaul creates culture clash</a>:</p>
<ul>
<li style="font-weight:400;"><a href="https://www.meta.com/about/">Meta</a> is developing Avocado, a new frontier AI model codenamed to succeed Llama, now expected to launch in Q1 2026 after internal delays related to training performance testing. </li>
<li style="font-weight:400;">The model may be proprietary rather than open source, marking a significant shift from Meta’s previous strategy of freely distributing Llama’s weights and architecture to developers. We feel like this is an interesting choice for Meta, but what do we know? </li>
<li style="font-weight:400;">Meta spent 14.3 billion dollars in June 2025 to hire <a href="https://scale.com/">Scale AI</a> founder Alexandr Wang as Chief AI Officer and acquire a stake in Scale, while raising 2026 capital expenditure guidance to 70-72 billion dollars. 
<ul>
<li style="font-weight:400;">Wang now leads the elite <a href="https://www.msn.com/en-us/money/companies/meet-tbd-lab-meta-s-superintelligence-swat-team/ar-AA1K7PEg">TBD Lab</a> developing Avocado, operating separately from traditional Meta teams and not using the company’s internal workplace network.</li>
</ul>
</li>
<li style="font-weight:400;">The company has restructured its AI leadership following the poor reception of Llama 4 in April, with Chief Product Officer Chris Cox no longer overseeing the GenAI unit. </li>
<li style="font-weight:400;">Meta cut 600 jobs in <a href="https://builtin.com/artificial-intelligence/meta-superintelligence-labs">Meta Superintelligence Labs</a> in October, contributing to the departure of Chief AI Scientist Yann LeCun to launch a startup, while implementing 70-hour workweeks across AI organizations.</li>
<li style="font-weight:400;">Meta’s new AI leadership under Wang and former GitHub CEO Nat Friedman has introduced a “demo, don’t memo” development approach, replacing traditional multi-step approval processes with rapid prototyping using AI agents and newer tools. </li>
<li style="font-weight:400;">The company is also leveraging third-party cloud services from <a href="https://www.coreweave.com/">CoreWeave</a> and <a href="https://www.oracle.com/">Oracle</a> while buil...</li></ul>
<h3>Chapters</h3>
<ul><li>(00:00:00) - A Year in Cloud</li><li>(00:01:21) - Meta Developing New Frontier AI Model</li><li>(00:03:15) - Disney Sues Google AI for Copyright Infringement</li><li>(00:04:59) - OpenAI to License Disney's 'Sora' Characters</li><li>(00:07:13) - OpenAI GPT Image 1.5 and 1.6</li><li>(00:08:41) - ChatGPT 5.2 Release</li><li>(00:10:45) - Cedar Open-Sourcing and CNCF</li><li>(00:12:38) - AWS GuardDuty Extended Threat Detection</li><li>(00:15:51) - Amazon EKS: Admin Network Policies and Application Network Policies for Ku</li><li>(00:18:43) - Amazon Web Services: Thread dump analysis solution</li><li>(00:22:16) - Amazon EC2: Automatic Cost Allocation based on User Attributes</li><li>(00:25:47) - GCP's Gemini 3 Flash for Enterprises</li><li>(00:27:19) - Google's MCP Server Integration into Anti Gravity</li><li>(00:30:59) - Google's Application Design Center (GAA) Now General Availability</li><li>(00:33:12) - Microsoft to deprecate RC4 Authentication by default</li><li>(00:35:50) - Azure Storage: 50 Terabit Bucket Support</li><li>(00:38:23) - Microsoft Expands Azure's Network for AI and Disaster Recovery</li><li>(00:42:14) - This Week in Cloud: Looking Back & Looking Forward</li><li>(00:43:29) - IRobot's bankruptcy throws a cloud spotlight</li><li>(00:49:27) - RIP iRobot: Ben Kehoe</li><li>(00:50:35) - Christmas wishes for everyone</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 335 of The Cloud Pod, where the forecast is always cloudy! This pre-Christmas week, Ryan and Justin have hit the studio to bring you the final show of 2025. We’ve got lots of AI images, EKS Network Policies, Gemini 3, and even some Disney drama. 
Let’s get into it! 
Titles we almost went with this week

 From Roomba to Tomb-ba: How the Robot Vacuum Pioneer Got Cleaned Out **OpenAI
 From Napkin Sketch to Production: Google’s App Design Center Goes GA
 Terraform Gets a Canvas: Google Paints Infrastructure Design with AI
 Mickey Mouse Takes Off the Gloves: Disney vs Google AI Showdown
 From Data Silos to Data Solos: Google Conducts the Integration Orchestra
 No More Thread Dread: AWS Brings AI to JVM Performance Troubleshooting
 MCP: More Corporate Plumbing Than You Think
 GPT-5.2 Beats Humans at Work Tasks, Still Can’t Get You Out of Monday Meetings
 Kerberos More Like Kerbero-Less: Microsoft Axes Ancient Encryption Standard
 OpenAI Teaches GPT-5.2 to PowerPoint: Death by Bullet Points Now AI-Generated
 MCP: Like USB-C, But Everyone’s Keeping Theirs in the Drawer
 Flash Gordon: Google’s Gemini 3 Gets a Speed Boost Without the Sacrifice
 Tag, You’re It: AWS Finally Knows Who to Bill
 Snowflake Gets a GPT-5.2 Upgrade: Now With More Intelligence Per Query
 OpenAI and Snowflake: Making Data Warehouses Smarter Than Your Average Analyst
GPT-5.2 Moves Into the Snowflake: No Melting Required

AI Is Going Great, or How ML Makes Money 
01:06 Meta’s multibillion-dollar AI strategy overhaul creates culture clash:

Meta is developing Avocado, a new frontier AI model codenamed to succeed Llama, now expected to launch in Q1 2026 after internal delays related to training performance testing. 
The model may be proprietary rather than open source, marking a significant shift from Meta’s previous strategy of freely distributing Llama’s weights and architecture to developers. We feel like this is an interesting choice for Meta, but what do we know? 
Meta spent 14.3 billion dollars in June 2025 to hire Scale AI founder Alexandr Wang as Chief AI Officer and acquire a stake in Scale, while raising 2026 capital expenditure guidance to 70-72 billion dollars. 

Wang now leads the elite TBD Lab developing Avocado, operating separately from traditional Meta teams and not using the company’s internal workplace network.


The company has restructured its AI leadership following the poor reception of Llama 4 in April, with Chief Product Officer Chris Cox no longer overseeing the GenAI unit. 
Meta cut 600 jobs in Meta Superintelligence Labs in October, contributing to the departure of Chief AI Scientist Yann LeCun to launch a startup, while implementing 70-hour workweeks across AI organizations.
Meta’s new AI leadership under Wang and former GitHub CEO Nat Friedman has introduced a “demo, don’t memo” development approach, replacing traditional multi-step approval processes with rapid prototyping using AI agents and newer tools. 
The company is also leveraging third-party cloud services from CoreWeave and Oracle while buil...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[335: EKS Network Policies:  Now With More Layers Than Your Security Team's Org Chart]]>
                </itunes:title>
                                    <itunes:episode>335</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p>Welcome to episode 335 of The Cloud Pod, where the forecast is always cloudy! This pre-Christmas week, Ryan and Justin have hit the studio to bring you the final show of 2025. We’ve got lots of AI images, EKS Network Policies, Gemini 3, and even some Disney drama. </p>
<p>Let’s get into it! </p>
<h3>Titles we almost went with this week</h3>
<ul>
<li> From Roomba to Tomb-ba: How the Robot Vacuum Pioneer Got Cleaned Out **OpenAI</li>
<li> From Napkin Sketch to Production: Google’s App Design Center Goes GA</li>
<li> Terraform Gets a Canvas: Google Paints Infrastructure Design with AI</li>
<li> Mickey Mouse Takes Off the Gloves: Disney vs Google AI Showdown</li>
<li> From Data Silos to Data Solos: Google Conducts the Integration Orchestra</li>
<li> No More Thread Dread: AWS Brings AI to JVM Performance Troubleshooting</li>
<li> MCP: More Corporate Plumbing Than You Think</li>
<li> GPT-5.2 Beats Humans at Work Tasks, Still Can’t Get You Out of Monday Meetings</li>
<li> Kerberos More Like Kerbero-Less: Microsoft Axes Ancient Encryption Standard</li>
<li> OpenAI Teaches GPT-5.2 to PowerPoint: Death by Bullet Points Now AI-Generated</li>
<li> MCP: Like USB-C, But Everyone’s Keeping Theirs in the Drawer</li>
<li> Flash Gordon: Google’s Gemini 3 Gets a Speed Boost Without the Sacrifice</li>
<li> Tag, You’re It: AWS Finally Knows Who to Bill</li>
<li> Snowflake Gets a GPT-5.2 Upgrade: Now With More Intelligence Per Query</li>
<li> OpenAI and Snowflake: Making Data Warehouses Smarter Than Your Average Analyst</li>
<li>GPT-5.2 Moves Into the Snowflake: No Melting Required</li>
</ul>
<h2>AI Is Going Great, or How ML Makes Money </h2>
<p>01:06 <a href="https://www.cnbc.com/2025/12/09/meta-avocado-ai-strategy-issues.html?utm_source=tldrnewsletter">Meta’s multibillion-dollar AI strategy overhaul creates culture clash</a>:</p>
<ul>
<li style="font-weight:400;"><a href="https://www.meta.com/about/">Meta</a> is developing Avocado, a new frontier AI model codenamed to succeed Llama, now expected to launch in Q1 2026 after internal delays related to training performance testing. </li>
<li style="font-weight:400;">The model may be proprietary rather than open source, marking a significant shift from Meta’s previous strategy of freely distributing Llama’s weights and architecture to developers. We feel like this is an interesting choice for Meta, but what do we know? </li>
<li style="font-weight:400;">Meta spent 14.3 billion dollars in June 2025 to hire <a href="https://scale.com/">Scale AI</a> founder Alexandr Wang as Chief AI Officer and acquire a stake in Scale, while raising 2026 capital expenditure guidance to 70-72 billion dollars. 
<ul>
<li style="font-weight:400;">Wang now leads the elite <a href="https://www.msn.com/en-us/money/companies/meet-tbd-lab-meta-s-superintelligence-swat-team/ar-AA1K7PEg">TBD Lab</a> developing Avocado, operating separately from traditional Meta teams and not using the company’s internal workplace network.</li>
</ul>
</li>
<li style="font-weight:400;">The company has restructured its AI leadership following the poor reception of Llama 4 in April, with Chief Product Officer Chris Cox no longer overseeing the GenAI unit. </li>
<li style="font-weight:400;">Meta cut 600 jobs in <a href="https://builtin.com/artificial-intelligence/meta-superintelligence-labs">Meta Superintelligence Labs</a> in October, contributing to the departure of Chief AI Scientist Yann LeCun to launch a startup, while implementing 70-hour workweeks across AI organizations.</li>
<li style="font-weight:400;">Meta’s new AI leadership under Wang and former GitHub CEO Nat Friedman has introduced a “demo, don’t memo” development approach, replacing traditional multi-step approval processes with rapid prototyping using AI agents and newer tools. </li>
<li style="font-weight:400;">The company is also leveraging third-party cloud services from <a href="https://www.coreweave.com/">CoreWeave</a> and <a href="https://www.oracle.com/">Oracle</a> while building the 27 billion dollar Hyperion data center in Louisiana.</li>
<li style="font-weight:400;">Meta’s <a href="https://metavibes.studio/">Vibes AI</a> video product, launched in September, trails OpenAI’s Sora 2 in downloads, and was criticized for lacking features like realistic lip-synced audio, while the company increasingly relies on external AI models from <a href="https://bfl.ai/">Black Forest Labs</a> and <a href="https://www.midjourney.com/home">Midjourney</a> rather than exclusively using internal technology.</li>
</ul>
<p>02:23  Ryan – “I guess I really don’t understand the business of the AI models. I guess if you’re going to offer a chat service, you have to have a proprietary model, but it’s kind of strange.”</p>
<p>03:04 <a href="https://arstechnica.com/google/2025/12/disney-says-google-ai-infringes-copyright-on-a-massive-scale/">Disney says Google AI infringes copyright “on a massive scale” – Ars </a><a href="https://arstechnica.com/google/2025/12/disney-says-google-ai-infringes-copyright-on-a-massive-scale/">Technica</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.disney.com/">Disney</a> has issued a cease and desist letter to Google alleging copyright infringement through its generative AI models, claiming <a href="https://www.google.com/?zx=1766519431570&amp;no_sw_cr=1">Google</a> trained its systems on Disney’s copyrighted content without authorization and now enables users to generate Disney-owned characters like those from The Lion King, Deadpool, and Star Wars. </li>
<li style="font-weight:400;">This represents one of the first major legal challenges from a content owner with substantial legal resources against a cloud AI provider.</li>
<li style="font-weight:400;">The legal notice targets two specific violations: Google’s use of Disney’s copyrighted works in training data for its image and video generation models, and the distribution of Disney character reproductions to end users through AI-generated outputs. 
<ul>
<li style="font-weight:400;">Disney demands the immediate cessation of using its content and the implementation of safeguards to prevent the future generation of Disney-owned intellectual property.</li>
</ul>
</li>
<li style="font-weight:400;">This case could establish important precedents for how cloud providers handle copyrighted training data and implement content filtering in AI services. </li>
<li style="font-weight:400;">The outcome may force cloud AI platforms to develop more sophisticated copyright detection systems or negotiate licensing agreements with content owners before deploying generative models.</li>
<li style="font-weight:400;">Disney’s involvement brings considerable legal firepower to the AI copyright debate, as the company has historically shaped US copyright law through decades of litigation to protect its intellectual property. </li>
<li style="font-weight:400;">Cloud providers offering generative AI services may need to reassess their training data sources and output filtering mechanisms to avoid similar legal challenges from other major content owners.</li>
</ul>
<p>04:06  Ryan – “Disney – suing for copyright infringement – shocking.” </p>
<p>04:54 <a href="https://arstechnica.com/ai/2025/12/disney-invests-1-billion-in-openai-licenses-200-characters-for-ai-video-app-sora/">Disney invests $1 billion in OpenAI, licenses 200 characters for AI video </a><a href="https://arstechnica.com/ai/2025/12/disney-invests-1-billion-in-openai-licenses-200-characters-for-ai-video-app-sora/">app Sora – Ars Technica</a></p>
<ul>
<li style="font-weight:400;">Disney invests $1 billion in <a href="https://openai.com/">OpenAI</a> and licenses over 200 characters from Disney, Marvel, Pixar, and Star Wars franchises for use in <a href="https://openai.com/index/sora/">Sora</a> video generator. </li>
<li style="font-weight:400;">This marks the first major Hollywood studio content licensing deal for OpenAI’s AI video platform, which launched in late September and faced industry criticism over copyright concerns.</li>
<li style="font-weight:400;">The three-year licensing agreement allows Sora users to create short video clips featuring licensed Disney characters, representing a shift from OpenAI’s previous approach of training models on copyrighted material without permission. </li>
<li style="font-weight:400;">This deal is notable given Disney’s history of aggressive copyright protection and lobbying that shaped modern US copyright law in the 1990s.</li>
<li style="font-weight:400;">OpenAI has been pursuing content licensing deals with major IP holders after facing multiple lawsuits over unauthorized use of copyrighted training data. </li>
<li style="font-weight:400;">The company previously argued that useful AI models cannot be created without copyrighted material, but has shifted strategy since becoming well-funded through investments.</li>
<li style="font-weight:400;">The partnership aims to extend Disney’s storytelling reach through generative AI while addressing creator concerns about unauthorized use of intellectual property. </li>
<li style="font-weight:400;">Disney CEO Robert Iger emphasized the company’s commitment to respecting and protecting creators’ works while leveraging AI technology for content creation.</li>
<li style="font-weight:400;">This deal could establish a precedent for how AI companies and content owners structure licensing agreements, potentially influencing how other studios and IP holders approach AI-generated content partnerships. </li>
<li style="font-weight:400;">The financial terms suggest significant value in controlled character licensing for AI applications.</li>
</ul>
<p>06:26  Ryan – “Is it just a way to get out of the lawsuit so they can generate the content?” </p>
<p>07:12  <a href="https://openai.com/index/new-chatgpt-images-is-here/">The new ChatGPT Images is here | OpenAI</a></p>
<ul>
<li style="font-weight:400;">OpenAI released <a href="http://chatgpt.com/images?openaicom-did=cd2bc678-e6bc-4ba0-9ff6-73d16056dd65&amp;openaicom_referred=true">GPT Image 1.5</a>, their new flagship image generation model, now available in ChatGPT for all users and via API. </li>
<li style="font-weight:400;">The model <a href="http://chatgpt.com/images?openaicom-did=cd2bc678-e6bc-4ba0-9ff6-73d16056dd65&amp;openaicom_referred=true">generates images</a> up to 4x faster than the previous version and includes a dedicated Images feature in the <a href="https://chatgpt.com/">ChatGPT</a> sidebar with preset filters and prompts for quick exploration.</li>
<li style="font-weight:400;">The model delivers improved image editing capabilities with better preservation of original elements like lighting, composition, and people’s appearance across edits. </li>
<li style="font-weight:400;">It handles precise modifications, including adding, subtracting, combining, and blending elements while maintaining consistency, making it suitable for practical photo edits and creative transformations.</li>
<li style="font-weight:400;">GPT Image 1.5 shows improvements in text rendering with support for denser and smaller text, better handling of multiple small faces, and more natural-looking outputs. </li>
<li style="font-weight:400;">The model follows instructions more reliably than the initial version, enabling more intricate compositions where relationships between elements are preserved as intended.</li>
<li style="font-weight:400;">API pricing for GPT Image 1.5 is 20% cheaper than GPT Image 1 for both inputs and outputs, allowing developers to generate and iterate on more images within the same budget. </li>
<li style="font-weight:400;">The model is particularly useful for marketing teams, ecommerce product catalogs, and brand work requiring consistent logo and visual preservation across multiple edits.</li>
<li style="font-weight:400;">The new ChatGPT Images model works across all ChatGPT models without requiring manual selection, while the earlier version remains available as a custom GPT. </li>
<li style="font-weight:400;">Business and Enterprise users will receive access to the new Images experience later, with the API version available now through OpenAI Playground. </li>
</ul>
<p>07:38  Justin – “It’s very competitive against Nano Banana, and I was looking at some of the charts, and it’s already jumped to the top of the charts.” </p>
<p>08:52 <a href="https://openai.com/index/introducing-gpt-5-2/">Introducing GPT-5.2 | OpenAI</a></p>
<ul>
<li style="font-weight:400;">OpenAI has released <a href="https://openai.com/index/introducing-gpt-5-2/">GPT-5.2</a>, now generally available in ChatGPT for paid users and via API as gpt-5.2, with three variants: Instant for everyday tasks, Thinking for complex work, and Pro for the highest-quality outputs. </li>
<li style="font-weight:400;">The model introduces native spreadsheet and presentation generation capabilities, with ChatGPT Enterprise users reporting 40-60 minutes saved daily on average.</li>
<li style="font-weight:400;">GPT-5.2 Thinking achieves a 70.9% win rate against human experts on GDPval benchmark spanning 44 occupations and sets new records on <a href="https://scale.com/leaderboard/swe_bench_pro_public">SWE-Bench Pro</a> at 55.6% (80% on SWE-bench Verified). </li>
<li style="font-weight:400;">The model demonstrates 11x faster output generation and less than 1% the cost of expert professionals on knowledge work tasks, though human oversight remains necessary.</li>
<li style="font-weight:400;">Long-context performance reaches near 100% accuracy on the 4-needle MRCR variant up to 256k tokens, with a new Responses compact endpoint extending the effective context window for tool-heavy workflows. Vision capabilities show roughly 50% error reduction on chart reasoning and interface understanding compared to GPT-5.1.</li>
<li style="font-weight:400;">API pricing is set at $1.75 per million input tokens and $14 per million output tokens, with a 90% discount on cached inputs. </li>
<li style="font-weight:400;">OpenAI reports that despite higher per-token costs, GPT-5.2 achieves a lower total cost for given quality levels due to improved token efficiency. </li>
<li style="font-weight:400;">The company has no current plans to deprecate GPT-5.1, GPT-5, or GPT-4.1.</li>
<li style="font-weight:400;">The model introduces improved safety features, including strengthened responses for mental health and self-harm scenarios, plus a gradual rollout of age prediction for content protections. </li>
<li style="font-weight:400;">GPT-5.2 was built on <a href="https://www.nvidia.com/en-us/data-center/h100/">NVIDIA H100</a>, <a href="https://www.nvidia.com/en-us/data-center/h200/">H200</a>, and <a href="https://www.nvidia.com/en-us/data-center/gb200-nvl72/">GB200-NVL72</a> GPUs in <a href="https://portal.azure.com">Microsoft Azure</a> data centers, with a Codex-optimized version planned for the coming weeks.</li>
</ul>
<p>10:06  Ryan – “I’m happy to see the improved safety features because that’s come up in the news recently and had some high-profile events happen, where it’s become a concern, for sure. So I want to see more protection in that space from all the providers.” </p>
<h2>Cloud Tools</h2>
<p>10:58 <a href="https://aws.amazon.com/blogs/opensource/cedar-joins-cncf-as-a-sandbox-project/">Cedar Joins CNCF as a Sandbox Project | AWS Open Source Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/about-aws/whats-new/2023/05/cedar-open-source-language-access-control/">Cedar</a> is an <a href="https://aws.amazon.com/blogs/opensource/category/open-source/">open source</a> authorization policy language that just joined <a href="https://www.cncf.io/">CNCF</a> as a Sandbox project, solving the problem of hard-coded access control by letting developers define fine-grained permissions as policies separate from application code. 
<ul>
<li style="font-weight:400;">It supports RBAC, ABAC, and ReBAC models with fast real-time evaluation.</li>
</ul>
</li>
<li style="font-weight:400;">The language stands out for its formal verification using the Lean theorem prover and differential random testing against its specification, providing mathematical guarantees for security-critical authorization logic. This rigor addresses the growing complexity of cloud-native authorization, where traditional ad-hoc systems fall short.</li>
<li style="font-weight:400;">Production adoption is already strong with users including <a href="https://www.cloudflare.com/">Cloudflare</a>, <a href="https://www.mongodb.com/">MongoDB</a>, <a href="https://aws.amazon.com/bedrock/">AWS Bedrock</a>, and <a href="https://kubernetes.io/">Kubernetes</a> integrations like <a href="https://github.com/upbound/kubernetes-cedar-authorizer">kubernetes-cedar-authorizer</a>. </li>
<li style="font-weight:400;">The CNCF move provides vendor-neutral governance and broader community access beyond AWS stewardship.</li>
<li style="font-weight:400;">Cedar offers an interactive policy playground and <a href="https://aws.amazon.com/sdk-for-rust/">Rust SDK</a> for developers to test authorization logic before deployment. </li>
<li style="font-weight:400;">The analyzability features enable automated policy optimization and verification, reducing the risk of misconfigured permissions in production.</li>
<li style="font-weight:400;">The CNCF acceptance fills a gap in the cloud-native landscape for a foundation-backed authorization standard, complementing existing projects and potentially becoming the go-to solution as it progresses from Sandbox to Incubation status.</li>
</ul>
<p>12:05 Ryan – “I think this kind of policy is going to be absolutely key to managing permissions going forward.” </p>
<h2>AWS</h2>
<p>12:50 <a href="https://aws.amazon.com/blogs/security/cryptomining-campaign-targeting-amazon-ec2-and-amazon-ecs/">GuardDuty Extended Threat Detection uncovers a cryptomining campaign on </a><a href="https://aws.amazon.com/blogs/security/cryptomining-campaign-targeting-amazon-ec2-and-amazon-ecs/">Amazon EC2 and Amazon ECS | AWS Security Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/guardduty/">GuardDuty</a> <a href="https://docs.aws.amazon.com/guardduty/latest/ug/guardduty-extended-threat-detection.html">Extended Threat Detection</a> identified a coordinated cryptomining campaign starting November 2, 2025, where attackers used compromised <a href="https://aws.amazon.com/iam/">IAM</a> credentials to deploy miners across <a href="https://aws.amazon.com/ec2/">EC2</a> and <a href="https://aws.amazon.com/ecs/">ECS</a> within 10 minutes of initial access. </li>
<li style="font-weight:400;">The new AttackSequence: EC2/CompromisedInstanceGroup finding correlated signals across multiple data sources to detect the sophisticated attack pattern, demonstrating how Extended Threat Detection capabilities launched at re:Invent 2025 can identify coordinated campaigns.</li>
<li style="font-weight:400;">The attackers employed a novel persistence technique using ModifyInstanceAttribute to disable API termination on all launched instances, forcing victims to manually re-enable termination before cleanup and disrupting automated remediation workflows. </li>
<li style="font-weight:400;">They also created public <a href="https://aws.amazon.com/pm/lambda/?trk=a701c515-a0b8-4910-ab98-44b7b0a4ec4b&amp;sc_channel=ps&amp;s_kwcid=AL!4422!10!71674651309632!!!!71675180089924!!482503115!1146791634290665&amp;ef_id=3cf36dc0a3001f76b9dc048a5b112548:G:s">Lambda</a> endpoints without authentication and established backdoor IAM users with SES permissions, showing advancement in cryptomining persistence methodologies beyond typical mining operations.</li>
<li style="font-weight:400;">The campaign targeted high-value GPU and ML instances (g4dn, g5, p3, p4d) through auto scaling groups configured to scale from 20 to 999 instances, with attackers first using DryRun flags to validate permissions without triggering costs. The malicious Docker Hub image yenik65958/secret accumulated over 100,000 pulls before takedown, and attackers created up to 50 ECS clusters per account with <a href="https://aws.amazon.com/fargate/">Fargate</a> tasks configured for maximum CPU allocation of 16,384 units.</li>
<li style="font-weight:400;">AWS recommends enabling <a href="https://docs.aws.amazon.com/guardduty/latest/ug/runtime-monitoring.html">GuardDuty Runtime Monitoring</a> alongside the foundational protection plan for comprehensive coverage, as Runtime Monitoring provides host-level signals critical for Extended Threat Detection correlation and detects crypto mining execution through <a href="https://docs.aws.amazon.com/guardduty/latest/ug/findings-runtime-monitoring.html#impact-runtime-cryptominerexecuted">Impact:Runtime/CryptoMinerExecuted</a> findings. </li>
<li style="font-weight:400;">Organizations should implement <a href="https://docs.aws.amazon.com/organizations/latest/userguide/orgs_manage_policies_scps.html">SCPs</a> to deny Lambda URL creation with an AuthType of NONE and monitor <a href="https://aws.amazon.com/cloudtrail/">CloudTrail</a> for unusual DryRun API patterns as early warning indicators.</li>
<li style="font-weight:400;">The attack demonstrates the importance of temporary credentials over long-term access keys, MFA enforcement, and least privilege IAM policies, as the compromise exploited valid credentials rather than AWS service vulnerabilities. GuardDuty’s multilayered detection using threat intelligence, anomaly detection, and Extended Threat Detection successfully identified all attack stages from initial access through persistence.</li>
</ul>
<p>55:31 Justin – “Hackers have the same tools we do for development.” </p>
<p>16:17 <a href="https://aws.amazon.com/blogs/containers/amazon-eks-introduces-enhanced-network-policy-capabilities/">Amazon EKS introduces enhanced network policy capabilities | Containers</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/eks/">Amazon EKS</a> now supports Admin Network Policies and Application Network Policies, giving cluster administrators centralized control over network security across all namespaces while allowing namespace administrators to filter outbound traffic using domain names instead of maintaining IP address lists. </li>
<li style="font-weight:400;">This addresses a key limitation of standard Kubernetes Network Policies, which only work within individual namespaces and lack explicit deny rules or policy hierarchies.</li>
<li style="font-weight:400;">The new Admin Network Policies operate in two tiers: Admin Tier rules that cannot be overridden by developers, and Baseline Tier rules that provide default connectivity but can be overridden by standard Network Policies. </li>
<li style="font-weight:400;">This enables platform teams to enforce cluster-wide security requirements like isolating sensitive workloads or ensuring monitoring access while still giving application teams flexibility within those boundaries.</li>
<li style="font-weight:400;">Application Network Policies, exclusive to EKS Auto Mode clusters, add Layer 7 FQDN-based filtering to traditional Layer 3/4 network policies, solving the problem of managing egress to external services with frequently changing IP addresses. Instead of maintaining IP lists for SaaS providers or on-premises resources behind load balancers, teams can simply whitelist domain names like internal-api.company.com, and policies remain valid even when underlying IPs change.</li>
<li style="font-weight:400;">Requirements include Kubernetes 1.29 or later, <a href="https://github.com/aws/amazon-vpc-cni-k8s">Amazon VPC CNI</a> plugin v1.21.0 for standard EKS clusters, and EKS Auto Mode for Application Network Policies with DNS filtering. </li>
<li style="font-weight:400;">The feature is available now for new clusters, with support for existing clusters coming in the following weeks, though pricing remains unchanged, as this is a native capability of the VPC CNI plugin.</li>
</ul>
<p>17:30 Ryan – “This is one of those things that’s showing a maturity level of container-driven applications. It’s been a while since security teams have been aware of some of the things you can do with network policies and routing, and so you want to empower your developers, but also being able to have a comprehensive way to ban and approve has been missing from a lot of these ingress controllers. So this is a great thing for security teams, and probably terrible for developers.” </p>
<p>19:12 <a href="https://aws.amazon.com/blogs/containers/automate-java-performance-troubleshooting-with-ai-powered-thread-dump-analysis-on-amazon-ecs-and-eks/">Automate java performance troubleshooting with AI-Powered thread dump </a><a href="https://aws.amazon.com/blogs/containers/automate-java-performance-troubleshooting-with-ai-powered-thread-dump-analysis-on-amazon-ecs-and-eks/">analysis on Amazon ECS and EKS | Containers</a></p>
<ul>
<li style="font-weight:400;">AWS has released an automated Java thread dump analysis solution that combines <a href="https://prometheus.io/">Prometheus</a> monitoring, <a href="https://grafana.com/">Grafana</a> alerting, <a href="https://aws.amazon.com/lambda/">Lambda</a> orchestration, and <a href="https://aws.amazon.com/bedrock/?trk=eef97220-36a4-4cb3-9953-f388c339e321&amp;sc_channel=ps&amp;s_kwcid=AL!4422!10!71262362722793!!!!71262906255191!!485433981!1140195300946547&amp;ef_id=f7964839b8361bac73992173428796ed:G:s">Amazon Bedrock AI</a> to diagnose JVM performance issues in seconds rather than hours. </li>
<li style="font-weight:400;">The system works across both <a href="https://aws.amazon.com/ecs/">ECS</a> and <a href="https://aws.amazon.com/eks/">EKS</a> environments, automatically detecting high thread counts and generating actionable insights without requiring deep JVM expertise from operations teams.</li>
<li style="font-weight:400;">The solution uses <a href="https://docs.spring.io/spring-boot/reference/actuator/jmx.html">Spring Boot</a> Actuator endpoints for ECS deployments and Kubernetes API commands for EKS to capture thread dumps when Grafana alerts trigger. </li>
<li style="font-weight:400;">Amazon Bedrock then analyzes the dumps to identify deadlocks, performance bottlenecks, and thread states while providing structured recommendations across six key areas, including executive summary and optimization guidance.</li>
<li style="font-weight:400;">Deployment is handled through CloudFormation templates available in the <a href="https://catalog.workshops.aws/java-on-aws/en-US/">Java on AWS Immersion Day Workshop</a>, with all thread dumps and AI analysis reports automatically stored in S3 for historical trending. </li>
<li style="font-weight:400;">The architecture follows event-driven principles with modular components that can be extended to other diagnostic tools like heap dump analysis or automated remediation workflows.</li>
<li style="font-weight:400;">The system enriches JVM metrics with contextual tags, including cluster identification and container metadata, enabling the Lambda function to determine the appropriate thread dump collection method. This metadata-driven approach allows a single solution to handle heterogeneous container environments without manual configuration for each deployment type.</li>
<li style="font-weight:400;">Pricing follows standard AWS service costs for Lambda invocations, Bedrock LLM usage per token, S3 storage, and CloudWatch metrics, with no additional licensing fees for the open source monitoring components. </li>
<li style="font-weight:400;">The solution addresses the common problem where only a handful of engineers on most teams can interpret thread dumps, democratizing JVM troubleshooting across operations teams.</li>
</ul>
<p>20:55 Justin – “This tells me that if you have a bad container that crashes a lot, you could spend a lot of money on LLM usage for tokens analyzing your exact same crash dump every time. Do keep that in mind.” </p>
<p>22:50 <a href="https://aws.amazon.com/about-aws/whats-new/2025/12/ec2-auto-scaling-synchronous-api-launch-instances-auto-scaling-group/">EC2 Auto Scaling now offers a synchronous API to launch instances inside </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/12/ec2-auto-scaling-synchronous-api-launch-instances-auto-scaling-group/">an Auto Scaling group</a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/autoscaling/ec2/userguide/what-is-amazon-ec2-auto-scaling.html">EC2 Auto Scaling</a> introduces a new LaunchInstances API that provides synchronous feedback when launching instances, allowing customers to immediately know if capacity is available in their specified Availability Zone or subnet. </li>
<li style="font-weight:400;">This addresses scenarios where customers need precise control over instance placement and real-time confirmation of scaling operations rather than waiting for asynchronous results.</li>
<li style="font-weight:400;">The API enables customers to override default Auto Scaling group configurations by specifying exact Availability Zones and subnets for new instances, while still maintaining the benefits of automated fleet management like health checks and scaling policies. Optional asynchronous retries are included to help reach the desired capacity if initial synchronous attempts fail.</li>
<li style="font-weight:400;">This feature is particularly useful for workloads that require strict placement requirements or need to implement fallback strategies quickly when capacity constraints occur in specific zones. Customers can now build more sophisticated scaling logic that responds immediately to capacity availability rather than discovering issues after the fact.</li>
<li style="font-weight:400;">Available immediately in all AWS Regions and GovCloud at no additional cost beyond standard EC2 and EBS charges. Customers can access the feature through AWS CLI and SDKs, with documentation available at https://docs.aws.amazon.com/autoscaling/ec2/userguide/launch-instances-synchronously.</li>
</ul>
<p>23:47 Ryan – “I find that the things that it’s allowing you to tune – it’s the things that I moved to autoscaling for; I don’t want to deal with any of this nonsense. And so you still have to maintain your own orchestration, which understands which zone that you need to roll out to, because it’s going to have to call that API.” </p>
<p>24:28 <a href="https://aws.amazon.com/about-aws/whats-new/2025/12/cost-allocation-using-users-attributes/">Announcing cost allocation using users’ attributes</a></p>
<ul>
<li style="font-weight:400;">AWS now enables cost allocation based on workforce user attributes like cost center, division, and department imported from <a href="https://docs.aws.amazon.com/singlesignon/latest/userguide/what-is.html">IAM Identity Center</a>. </li>
<li style="font-weight:400;">This allows organizations to automatically tag per-user subscription and on-demand fees for services like <a href="https://aws.amazon.com/q/business/">Amazon Q Business</a>, <a href="https://aws.amazon.com/q/developer/">Q Developer</a>, and QuickSight with organizational metadata for chargeback purposes.</li>
<li style="font-weight:400;">The feature addresses a common <a href="https://www.finops.org/introduction/what-is-finops/">FinOps</a> challenge where companies struggle to attribute SaaS-style AWS application costs back to specific business units. Once user attributes are imported to IAM Identity Center and enabled as cost allocation tags in the Billing Console, usage automatically flows to <a href="https://aws.amazon.com/aws-cost-management/aws-cost-explorer/">Cost Explorer</a> and <a href="https://docs.aws.amazon.com/cur/latest/userguide/table-dictionary-cur2.html">CUR 2.0</a> with the appropriate organizational tags attached.</li>
<li style="font-weight:400;">This capability is particularly relevant for enterprises deploying Amazon Q Business or QuickSight at scale, where individual user subscriptions can quickly add up across departments. Instead of manually tracking which users belong to which cost centers, the system automatically associates costs based on existing identity data.</li>
<li style="font-weight:400;">The feature is generally available in all commercial AWS regions except GovCloud and China regions. </li>
<li style="font-weight:400;">No additional pricing is mentioned beyond the standard costs of the underlying AWS applications being tracked.</li>
</ul>
<p>25:26 Justin – “There’s lots of use cases; this gets interesting real quickly. It’s a really nice feature that I’m really happy about.”  </p>
<h2>GCP</h2>
<p>26:34 <a href="https://blog.google/products/gemini/gemini-3-flash/">Introducing Gemini 3 Flash: Benchmarks, global availability</a></p>
<ul>
<li style="font-weight:400;">Google launches <a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/models/gemini/3-flash">Gemini 3 Flash</a> in general availability, positioning it as a frontier intelligence model optimized for speed at reduced cost. </li>
<li style="font-weight:400;">The model processes over 1 trillion tokens daily through Google’s API and replaces Gemini 2.5 Flash as the default model in the Gemini app globally at no cost to users.</li>
<li style="font-weight:400;">Gemini 3 Flash achieves strong benchmark performance with 90.4% on GPQA Diamond and 81.2% on MMMU Pro while running 3x faster than Gemini 2.5 Pro and using 30% fewer tokens on average for typical tasks. </li>
<li style="font-weight:400;">Pricing is set at $0.50 per million input tokens and $3 per million output tokens, with audio input at $1 per million tokens.</li>
<li style="font-weight:400;">The model demonstrates strong coding capabilities with a 78% score on SWE-bench Verified, outperforming both the 2.5 series and <a href="https://blog.google/products/gemini/gemini-3/#note-from-ceo">Gemini 3 Pro</a>. This makes it suitable for agentic workflows, production systems, and interactive applications requiring both speed and reasoning depth.</li>
<li style="font-weight:400;">Gemini 3 Flash is available through multiple channels, including <a href="https://blog.google/technology/developers/build-with-gemini-3-flash">Google AI Studio</a>, <a href="https://cloud.google.com/vertex-ai">Vertex AI</a>, <a href="https://cloud.google.com/gemini-enterprise">Gemini Enterprise</a>, <a href="https://antigravity.google/blog/gemini-3-flash-in-google-antigravity">Google Antigravity</a> platform, <a href="https://developers.googleblog.com/gemini-3-flash-is-now-available-in-gemini-cli/">Gemini CLI</a>, and <a href="https://developer.android.com/studio">Android Studio</a>. </li>
<li style="font-weight:400;">The model is also rolling out as the default for AI Mode in Search globally, combining real-time information retrieval with multimodal reasoning capabilities.</li>
<li style="font-weight:400;">Early enterprise adopters, including JetBrains, Bridgewater Associates, and Figma, are using the model for applications ranging from video analysis and data extraction to visual Q&amp;A and in-game assistance. </li>
<li style="font-weight:400;">The multimodal capabilities support real-time analysis of images, video, and audio content for actionable insights.</li>
</ul>
<p>27:01 Justin – “This, just in general, is a pretty big improvement from not only the cost perspective, but also the overall performance, and the ability to run this on local devices, for like Android phones, is gonna be a huge breakthrough in LM performance on the device. So I suspect you’ll see a lot of Gemini 3 flash getting rolled out all over the place because it does a lot of things really darn well.”</p>
<p>28:16 <a href="https://cloud.google.com/blog/products/data-analytics/connect-google-antigravity-ide-to-googles-data-cloud-services/">Connect Google Antigravity IDE to Google’s Data Cloud services | Google </a><a href="https://cloud.google.com/blog/products/data-analytics/connect-google-antigravity-ide-to-googles-data-cloud-services/">Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google has integrated Model Context Protocol servers into its new <a href="https://antigravity.google/">Antigravity</a> IDE, allowing AI agents to directly connect to Google Cloud data services, including <a href="https://cloud.google.com/alloydb">AlloyDB</a>, <a href="https://cloud.google.com/bigquery">BigQuery</a>, <a href="https://cloud.google.com/spanner?e=48754805">Spanner</a>, <a href="https://cloud.google.com/sql?e=48754805">Cloud SQL</a>, and <a href="https://cloud.google.com/looker">Looker</a>. </li>
<li style="font-weight:400;"><a href="https://googleapis.github.io/genai-toolbox/getting-started/introduction/">The MCP Toolbox for Databases</a> provides pre-built connectors that eliminate manual configuration, letting developers access enterprise data through a UI-driven setup process within the IDE.</li>
<li style="font-weight:400;">The integration enables AI agents to perform database administration tasks, generate SQL code, and run queries without switching between tools. </li>
<li style="font-weight:400;">For AlloyDB and Cloud SQL, agents can explore schemas, develop queries, and optimize performance using tools like list_tables, execute_sql, and get_query_plan directly in the development environment.</li>
<li style="font-weight:400;">BigQuery and Looker connections extend agent capabilities into analytics and business intelligence workflows. </li>
<li style="font-weight:400;">Agents can forecast trends, search data catalogs, validate metric definitions against semantic models, and run ad-hoc queries to ensure application logic matches production reporting standards.</li>
<li style="font-weight:400;">The MCP servers use IAM credentials or secure password storage to maintain security while giving agents access to production data sources. This approach positions Antigravity as a data-aware development environment where AI assistance is grounded in actual enterprise data rather than abstract reasoning alone.</li>
<li style="font-weight:400;">The feature is available now through the Antigravity MCP Store with documentation at <a href="http://cloud.google.com/alloydb/docs">cloud.google.com/alloydb/docs</a> and the open-source MCP Toolbox on GitHub at googleapis/genai-toolbox. </li>
<li style="font-weight:400;">No specific pricing information was provided for the MCP integration itself, though standard data service costs for AlloyDB, BigQuery, and other connected services apply.</li>
</ul>
<p>29:15 <a href="https://cloud.google.com/blog/products/ai-machine-learning/announcing-official-mcp-support-for-google-services/">Announcing official MCP support for Google services | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google now offers fully-managed, remote <a href="https://www.anthropic.com/news/model-context-protocol">Model Context Protocol</a> (MCP) servers for its services, eliminating the need for developers to deploy and maintain individual local MCP servers. </li>
<li style="font-weight:400;">This provides a unified, enterprise-ready endpoint for connecting AI agents to Google and Google Cloud services with built-in IAM, audit logging, and Model Armor security.</li>
<li style="font-weight:400;">Initial MCP support launches for four key services: <a href="https://developers.google.com/maps/ai/grounding-lite">Google Maps Platform</a> for location grounding, BigQuery for querying enterprise data in-place, Compute Engine for infrastructure management, and GKE for container operations. Additional services, including Cloud Run, Cloud Storage, AlloyDB, Spanner, and SecOps, will receive MCP support in the coming months.</li>
<li style="font-weight:400;">Apigee integration allows enterprises to expose their own custom APIs and third-party APIs as discoverable tools for AI agents, extending MCP capabilities beyond Google services to the broader enterprise stack. </li>
<li style="font-weight:400;">Organizations can use Cloud API Registry and Apigee API Hub to discover and govern available MCP tools across their environment.</li>
<li style="font-weight:400;">The implementation enables agents to perform complex multi-step workflows like analyzing BigQuery sales data for revenue forecasting while simultaneously querying Google Maps for location intelligence, all through standardized MCP interfaces. </li>
<li style="font-weight:400;">This approach keeps data in place rather than moving it into context windows, reducing security risks and latency.</li>
</ul>
<p>30:34 <a href="https://cloud.google.com/blog/products/ai-machine-learning/mcp-support-for-apigee/">MCP support for Apigee | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Apigee now supports Model Context Protocol (MCP), allowing organizations to expose their existing APIs as tools for AI agents without writing code or managing MCP servers. Google handles the infrastructure, transcoding, and protocol management while Apigee applies its <a href="https://cloud.google.com/apigee/docs/api-platform/reference/policies/reference-overview-policy">30+ built-in policies</a> for authentication, authorization, and security to govern agentic interactions.</li>
<li style="font-weight:400;">The implementation automatically registers deployed MCP proxies in Apigee API hub as searchable MCP APIs, enabling centralized tool catalogs and granular access controls through <a href="https://docs.cloud.google.com/apigee/docs/api-platform/publish/what-api-product">API products</a>. </li>
<li style="font-weight:400;">Organizations can apply quota policies and identity controls to restrict which agents and clients can access specific MCP tools, with full visibility through Apigee Analytics and the new <a href="https://cloud.google.com/apigee/docs/apihub/api-insights-overview">API Insights</a> feature.</li>
<li style="font-weight:400;">Integration with Google’s <a href="https://google.github.io/adk-docs/">Agent Development Kit</a> (ADK) provides streamlined access to Apigee MCP endpoints for developers building custom agents, with an <a href="https://google.github.io/adk-docs/agents/models/#using-apigee-gateway-for-ai-models">ApigeeLLM wrapper</a> available for routing LLM calls through Apigee proxies. </li>
<li style="font-weight:400;">The feature works with multiple agent frameworks, including LangGraph, though ADK users get optimized tooling for the Google ecosystem, including <a href="https://docs.cloud.google.com/agent-builder/agent-engine/overview">Vertex AI Agent Engine</a> and <a href="https://cloud.google.com/gemini-enterprise?hl=en">Gemini Enterprise deployment</a> options.</li>
<li style="font-weight:400;">Security capabilities extend beyond standard API protection to include <a href="https://cloud.google.com/security/products/dlp?hl=en">Cloud Data Loss Prevention</a> for sensitive data classification and <a href="https://docs.cloud.google.com/security-command-center/docs/model-armor-overview">Model Armor</a> for defending against prompt injection attacks. </li>
<li style="font-weight:400;">The feature is currently in preview with select customers, requiring contact with Apigee or Google Cloud account teams for access, with no pricing information disclosed yet.</li>
</ul>
<p>31:07 Ryan – “I just did some real-time analysis about the features of the MCP and then also the browser and stuff. It’s one of those things where it is the newer model of coding, where you’re having distributed agents do tasks, and that, so the new IDs are taking advantage of that… And it is a VS Code fork. So it’s very comfortable to your VS Code users.”</p>
<p>32:05 <a href="https://cloud.google.com/blog/products/application-development/application-design-center-now-ga/">Application Design Center now GA | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google’s Application Design Center reaches general availability as a visual, AI-powered platform for designing and deploying Terraform-backed application infrastructure on GCP. </li>
<li style="font-weight:400;">The service integrates with Gemini Cloud Assist to let users describe infrastructure needs in natural language and receive deployable architecture diagrams with Terraform code, while automatically registering applications with <a href="https://cloud.google.com/app-hub/docs/overview">App Hub</a> for unified management.</li>
<li style="font-weight:400;">The platform addresses platform engineering needs by providing a curated catalog of opinionated <a href="https://docs.cloud.google.com/application-design-center/docs/manage-application-instances#create-application-revision">application templates</a>, including specialized GKE templates for AI inference workloads using various LLM models. </li>
<li style="font-weight:400;">Organizations can bring their own Terraform configurations from Git repositories and combine them with Google-provided components to create standardized infrastructure patterns for reuse across development teams.</li>
<li style="font-weight:400;">New GA features include<a href="https://cloud.google.com/sdk/gcloud/reference/design-center"> public APIs and gcloud CLI support</a>, <a href="https://docs.cloud.google.com/application-design-center/docs/set-up-secure-perimeter">VPC service controls compatibility</a>, and <a href="https://docs.cloud.google.com/application-design-center/docs/download-and-deploy#export_terraform_code">GitOps integration</a> for CI/CD workflows. </li>
<li style="font-weight:400;">The service offers application template revisions as an immutable audit trail and automatically detects configuration drift between intended designs and deployed applications to maintain compliance.</li>
<li style="font-weight:400;">The platform is available free of cost for building and deploying application templates, with pricing details at cloud.google.com/products/application-design-center/pricing. </li>
<li style="font-weight:400;">Integration with Cloud Hub provides operational insights and a unified control plane for managing application portfolios across the organization.</li>
<li style="font-weight:400;">Platform teams can create secure, shareable catalogs of approved templates that give developers self-service access to compliant infrastructure while maintaining governance and security standards. </li>
<li style="font-weight:400;">The service supports downloading templates as infrastructure-as-code for direct editing in local IDEs with changes flowing through standard Git pull request workflows.</li>
</ul>
<p>33:10 Ryan – “It’s kind of the pangea that everyone’s been hoping for, for a long time. With AI making it possible. Being able to plain text speak your infrastructure into existence…I definitely like this model better than like Beanstalk or the hosted application model, which has been the solution until this. This is the answer I want.” </p>
<h2>Azure</h2>
<p>34:30 <a href="https://arstechnica.com/security/2025/12/microsoft-will-finally-kill-obsolete-cipher-that-has-wreaked-decades-of-havoc/">Microsoft will finally kill obsolete cipher that has wreaked decades of havoc </a><a href="https://arstechnica.com/security/2025/12/microsoft-will-finally-kill-obsolete-cipher-that-has-wreaked-decades-of-havoc/">– Ars Technica</a></p>
<ul>
<li style="font-weight:400;">Microsoft is deprecating RC4 encryption in Windows Active Directory after 26 years of default support, following its role in major breaches, including the 2024 Ascension healthcare attack that affected 5.6 million patient records. </li>
<li style="font-weight:400;">The cipher has been cryptographically weak since 1994 and enabled Kerberoasting attacks that have compromised enterprise networks for over a decade.</li>
<li style="font-weight:400;">Windows servers have continued to accept RC4-based authentication requests by default even after AES support was added, creating a persistent attack vector that hackers routinely exploit. </li>
<li style="font-weight:400;">Senator Ron Wyden called for an FTC investigation of Microsoft in September 2025 for gross cybersecurity negligence related to this default configuration.</li>
<li style="font-weight:400;">The deprecation addresses a fundamental security gap in enterprise identity management that has existed since Active Directory launched in 2000. Organizations using Windows authentication will need to ensure their systems are configured to use AES encryption and disable RC4 fallback to prevent downgrade attacks.</li>
<li style="font-weight:400;">This change affects any organization running Active Directory for user authentication and access control, particularly those in healthcare, finance, and other regulated industries where credential theft can lead to catastrophic breaches. (Or literally anyone running Windows.) </li>
<li style="font-weight:400;">The move comes after years of security researchers and government officials pressuring Microsoft to remove the obsolete cipher from default configurations.</li>
</ul>
<p>36:06 Ryan – “It’s so complex, everyone just accepts the defaults just to get it up and going, and if you don’t know how compromised the cipher is, you don’t really prioritize getting back and fixing the encryption. So I’m really happy to see this; it’s always been a black mark that’s made me not trust Windows.” </p>
<p>37:11 <a href="https://azure.microsoft.com/en-us/blog/azure-storage-innovations-unlocking-the-future-of-data/">Azure Storage innovations: Unlocking the future of data </a></p>
<ul>
<li style="font-weight:400;"><a href="https://azure.microsoft.com/en-us/products/storage/blobs/?msockid=0d36bfb9b86d68ee3afdae84b944695f">Azure Blob Storage</a> now scales to exabytes with 50+ Tbps throughput and millions of IOPS, specifically architected to keep GPUs continuously fed during AI training workloads. </li>
<li style="font-weight:400;">The platform powers <a href="https://www.microsoft.com/en/customers/story/23427-openai-lp-azure-blob-storage/?msockid=3ed723774adc6b5418ce315e4b1b6a83">OpenAI’s</a> model training and includes a new Smart Tier preview that automatically moves data between hot, cool, and cold tiers based on 30 and 90-day access patterns to optimize costs without manual intervention.</li>
<li style="font-weight:400;">Azure Ultra Disk delivers sub-0.5ms latency with 30% improvement on Azure Boost VMs, scaling to 400K IOPS per disk and up to 800K IOPS per VM on new Ebsv6 instances. </li>
<li style="font-weight:400;">The new <a href="https://azure.microsoft.com/en-us/updates?id=520805">Instant Access Snapshots</a> preview eliminates pre-warming requirements and reduces recovery times from hours to seconds for Premium SSD v2 and Ultra Disk, while flexible provisioning can reduce total cost of ownership by up to 50%.</li>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/azure-managed-lustre/amlfs-overview">Azure Managed Lustre</a> AMLFS 20 preview supports 25 PiB namespaces with 512 GBps throughput, featuring <a href="https://azure.microsoft.com/en-us/updates?id=529342">auto-import</a> and auto-export capabilities for seamless data movement between AMLFS and Azure Blob Storage. </li>
<li style="font-weight:400;">This addresses the specific challenge of training AI models at terabyte and petabyte scale by maintaining high GPU utilization through parallel I/O operations.</li>
<li style="font-weight:400;">Azure Files introduces Entra-only identity support for SMB shares, eliminating the need for on-premises Active Directory infrastructure and enabling cloud-native identity management, including external identities for Azure Virtual Desktop. Storage Mover adds cloud-to-cloud transfers and on-premises NFS to Azure Files NFS 4.1 migration, while Azure NetApp Files large volumes now scale to 7.2 PiB capacity with 50 GiBps throughput, representing a 3x and 4x increase, respectively.</li>
<li style="font-weight:400;">Azure Native offers now include Pure Storage and Dell PowerScale for customers wanting to migrate existing on-premises partner solutions to Azure using familiar technology stacks. The Storage Migration Program provides access to partners like Atempo, Cirata, Cirrus Data, and Komprise for SAN and NAS workload migrations, with a new Storage Migration Solution Advisor in Copilot to streamline decision-making. Pricing details were not disclosed in the announcement.</li>
</ul>
<p>38:26 Ryan – “It just dawned on me, as you’re reading through here… this is interesting; getting all this high performance from object stores just sort of blows my mind. And then I realized that all these sorts of ‘cloud file systems’ have been backed underneath by these object stores for a long time; like, of course, they need this.”</p>
<p>39:49 <a href="https://azure.microsoft.com/en-us/blog/microsofts-commitment-to-supporting-cloud-infrastructure-demand-in-the-united-states/">Future-Ready Cloud: Microsoft’s U.S. Infrastructure Investments</a></p>
<ul>
<li style="font-weight:400;">Microsoft is expanding its U.S. datacenter footprint with a new East US 3 region launching in Greater <a href="https://news.microsoft.com/source/features/ai/from-wisconsin-to-atlanta-microsoft-connects-datacenters-to-build-its-first-ai-superfactory/?msockid=21b51488f70560c918500229f66f61114">Atlanta</a> in early 2027, plus adding Availability Zones to five existing regions by the end of 2027. </li>
<li style="font-weight:400;">The Atlanta, <a href="https://blogs.microsoft.com/blog/2025/11/12/infinite-scale-the-architecture-behind-the-azure-ai-superfactory/">Georgia</a> region will support advanced AI workloads and feature zone-redundant storage for improved application resilience, designed to meet LEED Gold certification standards for sustainability.</li>
<li style="font-weight:400;">The expansion adds Availability Zones to North Central US, West Central US, and US Gov Arizona regions, plus enhances existing zones in East US 2 Virginia and South Central US Texas. </li>
<li style="font-weight:400;">This provides customers with more options for multi-region architectures to improve recovery time objectives and meet compliance requirements like CMMC and NIST guidance for government workloads.</li>
<li style="font-weight:400;">Azure Government customers get dedicated infrastructure expansion with three Availability Zones coming to US Gov Arizona in early 2026, specifically supporting Defense Industrial Base requirements. </li>
<li style="font-weight:400;">This complements the Azure for US Government Secret cloud region launched earlier in 2025, offering an alternative to US Gov Virginia for latency-sensitive and mission-critical deployments.</li>
<li style="font-weight:400;">The infrastructure investments support organizations like the <a href="https://www.microsoft.com/en/customers/story/1452353711502789549-university-of-miami-higher-education-azure">University of Miami </a>using Availability Zones for disaster recovery in hurricane-prone regions, and the <a href="https://www.microsoft.com/en/customers/story/1747116642806599053-state-of-alaska-azure-government-en-united-states">State of Alaska</a> consolidating legacy systems while improving reliability. </li>
<li style="font-weight:400;">Microsoft emphasizes its global network of over 70 regions, 400 datacenters, and 370,000 miles of fiber as a foundation for resilient cloud strategies using its <a href="https://learn.microsoft.com/en-us/azure/cloud-adoption-framework/">Cloud Adoption Framework</a> and <a href="https://learn.microsoft.com/en-us/azure/well-architected/">Well-Architected Framework</a> guidance.</li>
<li style="font-weight:400;">ai.azure.com for building production-ready AI agents.</li>
</ul>
<p>40:33 Ryan – “AI is definitely driving a lot of this, but like with large data sets, you don’t really want that distributed globally. But I also think that they’re just purely running out of space.”</p>
<p>41:17 <a href="https://azure.microsoft.com/en-us/blog/azure-networking-updates-on-security-reliability-and-high-availability/">Azure Networking Updates: Secure, Scalable, and AI-Optimized</a></p>
<ul>
<li style="font-weight:400;">Azure is tripling down on AI infrastructure with its global network now reaching 18 petabits per second of total capacity, up from 6 Pbps at the end of FY24. </li>
<li style="font-weight:400;">The network spans over 60 AI regions with 500,000 miles of fiber and 4 Pbps of WAN capacity, using InfiniBand and high-speed Ethernet for lossless data transfer between GPU clusters.</li>
<li style="font-weight:400;"><a href="https://techcommunity.microsoft.com/blog/azurenetworkingblog/announcing-the-public-preview-of-standardv2-nat-gateway-and-standardv2-public-ip/4458292">NAT Gateway</a> Standard V2 enters public preview with zone redundancy by default at no additional cost, delivering 100 Gbps throughput and 10 million packets per second. </li>
<li style="font-weight:400;">This joins ExpressRoute, VPN, and Application Gateway in offering zone-resilient SKUs as part of Azure’s resiliency-by-default strategy.</li>
<li style="font-weight:400;">Security updates include <a href="https://azure.microsoft.com/en-us/updates/?id=530183">DNS Security Policy with Threat Intel</a> now generally available for blocking malicious domains, <a href="https://azure.microsoft.com/en-us/updates/?id=503988">Private Link Direct Connect</a> in preview for extending connectivity to any routable private IP, and <a href="https://learn.microsoft.com/en-us/azure/application-gateway/json-web-token-overview">JWT validation at Layer 7 in Application Gateway</a> preview to offload token validation from backend servers.</li>
<li style="font-weight:400;">ExpressRoute is getting 400G direct ports in select locations starting in 2026 for multi-terabit throughput, while VPN Gateway, now generally available, supports 5 Gbps single TCP flow and 20 Gbps total throughput with four tunnels. </li>
<li style="font-weight:400;">Private Link scales to 5,000 endpoints per VNet and 20,000 across peered VNets.</li>
<li style="font-weight:400;">Container networking improvements for AKS include <a href="https://azure.microsoft.com/en-us/updates/?id=523100">eBPF Host Routing</a> for lower latency, <a href="https://azure.microsoft.com/en-us/updates/?id=523086">Pod CIDR Expansion</a> without cluster redeployment, <a href="https://azure.microsoft.com/en-us/updates/?id=525419">WAF for Application Gateway for Containers</a> now generally available, and <a href="https://learn.microsoft.com/en-us/azure/bastion/bastion-connect-to-aks-private-cluster">Azure Bastion</a> support for private AKS cluster access.</li>
</ul>
<p>42:45 Ryan – “If you have those high-end network throughput needs, that’s fantastic! It’s been a while since I’ve really got into cloud at that deep layer, but I do remember in AWS the VPN limitations really biting; it was easy to hit those limits really fast.” </p>
<h2>After Show </h2>
<p>44:22 <a href="https://arstechnica.com/information-technology/2025/12/roomba-maker-irobot-swept-into-bankruptcy/">Roomba maker iRobot swept into bankruptcy</a></p>
<ul>
<li style="font-weight:400;">iRobot’s bankruptcy marks the end of an era for the company that pioneered consumer robotics with the Roomba, now being acquired by its Chinese supplier Picea Robotics after losing ground to cheaper competitors. </li>
<li style="font-weight:400;">The stock crashed from Amazon’s $52 offer in 2023 to just $4, showing how quickly market leaders can fall when undercut on price.</li>
<li style="font-weight:400;">The failed Amazon acquisition in 2023 due to EU antitrust concerns looks particularly painful in hindsight, as iRobot might have been better off with Amazon’s resources than facing bankruptcy. </li>
<li style="font-weight:400;">This highlights how regulatory decisions intended to preserve competition can sometimes accelerate a company’s decline instead.</li>
<li style="font-weight:400;">For cloud professionals, this demonstrates how hardware IoT companies struggle without strong cloud services and ecosystem lock-in that could justify premium pricing. iRobot’s inability to differentiate beyond hardware shows why companies like Amazon, Google, and Apple integrate devices tightly with their cloud platforms.</li>
<li style="font-weight:400;">The Chinese supplier takeover raises questions about data privacy and security for the millions of Roombas already mapping homes worldwide. </li>
<li style="font-weight:400;">This could become a cautionary tale about supply chain dependencies and what happens when your manufacturer becomes your owner.</li>
<li style="font-weight:400;">Founded by MIT engineers in 1990 and selling 40 million devices, iRobot’s fall shows that innovation alone isn’t enough without sustainable competitive advantages in manufacturing costs and ongoing software value.</li>
<li style="font-weight:400;">This is a sad day, especially if you’re a fan of all things serverless, as they were the poster child of all things serverless.  </li>
</ul>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback, or ask questions at theCloudPod.net or tweet at us with the hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2304352/c1e-rodobwg4r4f8d507-ndvg6gzkfgv9-qkjs9t.mp3" length="97458013"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 335 of The Cloud Pod, where the forecast is always cloudy! This pre-Christmas week, Ryan and Justin have hit the studio to bring you the final show of 2025. We’ve got lots of AI images, EKS Network Policies, Gemini 3, and even some Disney drama. 
Let’s get into it! 
Titles we almost went with this week

 From Roomba to Tomb-ba: How the Robot Vacuum Pioneer Got Cleaned Out **OpenAI
 From Napkin Sketch to Production: Google’s App Design Center Goes GA
 Terraform Gets a Canvas: Google Paints Infrastructure Design with AI
 Mickey Mouse Takes Off the Gloves: Disney vs Google AI Showdown
 From Data Silos to Data Solos: Google Conducts the Integration Orchestra
 No More Thread Dread: AWS Brings AI to JVM Performance Troubleshooting
 MCP: More Corporate Plumbing Than You Think
 GPT-5.2 Beats Humans at Work Tasks, Still Can’t Get You Out of Monday Meetings
 Kerberos More Like Kerbero-Less: Microsoft Axes Ancient Encryption Standard
 OpenAI Teaches GPT-5.2 to PowerPoint: Death by Bullet Points Now AI-Generated
 MCP: Like USB-C, But Everyone’s Keeping Theirs in the Drawer
 Flash Gordon: Google’s Gemini 3 Gets a Speed Boost Without the Sacrifice
 Tag, You’re It: AWS Finally Knows Who to Bill
 Snowflake Gets a GPT-5.2 Upgrade: Now With More Intelligence Per Query
 OpenAI and Snowflake: Making Data Warehouses Smarter Than Your Average Analyst
GPT-5.2 Moves Into the Snowflake: No Melting Required

AI Is Going Great, or How ML Makes Money 
01:06 Meta’s multibillion-dollar AI strategy overhaul creates culture clash:

Meta is developing Avocado, a new frontier AI model codenamed to succeed Llama, now expected to launch in Q1 2026 after internal delays related to training performance testing. 
The model may be proprietary rather than open source, marking a significant shift from Meta’s previous strategy of freely distributing Llama’s weights and architecture to developers. We feel like this is an interesting choice for Meta, but what do we know? 
Meta spent 14.3 billion dollars in June 2025 to hire Scale AI founder Alexandr Wang as Chief AI Officer and acquire a stake in Scale, while raising 2026 capital expenditure guidance to 70-72 billion dollars. 

Wang now leads the elite TBD Lab developing Avocado, operating separately from traditional Meta teams and not using the company’s internal workplace network.


The company has restructured its AI leadership following the poor reception of Llama 4 in April, with Chief Product Officer Chris Cox no longer overseeing the GenAI unit. 
Meta cut 600 jobs in Meta Superintelligence Labs in October, contributing to the departure of Chief AI Scientist Yann LeCun to launch a startup, while implementing 70-hour workweeks across AI organizations.
Meta’s new AI leadership under Wang and former GitHub CEO Nat Friedman has introduced a “demo, don’t memo” development approach, replacing traditional multi-step approval processes with rapid prototyping using AI agents and newer tools. 
The company is also leveraging third-party cloud services from CoreWeave and Oracle while buil...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2304352/c1a-k5d5-v6pkok53s8g1-qumvcj.jpg"></itunes:image>
                                                                            <itunes:duration>00:50:41</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2304352/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[334: AWS Makes Kubernetes Conversational]]>
                </title>
                <pubDate>Fri, 19 Dec 2025 01:56:26 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2296281</guid>
                                    <link>https://tcpfm.castos.com/episodes/aws-makes-kubernetes-conversational</link>
                                <description>
                                            <![CDATA[<p>Welcome to episode 334 of The Cloud Pod, where the forecast is always cloudy! This week, we’re bringing you a jam-packed recap of re:Invent! We’ve got all the news, from keynotes to announcements. Whether you were there live or catching up on all the news, Justin, Matt, and Ryan are here to break it all down. Let’s get started! </p>
<h3>Titles we almost went with this week</h3>
<ul>
<li> EKS Gets Chatty: Natural Language Replaces Command Line Nightmares</li>
<li> Harvest Now, Decrypt Later: Why Your RSA Keys Need a Quantum Makeover Before 2026</li>
<li> NAT So Fast: AWS Helps You Find Gateways Doing Absolutely Nothing</li>
<li> AWS Finally Admits You Have Too Many Log Buckets</li>
<li> AWS Finally Lets You Log In Like a Normal Human</li>
<li> Lambda Gets a Memory: Checkpoint Your Way to Multi-Step Workflows</li>
<li> Step Functions at Home: Lambda Durable Functions Let You Write Workflows in Actual Code</li>
<li> No More Bucket List: S3 Public Access Gets Organization-Wide Lockdown</li>
<li> AWS Hits Ctrl-Z on CodeCommit Deprecation</li>
<li> AWS Puts a Cap on CloudFront: Unlimited Traffic, Limited Anxiety</li>
<li> AWS Tells SQL Server to Take a Thread Off: Optimize CPU Cuts Costs by 55%</li>
<li> Amazon Bedrock Gets a Bouncer: AgentCore Identity Checks IDs at the Door</li>
<li> AI Brings on the Developer Renaissance</li>
</ul>
<h2>Follow Up </h2>
<p>01:27 re:Invent </p>
<ul>
<li style="font-weight:400;">Matt Garman- 14th Reinvent, which is weird, since we’ve been doing cloud stuff for 87 years…</li>
<li style="font-weight:400;">Warner – Open Mind for a different View and nothing else matters T-shirt.</li>
</ul>
<p>02:59 re:Invent predictions</p>
<p>Jonathan</p>
<ol>
<li style="list-style-type:none;">
<ol>
<li style="font-weight:400;">Serverless GPU support (extension in Lambda or a different service), it’s about time we have a serverless GPU/Inference capability.
<ol>
<li style="font-weight:400;">It is talked about in the keynote with DeSantis.</li>
</ol>
</li>
</ol>
</li>
</ol>
<ul>
<li>AI Agent with a goal/instructions that can run when they need to, periodically, or always, and perform an action (Agentic Platform that runs agents) –</li>
</ul>
<ul>
<li> Garman – Bedrock AgentCore and Kiro Autonomous Agent</li>
</ul>
<ul>
<li style="font-weight:400;">Werner will announce this is his last keynote and he will retire</li>
</ul>
<ul>
<li style="font-weight:400;">He retired from re:Invent Presentations</li>
</ul>
<p>Ryan</p>
<ul>
<li>New Tranium 3 chips, Inferentia, and Graviton chips</li>
</ul>
<ul>
<li>Garman – announced Tranium 3 Ultraservers.</li>
</ul>
<ul>
<li>They brought the Rack Ryan</li>
</ul>
<ul>
<li>Expand the number of models in or via bedrock</li>
</ul>
<ul>
<li>Doubled the number of models and announced Gemma, Minimax M2, Nvidia Nemotron, Mistral Large, and Mistral 3</li>
<li style="font-weight:400;">Refresh to AWS Organizations</li>
</ul>
<p>Justin</p>
<ul>
<li>New Nova Model &amp; Sonic with Multi-modal</li>
</ul>
<ul>
<li>Garman Nova 2 – Lite, Pro, and Sonic (the lack of Sonic the Hedgehog/Sega reference is a shame)</li>
</ul>
<ul>
<li>Nova 2 Omni</li>
</ul>
<ul>
<li style="font-weight:400;">Announce a partnership with OpenAI (likely on stage)</li>
</ul>
<ol>
<li style="list-style-type:none;">
<ol>
<li style="list-style-type:none;">
<ol>
<li style="font-weight:400;">Not announced as new, but said they’re running on AWS and that EC2 Ultraservers are in use. </li>
</ol>
</li>
</ol>
</li>
</ol>
<ul>
<li>Advanced Agentic AI Capabilities for Security Hub (Automate the SOC teams)</li>
</ul>
<ul>
<li>Garman – Advanced Agentic AI Capabilities for Security Hub – with NEW AWS Security Agent</li>
</ul>
<p>Matt</p>
<ol>
<li style="font-weight:400;">A model router to route LLM queries to different AI models</li>
<li style="font-weight:400;">Well-architected framework expansion </li>
<li style="font-weight:400;">End user Authentication that doesn’t suck (not current Cognito)</li>
</ol>
<p>Tie Breaker – How many times w...</p>
<h3>Chapters</h3>
<ul><li>(00:00:00) - AWS + GCP: Kubectl Goodbye</li><li>(00:01:31) - Reinvent Prediction: Who Won The PC World Awards</li><li>(00:02:28) - AWS 10.2: Serverless and AI Agents</li><li>(00:03:35) - Amazon Keynotes: Ryan Will Retire From Speaking</li><li>(00:07:15) - AWS Security Hub: Advanced Agentic AI capabilities</li><li>(00:08:06) - Treat Time: The AI Conference</li><li>(00:11:04) - Matt Garmin's Conference Keynote</li><li>(00:13:49) - Amazon Cloud Conference 2018: Highlights and Disclosures</li><li>(00:19:05) - Swami's Keynote</li><li>(00:20:33) - Peter Desantis at Reinvent:</li><li>(00:21:55) - Peter Desantis's keynote</li><li>(00:24:36) - Bedrock Reinforcement Learning Keynotes</li><li>(00:29:23) - EC2 and Lambda: Computing with AWS, AI factories</li><li>(00:30:43) - AWS Lambda Managed Instances</li><li>(00:33:32) - AWS Lambda: Durable Functions Invite</li><li>(00:37:37) - Amazon's Step Functions vs. AWS Lambda</li><li>(00:40:40) - ECS x Kubernetes, NAT & More</li><li>(00:47:16) - AWS: VPC Encryption Control (Nitro)</li><li>(00:49:38) - AWS Network Firewall Proxy</li><li>(00:50:58) -  AWS S3: New Block Public Access Controls and More</li><li>(00:54:19) - Amazon FSX for NetApp ONTAP Adds S3</li><li>(00:55:56) - Database Enhancements in 2017</li><li>(00:56:35) - AWS Adds Four New Features to SQL Server & Oracle RDS</li><li>(00:57:30) - AWS Database Savings Plan Announcement</li><li>(00:59:28) - RDS 10.2: SQL Server Resource Governor</li><li>(01:00:41) - WAF and Security Identity</li><li>(01:01:36) - Guardduty: Extended Threat Detection for Amazon EC2 & ECS</li><li>(01:03:45) - AWS Security Agent: Automated Application Security Reviews, Code Scan</li><li>(01:06:14) - Amazon IAM Policy Autopilot Release</li><li>(01:08:36) -  AWS data exports in the Focus 1.2 format and then</li><li>(01:09:36) - AWS Compute Optimizer: Cost Efficiency and Cost Optimization</li><li>(01:12:58) - Amazon Rescues CodeCommun from the AWS Cloud</li><li>(01:17:10) - CloudWatch: Governance, Control Tower, and More</li><li>(01:18:24) - AWS: AMI Ancestry</li><li>(01:20:58) - Amazon Support Plans Reshuffled</li><li>(01:25:29) - Amazon Cloud: Announcements #271</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 334 of The Cloud Pod, where the forecast is always cloudy! This week, we’re bringing you a jam-packed recap of re:Invent! We’ve got all the news, from keynotes to announcements. Whether you were there live or catching up on all the news, Justin, Matt, and Ryan are here to break it all down. Let’s get started! 
Titles we almost went with this week

 EKS Gets Chatty: Natural Language Replaces Command Line Nightmares
 Harvest Now, Decrypt Later: Why Your RSA Keys Need a Quantum Makeover Before 2026
 NAT So Fast: AWS Helps You Find Gateways Doing Absolutely Nothing
 AWS Finally Admits You Have Too Many Log Buckets
 AWS Finally Lets You Log In Like a Normal Human
 Lambda Gets a Memory: Checkpoint Your Way to Multi-Step Workflows
 Step Functions at Home: Lambda Durable Functions Let You Write Workflows in Actual Code
 No More Bucket List: S3 Public Access Gets Organization-Wide Lockdown
 AWS Hits Ctrl-Z on CodeCommit Deprecation
 AWS Puts a Cap on CloudFront: Unlimited Traffic, Limited Anxiety
 AWS Tells SQL Server to Take a Thread Off: Optimize CPU Cuts Costs by 55%
 Amazon Bedrock Gets a Bouncer: AgentCore Identity Checks IDs at the Door
 AI Brings on the Developer Renaissance

Follow Up 
01:27 re:Invent 

Matt Garman- 14th Reinvent, which is weird, since we’ve been doing cloud stuff for 87 years…
Warner – Open Mind for a different View and nothing else matters T-shirt.

02:59 re:Invent predictions
Jonathan



Serverless GPU support (extension in Lambda or a different service), it’s about time we have a serverless GPU/Inference capability.

It is talked about in the keynote with DeSantis.






AI Agent with a goal/instructions that can run when they need to, periodically, or always, and perform an action (Agentic Platform that runs agents) –


 Garman – Bedrock AgentCore and Kiro Autonomous Agent


Werner will announce this is his last keynote and he will retire


He retired from re:Invent Presentations

Ryan

New Tranium 3 chips, Inferentia, and Graviton chips


Garman – announced Tranium 3 Ultraservers.


They brought the Rack Ryan


Expand the number of models in or via bedrock


Doubled the number of models and announced Gemma, Minimax M2, Nvidia Nemotron, Mistral Large, and Mistral 3
Refresh to AWS Organizations

Justin

New Nova Model & Sonic with Multi-modal


Garman Nova 2 – Lite, Pro, and Sonic (the lack of Sonic the Hedgehog/Sega reference is a shame)


Nova 2 Omni


Announce a partnership with OpenAI (likely on stage)






Not announced as new, but said they’re running on AWS and that EC2 Ultraservers are in use. 






Advanced Agentic AI Capabilities for Security Hub (Automate the SOC teams)


Garman – Advanced Agentic AI Capabilities for Security Hub – with NEW AWS Security Agent

Matt

A model router to route LLM queries to different AI models
Well-architected framework expansion 
End user Authentication that doesn’t suck (not current Cognito)

Tie Breaker – How many times w...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[334: AWS Makes Kubernetes Conversational]]>
                </itunes:title>
                                    <itunes:episode>334</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p>Welcome to episode 334 of The Cloud Pod, where the forecast is always cloudy! This week, we’re bringing you a jam-packed recap of re:Invent! We’ve got all the news, from keynotes to announcements. Whether you were there live or catching up on all the news, Justin, Matt, and Ryan are here to break it all down. Let’s get started! </p>
<h3>Titles we almost went with this week</h3>
<ul>
<li> EKS Gets Chatty: Natural Language Replaces Command Line Nightmares</li>
<li> Harvest Now, Decrypt Later: Why Your RSA Keys Need a Quantum Makeover Before 2026</li>
<li> NAT So Fast: AWS Helps You Find Gateways Doing Absolutely Nothing</li>
<li> AWS Finally Admits You Have Too Many Log Buckets</li>
<li> AWS Finally Lets You Log In Like a Normal Human</li>
<li> Lambda Gets a Memory: Checkpoint Your Way to Multi-Step Workflows</li>
<li> Step Functions at Home: Lambda Durable Functions Let You Write Workflows in Actual Code</li>
<li> No More Bucket List: S3 Public Access Gets Organization-Wide Lockdown</li>
<li> AWS Hits Ctrl-Z on CodeCommit Deprecation</li>
<li> AWS Puts a Cap on CloudFront: Unlimited Traffic, Limited Anxiety</li>
<li> AWS Tells SQL Server to Take a Thread Off: Optimize CPU Cuts Costs by 55%</li>
<li> Amazon Bedrock Gets a Bouncer: AgentCore Identity Checks IDs at the Door</li>
<li> AI Brings on the Developer Renaissance</li>
</ul>
<h2>Follow Up </h2>
<p>01:27 re:Invent </p>
<ul>
<li style="font-weight:400;">Matt Garman- 14th Reinvent, which is weird, since we’ve been doing cloud stuff for 87 years…</li>
<li style="font-weight:400;">Warner – Open Mind for a different View and nothing else matters T-shirt.</li>
</ul>
<p>02:59 re:Invent predictions</p>
<p>Jonathan</p>
<ol>
<li style="list-style-type:none;">
<ol>
<li style="font-weight:400;">Serverless GPU support (extension in Lambda or a different service), it’s about time we have a serverless GPU/Inference capability.
<ol>
<li style="font-weight:400;">It is talked about in the keynote with DeSantis.</li>
</ol>
</li>
</ol>
</li>
</ol>
<ul>
<li>AI Agent with a goal/instructions that can run when they need to, periodically, or always, and perform an action (Agentic Platform that runs agents) –</li>
</ul>
<ul>
<li> Garman – Bedrock AgentCore and Kiro Autonomous Agent</li>
</ul>
<ul>
<li style="font-weight:400;">Werner will announce this is his last keynote and he will retire</li>
</ul>
<ul>
<li style="font-weight:400;">He retired from re:Invent Presentations</li>
</ul>
<p>Ryan</p>
<ul>
<li>New Tranium 3 chips, Inferentia, and Graviton chips</li>
</ul>
<ul>
<li>Garman – announced Tranium 3 Ultraservers.</li>
</ul>
<ul>
<li>They brought the Rack Ryan</li>
</ul>
<ul>
<li>Expand the number of models in or via bedrock</li>
</ul>
<ul>
<li>Doubled the number of models and announced Gemma, Minimax M2, Nvidia Nemotron, Mistral Large, and Mistral 3</li>
<li style="font-weight:400;">Refresh to AWS Organizations</li>
</ul>
<p>Justin</p>
<ul>
<li>New Nova Model &amp; Sonic with Multi-modal</li>
</ul>
<ul>
<li>Garman Nova 2 – Lite, Pro, and Sonic (the lack of Sonic the Hedgehog/Sega reference is a shame)</li>
</ul>
<ul>
<li>Nova 2 Omni</li>
</ul>
<ul>
<li style="font-weight:400;">Announce a partnership with OpenAI (likely on stage)</li>
</ul>
<ol>
<li style="list-style-type:none;">
<ol>
<li style="list-style-type:none;">
<ol>
<li style="font-weight:400;">Not announced as new, but said they’re running on AWS and that EC2 Ultraservers are in use. </li>
</ol>
</li>
</ol>
</li>
</ol>
<ul>
<li>Advanced Agentic AI Capabilities for Security Hub (Automate the SOC teams)</li>
</ul>
<ul>
<li>Garman – Advanced Agentic AI Capabilities for Security Hub – with NEW AWS Security Agent</li>
</ul>
<p>Matt</p>
<ol>
<li style="font-weight:400;">A model router to route LLM queries to different AI models</li>
<li style="font-weight:400;">Well-architected framework expansion </li>
<li style="font-weight:400;">End user Authentication that doesn’t suck (not current Cognito)</li>
</ol>
<p>Tie Breaker – How many times will they say AI or Artificial Intelligence</p>
<p>Matt: 200</p>
<p>Justin: 160</p>
<p>Ryan: 99
Jonathan: 1</p>
<p>Matt Garman’s Keynote: 77</p>
<p>DeSantis’ Keynote: 31</p>
<p>Swami: 44</p>
<p>Werner: 31</p>
<p>Total: 183</p>
<p>This means Justin wins this year! </p>
<p>10:05 Honorable Mentions:</p>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;">Mathematical Proof that one of Amazon’s Models has output that can be verifiable with math</li>
</ul>
</li>
</ul>
<ul>
<li>Marketplace for AI Work</li>
</ul>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;">New Device to go along with the Nova Models</li>
<li style="font-weight:400;">Cost Savings for Networking</li>
<li style="font-weight:400;">FinOps AI recommender for Model Usage</li>
<li style="font-weight:400;">Savings Plans for AI/Bedrock Models</li>
<li style="font-weight:400;">S3 Vectors with integration bedrock</li>
<li style="font-weight:400;">FinOps Kubernetes Service</li>
</ul>
</li>
</ul>
<ul>
<li>Q Developer with Autonomous Agents</li>
</ul>
<ul>
<li>Next Generation Silicone for a combined TPU competitor, ie GPU/Graviton/Learning</li>
<li style="font-weight:400;">Bedrock Model Marketplace with Revenue Share for fine-tuned models (Ryan)</li>
<li style="font-weight:400;">Sustainability Dashboard</li>
<li style="font-weight:400;">Aurora/DSQL is an AI feature</li>
</ul>
<h2>AWS</h2>
<p>11:59 re:Invent keynote Recap</p>
<ul>
<li>Matt – started the weekend strong, although we struggled with his keynotes. (Sounds like he could use a good copywriter to help with his speeches.) </li>
<li>Swami – Solid B from us, but that’s because we’re not super interested in his topics. Sorry. </li>
<li>Peter – we enjoyed this one more. Cool tech, lots of mentions, and one of the better presenters. A for him. </li>
<li>Werner – Great Intro Video. Welcome to the Renaissance Coder</li>
</ul>
<p>15:00 A Quick Recap </p>
<p>Look.  We know you care about non-AI things (and so do we), so we’re going to do 25 exciting new announcements in 10 minutes. x8, elon instance, c8a, c8ine instances, m8azn, m3 and m4 max macs, lambda durable functions, 50tb s3 object, s3 batch ops 10x faster, intelligent tiering for s3 tables, automatic replication for s3 tables, s3 access points for FSX netapp, S3 Vectors, GPU Index for Amazon Opensearch, Amazon EMR Serverless with no storage provisioning, Guardduty to ECS &amp; Ec2, Security Hub is GA, Unified data store in cloudwatch, Increases STorage for SQL and Oracle RDS, Optimize CPus for RDS for SQL server, SQL Server Development support, Database Savings Plans. 2 hours on AI…when we would have been really happy with all of THIS as the keynote. </p>
<p>26:08  AI/ML &amp; Amazon Bedrock</p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/bedrock/latest/userguide/service-tiers-inference.html">Bedrock Service Tiers</a> (Priority/Standard/Flex) – Match AI workload performance with cost</li>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/bedrock/latest/userguide/service-tiers-inference.html">Bedrock Reserved Service Tier</a> – Pre-purchase guaranteed tokens-per-minute capacity with 99.5% SLA</li>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/bedrock-agentcore/">Bedrock AgentCore</a> – Policy controls, evaluations, episodic memory for AI agents</li>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/bedrock/latest/userguide/reinforcement-fine-tuning.html">Bedrock Reinforcement Fine-tuning</a> – RLVR and RLAIF for model customization</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/introducing-amazon-nova-2-lite-a-fast-cost-effective-reasoning-model/">Amazon Nova 2 Lite</a> – Fast, cost-effective reasoning model with configurable thinking</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/nova/forge/'">Nova Forge</a> – Build your own foundational models</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/amazon-bedrock-adds-fully-managed-open-weight-models/">18 New Open Weight Models</a> – Mistral Large 3, Ministral 3 variants, others</li>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/cost-management/latest/userguide/ce-q-overview.html">Amazon Q Developer Cost Management</a> – Natural language queries for AWS spending analysis</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/new-serverless-customization-in-amazon-sagemaker-ai-accelerates-model-fine-tuning/">SageMaker Serverless Customization</a> – Automated infrastructure for fine-tuning</li>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/sagemaker/latest/dg/sagemaker-hyperpod.html">SageMaker HyperPod</a> – Checkpointless and elastic training capabilities</li>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/clean-rooms/latest/userguide/machine-learning.html">AWS Clean Rooms ML</a> – Privacy-enhancing synthetic dataset generation</li>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/bedrock-agentcore/latest/devguide/evaluations.html">AgentCore Evaluations</a> – Continuously inspect agent quality based on real-world behavior</li>
</ul>
<p>29:09  Ryan – “I do agree with you that no one should be building their own foundational models unless it’s really, truly built on a data set that’s unique, but I do think that everyone should go through the exercise of building a model to understand how AI works.” </p>
<p>30:58  Compute (EC2 &amp; Lambda)</p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/about-aws/whats-new/2025/11/amazon-ec2-p6-b300-instances-nvidia-blackwell-ultra-gpus-available/">EC2 P6-B300 Instances</a> – NVIDIA Blackwell Ultra GPUs, 6.4Tbps networking</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/ec2/instance-types/x8aedz/">EC2 X8aedz Instances</a> – AMD EPYC 5GHz, memory-optimized for EDA/databases
<ul>
<li style="font-weight:400;">X Æ A-Xii Musk</li>
</ul>
</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/ec2/instance-types/c8a/">EC2 C8a Instances</a> – AMD EPYC Turin, 30% higher compute performance</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/ec2/instance-types/m9g/">EC2 M9g Instances</a> – Graviton5 powered, 25% better than Graviton4</li>
<li style="font-weight:400;"><a href="https://www.aboutamazon.com/news/aws/aws-graviton-5-cpu-amazon-ec2">Graviton5 Processor</a> – 192 cores, 5x larger cache</li>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/lambda/latest/dg/tenant-isolation.html">Lambda Tenant Isolation Mode</a> – Built-in multi-tenant separation</li>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/lambda/latest/dg/lambda-managed-instances.html">Lambda Managed Instances</a> – Run Lambda on your EC2 with AWS management</li>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/lambda/latest/dg/durable-functions.html">Lambda Durable Functions</a> – Multi-step workflows with automatic state management</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/about-aws/whats-new/2025/12/aws-ai-factories/">AWS AI Factories</a> – Cloud-scale AI infrastructure in customer data centers|</li>
</ul>
<p>33:46  Matt – “I feel like we should have seen this coming, given that they just released the ECS management system a couple of months ago, and it feels like the next step.” </p>
<p>42:24  Containers (EKS &amp; ECS)</p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/eks/latest/userguide/capabilities.html">EKS Capabilities</a> – Managed Argo CD, ACK, KRO in AWS-owned infrastructure</li>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/eks/latest/userguide/eks-mcp-getting-started.html">EKS MCP Server</a> – Natural language Kubernetes management (preview)</li>
<li style="font-weight:400;"><a href="https://aws-news.com/article/2025-11-19-amazon-eks-introduces-enhanced-container-network-observability">EKS Container Network Observability</a> – Service maps, flow tables, performance metrics</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/about-aws/whats-new/2025/11/amazon-ecs-eks-ai-powered-troubleshooting-console/">EKS/ECS Amazon Q Troubleshooting</a> – AI-powered console diagnostics</li>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/AmazonECS/latest/developerguide/express-service-overview.html">ECS Express Mode</a> – Simplified deployment with automatic ALB, domains, HTTPS</li>
</ul>
<p>43:36  Ryan – “I think this is what I’ve always wanted Beanstalk and Lightsail to be, is this service. This, for me, feels like the best of both worlds.” </p>
<p>45:34  Networking &amp; Content Delivery</p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/AmazonCloudFront/latest/DeveloperGuide/flat-rate-pricing-plan.html">CloudFront Flat-Rate Pricing</a> – Bundled delivery, WAF, DDoS protection ($0-$1K/month tiers)</li>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/vpn/latest/s2svpn/vpn-concentrator.html">VPN Concentrator</a> – 25-100 low-bandwidth sites via a single Transit Gateway attachment</li>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/Route53/latest/DeveloperGuide/accelerated-recovery.html">Route 53 Accelerated Recovery</a> – 60-minute RTO for DNS during regional outages</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/route53/global-resolver/">Route 53 Global Resolver</a> (preview) – Anycast DNS for remote/distributed clients</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/about-aws/whats-new/2025/11/aws-nat-gateway-regional-availability/">NAT Gateway Regional Availability</a> – Auto-scale across AZs, simplified management</li>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/vpc/latest/userguide/vpc-encryption-controls.html">VPC Encryption Controls </a>– Enforce encryption in transit within/across VPCs</li>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/network-firewall/latest/developerguide/network-firewall-proxy-developer-guide.html">Network Firewall Proxy</a> (preview) – Explicit proxy for outbound traffic filtering
</li>
</ul>
<p>50:29  Ryan – “If you’ve ever had to do any kind of compliance evidence, that’s the reason why this exists and that’s why I love it so much. The song and dance that you have to do to illustrate your use of encryption across your environment is painful.”</p>
<p>53:14  Storage (S3 &amp; FSx)</p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/s3/features/vectors/">S3 Vectors GA</a> – Native vector support, 2B vectors/index, 20T vectors/bucket</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/announcing-replication-support-and-intelligent-tiering-for-amazon-s3-tables/">S3 Tables Replication &amp; Intelligent-Tiering</a> – Cross-region/account Iceberg replication</li>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/AmazonS3/latest/userguide/storage_lens_basics_metrics_recommendations.html">S3 Storage Lens Enhancements</a> – Performance metrics, billions of prefixes, S3 Tables export</li>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/AmazonS3/latest/userguide/UsingEncryption.html">S3 Encryption Controls</a> – Bucket-level encryption type enforcement</li>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/AmazonS3/latest/userguide/access-control-block-public-access.html">S3 Block Public Access</a> – Organization-level enforcement</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/about-aws/whats-new/2025/12/amazon-s3-maximum-object-size-50-tb/">S3 50TB Object Size</a> – 10x increase from previous 5TB limit</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/about-aws/whats-new/2025/12/amazon-fsx-netapp-ontap-s3-access/">FSx for NetApp ONTAP S3 Access Points</a> – Access file data via S3 API</li>
</ul>
<p>54:38  Matt – “This is just a nice quality of life improvement.” </p>
<p>58:24 Databases</p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/rds/aurora/dsql/pricing/">Aurora DSQL Cost Estimates</a> – Statement-level DPU usage in query plans</li>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/AmazonRDS/latest/AuroraUserGuide/AuroraPostgreSQL.Security.DynamicMasking.html">Aurora PostgreSQL Dynamic Data Masking</a> – pg_columnmask extension</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/about-aws/whats-new/2025/11/amazon-opensearch-service-opensearch-version-3-3/">OpenSearch 3.3</a> – Agentic search, semantic highlighter improvements</li>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/opensearch-service/latest/developerguide/gpu-acceleration-vector-index.html">OpenSearch GPU Acceleration</a> – 6-14x faster vector indexing</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/amazon-rds-for-oracle-and-rds-for-sql-server-add-new-capabilities-to-enhance-performance-and-optimize-costs/">RDS SQL Server/Oracle Optimizations</a> – Free Developer Edition, 256 TiB storage, CPU optimization</li>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/AmazonRDS/latest/UserGuide/Appendix.SQLServer.Options.ResourceGovernor.html">RDS SQL Server Resource Governor</a> – Workload resource control</li>
<li style="font-weight:400;"><a href="http://v">Database Savings Plans</a> – Up to 35% savings across 9 database services</li>
</ul>
<p>1:01:01  Justin – “This is quite nice, and quite broad, so they definitely heard all of the community saying please bring us database savings plans.” </p>
<p>1:03:33 Security &amp; Identity</p>
<ul>
<li><a href="https://aws.amazon.com/about-aws/whats-new/2025/12/security-hub-near-real-time-risk-analytics/">Security Hub GA</a> – Near real-time analytics, risk prioritization, Trends feature</li>
<li><a href="https://aws.amazon.com/about-aws/whats-new/2025/11/aws-secrets-manager-managed-external-secrets/">Secrets Manager External Secrets</a> – Managed rotation for Salesforce, Snowflake, BigID</li>
<li><a href="https://github.com/hashicorp/terraform-provider-aws/issues/45146">IAM Outbound Identity Federation</a> – Short-lived JWTs for external service authentication</li>
<li>AWS<a href="https://docs.aws.amazon.com/signin/latest/userguide/command-line-sign-in.html"> login CLI Command</a> – Eliminate long-term access keys with OAuth 2.0</li>
<li><a href="https://aws.amazon.com/about-aws/whats-new/2025/11/aws-waf-web-bot-auth-support/">WAF Web Bot Auth</a> – Cryptographic signature verification for legitimate AI agents</li>
<li><a href="https://github.com/awslabs/amazon-bedrock-agentcore-samples/blob/main/01-tutorials/03-AgentCore-identity/02-how_it_works.md">Agentcore Identity</a> </li>
<li><a href="https://docs.aws.amazon.com/guardduty/latest/ug/guardduty-extended-threat-detection.html">GuardDuty Extended Threat Detection</a> – EC2/ECS multistage attack correlation</li>
<li><a href="https://aws.amazon.com/security-agent/">AWS Security Agent</a> (preview) – AI-powered security reviews, code scanning, pen testing</li>
<li><a href="https://aws.amazon.com/blogs/aws/simplify-iam-policy-creation-with-iam-policy-autopilot-a-new-open-source-mcp-server-for-builders/">IAM Policy Autopilot</a> – Open source MCP server for generating IAM policies from code.</li>
</ul>
<p>1:08:18  Matt – “…it’s definitely competing with Azure releasing the same thing during their conference. The piece I like about this is the pen test piece because it now lives in your source code, which you probably already have in SCA or a static code analysis tool.”</p>
<p>1:11:46 Cost Management &amp; FinOps</p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/about-aws/whats-new/2025/11/cost-explorer-18-month-forecasting-ai-powered-forecasts/">Cost Explorer 18-Month Forecasting</a> – Extended from 12 months to 18 months, explainable with AI (in preview).</li>
<li style="font-weight:400;"><a href="https://fastercapital.com/content/Cost-Efficiency--Cost-Efficiency-Metrics-and-How-to-Improve-Them.html">Cost Efficiency Metric</a> – Single percentage score combining optimization opportunities.</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/about-aws/whats-new/2025/11/aws-data-exports-focus-1-2-available/">AWS Data Exports FOCUS 1.2</a> – Standardized multi-cloud billing format</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/aws-cost-management/aws-billing-transfer/">Billing Transfer</a> – Centralized billing across multiple Organizations</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/about-aws/whats-new/2025/11/aws-compute-optimizer-unused-nat-gateway-recommendations/">Compute Optimizer NAT Gateway Recommendations</a> – Identify unused NAT Gateways</li>
</ul>
<p>1:14:09 Developer Tools &amp; Modernization</p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/step-functions/latest/dg/sfn-local.html">Step Functions Local Testing</a> – TestState API with mocking support</li>
<li style="font-weight:400;"><a href="http://v">AWS Transform Custom</a> – AI-powered code modernization (Java, Node.js, Python)</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/transform/mainframe/">AWS Transform Mainframe</a> – COBOL to microservices with automated testing</li>
<li style="font-weight:400;"><a href="http://v">API Gateway Developer Portals</a> – Native API discovery and documentation</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/devops/aws-codecommit-returns-to-general-availability/">CodeCommit Restored to GA</a> – Git LFS (Q1 2026), regional expansion (Q3 2026)</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/transform/windows/">AWS Transform Windows</a> – Full-stack .NET/SQL Server modernization</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/about-aws/whats-new/2025/12/amazon-cloudwatch-unified-management-analytics/">CloudWatch Unified Data Management</a> – Consolidated ops/security/compliance logs</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/about-aws/whats-new/2025/11/amazon-cloudwatch-deletion-protection-logs/">CloudWatch Deletion Protection</a> – Prevent accidental log group removal. </li>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/AmazonCloudWatch/latest/monitoring/CloudWatch-NetworkFlowMonitor.html">CloudWatch Network Flow Monitor</a> – Container network observability for EKS
</li>
</ul>
<p>1:18:09  Matt – “I mean, I hope all customers have some sort of plan, knowing that I’ve seen many companies say ‘we got this notice six months ago, we’ll deal with it in six months’ and now it’s three weeks and six days, and it expires tomorrow…there’s probably a lot of customers still there.” </p>
<p>1:20:58 Observability &amp; Monitoring</p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/about-aws/whats-new/2025/12/amazon-cloudwatch-unified-management-analytics/">CloudWatch Unified Data Management</a> – Consolidated ops/security/compliance logs</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/about-aws/whats-new/2025/11/amazon-cloudwatch-deletion-protection-logs/">CloudWatch Deletion Protection</a> – Prevent accidental log group removal</li>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/AmazonCloudWatch/latest/monitoring/CloudWatch-NetworkFlowMonitor.html">CloudWatch Network Flow Monitor</a> – Container network observability for EKS</li>
</ul>
<p>1:21:39 Governance &amp; Management</p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/about-aws/whats-new/2025/11/aws-control-tower-controls-dedicated-experience/">Control Tower Controls Dedicated</a> – Use managed controls without a full landing zone.</li>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/servicequotas/latest/userguide/automatic-management.html">Service Quotas Automatic Management</a> – Auto-adjust limits based on usage</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/about-aws/whats-new/2025/11/supplementary-packages-amazon-linux/">Supplementary Packages for Amazon Linux</a> – Pre-built EPEL9 packages</li>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/ami-ancestry.html">AMI Ancestry</a> – Automatic lineage tracking for AMIs</li>
</ul>
<p>1:23:05  Matt – “I’ve built three different ways to do this in my career. You always want to know where it came from, so if there’s a vulnerability, you know where to start patching and go up from there…but if you have multiple teams, it’s hard to track. So knowing I can track it is a godsend.” </p>
<p>1:25:35 DevOps &amp; Operations</p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/devops-agent/">AWS DevOps Agent</a> (preview) – Autonomous incident investigation and root cause analysis</li>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/awssupport/latest/user/changing-support-plans.html">AWS Support Plan Restructure</a> – Business Support+ ($29/mo), Enterprise ($5K/mo), Unified Ops ($50K/mo)</li>
</ul>
<p>1:26:41  Ryan – “I hope this ends up being decent service, but in my head I’m thinking they’re lowering the cost because they’re getting rid of all their support staff.” </p>
<p>1:29:29 Marketplace &amp; Partner</p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/partners/partner-central/">Partner Central in Console</a> – Unified customer/partner experience</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/about-aws/whats-new/2025/11/introducing-multi-product-solutions-aws-marketplace/">Multi-Product Solutions</a> – Bundled offerings from multiple vendors</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/about-aws/whats-new/2025/11/automated-integration-crowdstrike-falcon-next-gen/">CrowdStrike Falcon Integration</a> – Automated SIEM setup wizard</li>
</ul>
<p>1:30:15 Connectivity &amp; Contact Center</p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/connect/latest/adminguide/customer-profiles-predictive-insights.html">Amazon Connect Predictive Insights</a> (preview) – AI-powered recommendations</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/about-aws/whats-new/2025/11/amazon-connect-mcp-support/">Amazon Connect MCP Support </a>– Standardized tools for AI agents</li>
</ul>
<p>Noteable Announcments We Didn’t Cover in the Show:</p>
<ul>
<li><a href="https://aws.amazon.com/about-aws/whats-new/2025/11/aws-flat-rate-pricing-plans">AWS announces flat-rate pricing plans for website delivery and security</a></li>
<li><a href="https://aws.amazon.com/blogs/aws/accelerate-workflow-development-with-enhanced-local-testing-in-aws-step-functions/">Accelerate workflow development with enhanced local testing in AWS Step Functions</a></li>
<li><a href="https://aws.amazon.com/blogs/aws/streamlined-multi-tenant-application-development-with-tenant-isolation-mode-in-aws-lambda/">Streamlined multi-tenant application development with tenant isolation mode in AWS Lambda</a></li>
<li><a href="https://aws.amazon.com/blogs/aws/aws-control-tower-introduces-a-controls-dedicated-experience/">AWS Control Tower introduces a Controls Dedicated experience</a></li>
<li><a href="https://aws.amazon.com/blogs/aws/monitor-network-performance-and-traffic-across-your-eks-clusters-with-container-network-observability/">Monitor network performance and traffic across your EKS clusters with Container Network Observability </a></li>
<li><a href="https://aws.amazon.com/blogs/aws/new-aws-billing-transfer-for-centrally-managing-aws-billing-and-costs-across-multiple-organizations/">New AWS Billing Transfer for centrally managing AWS billing and costs across multiple organizations</a></li>
<li><a href="https://aws.amazon.com/about-aws/whats-new/2025/11/cost-explorer-18-month-forecasting-ai-powered-forecasts/">AWS Cost Explorer now provides 18-month forecasting and explainable AI-powered forecasts</a></li>
<li><a href="https://aws.amazon.com/about-aws/whats-new/2025/11/enhanced-cost-management-amazon-q-developer/">Announcing enhanced cost management capabilities in Amazon Q Developer</a></li>
<li><a href="https://aws.amazon.com/blogs/aws/simplify-access-to-external-services-using-aws-iam-outbound-identity-federation/">Simplify access to external services using AWS IAM Outbound Identity Federation</a></li>
<li><a href="https://aws.amazon.com/about-aws/whats-new/2025/11/aws-glue-5-1">Introducing AWS Glue 5.1</a></li>
<li><a href="https://www.allthingsdistributed.com/2025/11/tech-predictions-for-2026-and-beyond.html?utm_source=tldrnewsletter">Tech predictions for 2026 and beyond | All Things Distributed</a></li>
<li><a href="https://aws.amazon.com/about-aws/whats-new/2025/11/introducing-multi-product-solutions-aws-marketplace/">Introducing multi-product solutions in AWS Marketplace</a></li>
</ul>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback, or ask questions at theCloudPod.net or tweet at us with the hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2296281/c1e-0424uk0wgquor4q2-1p7659gjc973-7zccfd.mp3" length="169469195"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 334 of The Cloud Pod, where the forecast is always cloudy! This week, we’re bringing you a jam-packed recap of re:Invent! We’ve got all the news, from keynotes to announcements. Whether you were there live or catching up on all the news, Justin, Matt, and Ryan are here to break it all down. Let’s get started! 
Titles we almost went with this week

 EKS Gets Chatty: Natural Language Replaces Command Line Nightmares
 Harvest Now, Decrypt Later: Why Your RSA Keys Need a Quantum Makeover Before 2026
 NAT So Fast: AWS Helps You Find Gateways Doing Absolutely Nothing
 AWS Finally Admits You Have Too Many Log Buckets
 AWS Finally Lets You Log In Like a Normal Human
 Lambda Gets a Memory: Checkpoint Your Way to Multi-Step Workflows
 Step Functions at Home: Lambda Durable Functions Let You Write Workflows in Actual Code
 No More Bucket List: S3 Public Access Gets Organization-Wide Lockdown
 AWS Hits Ctrl-Z on CodeCommit Deprecation
 AWS Puts a Cap on CloudFront: Unlimited Traffic, Limited Anxiety
 AWS Tells SQL Server to Take a Thread Off: Optimize CPU Cuts Costs by 55%
 Amazon Bedrock Gets a Bouncer: AgentCore Identity Checks IDs at the Door
 AI Brings on the Developer Renaissance

Follow Up 
01:27 re:Invent 

Matt Garman- 14th Reinvent, which is weird, since we’ve been doing cloud stuff for 87 years…
Warner – Open Mind for a different View and nothing else matters T-shirt.

02:59 re:Invent predictions
Jonathan



Serverless GPU support (extension in Lambda or a different service), it’s about time we have a serverless GPU/Inference capability.

It is talked about in the keynote with DeSantis.






AI Agent with a goal/instructions that can run when they need to, periodically, or always, and perform an action (Agentic Platform that runs agents) –


 Garman – Bedrock AgentCore and Kiro Autonomous Agent


Werner will announce this is his last keynote and he will retire


He retired from re:Invent Presentations

Ryan

New Tranium 3 chips, Inferentia, and Graviton chips


Garman – announced Tranium 3 Ultraservers.


They brought the Rack Ryan


Expand the number of models in or via bedrock


Doubled the number of models and announced Gemma, Minimax M2, Nvidia Nemotron, Mistral Large, and Mistral 3
Refresh to AWS Organizations

Justin

New Nova Model & Sonic with Multi-modal


Garman Nova 2 – Lite, Pro, and Sonic (the lack of Sonic the Hedgehog/Sega reference is a shame)


Nova 2 Omni


Announce a partnership with OpenAI (likely on stage)






Not announced as new, but said they’re running on AWS and that EC2 Ultraservers are in use. 






Advanced Agentic AI Capabilities for Security Hub (Automate the SOC teams)


Garman – Advanced Agentic AI Capabilities for Security Hub – with NEW AWS Security Agent

Matt

A model router to route LLM queries to different AI models
Well-architected framework expansion 
End user Authentication that doesn’t suck (not current Cognito)

Tie Breaker – How many times w...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2296281/c1a-k5d5-v6px4gd0bnpk-l1msfq.jpg"></itunes:image>
                                                                            <itunes:duration>01:28:07</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2296281/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[333: The Cloud Pod Goes Nano Banana]]>
                </title>
                <pubDate>Wed, 10 Dec 2025 23:25:34 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2283771</guid>
                                    <link>https://tcpfm.castos.com/episodes/333-the-cloud-pod-goes-nano-banana</link>
                                <description>
                                            <![CDATA[<p>Welcome to episode 333 of The Cloud Pod, where the forecast is always cloudy! Justin, Ryan, and Matt are taking a quick break from re:Invent festivities. They bring you the latest and greatest in Cloud and AI news. This week, we discuss Norad and Anthropic teaming up to bring you Christmas cheer. Wait, is that right? Huh. We also have undersea cables, some Turkish region delight, and a LOT of Opus 4.5 news. Let’s get into it!</p>
<h3>Titles we almost went with this week</h3>
<ul>
<li> Boring Error Pages Not Found</li>
<li> Claude Goes Native in Snowflake: Finally, AI That Stays Where Your Data Lives</li>
<li> Cross-Cloud Romance: AWS and Google Make It Official with Interconnect</li>
<li> Google Gemini Puts OpenAI in Code Red: The Tables Have Turned</li>
<li> Azure NAT Gateway V2: Now With More Zones Than a Parking Lot</li>
<li> From ChatGPT to Chat-Uh-Oh: OpenAI Sounds the Alarm as Gemini Steals 200 Million </li>
<li>      Users **Anthropic</li>
<li> Scheduled Actions: Because Your VMs Need a Work-Life Balance Too</li>
<li> Finally, Your 500 Errors Can Look as Good as Your Homepage</li>
<li> Foundry Model Router: Because Choosing Between 47 AI Models is Nobody’s Idea of Fun</li>
<li> Google Takes the Scenic Route: New Cable Avoids the Sunda Strait Traffic Jam</li>
<li> Azure Application Gateway Gets Its TCP/IP Diploma</li>
<li> Google Cloud Gets Its Türkiye Dinner: 2 Billion Dollar Cloud Feast Coming Soon</li>
<li> Microsoft Foundry: Turning AI Chaos into Compliance Gold</li>
</ul>
<h2>AI Is Going Great, or How ML Makes Money </h2>
<p>02:59 <a href="https://cloud.google.com/blog/products/ai-machine-learning/nano-banana-pro-available-for-enterprise/">Nano Banana Pro available for enterprise</a></p>
<ul>
<li>Google launches <a href="https://blog.google/technology/ai/nano-banana-pro"> Nano Banana Pro</a> (Gemini 3 Pro Image) in general availability on <a href="https://cloud.google.com/vertex-ai">Vertex AI</a> and <a href="https://workspace.google.com/solutions/ai/">Google Workspace</a>, with <a href="https://cloud.google.com/gemini-enterprise?utm_source=google&amp;utm_medium=cpc&amp;utm_campaign=1710046-Workspace-DR-NA-US-en-Google-BKWS-EXA-na&amp;utm_content=c-Hybrid+%7C+BKWS+-+EXA+%7C+Txt_Pollux-435278751514&amp;utm_term=gemini+enterprise&amp;gclsrc=aw.ds&amp;gad_source=1&amp;gad_campaignid=23071651391&amp;gclid=CjwKCAiA8vXIBhAtEiwAf3B-gzz0u3LSmwxBXBql13NaVpNPlpH_mOOIT4UTcP-HIo_ei8K5e5OQ8RoCDjIQAvD_BwE&amp;e=48754805">Gemini Enterprise</a> support coming soon.</li>
<li style="font-weight:400;">The model supports up to 14 reference images for style consistency and generates 4K resolution outputs with multilingual text rendering capabilities.</li>
<li style="font-weight:400;">The model includes Google Search grounding for factual accuracy in generated infographics and diagrams, plus built-in SynthID watermarking for transparency. Copyright indemnification will be available at general availability under Google’s shared responsibility framework.</li>
<li style="font-weight:400;">Enterprise integrations are live with <a href="https://www.adobe.com/products/firefly.html?msockid=01d3685b64b56f5b0f8b7ee865eb6e52">Adobe Firefly</a>, <a href="https://www.adobe.com/products/photoshop.html?promoid=RBS7NL7F&amp;mv=other">Photoshop</a>, <a href="https://www.canva.com/">Canva</a>, and <a href="https://www.figma.com/">Figma</a>, enabling production-grade creative workflows. Major retailers, including Klarna, Shopify, and Wayfair, report using the model for product visualization and marketing asset generation at scale.</li>
<li style="font-weight:400;">Developers can access Nano Banana Pro through Vertex AI with Provisioned Throughput and Pay As You Go pricing options, plus advanced safety filters. Business users get access through Google Workspace apps, including Slides, Vids, and <a href="https://www.bing.com/ck/a?!&amp;&amp;p=dc75ccf6e9f1bc1b758926fdcff3ced77d48c762387c8376b3aa8781f73a8a45JmltdHM9MTc2NTIzODQwMA&amp;ptn=3&amp;ver=2&amp;h..."></a></li></ul>
<h3>Chapters</h3>
<ul><li>(00:00:00) - The Cloud Pod: This Week's News</li><li>(00:03:02) - Google Launches Nano Banana Pro in Google Workspace</li><li>(00:05:59) - Cloud Opus 4.5 Availability and Performance</li><li>(00:10:41) - OpenAI Declares Code Red as Google's Gemini GPT G</li><li>(00:14:00) - AWS 10: Prediction vs. Keynotes</li><li>(00:14:49) - Google Cloud Region Coming to Turkey</li><li>(00:18:52) - Google to Build New Subsea Cable Link Between Australia and Thailand</li><li>(00:22:12) - Google Cloud Next</li><li>(00:25:57) - Google Cloud VPN Flow Logs now support Cross-Cloud Networks</li><li>(00:29:43) - Amazon Cloud Connects to Google Cloud</li><li>(00:32:10) - Azure Application Gateway: TLS and TCP Protocol Termination</li><li>(00:35:39) - Azure 2.8: Agent to Agent in Public Preview</li><li>(00:37:02) - Microsoft Cloud Open Sport 5</li><li>(00:39:10) - Azure DNS & Security: Threat Intelligence Feed Blocking</li><li>(00:41:22) - NAT Gateway: Standard V2 SKU and Public Preview</li><li>(00:45:23) - Azure app service: Custom Error Pages now in general availability</li><li>(00:47:22) - Microsoft Foundry</li><li>(00:51:02) - Microsoft's AI Orchestration Layer Gets Scheduled Tasks</li><li>(00:56:18) - Week in the Cloud: AWS Extravaganza</li><li>(00:57:06) - NORAD's AI-powered Holiday Tools</li><li>(01:00:34) - Elf Photo Day</li><li>(01:01:20) - Unifi: Printer v2 local</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 333 of The Cloud Pod, where the forecast is always cloudy! Justin, Ryan, and Matt are taking a quick break from re:Invent festivities. They bring you the latest and greatest in Cloud and AI news. This week, we discuss Norad and Anthropic teaming up to bring you Christmas cheer. Wait, is that right? Huh. We also have undersea cables, some Turkish region delight, and a LOT of Opus 4.5 news. Let’s get into it!
Titles we almost went with this week

 Boring Error Pages Not Found
 Claude Goes Native in Snowflake: Finally, AI That Stays Where Your Data Lives
 Cross-Cloud Romance: AWS and Google Make It Official with Interconnect
 Google Gemini Puts OpenAI in Code Red: The Tables Have Turned
 Azure NAT Gateway V2: Now With More Zones Than a Parking Lot
 From ChatGPT to Chat-Uh-Oh: OpenAI Sounds the Alarm as Gemini Steals 200 Million 
      Users **Anthropic
 Scheduled Actions: Because Your VMs Need a Work-Life Balance Too
 Finally, Your 500 Errors Can Look as Good as Your Homepage
 Foundry Model Router: Because Choosing Between 47 AI Models is Nobody’s Idea of Fun
 Google Takes the Scenic Route: New Cable Avoids the Sunda Strait Traffic Jam
 Azure Application Gateway Gets Its TCP/IP Diploma
 Google Cloud Gets Its Türkiye Dinner: 2 Billion Dollar Cloud Feast Coming Soon
 Microsoft Foundry: Turning AI Chaos into Compliance Gold

AI Is Going Great, or How ML Makes Money 
02:59 Nano Banana Pro available for enterprise

Google launches  Nano Banana Pro (Gemini 3 Pro Image) in general availability on Vertex AI and Google Workspace, with Gemini Enterprise support coming soon.
The model supports up to 14 reference images for style consistency and generates 4K resolution outputs with multilingual text rendering capabilities.
The model includes Google Search grounding for factual accuracy in generated infographics and diagrams, plus built-in SynthID watermarking for transparency. Copyright indemnification will be available at general availability under Google’s shared responsibility framework.
Enterprise integrations are live with Adobe Firefly, Photoshop, Canva, and Figma, enabling production-grade creative workflows. Major retailers, including Klarna, Shopify, and Wayfair, report using the model for product visualization and marketing asset generation at scale.
Developers can access Nano Banana Pro through Vertex AI with Provisioned Throughput and Pay As You Go pricing options, plus advanced safety filters. Business users get access through Google Workspace apps, including Slides, Vids, and ]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[333: The Cloud Pod Goes Nano Banana]]>
                </itunes:title>
                                    <itunes:episode>333</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p>Welcome to episode 333 of The Cloud Pod, where the forecast is always cloudy! Justin, Ryan, and Matt are taking a quick break from re:Invent festivities. They bring you the latest and greatest in Cloud and AI news. This week, we discuss Norad and Anthropic teaming up to bring you Christmas cheer. Wait, is that right? Huh. We also have undersea cables, some Turkish region delight, and a LOT of Opus 4.5 news. Let’s get into it!</p>
<h3>Titles we almost went with this week</h3>
<ul>
<li> Boring Error Pages Not Found</li>
<li> Claude Goes Native in Snowflake: Finally, AI That Stays Where Your Data Lives</li>
<li> Cross-Cloud Romance: AWS and Google Make It Official with Interconnect</li>
<li> Google Gemini Puts OpenAI in Code Red: The Tables Have Turned</li>
<li> Azure NAT Gateway V2: Now With More Zones Than a Parking Lot</li>
<li> From ChatGPT to Chat-Uh-Oh: OpenAI Sounds the Alarm as Gemini Steals 200 Million </li>
<li>      Users **Anthropic</li>
<li> Scheduled Actions: Because Your VMs Need a Work-Life Balance Too</li>
<li> Finally, Your 500 Errors Can Look as Good as Your Homepage</li>
<li> Foundry Model Router: Because Choosing Between 47 AI Models is Nobody’s Idea of Fun</li>
<li> Google Takes the Scenic Route: New Cable Avoids the Sunda Strait Traffic Jam</li>
<li> Azure Application Gateway Gets Its TCP/IP Diploma</li>
<li> Google Cloud Gets Its Türkiye Dinner: 2 Billion Dollar Cloud Feast Coming Soon</li>
<li> Microsoft Foundry: Turning AI Chaos into Compliance Gold</li>
</ul>
<h2>AI Is Going Great, or How ML Makes Money </h2>
<p>02:59 <a href="https://cloud.google.com/blog/products/ai-machine-learning/nano-banana-pro-available-for-enterprise/">Nano Banana Pro available for enterprise</a></p>
<ul>
<li>Google launches <a href="https://blog.google/technology/ai/nano-banana-pro"> Nano Banana Pro</a> (Gemini 3 Pro Image) in general availability on <a href="https://cloud.google.com/vertex-ai">Vertex AI</a> and <a href="https://workspace.google.com/solutions/ai/">Google Workspace</a>, with <a href="https://cloud.google.com/gemini-enterprise?utm_source=google&amp;utm_medium=cpc&amp;utm_campaign=1710046-Workspace-DR-NA-US-en-Google-BKWS-EXA-na&amp;utm_content=c-Hybrid+%7C+BKWS+-+EXA+%7C+Txt_Pollux-435278751514&amp;utm_term=gemini+enterprise&amp;gclsrc=aw.ds&amp;gad_source=1&amp;gad_campaignid=23071651391&amp;gclid=CjwKCAiA8vXIBhAtEiwAf3B-gzz0u3LSmwxBXBql13NaVpNPlpH_mOOIT4UTcP-HIo_ei8K5e5OQ8RoCDjIQAvD_BwE&amp;e=48754805">Gemini Enterprise</a> support coming soon.</li>
<li style="font-weight:400;">The model supports up to 14 reference images for style consistency and generates 4K resolution outputs with multilingual text rendering capabilities.</li>
<li style="font-weight:400;">The model includes Google Search grounding for factual accuracy in generated infographics and diagrams, plus built-in SynthID watermarking for transparency. Copyright indemnification will be available at general availability under Google’s shared responsibility framework.</li>
<li style="font-weight:400;">Enterprise integrations are live with <a href="https://www.adobe.com/products/firefly.html?msockid=01d3685b64b56f5b0f8b7ee865eb6e52">Adobe Firefly</a>, <a href="https://www.adobe.com/products/photoshop.html?promoid=RBS7NL7F&amp;mv=other">Photoshop</a>, <a href="https://www.canva.com/">Canva</a>, and <a href="https://www.figma.com/">Figma</a>, enabling production-grade creative workflows. Major retailers, including Klarna, Shopify, and Wayfair, report using the model for product visualization and marketing asset generation at scale.</li>
<li style="font-weight:400;">Developers can access Nano Banana Pro through Vertex AI with Provisioned Throughput and Pay As You Go pricing options, plus advanced safety filters. Business users get access through Google Workspace apps, including Slides, Vids, and <a href="https://www.bing.com/ck/a?!&amp;&amp;p=dc75ccf6e9f1bc1b758926fdcff3ced77d48c762387c8376b3aa8781f73a8a45JmltdHM9MTc2NTIzODQwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=01d3685b-64b5-6f5b-0f8b-7ee865eb6e52&amp;psq=NotebookLM&amp;u=a1aHR0cHM6Ly9ub3RlYm9va2xtLmdvb2dsZS8">NotebookLM</a>, starting today.</li>
<li style="font-weight:400;">The model handles complex editing tasks like translating text within images while preserving visual elements, and maintains character and brand consistency across multiple generated assets. This addresses a key enterprise challenge of maintaining creative control when using AI for production assets.</li>
</ul>
<p>03:59  Justin – “The thing that’s the most important about this is when Nano Banana messes up the text (which it doesn’t do as often), you can now edit it without generating a whole completely different image.” </p>
<p>05:58 <a href="https://www.anthropic.com/news/claude-opus-4-5">Introducing Claude Opus 4.5 </a></p>
<ul>
<li><a href="https://www.anthropic.com/news/claude-opus-4-5">Claude Opus 4.5</a> is now generally available across <a href="https://claude.com/platform/api">Anthropic’s API</a>, apps, and all three major cloud platforms at $5 per million input tokens and $25 per million output tokens. This represents a substantial price reduction that makes Opus-level capabilities more accessible.</li>
<li style="font-weight:400;">Developers can access it via the claude-opus-4-5-20251101 model identifier.</li>
<li style="font-weight:400;">The model achieves state-of-the-art performance on software engineering benchmarks, scoring higher than any human candidate on Anthropic’s internal performance engineering exam within a 2-hour time limit on SWE-bench Verified. It matches <a href="https://www.anthropic.com/claude/sonnet">Sonnet 4.5</a>‘s best score while using 76% fewer output tokens at medium effort, and exceeds it by 4.3 percentage points at highest effort while still using 48% fewer tokens.</li>
<li style="font-weight:400;">Anthropic introduces a new effort parameter in the API that lets developers control the tradeoff between speed and capability, allowing optimization for either minimal time and cost or maximum performance depending on the task requirements. 
<ul>
<li style="font-weight:400;">This combines with new context management and memory capabilities to boost performance on agentic tasks by nearly 15 percentage points in testing.</li>
</ul>
</li>
<li style="font-weight:400;"><a href="https://www.claude.com/product/claude-code">Claude Code</a> gains Plan Mode that builds a user-editable plan.md files before execution, and is now available in the desktop app for running multiple parallel sessions. The consumer apps remove message limits for Opus 4.5 through automatic context summarization, and <a href="https://support.claude.com/en/articles/12012173-getting-started-with-claude-for-chrome">Claude for Chrome</a> and <a href="https://claude.com/claude-for-excel">Claude for Excel</a> expand to all Max, Team, and Enterprise users.</li>
<li style="font-weight:400;">The model demonstrates improved robustness against prompt injection attacks compared to other frontier models and is described as the most robustly aligned model Anthropic has released. 
<ul>
<li style="font-weight:400;">It shows better performance across vision, reasoning, and mathematics tasks while using dramatically fewer tokens than predecessors, reaching similar or better outcomes.</li>
</ul>
</li>
</ul>
<p>08:01  Justin – “The most important part of the whole announcement is the cheaper context input and output tokens.” </p>
<p>09:58 <a href="https://www.snowflake.com/content/snowflake-site/global/en/blog/claude-opus-sonnet-4-5-snowflake-cortex-ai">Announcing Claude Opus 4.5 on Snowflake Cortex AI</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.snowflake.com/en/product/features/cortex/">Snowflake Cortex AI</a> now offers <a href="https://www.anthropic.com/news/claude-opus-4-5">Claude Opus 4.5</a> and <a href="https://www.anthropic.com/claude/sonnet">Claude Sonnet 4.5</a> in general availability, bringing Anthropic’s latest models directly into Snowflake’s data platform. </li>
<li style="font-weight:400;">Users can access these models through SQL, Python, or <a href="https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-llm-rest-api">REST APIs</a> without moving data outside their Snowflake environment.</li>
<li style="font-weight:400;">Claude Opus 4.5 delivers improved performance on complex reasoning tasks, coding, and multilingual capabilities compared to previous versions, while Claude Sonnet 4.5 provides a balanced option for speed and intelligence. 
<ul>
<li style="font-weight:400;">Both models support 200K token context windows and can process text and images natively within Snowflake queries.</li>
</ul>
</li>
<li style="font-weight:400;">The integration enables enterprises to build AI applications using their Snowflake data with built-in governance and security controls, eliminating the need to export sensitive data to external AI services. 
<ul>
<li style="font-weight:400;">Pricing follows Snowflake’s credit-based model, with costs varying by model and token usage.</li>
</ul>
</li>
<li style="font-weight:400;">Developers can combine Claude models with other Cortex AI features like vector search, document understanding, and fine-tuning capabilities to create end-to-end AI workflows. 
<ul>
<li style="font-weight:400;">This allows for use cases ranging from customer service automation to financial analysis and code generation, all within the Snowflake ecosystem.</li>
</ul>
</li>
</ul>
<p>11:03 <a href="https://arstechnica.com/ai/2025/12/openai-ceo-declares-code-red-as-gemini-gains-200-million-users-in-3-months/">OpenAI CEO declares “code red” as Gemini gains 200 million users in 3 </a><a href="https://arstechnica.com/ai/2025/12/openai-ceo-declares-code-red-as-gemini-gains-200-million-users-in-3-months/">months </a></p>
<ul>
<li style="font-weight:400;">Oh, how the turn tables have turned…</li>
<li style="font-weight:400;">OpenAI CEO Sam Altman issued an internal code red memo to refocus the company on improving ChatGPT after Google’s Gemini 3 model topped the LMArena leaderboard and gained 200 million users in three months. </li>
<li style="font-weight:400;">The directive delays planned features, including advertising integration, AI agents for health and shopping, and the Pulse personal assistant feature.</li>
<li style="font-weight:400;">Google’s Gemini 3 model, released in mid-November, has outperformed ChatGPT on industry benchmark tests and attracted high-profile users like Salesforce CEO Marc Benioff, who publicly announced switching from ChatGPT after three years. </li>
<li style="font-weight:400;">The model’s performance represents a significant shift in the competitive landscape since OpenAI’s initial ChatGPT launch in December 2022.</li>
<li style="font-weight:400;">The situation mirrors December 2022, when Google declared its own code red after ChatGPT’s rapid adoption, with CEO Sundar Pichai reassigning teams to develop competing AI products. </li>
<li style="font-weight:400;">This role reversal demonstrates how quickly competitive positions can shift in the AI model space, particularly around user experience and benchmark performance.</li>
<li style="font-weight:400;">OpenAI is implementing daily calls for teams responsible for ChatGPT improvements and encouraging temporary team transfers to address the competitive pressure. </li>
<li style="font-weight:400;">The company’s response indicates that maintaining market leadership in conversational AI requires continuous iteration even for established products with large user bases.</li>
</ul>
<p>13:11  Ryan – “I started on ChatGPT and tried to use it after adopting Claude, and I try to go back every once in a while – especially when they would announce a new model, but I always end up going back to one of the Anthropic models.” </p>
<h2>GCP</h2>
<p>15:19 <a href="https://cloud.google.com/blog/products/infrastructure/new-google-cloud-region-coming-to-turkiye/">New Google Cloud region coming to Türkiye</a></p>
<ul>
<li style="font-weight:400;">Google Cloud is launching a new region in Türkiye as part of a 2 billion dollar investment over 10 years, partnering with local telecom provider Turkcell, which will invest an additional 1 billion dollars in data centers and cloud infrastructure. </li>
<li style="font-weight:400;">This brings Google Cloud’s global footprint to 43 regions and 127 zones, with Türkiye serving as a strategic hub for EMEA customers.</li>
<li style="font-weight:400;">The region targets three key verticals already committed as customers: financial services with Garanti BBVA and Yapi Kredi Bank modernizing core banking systems, airlines with Turkish Airlines improving flight operations and passenger systems, and government entities focused on digital sovereignty. </li>
<li style="font-weight:400;">The local presence addresses data residency requirements and provides low-latency access for organizations that need to keep data within national borders.</li>
<li style="font-weight:400;">Technical capabilities include standard Google Cloud services for data analytics, AI, and cybersecurity with data encryption at rest and in transit, granular access controls, and threat detection systems meeting international security standards. The region will serve both Türkiye and neighboring countries with reduced latency compared to existing European regions.</li>
<li style="font-weight:400;">The announcement emphasizes digital sovereignty as a primary driver, with government officials highlighting the importance of local infrastructure for maintaining control over national data while accessing hyperscale cloud capabilities. </li>
<li style="font-weight:400;">This follows a pattern of Google Cloud expanding into regions where data localization requirements create demand for in-country infrastructure.</li>
<li style="font-weight:400;">No specific pricing details were provided for the Türkiye region, though standard Google Cloud pricing models based on compute, storage, and network usage will apply once the region launches. </li>
<li style="font-weight:400;">The timeline for when the region will be operational was not disclosed in the announcement.</li>
<li style="font-weight:400;">Show note editor Heather note: If you enjoy history, you need to travel to Türkiye immediately! </li>
</ul>
<p>17:03 <a href="https://cloud.google.com/blog/products/data-analytics/introducing-bigquery-agent-analytics/">Introducing BigQuery Agent Analytics</a></p>
<ul>
<li style="font-weight:400;">Google launches <a href="https://google.github.io/adk-docs/tools/google-cloud/bigquery-agent-analytics/">BigQuery Agent Analytics</a>, a new plugin for their <a href="https://google.github.io/adk-docs/">Agent Development Kit</a> that streams AI agent interaction data directly to BigQuery with a single line of code. </li>
<li style="font-weight:400;">The plugin captures metrics like latency, token consumption, tool usage, and user interactions in real-time using the <a href="https://docs.cloud.google.com/bigquery/docs/write-api">BigQuery Storage Write API</a>, enabling developers to analyze agent performance and optimize costs without complex instrumentation.</li>
<li style="font-weight:400;">The integration allows developers to leverage BigQuery’s advanced capabilities, including generative AI functions, vector search, and embedding generation to perform sophisticated analysis on agent conversations. </li>
<li style="font-weight:400;">Teams can cluster similar interactions, identify failure patterns, and join agent data with business metrics like CSAT scores to measure real-world impact, going beyond basic operational metrics to quality analysis.</li>
<li style="font-weight:400;">The plugin includes three core components: an ADK plugin that requires minimal code changes, a predefined optimized BigQuery schema for storing interaction data, and low-cost streaming via the BigQuery Storage Write API. 
<ul>
<li style="font-weight:400;">Developers maintain full control over what data gets streamed and can customize pre-processing, such as redacting sensitive information before logging.</li>
</ul>
</li>
<li style="font-weight:400;">Currently available in preview for ADK users, with support for other agent frameworks like LangGraph coming soon. </li>
<li style="font-weight:400;">The feature addresses a critical gap in agentic AI development where understanding user interaction patterns and agent performance is essential for refinement, particularly as organizations move from building agents to optimizing them at scale.</li>
<li style="font-weight:400;">Pricing follows standard BigQuery costs for storage and queries, with the Storage Write API offering cost-effective real-time streaming compared to traditional batch loading methods. </li>
<li style="font-weight:400;">Documentation and a hands-on codelab are available at <a href="http://google.github.io/adk-docs">google.github.io/adk-docs</a> for developers ready to implement agent analytics.</li>
</ul>
<p>18:16  Ryan – “This is an interesting model; providing both the schema and the already instrumented integration. I feel like a lot of times with other types of development, you’re left to your own devices, and so this is a neat thing. As you’re developing an agent, everyone is instrumenting these things in odd ways, and it’s very difficult to compile the data in a way where you get usable queries out of it. So it’s kind of an interesting concept.” </p>
<p>19:35 <a href="https://cloud.google.com/blog/products/infrastructure/talaylink-subsea-cable-to-connect-australia-and-thailand/">TalayLink subsea cable to connect Australia and Thailand</a></p>
<ul>
<li style="font-weight:400;">You know how much we love a good undersea cable…</li>
<li style="font-weight:400;">Google announces TalayLink, a new subsea cable connecting Australia and <a href="https://www.googlecloudpresscorner.com/2024-09-30-Google-Announces-Plans-to-Invest-US-1-Billion-to-Build-Data-Center-and-Cloud-Region-in-Thailand,-Support-Initiatives-to-Expand-AI-Opportunities-for-Thais">Thailand</a> via the Indian Ocean, taking a western route around the Sunda Strait to avoid congestion from existing cable paths. 
<ul>
<li style="font-weight:400;">This cable extends the Interlink system from the <a href="https://cloud.google.com/blog/products/infrastructure/bosun-australia-connect-initiative-for-indo-pacific-connectivity">Australia Connect </a>initiative and will directly connect to Google’s planned Thailand cloud region and data centers.</li>
</ul>
</li>
<li style="font-weight:400;">The project includes two new connectivity hubs in Mandurah, Western Australia, and South Thailand, providing diverse landing points away from existing cable concentrations in Perth and enabling cable switching, content caching, and colocation capabilities. 
<ul>
<li style="font-weight:400;">Google is partnering with AIS for the South Thailand hub to leverage existing infrastructure.</li>
</ul>
</li>
<li style="font-weight:400;">TalayLink forms part of a broader Indian Ocean connectivity strategy, linking with previously announced hubs in the Maldives and Christmas Island to create redundant paths connecting Australia, Southeast Asia, Africa, and the Middle East. </li>
<li style="font-weight:400;">This routing diversity aims to improve network resilience across multiple regions.</li>
<li style="font-weight:400;">The infrastructure supports Thailand’s digital economy transformation goals and Western Australia’s digital future roadmap, with the Thailand Board of Investment actively backing the project. 
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;">No pricing or specific completion timeline was disclosed in the announcement.</li>
</ul>
</li>
</ul>
<p>The Cloud Pod is excited to cover the latest innovations and trends. We aim to keep you informed about the evolving landscape of cloud technology and artificial intelligence.</p></li>
</ul>
<p>20:34  Matt – “It’s amazing…subsea cable congestion. How many cables can be there that there’s congestion?”  </p>
<p>23:16 <a href="https://cloud.google.com/blog/products/ai-machine-learning/claude-opus-4-5-on-vertex-ai/">Claude Opus 4.5 on Vertex AI </a></p>
<ul>
<li style="font-weight:400;"><a href="https://console.cloud.google.com/vertex-ai/publishers/anthropic/model-garden/claude-opus-4-5">Claude Opus 4.5</a> is now generally available on <a href="https://console.cloud.google.com/vertex-ai/studio/multimodal">Vertex AI</a>, delivering Anthropic’s most advanced model at one-third the cost of its predecessor Opus 4.1. </li>
<li style="font-weight:400;">The model excels in coding tasks that can compress multi-day development projects into hours, agentic workflows with dynamic tool discovery from hundreds of tools without context window bloat, and office productivity tasks with improved memory for maintaining consistency across documents.</li>
<li style="font-weight:400;">Google is positioning Vertex AI as a unified platform for deploying Claude with enterprise features, including <a href="https://cloud.google.com/blog/products/ai-machine-learning/global-endpoint-for-claude-models-generally-available-on-vertex-ai?e=48754805">global endpoints</a> for reduced latency, provisioned throughput for dedicated capacity at fixed costs, and <a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/partner-models/claude/prompt-caching">prompt caching</a> with flexible Time To Live up to one hour. </li>
<li style="font-weight:400;">The platform integrates with Google’s <a href="https://cloud.google.com/products/agent-builder?utm_source=google&amp;utm_medium=cpc&amp;utm_campaign=na-US-all-en-dr-skws-all-all-trial-b-dr-1710134&amp;utm_content=text-ad-none-any-DEV_c-CRE_772251321546-ADGP_Hybrid+%7C+SKWS+-+BRO+%7C+Txt-AIML-Conversational+AI-Agent+Builder-KWID_302905484362-kwd-302905484362&amp;utm_term=KW_ai+search-ST_ai+search&amp;gclsrc=aw.ds&amp;gad_source=1&amp;gad_campaignid=22980675808&amp;gclid=CjwKCAiAw9vIBhBBEiwAraSATsGA3xoiyHkKW3qLfsEE8H7MAbOdemUXCP8mp_SMaBDQChS5XiIT8xoCY1AQAvD_BwE&amp;e=48754805">Agent Builder stack</a>, including the open Agent Development Kit, Agent2Agent protocol, and fully managed Agent Engine for moving multi-step workflows from prototype to production.</li>
<li style="font-weight:400;">Security and governance capabilities include Google Cloud’s foundational security controls, data residency options, and <a href="https://cloud.google.com/security/products/model-armor?e=48754805&amp;hl=en">Model Armor</a> protection against AI-specific threats like prompt injection and tool poisoning through <a href="https://cloud.google.com/security/products/security-command-center">Security Command Center</a>. </li>
<li style="font-weight:400;">Customers like <a href="https://cloud.google.com/customers/palo-alto-networks?e=48754805">Palo Alto Networks</a> report 20-30 percent increases in code development velocity when using Claude on Vertex AI.</li>
<li style="font-weight:400;">The model supports a 1 million token context window, batch predictions for cost efficiency, and web search capabilities in preview. </li>
<li style="font-weight:400;">Regional availability and specific pricing details are available in the Vertex AI documentation, with the model accessible through both the Model Garden and <a href="https://console.cloud.google.com/marketplace/product/anthropic/claude-opus-4-5">Google Cloud Marketplace</a>.</li>
</ul>
<p>23:58 <a href="https://cloud.google.com/blog/topics/google-cloud-next/registration-is-open-for-google-cloud-next/">Registration is live for Google Cloud Next 2026 in Las Vegas</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.googlecloudevents.com/next-vegas/?utm_source=cgc-blog&amp;utm_medium=blog&amp;utm_campaign=FY26-Q2-GLOBAL-GLO27877-physicalevent-er-next26-mc-105752&amp;utm_content=cgc-blog-reg-is-open-dec-2&amp;utm_term=-">Google Cloud Next</a> 2026 takes place April 22-24 in Las Vegas, with registration now open at an early bird price of $999 for a limited time. </li>
<li style="font-weight:400;">This represents the standard pricing structure for Google’s flagship annual conference following their record-breaking attendance in 2025.</li>
<li style="font-weight:400;">The conference focuses heavily on AI agent development and implementation, featuring interactive demos, hackathons, and workshops designed to help attendees build intelligent agents. </li>
<li style="font-weight:400;">Organizations can learn from real-world case studies of companies deploying AI solutions at scale.</li>
<li style="font-weight:400;">Next 2026 offers hands-on technical training through deep-dive sessions, keynotes, and practical labs aimed at developers and technical practitioners. The format emphasizes actionable learning with direct access to Google engineers and product experts.</li>
<li style="font-weight:400;">The event serves as a networking hub for cloud practitioners to connect with peers facing similar technical challenges and to provide feedback that influences Google Cloud’s product roadmap. This direct line to product teams can be valuable for organizations planning their cloud strategy.</li>
<li style="font-weight:400;">Ready to register? You can do that <a href="https://www.googlecloudevents.com/next-vegas/?utm_source=cgc-blog&amp;utm_medium=blog&amp;utm_campaign=FY26-Q2-GLOBAL-GLO27877-physicalevent-er-next26-mc-105752&amp;utm_content=cgc-blog-reg-is-open-dec-2&amp;utm_term=-">here</a>. </li>
</ul>
<p>27:19 <a href="https://cloud.google.com/blog/products/networking/vpc-flow-logs-for-cross-cloud-network/">VPC Flow Logs for Cross-Cloud Network</a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/vpc/docs/flow-logs">VPC Flow Logs</a> now support Cloud VPN <a href="https://cloud.google.com/vpc/docs/using-flow-logs#enable-vpn-tunnel">tunnels</a> and <a href="https://cloud.google.com/vpc/docs/using-flow-logs#enable-vlan-attachment">VLAN attachments</a> for Cloud Interconnect and Cross-Cloud Interconnect, extending visibility beyond traditional VPC subnet traffic to hybrid and multi-cloud connections. 
<ul>
<li style="font-weight:400;">This addresses a critical gap for organizations running <a href="https://cloud.google.com/solutions/cross-cloud-network">Cross-Cloud Network</a> architectures who previously lacked detailed telemetry on traffic flowing between Google Cloud, on-premises infrastructure, and other cloud providers.</li>
</ul>
</li>
<li style="font-weight:400;">The feature provides 5-tuple granularity logging (source/destination IP, port, and protocol) with new gateway annotations that identify traffic direction and context through reporter and gateway object fields. </li>
<li style="font-weight:400;">Flow Analyzer integration eliminates the need for complex SQL queries, offering built-in analysis capabilities including Gemini-powered natural language queries and in-context Connectivity Tests to correlate flow data with firewall policies and network configurations.</li>
<li style="font-weight:400;">Primary use cases include identifying elephant flows that congest specific tunnels or attachments, auditing <a href="https://docs.cloud.google.com/vpc/docs/shared-vpc">Shared VPC</a> bandwidth consumption by service projects, and troubleshooting connectivity issues by verifying whether traffic reaches Google Cloud gateways. 
<ul>
<li style="font-weight:400;">Organizations can also validate <a href="https://docs.cloud.google.com/network-connectivity/docs/interconnect/how-to/cci/configure-traffic-differentiation">DSCP</a> markings for application-aware Cloud Interconnect policy configurations, which is particularly valuable for enterprises with quality-of-service requirements.</li>
</ul>
</li>
<li style="font-weight:400;">The feature is available now for both new and existing deployments through Console, CLI, API, and Terraform, with <a href="https://cloud.google.com/network-intelligence-center/docs/flow-analyzer/overview">Flow Analyzer</a> providing no-cost analysis of logs stored in Cloud Logging. </li>
<li style="font-weight:400;">This capability is particularly relevant for financial services, healthcare, and enterprises with strict compliance requirements that need comprehensive audit trails of cross-cloud and hybrid network traffic.</li>
</ul>
<p>28:37  Ryan – “The controls say that you have to have logging, not what the logging is – and so very frequently it is sort of ‘turn it on and sort of forget it’. I do think this is great, but it is sort of, they say the five-tuple granularity will help you measure congestion, but I don’t see them actually producing any sort of bandwidth or request size metrics. So it is sort of an interesting thing, but it’s at least better than the nothing that we had before. So I’ll take it.”</p>
<p>30:35 <a href="https://cloud.google.com/blog/products/networking/aws-and-google-cloud-collaborate-on-multicloud-networking/">AWS and Google Cloud collaborate on multicloud networking</a></p>
<ul>
<li style="font-weight:400;">AWS and Google Cloud jointly engineered a multicloud networking solution that eliminates the need for manual physical infrastructure setup between their platforms. </li>
<li style="font-weight:400;">Customers can now provision dedicated bandwidth and establish connectivity in minutes instead of weeks through either cloud console or API.</li>
<li style="font-weight:400;">The solution uses <a href="https://aws.amazon.com/interconnect/">AWS Interconnect multicloud</a> and <a href="https://cloud.google.com/hybrid-connectivity#multicloud-networking-connectivity">Google Cloud Cross-Cloud Interconnect</a> with quad-redundancy across physically separate facilities and MACsec encryption between edge routers. </li>
<li style="font-weight:400;">Both providers published open <a href="https://github.com/aws/AWSInterconnect">API specifications</a> on GitHub for other cloud providers to adopt the same standard.</li>
<li style="font-weight:400;">Previously, connecting AWS and Google Cloud required customers to manually coordinate physical connections, equipment, and multiple teams over weeks or months. </li>
<li style="font-weight:400;">This new managed service abstracts away physical connectivity, network addressing, and routing policy complexity into a cloud-native experience.</li>
<li style="font-weight:400;">Salesforce is using this capability to connect its Data 360 platform across clouds using pre-built capacity pools and familiar AWS tooling. 
<ul>
<li style="font-weight:400;">The integration allows them to ground AI and analytics in trusted data regardless of which cloud it resides in.</li>
</ul>
</li>
<li style="font-weight:400;">The collaboration represents a shift toward cloud provider interoperability through open standards rather than proprietary solutions. </li>
<li style="font-weight:400;">The published specifications enable any cloud provider or partner to implement compatible multicloud connectivity using the same framework.</li>
</ul>
<p>31:38 Justin – “I do want you guys to check the weather. Do you see pigs flying or anything crazy?” </p>
<h2>Azure</h2>
<p>33:17  <a href="https://azure.microsoft.com/en-us/updates?id=532202">Generally Available: TLS and TCP termination on Azure Application </a><a href="https://azure.microsoft.com/en-us/updates?id=532202">Gateway</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/application-gateway/overview">Azure Application Gateway</a> now supports TLS and TCP protocol termination at general availability, expanding beyond its traditional HTTP/HTTPS load balancing capabilities. </li>
<li style="font-weight:400;">This allows customers to use Application Gateway for non-web workloads like database connections, message queuing systems, and other TCP-based applications that previously required separate load balancing solutions.</li>
<li style="font-weight:400;">The feature consolidates infrastructure by letting organizations use a single gateway service for both web and non-web traffic, reducing the need to deploy and manage multiple load balancers. </li>
<li style="font-weight:400;">This is particularly useful for enterprises running mixed workloads that include legacy applications, databases like <a href="https://www.microsoft.com/en-us/sql-server/sql-server-downloads">SQL Server</a> or <a href="https://duckduckgo.com/y.js?ad_domain=microsoft.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=RSgHXStC1Mbh4ocPzJNtjrhNPkVz5OI6q5rYSNLNeV4FQLjJUSFPePSdkda6ON9CdbmTG3--85_Ojo9MayFX0utyeqyEIXOFRzPceFATRKQpJIVwerx1KC2L7hxFrY8EhuRArTlr-i4SAXVMBnkzkQ.NM_bhSTYopaNpdwCnXSW2w&amp;eddgt=biKxe-tGIJYAix6bqhI4vQ%3D%3D&amp;rut=8a07ac358ddfab73d434b1df8dc7729df2933dbc30185b4a36ca01682a131424&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De88WARnpezfvI7ekeqj2DArzVUCUyR6glhYloiAa-uvHopDTHDZPzNFaJl3VViC1UnI4u_VP30NTkpuKSdUrcquTMz9RpJGWKWELjRf53Ana3DrvBbive7fr7GXlvU5N5uN6MeHjj6n1hxG8ACUBw9UMqsdNZLA2XhrrxCr9Hi3Mthe3xqm7llNYqsN8mY-E7TU_UFfWKE792TJ1dpv5mz4jUUIxuQVa0a8i4-q-Jpxo_8DtWZYShtxFmKYpcnRKuylog-AZXjEuBTPUhmKpxOUhwCQ3PSN5OhJ5codjOJrSOzkgFM29iOVd_9Idl-vtxgb1ELU3KZHHdHTuTaRcIET37U4zhWimk4CEPixFmTtKr7gcS8UZKJN8Mtb8bJnOi23scMc8WAWiD5keMIgFRoXTyzYgohLP0iN1Xt1qZdNfK99OQ0ZMrUKeJZ3ddqARo_lpmnPWdJG4ug3VEaWxUbbj_JVr6aStnqMhM0A0awO0k38xrfHeE0YUHd1KsYcyYZrJf7_dJtm7U0mgODLQqn2YIXRUhVmBtO8WM9YUjrUbmrd1XBMMOqoFN0Wwo9FY5088R2yDhe4GS0LbRUZLCqH2XpLOhmU2hclKF0Rq6eawsLr_TzEpTUFE21atJMXRYrRD6hzJnCTvKJRXQ17fkfvw-7b3FeR-LJqYXRIn0DrC55ulM_JGsf6iV3TJM2EQ_-bk1_n1XDZVIveE9Ld5zEF9Yz25ZtrtpFImUZfmQ3eVToW33KAcqJ4bXKC863c3yhhjOw3mU0xa0MzfmAwbBMBJQvRQg%26u%3DaHR0cHMlM2ElMmYlMmY1MzUwLnhnNGtlbi5jb20lMmZ0cmslMmZ2MSUzZnByb2YlM2Q0MzklMjZjYW1wJTNkMTY4ODA2JTI2a2N0JTNkbXNuJTI2a2NoaWQlM2QxNTkwMDE5MjElMjZjcml0ZXJpYWlkJTNka3dkLTc5MzAyODgzNzY0Mjg3JTNhbG9jLTE5MCUyNmNhbXBhaWduaWQlM2Q1OTAyMzczNTclMjZsb2NwaHklM2Q4MDU2MCUyNmFkZ3JvdXBpZCUzZDEyNjg4MzgwNzEwODY5ODQlMjZjaWQlM2Q3OTMwMjQ4NTk5MDUxNyUyNmtkdiUzZGMlMjZrZXh0JTNkJTI2a3BnJTNkJTI2a3BpZCUzZCUyNnF1ZXJ5U3RyJTNkUG9zdGdyZVNRTCUyNnVybCUzZGh0dHBzJTNhJTJmJTJmYXp1cmUubWljcm9zb2Z0LmNvbSUyZmVuLXVzJTJmcHJvZHVjdHMlMmZwb3N0Z3Jlc3FsJTJmJTNmZWZfaWQlM2Rfa18yYjU5MTdmMTZiZTExYThjOGRiYzdlYzg0ZmZlNmJkM19rXyUyNk9DSUQlM2RBSURjbW01ZWRzd2R1dV9TRU1fX2tfMmI1OTE3ZjE2YmUxMWE4YzhkYmM3ZWM4NGZmZTZiZDNfa18lMjZtc2Nsa2lkJTNkMmI1OTE3ZjE2YmUxMWE4YzhkYmM3ZWM4NGZmZTZiZDM%26rlid%3D2b5917f16be11a8c8dbc7ec84ffe6bd3&amp;vqd=4-189158647996339608114228267293118061137&amp;iurl=%7B1%7DIG%3D8EAE83A9AB974F95993C789AD6C157F2%26CID%3D13A0E0B0C0E364FD13E2F60EC174650B%26ID%3DDevEx%2C5046.1">PostgreSQL</a>, and custom TCP services alongside modern web applications.</li>
<li style="font-weight:400;">Application Gateway’s existing features, like <a href="https://duckduckgo.com/y.js?ad_domain=microsoft.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=aJiMY9zz-lYy1Jv5XHI4j28wosXINvuV-6naiNXuyCApWs6isfw4gOcQrdSpw7z3lgA3s3kIcEimJUYz7R775S9KhVXpBf86YBOliji5GM0NaNT0VMIE0CE0QJdix3Zhvq-WK5JSBp6wTvaSpqX81Q.EMPx4mg5G0j1vgF70oXurw&amp;eddgt=nENAisaaffN3xoZSdizNMg%3D%3D&amp;rut=19e2c7a32370ff686c06827b86a6bb6d0a50215f39ab8061b4f42b598a9e5fbf&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8MJKR08ZVdL09rzPwYbr85zVUCUzfh6RXQsBOfmwAf6XyY_oIrYC-5O6Pr-nB3q_Oa17dP7XB5z7pXyPkdpu4W4FsFfWVVhR3QYH55CgN0qMcYiPFueyLFBb0WycSquS0ooErlzqzDGq6LHBYEdM6hhvNrs2VmtHcQW3jAYs3ctWGYt1tYP4BXe4DAbhUhQdrpQSRWHzbsnmitrvbowr9rlodX8VfG6ACIQhHNRt8L4tJJewgxhJYpa-R3AN8is7KLgQCQHZbKSiAi3vmq474hwy_VsgaRLIwpskRzKea8ByC7jPJblpTagi4Z9HAQaQiwEF2VTz5G9QlCe9eXTgMJWO8BChfsuVxo7GjKoXcXQdNAewf1ikMdfVs1oXZ6EiRDKIRGnfAKVwPjO3So9sz4-knCmByiwYNqNnppCAvMh9_TiB3MsvluoN1npLgjqnp3Uee7ey_JZrVc4UJ2UcUEyV7TQ6epqkMStgjDkUfxdAx1QX2WVWRwTgi12GLbBVEx22a0Ge1Q3ICpjXQk9Mr5yOICpJlqNE0arOIS4blewQAIEAy-nSzzFs2dBhLU0amDBAFgWmbwlyKYBWZr92PjdjmdBqWOMHp79_rnE73VRSLwac165ziTrhQqp2a_g2rL8nEgV76WGCJMkBJxhtUd5_eAvt9aVPulL_4AhxDeGwEdJoJeS6l63U6G551gZtTJgwXiS2d8QhVPI-yBwIGtb_Q0_Cg77SKryN-PsZZXhWbMl9mFZ1tKXDM4BeDRRv8YyyuKF9EA6Et6aELPGezqZ7MuOU%26u%3DaHR0cHMlM2ElMmYlMmY1MzUwLnhnNGtlbi5jb20lMmZ0cmslMmZ2MSUzZnByb2YlM2Q0MzklMjZjYW1wJTNkMTY4ODA2JTI2a2N0JTNkbXNuJTI2a2NoaWQlM2QxNTkwMDE5MjElMjZjcml0ZXJpYWlkJTNka3dkLTc5NTA5MDQxMTQ1MDIyJTNhbG9jLTE5MCUyNmNhbXBhaWduaWQlM2Q1OTAyMzczNTclMjZsb2NwaHklM2Q4MDUxMSUyNmFkZ3JvdXBpZCUzZDEyNzIxMzY2MDUzMjIxMzklMjZjaWQlM2Q3OTUwODY0MjM1NjI4OSUyNmtkdiUzZGMlMjZrZXh0JTNkJTI2a3BnJTNkJTI2a3BpZCUzZCUyNnF1ZXJ5U3RyJTNkV2ViJTI1MjBBcHBsaWNhdGlvbiUyNTIwRmlyZXdhbGwlMmMlMjZ1cmwlM2RodHRwcyUzYSUyZiUyZmF6dXJlLm1pY3Jvc29mdC5jb20lMmZlbi11cyUyZnByb2R1Y3RzJTJmd2ViLWFwcGxpY2F0aW9uLWZpcmV3YWxsJTJmJTNmZWZfaWQlM2Rfa180Y2NjZGYxMTYzMGExYzExZjc3OGI2Mjg0NzNiNmI2ZV9rXyUyNk9DSUQlM2RBSURjbW01ZWRzd2R1dV9TRU1fX2tfNGNjY2RmMTE2MzBhMWMxMWY3NzhiNjI4NDczYjZiNmVfa18lMjZtc2Nsa2lkJTNkNGNjY2RmMTE2MzBhMWMxMWY3NzhiNjI4NDczYjZiNmU%26rlid%3D4cccdf11630a1c11f778b628473b6b6e&amp;vqd=4-204457218300644183029125697658138046897&amp;iurl=%7B1%7DIG%3D4E0E55A08DB04114AB76A7B575D58001%26CID%3D30F5037D3BBE61931C3615C33A29607D%26ID%3DDevEx%2C5039.1">Web Application Firewall</a>,  autoscaling, and zone redundancy, now extend to TCP and TLS traffic, providing consistent security and availability across all application types. </li>
<li style="font-weight:400;">The pricing model follows Application Gateway’s standard consumption-based structure with charges for gateway hours and data processing, though specific costs for TCP/TLS termination were not detailed in the announcement.</li>
<li style="font-weight:400;">Common use cases include load balancing for database clusters, securing MQTT or AMQP message broker connections, and providing SSL offloading for legacy applications that don’t natively support modern TLS versions. 
<ul>
<li style="font-weight:400;">This positions Application Gateway as a more versatile Layer 4-7 load balancing solution competing with dedicated TCP load balancers and third-party appliances.</li>
</ul>
</li>
</ul>
<p>33:38 Justin – “Thank you for developing network load balancers.” </p>
<p>34:48 <a href="https://azure.microsoft.com/en-us/updates?id=488990">Generally Available: Azure Application Gateway mTLS passthrough </a><a href="https://azure.microsoft.com/en-us/updates?id=488990">support</a></p>
<ul>
<li style="font-weight:400;">Want to make your life even more complicated? Well, it’s GOOD NEWS!</li>
<li style="font-weight:400;">Azure Application Gateway now supports mutual TLS passthrough in general availability, allowing backend applications to validate client certificates and authorization headers directly while still benefiting from Web Application Firewall inspection. 
<ul>
<li style="font-weight:400;">This addresses a specific compliance requirement where organizations need end-to-end certificate validation but cannot terminate TLS at the gateway layer.</li>
</ul>
</li>
<li style="font-weight:400;">The feature enables scenarios where backend services must verify client identity through certificates for regulatory compliance or zero-trust architectures, particularly relevant for financial services, healthcare, and government workloads. Previously, customers had to choose between WAF protection or backend certificate validation, creating security or compliance gaps.</li>
<li style="font-weight:400;">Application Gateway continues to inspect traffic through WAF rules even as the mTLS connection passes through to the backend, maintaining protection against common web exploits and OWASP vulnerabilities. </li>
<li style="font-weight:400;">This dual-layer approach means organizations can enforce both perimeter security policies and application-level authentication without architectural compromises.</li>
<li style="font-weight:400;">The capability is available across all Azure regions where Application Gateway v2 SKU operates, with standard Application Gateway pricing applying based on capacity units consumed. </li>
<li style="font-weight:400;">No additional charges exist specifically for the mTLS passthrough feature itself, though backend certificate validation may increase processing overhead slightly.</li>
</ul>
<p>36:30 Matt – “I did S tunnel and MongoDB because it didn’t support encryption for the longest time…that was a fun one.” </p>
<p>36:50 <a href="https://azure.microsoft.com/en-us/updates?id=527635">Public Preview: Azure API Management adds support for A2A Agent APIs</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/api-management/genai-gateway-capabilities">Azure API Management</a> now supports <a href="https://a2a-protocol.org/dev/specification/">Agent-to-Agent (A2A) APIs</a> in public preview, allowing organizations to manage AI agent APIs alongside traditional REST APIs, AI model APIs, and Model Context Protocol tools within a single governance framework. 
<ul>
<li style="font-weight:400;">This addresses the growing need to standardize how autonomous agents communicate and interact across enterprise systems.</li>
</ul>
</li>
<li style="font-weight:400;">The feature enables centralized management of agent interactions, which is particularly relevant as organizations deploy multiple AI agents that need to coordinate tasks and share information. </li>
<li style="font-weight:400;">API Management can now apply consistent security policies, rate limiting, and monitoring across all agent communications, reducing the operational complexity of multi-agent architectures.</li>
<li style="font-weight:400;">This capability positions Azure API Management as a unified control plane for the full spectrum of API types emerging in AI-driven applications. </li>
<li style="font-weight:400;">Organizations already using API Management for traditional APIs can extend their existing governance practices to cover agent-based workflows without deploying separate infrastructure.</li>
<li style="font-weight:400;">The preview is available in Azure regions where API Management is currently supported, though specific pricing for A2A API features has not been disclosed separately from standard API Management tiers. </li>
<li style="font-weight:400;">Organizations should evaluate this against their existing API Management costs, which start at approximately $50 per month for the Developer tier.</li>
</ul>
<p>38:13 <a href="https://azure.microsoft.com/en-us/blog/introducing-claude-opus-4-5-in-microsoft-foundry/">Introducing Claude Opus 4.5 in Microsoft Foundry</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.anthropic.com/news/claude-opus-4-5">Claude Opus 4.5</a> is now available in public preview on <a href="https://ai.azure.com/">Microsoft Foundry</a>, <a href="http://v">GitHub Copilot</a> paid plans, and <a href="https://aka.ms/MCSOpus4.5">Microsoft Copilot Studio</a>, expanding Azure’s frontier model portfolio following the <a href="https://blogs.microsoft.com/blog/2025/11/18/microsoft-nvidia-and-anthropic-announce-strategic-partnerships/">Microsoft-Anthropic partnership</a> announced at Ignite. </li>
<li style="font-weight:400;">The model achieves 80.9% on SWE-bench software engineering <a href="https://www.anthropic.com/news/claude-opus-4-5">benchmarks</a> and is priced at one-third the cost of previous Opus-class models, making advanced AI capabilities more accessible for enterprise workloads.</li>
<li style="font-weight:400;">The model introduces three key developer features on Foundry: an Effort Parameter in beta that lets teams control computational allocation across thinking and tool calls, Compaction Control for managing context in long-running agentic tasks, and enhanced programmatic tool calling with dynamic tool discovery that doesn’t consume context window space. 
<ul>
<li style="font-weight:400;">These capabilities enable sophisticated multi-tool workflows across cybersecurity, financial modeling, and full-stack development.</li>
</ul>
</li>
<li style="font-weight:400;">Opus 4.5 serves as Anthropic’s strongest vision model and delivers improved computer use performance for automating desktop tasks, particularly for creating spreadsheets, presentations, and documents with professional polish. </li>
<li style="font-weight:400;">The model maintains context across complex projects using memory features, making it suitable for precision-critical verticals like finance and legal,   where consistency matters.</li>
<li style="font-weight:400;">Microsoft Foundry’s rapid integration strategy gives Azure customers immediate access to the latest frontier models while maintaining centralized governance, security, and observability at scale. </li>
<li style="font-weight:400;">This positions Azure as offering the widest selection of advanced AI models among cloud providers, with Opus 4.5 available now through the Foundry portal and coming soon to Visual Studio Code via the <a href="https://marketplace.visualstudio.com/items?itemName=TeamsDevApp.vscode-ai-foundry">Foundry extension</a>.</li>
</ul>
<p>38:37 Justin – “Cool, it’s in Foundry – hooray!” </p>
<p>40:21 <a href="https://azure.microsoft.com/en-us/updates?id=530183">Generally Available: DNS security policy Threat Intelligence feed</a></p>
<ul>
<li style="font-weight:400;">Azure DNS security policy now includes a managed <a href="https://learn.microsoft.com/en-us/azure/dns/dns-traffic-log-how-to#secure-dns-traffic-with-threat-intelligence-feed">Threat Intelligence feed</a> that blocks queries to known malicious domains. </li>
<li style="font-weight:400;">This feature addresses the common attack vector where nearly all cyber attacks begin with a DNS query, providing an additional layer of protection at the DNS resolution level.</li>
<li style="font-weight:400;">The service integrates with Azure’s existing DNS infrastructure and uses Microsoft’s threat intelligence data to automatically update the list of malicious domains. 
<ul>
<li style="font-weight:400;">Organizations can enable this protection without managing their own threat feeds or maintaining blocklists, reducing operational overhead for security teams.</li>
</ul>
</li>
<li style="font-weight:400;">This capability is particularly relevant for enterprises looking to implement defense-in-depth strategies, as it stops threats before they can establish connections to command and control servers or phishing sites. </li>
<li style="font-weight:400;">The feature works alongside existing <a href="https://learn.microsoft.com/en-us/azure/firewall/overview">Azure Firewall</a> and network security tools to provide comprehensive protection.</li>
<li style="font-weight:400;">The general availability means the service is now production-ready with full SLA support across Azure regions. </li>
<li style="font-weight:400;">Pricing details were not specified in the announcement, so customers should check Azure pricing documentation for DNS security policy costs.</li>
</ul>
<p>41:28 Ryan – “It is something, being able to automatically take the results of a feed, I will do any day just because these things are updated by many more parties and faster than I can ever react to, and you know, our own threat intelligence. So that’s pretty great. I like it.”</p>
<p>42:46 <a href="https://azure.microsoft.com/en-us/updates?id=525405">Public Preview: Standard V2 NAT Gateway and StandardV2 Public IPs</a></p>
<ul>
<li style="font-weight:400;">Azure introduces <a href="https://learn.microsoft.com/en-us/azure/nat-gateway/nat-overview#standardv2-nat-gateway">StandardV2 NAT Gateway</a> in public preview, adding zone-redundancy for high availability in regions with availability zones. </li>
<li style="font-weight:400;">This upgrade addresses a key limitation of the original NAT Gateway by ensuring outbound connectivity survives zone failures, which matters for enterprises running mission-critical workloads that require consistent internet egress.</li>
<li style="font-weight:400;">The StandardV2 SKU includes matching <a href="https://learn.microsoft.com/en-us/azure/virtual-network/ip-services/public-ip-addresses#sku">StandardV2 Public IPs</a> that work together with the new NAT Gateway tier. Organizations using the original Standard SKU will need to evaluate migration paths since zone-redundancy represents a fundamental architectural change requiring new resource types rather than an in-place upgrade.</li>
<li style="font-weight:400;">This release targets customers who previously had to architect complex workarounds for zone-resilient outbound connectivity, particularly those running multi-zone deployments of containerized applications or database clusters. 
<ul>
<li style="font-weight:400;">The preview allows testing of failover scenarios before production deployment.</li>
</ul>
</li>
<li style="font-weight:400;">The announcement lacks specific pricing details for the StandardV2 tier, though NAT Gateway typically charges based on hourly resource fees plus data processing costs. </li>
<li style="font-weight:400;">Customers should monitor Azure pricing pages as the preview progresses toward general availability for cost comparisons against the Standard SKU.</li>
</ul>
<p>43:48 Justin – “The fact that this is not an upgrade that I can just check, and I have to redeploy a whole new thing, annoys the crap out of me.” </p>
<p>46:51 <a href="https://azure.microsoft.com/en-us/updates?id=492303">Generally Available: Custom error pages on Azure App Service</a></p>
<ul>
<li style="font-weight:400;">Custom error pages on <a href="https://learn.microsoft.com/en-us/azure/app-service/configure-common?source=recommendations">Azure App Service</a> have moved to general availability, allowing developers to replace default HTTP error pages with branded or customized alternatives. </li>
<li style="font-weight:400;">This addresses a common requirement for production applications where maintaining a consistent user experience during errors is important for brand identity and user trust.</li>
<li style="font-weight:400;">The feature integrates directly into App Service configuration without requiring additional Azure services or third-party tools. </li>
<li style="font-weight:400;">Developers can specify custom HTML pages for different HTTP error codes like 404 or 500, which App Service will serve automatically when those errors occur.</li>
<li style="font-weight:400;">This capability is particularly relevant for customer-facing web applications, e-commerce sites, and SaaS platforms where error handling needs to align with corporate branding guidelines. </li>
<li style="font-weight:400;">The feature works across all App Service tiers that support custom domains and SSL certificates.</li>
<li style="font-weight:400;">No additional cost is associated with custom error pages beyond standard App Service hosting fees, which start at approximately $13 per month for the Basic tier. Implementation requires uploading error page files to the app’s file system and updating configuration settings through Azure Portal or deployment templates.</li>
<li style="font-weight:400;">The general availability status means the feature is now production-ready with full support coverage, moving beyond the preview phase where it was available for testing. 
<ul>
<li style="font-weight:400;">Documentation is available at the Azure App Service custom error pages guide.</li>
</ul>
</li>
</ul>
<p>48:17  Matt – “It’s crazy that this wasn’t already there. The workarounds you had to do to make your own error page was messy at best.” </p>
<p>49:01 <a href="https://azure.microsoft.com/en-us/updates?id=525942">Generally Available: Streamline IT governance, security, and cost </a><a href="https://azure.microsoft.com/en-us/updates?id=525942">management experiences with Microsoft Foundry</a></p>
<ul>
<li style="font-weight:400;"><a href="https://azure.microsoft.com/en-us/products/ai-foundry/">Microsoft Foundry</a> reaches general availability as an enterprise AI governance platform that consolidates security, compliance, and cost management controls for IT administrators deploying AI solutions. </li>
<li style="font-weight:400;">The platform addresses the growing need for centralized oversight as organizations scale their AI initiatives across Azure infrastructure.</li>
<li style="font-weight:400;">The service integrates with existing Azure management tools to provide unified visibility and control over AI workloads, allowing IT teams to enforce policies and monitor resource usage from a single interface. 
<ul>
<li style="font-weight:400;">This reduces the operational overhead of managing disparate AI projects while maintaining enterprise security standards.</li>
</ul>
</li>
<li style="font-weight:400;">Foundry targets large enterprises and regulated industries that require strict governance frameworks for AI deployment, particularly organizations balancing innovation speed with compliance requirements. </li>
<li style="font-weight:400;">The platform helps bridge the gap between data science teams pushing for rapid AI adoption and IT departments responsible for risk management.</li>
<li style="font-weight:400;">The general availability announcement indicates Microsoft is positioning Azure as the enterprise-ready AI cloud, competing directly with AWS and Google Cloud for organizations prioritizing governance alongside AI capabilities. </li>
<li style="font-weight:400;">Specific pricing details were not disclosed in the announcement, suggesting costs likely vary based on usage and existing Azure commitments.</li>
</ul>
<p>50:22  Justin – “It’s like a combination of SageMaker and Vertex married Databricks and then had a baby – plus a report interface.” </p>
<p>52:44 <a href="https://azure.microsoft.com/en-us/updates?id=526330">Generally Available: Model Router in Microsoft Foundry</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/ai-foundry/what-is-azure-ai-foundry?view=foundry-classic#microsoft-foundry-portals">Microsoft Foundry’s</a> <a href="https://learn.microsoft.com/en-us/azure/ai-foundry/openai/concepts/model-router?view=foundry-classic">Model Router</a> is now generally available as an AI orchestration layer that automatically selects the optimal language model for each prompt based on factors like complexity, cost, and performance requirements. </li>
<li style="font-weight:400;">This eliminates the need for developers to manually choose between different AI models for each use case.</li>
<li style="font-weight:400;">The service supports an expanded range of models, including the <a href="https://openai.com/index/gpt-4-1/">GPT-4 family</a>, <a href="https://openai.com/gpt-5/">GPT-5 family</a>, <a href="https://openai.com/index/introducing-gpt-oss/">GPT-oss</a>, and <a href="https://www.deepseek.com/en">DeepSeek</a> models, giving organizations flexibility to balance performance needs against cost considerations. </li>
<li style="font-weight:400;">The router can dynamically switch between models within a single application based on prompt characteristics.</li>
<li style="font-weight:400;">This addresses a practical challenge for enterprises deploying multiple AI models where different tasks require different model capabilities. For example, simple queries could route to smaller, less expensive models while complex reasoning tasks automatically use more capable models.</li>
<li style="font-weight:400;">The orchestration layer integrates with Microsoft Foundry’s broader AI infrastructure, allowing customers to manage multiple model deployments through a single interface rather than building custom routing logic. This reduces operational complexity for teams managing diverse AI workloads across their organization.</li>
<li style="font-weight:400;">No specific pricing details are provided in the announcement, though costs will likely vary based on the underlying models selected by the router and usage patterns. Organizations should evaluate potential cost savings from routing simpler queries to less expensive models versus always using premium models.</li>
</ul>
<p>54:50  <a href="https://azure.microsoft.com/en-us/updates?id=530797">Generally Available: Scheduled Actions </a></p>
<ul>
<li style="font-weight:400;">Azure’s Scheduled Actions feature is now generally available, providing automated VM lifecycle management at scale with built-in handling of subscription throttling and transient error retries. </li>
<li style="font-weight:400;">This eliminates the need for custom scripting or third-party tools to start, stop, or deallocate VMs on a recurring schedule.</li>
<li style="font-weight:400;">The feature addresses common cost optimization scenarios where organizations need to automatically shut down development and test environments during off-hours or scale down non-production workloads on weekends. </li>
<li style="font-weight:400;">This can reduce compute costs by 40-70% for environments that don’t require 24/7 availability.</li>
<li style="font-weight:400;">Scheduled Actions integrates directly with Azure Resource Manager and works across VM scale sets, making it suitable for both individual VMs and large-scale deployments. The automatic retry logic and throttling management means operations complete reliably even when managing hundreds or thousands of VMs simultaneously.</li>
<li style="font-weight:400;">The service is available in all Azure public cloud regions where VMs are supported, with no additional cost beyond standard VM compute charges. </li>
<li style="font-weight:400;">Organizations pay only for the time VMs are running, so automated shutdown schedules directly translate to reduced monthly bills.</li>
</ul>
<p>55:31 Justin – “Thank you for copying every other cloud that’s had this forever…”</p>
<h2>After Show </h2>
<p>51:46 <a href="https://openai.com/index/norad-holiday-collaboration">OpenAI and NORAD team up to bring new magic to “NORAD Tracks Santa.”</a></p>
<ul>
<li style="font-weight:400;">OpenAI partnered with NORAD to add AI-powered holiday tools to the annual Santa tracking tradition, creating three ChatGPT-based features that turn kids’ photos into elf portraits, generate custom toy coloring pages, and build personalized Christmas stories. This represents a consumer-friendly application of generative AI that demonstrates how large language models can be packaged for mainstream family use during the holidays.</li>
<li style="font-weight:400;">The collaboration shows OpenAI pursuing brand-building partnerships with trusted institutions like NORAD to normalize AI tools in everyday contexts. By embedding ChatGPT features into a 68-year-old military tradition that reaches millions of families, OpenAI gains exposure to non-technical users who might otherwise be hesitant about AI adoption.</li>
<li style="font-weight:400;">From a technical perspective, these tools showcase practical implementations of image generation and text-to-image capabilities that parents can use without understanding the underlying models. The focus on simple, single-purpose GPTs rather than complex interfaces suggests OpenAI is testing how to make their technology more accessible to casual users.</li>
<li style="font-weight:400;">The partnership raises interesting questions about AI companies seeking legitimacy through associations with government organizations and cultural traditions. While the tools are harmless holiday fun, they demonstrate how AI providers are moving beyond enterprise sales to embed their technology into cultural moments and family activities.</li>
<li style="font-weight:400;">This is essentially a marketing play disguised as holiday cheer, but it does illustrate how cloud-based AI services are becoming infrastructure for consumer experiences rather than just backend business tools. The real story is about distribution strategy and making AI feel safe and familiar to mainstream audiences.</li>
<li style="font-weight:400;">The Cloud Pod has one message: keep Skynet out of Christmas! </li>
</ul>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback, or ask questions at theCloudPod.net or tweet at us with the hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2283771/c1e-0424uk484ouo88mp-0v7352q5iwx0-tlwyte.mp3" length="120445497"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 333 of The Cloud Pod, where the forecast is always cloudy! Justin, Ryan, and Matt are taking a quick break from re:Invent festivities. They bring you the latest and greatest in Cloud and AI news. This week, we discuss Norad and Anthropic teaming up to bring you Christmas cheer. Wait, is that right? Huh. We also have undersea cables, some Turkish region delight, and a LOT of Opus 4.5 news. Let’s get into it!
Titles we almost went with this week

 Boring Error Pages Not Found
 Claude Goes Native in Snowflake: Finally, AI That Stays Where Your Data Lives
 Cross-Cloud Romance: AWS and Google Make It Official with Interconnect
 Google Gemini Puts OpenAI in Code Red: The Tables Have Turned
 Azure NAT Gateway V2: Now With More Zones Than a Parking Lot
 From ChatGPT to Chat-Uh-Oh: OpenAI Sounds the Alarm as Gemini Steals 200 Million 
      Users **Anthropic
 Scheduled Actions: Because Your VMs Need a Work-Life Balance Too
 Finally, Your 500 Errors Can Look as Good as Your Homepage
 Foundry Model Router: Because Choosing Between 47 AI Models is Nobody’s Idea of Fun
 Google Takes the Scenic Route: New Cable Avoids the Sunda Strait Traffic Jam
 Azure Application Gateway Gets Its TCP/IP Diploma
 Google Cloud Gets Its Türkiye Dinner: 2 Billion Dollar Cloud Feast Coming Soon
 Microsoft Foundry: Turning AI Chaos into Compliance Gold

AI Is Going Great, or How ML Makes Money 
02:59 Nano Banana Pro available for enterprise

Google launches  Nano Banana Pro (Gemini 3 Pro Image) in general availability on Vertex AI and Google Workspace, with Gemini Enterprise support coming soon.
The model supports up to 14 reference images for style consistency and generates 4K resolution outputs with multilingual text rendering capabilities.
The model includes Google Search grounding for factual accuracy in generated infographics and diagrams, plus built-in SynthID watermarking for transparency. Copyright indemnification will be available at general availability under Google’s shared responsibility framework.
Enterprise integrations are live with Adobe Firefly, Photoshop, Canva, and Figma, enabling production-grade creative workflows. Major retailers, including Klarna, Shopify, and Wayfair, report using the model for product visualization and marketing asset generation at scale.
Developers can access Nano Banana Pro through Vertex AI with Provisioned Throughput and Pay As You Go pricing options, plus advanced safety filters. Business users get access through Google Workspace apps, including Slides, Vids, and ]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2283771/c1a-k5d5-9j38nn1ruv4-uyfi6t.jpg"></itunes:image>
                                                                            <itunes:duration>01:02:32</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2283771/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[332: 2025 Re:Invent Predictions Draft – May The Odds Be Ever In Your Favor]]>
                </title>
                <pubDate>Fri, 28 Nov 2025 17:00:41 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2248333</guid>
                                    <link>https://tcpfm.castos.com/episodes/332-2025-reinvent-predictions-draft-may-the-odds-be-ever-in-your-favor</link>
                                <description>
                                            <![CDATA[<p class="font-claude-response-body whitespace-normal break-words">Welcome to episode 332 of The Cloud Pod – where the forecast is always cloudy! It’s Thanksgiving week, which can only mean one thing: AWS Re:Invent predictions! In this special episode, Justin, Jonathan, Ryan, and Matt engage in the annual tradition of drafting their best guesses for what AWS will announce at the biggest cloud conference of the year. Justin is the reigning champion (probably because he actually reads the show notes), but with a reverse snake draft order determined by dice roll, anything could happen. Will Werner announce his retirement? Is Cognito finally getting a much-needed overhaul? And just how many times will “AI” be uttered on stage? Grab your turkey and let’s get predicting!</p>
<h2 class="font-claude-response-heading text-text-100 mt-1 -mb-0.5">Titles we almost went with this week:</h2>
<ul class="[&amp;:not(:last-child)_ul]:pb-1 [&amp;:not(:last-child)_ol]:pb-1 list-disc space-y-2.5 pl-7">
<li class="whitespace-normal break-words"> Roll For Initiative: The Re:Invent Prediction Draft</li>
<li class="whitespace-normal break-words"> Justin’s Winning Streak: A Study in Actually Doing Your Homework</li>
<li class="whitespace-normal break-words"> Serverless GPUs and Broken Dreams: Our Re:Invent Wishlist</li>
<li class="whitespace-normal break-words"> Shooting in the Dark: AWS Predictions Edition</li>
<li class="whitespace-normal break-words"> We’re Never Good at This, But Here We Go Again</li>
<li class="whitespace-normal break-words"> Vegas Odds: What Happens at Re:Invent, Gets Predicted Wrong</li>
</ul>

<h2 class="font-claude-response-heading text-text-100 mt-1 -mb-0.5">AWS Re:Invent Predictions 2025</h2>
<p class="font-claude-response-body whitespace-normal break-words">The annual prediction draft is here! Draft order was determined by dice roll: Jonathan first, followed by Ryan, Justin, and Matt in last position. As always, it’s a reverse order format, with points awarded for each correct prediction announced during the Tuesday, Wednesday, and Thursday keynotes.</p>
<h3 class="font-claude-response-subheading text-text-100 mt-1 -mb-1.5">Jonathan’s Predictions</h3>
<ol class="[&amp;:not(:last-child)_ul]:pb-1 [&amp;:not(:last-child)_ol]:pb-1 list-decimal space-y-2.5 pl-7">
<li class="whitespace-normal break-words"><strong>Serverless GPU Support</strong> – An extension to Lambda or a different service that provides on-demand serverless GPU/inference capability. Likely with requirements for pre-warmed provisioned instances.</li>
<li class="whitespace-normal break-words"><strong>Agentic Platform for Continuous AI Agents</strong> – A service that allows agents to run continuously with goals or instructions, performing actions periodically or on-demand in the real world. Think: running agents on a schedule that can check conditions and take automated actions.</li>
<li class="whitespace-normal break-words"><strong>Werner Vogels Retirement Announcement</strong> – Werner will announce that this is his last Re:Invent keynote and that he is retiring.</li>
</ol>
<h3 class="font-claude-response-subheading text-text-100 mt-1 -mb-1.5">Ryan’s Predictions</h3>
<ol class="[&amp;:not(:last-child)_ul]:pb-1 [&amp;:not(:last-child)_ol]:pb-1 list-decimal space-y-2.5 pl-7">
<li class="whitespace-normal break-words"><strong>New Trainium 3 Chips, Inferentia, and Graviton Chips</strong> – New generation of AWS custom silicon across training, inference, and general compute.</li>
<li class="whitespace-normal break-words"><strong>Expanded Model Availability in Bedrock</strong> – AWS will significantly expand the number of models available in Bedrock, potentially via partnerships or integrations with additional providers.</li>
<li class="whitespace-normal break-words"><strong>Major Refresh to AWS Organizations</strong> – UI-based or functionality refresh providing better visibility into SCPs, OU mappings, and stack sets across organizations.</li>
</ol>
<h3 class="fon...&lt;/div&gt;&lt;/body&gt;&lt;/html&gt;"></h3>
<h3>Chapters</h3>
<ul><li>(00:00:02) - Episode 332: Reinvent Predictions For</li><li>(00:01:26) - Reinvent: The Contest</li><li>(00:03:35) - How to Predict the AI Announcement</li><li>(00:04:23) - Serverless GPUs: First Step</li><li>(00:05:58) - SageMaker vs. Amazon: The Fight</li><li>(00:09:56) - What is the Future of AI Agents?</li><li>(00:11:03) - Facebook is an Agent Platform, but...</li><li>(00:11:38) - AWS: Bedrock Expansion & OpenAI Partnership</li><li>(00:15:09) - Top Tech Speakers: ML, AI and the Warner Key</li><li>(00:16:15) - Third and Final Prediction</li><li>(00:17:15) - WSJDLive: Future of AWS IT refresh</li><li>(00:18:18) - 3 of the Best Security Hub Features</li><li>(00:19:22) -  AWS: Cognito 2.0 or Agentic Identities?</li><li>(00:21:27) - Tiebreaker: How Many Times Will AI Be Said?</li><li>(00:23:28) - What to Do to Reinvent Yourself at Reinvent 2012</li><li>(00:24:00) - Amazon's AI Wish List</li><li>(00:29:50) - A Taste of Re Invent 2018</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 332 of The Cloud Pod – where the forecast is always cloudy! It’s Thanksgiving week, which can only mean one thing: AWS Re:Invent predictions! In this special episode, Justin, Jonathan, Ryan, and Matt engage in the annual tradition of drafting their best guesses for what AWS will announce at the biggest cloud conference of the year. Justin is the reigning champion (probably because he actually reads the show notes), but with a reverse snake draft order determined by dice roll, anything could happen. Will Werner announce his retirement? Is Cognito finally getting a much-needed overhaul? And just how many times will “AI” be uttered on stage? Grab your turkey and let’s get predicting!
Titles we almost went with this week:

 Roll For Initiative: The Re:Invent Prediction Draft
 Justin’s Winning Streak: A Study in Actually Doing Your Homework
 Serverless GPUs and Broken Dreams: Our Re:Invent Wishlist
 Shooting in the Dark: AWS Predictions Edition
 We’re Never Good at This, But Here We Go Again
 Vegas Odds: What Happens at Re:Invent, Gets Predicted Wrong


AWS Re:Invent Predictions 2025
The annual prediction draft is here! Draft order was determined by dice roll: Jonathan first, followed by Ryan, Justin, and Matt in last position. As always, it’s a reverse order format, with points awarded for each correct prediction announced during the Tuesday, Wednesday, and Thursday keynotes.
Jonathan’s Predictions

Serverless GPU Support – An extension to Lambda or a different service that provides on-demand serverless GPU/inference capability. Likely with requirements for pre-warmed provisioned instances.
Agentic Platform for Continuous AI Agents – A service that allows agents to run continuously with goals or instructions, performing actions periodically or on-demand in the real world. Think: running agents on a schedule that can check conditions and take automated actions.
Werner Vogels Retirement Announcement – Werner will announce that this is his last Re:Invent keynote and that he is retiring.

Ryan’s Predictions

New Trainium 3 Chips, Inferentia, and Graviton Chips – New generation of AWS custom silicon across training, inference, and general compute.
Expanded Model Availability in Bedrock – AWS will significantly expand the number of models available in Bedrock, potentially via partnerships or integrations with additional providers.
Major Refresh to AWS Organizations – UI-based or functionality refresh providing better visibility into SCPs, OU mappings, and stack sets across organizations.

]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[332: 2025 Re:Invent Predictions Draft – May The Odds Be Ever In Your Favor]]>
                </itunes:title>
                                    <itunes:episode>332</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p class="font-claude-response-body whitespace-normal break-words">Welcome to episode 332 of The Cloud Pod – where the forecast is always cloudy! It’s Thanksgiving week, which can only mean one thing: AWS Re:Invent predictions! In this special episode, Justin, Jonathan, Ryan, and Matt engage in the annual tradition of drafting their best guesses for what AWS will announce at the biggest cloud conference of the year. Justin is the reigning champion (probably because he actually reads the show notes), but with a reverse snake draft order determined by dice roll, anything could happen. Will Werner announce his retirement? Is Cognito finally getting a much-needed overhaul? And just how many times will “AI” be uttered on stage? Grab your turkey and let’s get predicting!</p>
<h2 class="font-claude-response-heading text-text-100 mt-1 -mb-0.5">Titles we almost went with this week:</h2>
<ul class="[&amp;:not(:last-child)_ul]:pb-1 [&amp;:not(:last-child)_ol]:pb-1 list-disc space-y-2.5 pl-7">
<li class="whitespace-normal break-words"> Roll For Initiative: The Re:Invent Prediction Draft</li>
<li class="whitespace-normal break-words"> Justin’s Winning Streak: A Study in Actually Doing Your Homework</li>
<li class="whitespace-normal break-words"> Serverless GPUs and Broken Dreams: Our Re:Invent Wishlist</li>
<li class="whitespace-normal break-words"> Shooting in the Dark: AWS Predictions Edition</li>
<li class="whitespace-normal break-words"> We’re Never Good at This, But Here We Go Again</li>
<li class="whitespace-normal break-words"> Vegas Odds: What Happens at Re:Invent, Gets Predicted Wrong</li>
</ul>

<h2 class="font-claude-response-heading text-text-100 mt-1 -mb-0.5">AWS Re:Invent Predictions 2025</h2>
<p class="font-claude-response-body whitespace-normal break-words">The annual prediction draft is here! Draft order was determined by dice roll: Jonathan first, followed by Ryan, Justin, and Matt in last position. As always, it’s a reverse order format, with points awarded for each correct prediction announced during the Tuesday, Wednesday, and Thursday keynotes.</p>
<h3 class="font-claude-response-subheading text-text-100 mt-1 -mb-1.5">Jonathan’s Predictions</h3>
<ol class="[&amp;:not(:last-child)_ul]:pb-1 [&amp;:not(:last-child)_ol]:pb-1 list-decimal space-y-2.5 pl-7">
<li class="whitespace-normal break-words"><strong>Serverless GPU Support</strong> – An extension to Lambda or a different service that provides on-demand serverless GPU/inference capability. Likely with requirements for pre-warmed provisioned instances.</li>
<li class="whitespace-normal break-words"><strong>Agentic Platform for Continuous AI Agents</strong> – A service that allows agents to run continuously with goals or instructions, performing actions periodically or on-demand in the real world. Think: running agents on a schedule that can check conditions and take automated actions.</li>
<li class="whitespace-normal break-words"><strong>Werner Vogels Retirement Announcement</strong> – Werner will announce that this is his last Re:Invent keynote and that he is retiring.</li>
</ol>
<h3 class="font-claude-response-subheading text-text-100 mt-1 -mb-1.5">Ryan’s Predictions</h3>
<ol class="[&amp;:not(:last-child)_ul]:pb-1 [&amp;:not(:last-child)_ol]:pb-1 list-decimal space-y-2.5 pl-7">
<li class="whitespace-normal break-words"><strong>New Trainium 3 Chips, Inferentia, and Graviton Chips</strong> – New generation of AWS custom silicon across training, inference, and general compute.</li>
<li class="whitespace-normal break-words"><strong>Expanded Model Availability in Bedrock</strong> – AWS will significantly expand the number of models available in Bedrock, potentially via partnerships or integrations with additional providers.</li>
<li class="whitespace-normal break-words"><strong>Major Refresh to AWS Organizations</strong> – UI-based or functionality refresh providing better visibility into SCPs, OU mappings, and stack sets across organizations.</li>
</ol>
<h3 class="font-claude-response-subheading text-text-100 mt-1 -mb-1.5">Justin’s Predictions</h3>
<ol class="[&amp;:not(:last-child)_ul]:pb-1 [&amp;:not(:last-child)_ol]:pb-1 list-decimal space-y-2.5 pl-7">
<li class="whitespace-normal break-words"><strong>New Nova Model with Multi-modal Support</strong> – Launch of Nova Premier or Nova Sonic with multi-modal capabilities, bringing Amazon’s foundational model to the next level.</li>
<li class="whitespace-normal break-words"><strong>OpenAI Partnership Announcement</strong> – AWS and OpenAI will announce a strategic partnership, potentially bringing OpenAI models to Bedrock (likely announced on stage).</li>
<li class="whitespace-normal break-words"><strong>Advanced Agentic AI Capabilities for Security Hub</strong> – Enhanced features for Security Hub adding Agentic AI to help automate SOC team operations.</li>
</ol>
<h3 class="font-claude-response-subheading text-text-100 mt-1 -mb-1.5">Matt’s Predictions</h3>
<ol class="[&amp;:not(:last-child)_ul]:pb-1 [&amp;:not(:last-child)_ol]:pb-1 list-decimal space-y-2.5 pl-7">
<li class="whitespace-normal break-words"><strong>Model Router for Bedrock</strong> – A service to route LLM queries to different AI models, simplifying the process of testing and selecting models for different use cases.</li>
<li class="whitespace-normal break-words"><strong>Well-Architected Framework Expansion</strong> – New lenses or significant updates to the Well-Architected Framework beyond the existing Generative AI and Sustainability lenses.</li>
<li class="whitespace-normal break-words"><strong>End User Authentication That Doesn’t Suck</strong> – A new or significantly revamped end-user authentication service (essentially Cognito 2.0) that actually works well for client portals.</li>
</ol>

<h2 class="font-claude-response-heading text-text-100 mt-1 -mb-0.5">Tiebreaker: How Many Times Will “AI” or “Artificial Intelligence” Be Said On Stage?</h2>
<p class="font-claude-response-body whitespace-normal break-words">If we end in a tie (or nobody gets any predictions correct, which is historically possible), we go to the tiebreaker!</p>



Host
Guess




Matt
200


Justin
160


Ryan
99


Jonathan
1




<h2 class="font-claude-response-heading text-text-100 mt-1 -mb-0.5">Honorable Mentions</h2>
<p class="font-claude-response-body whitespace-normal break-words">Ideas that didn’t make the cut but might just surprise us:</p>
<p class="font-claude-response-body whitespace-normal break-words"><strong>Jonathan:</strong></p>
<ul class="[&amp;:not(:last-child)_ul]:pb-1 [&amp;:not(:last-child)_ol]:pb-1 list-disc space-y-2.5 pl-7">
<li class="whitespace-normal break-words">Mathematical proof/verification that text was generated by Amazon’s LLMs (watermarking for AI output)</li>
<li class="whitespace-normal break-words">Marketplace for AI work – publish and monetize AI-based tools with Amazon handling billing</li>
<li class="whitespace-normal break-words">New consumer device to accompany Nova models (smarter Alexa replacement with local inference)</li>
</ul>
<p class="font-claude-response-body whitespace-normal break-words"><strong>Ryan:</strong></p>
<ul class="[&amp;:not(:last-child)_ul]:pb-1 [&amp;:not(:last-child)_ol]:pb-1 list-disc space-y-2.5 pl-7">
<li class="whitespace-normal break-words">FinOps AI recommender for model usage and cost optimization</li>
<li class="whitespace-normal break-words">Savings plans or committed use discounts for Bedrock use cases</li>
</ul>
<p class="font-claude-response-body whitespace-normal break-words"><strong>Matt:</strong></p>
<ul class="[&amp;:not(:last-child)_ul]:pb-1 [&amp;:not(:last-child)_ol]:pb-1 list-disc space-y-2.5 pl-7">
<li class="whitespace-normal break-words">Sustainability/green dashboard improvements</li>
<li class="whitespace-normal break-words">AI-specific features for Aurora or DSQL</li>
</ul>
<p class="font-claude-response-body whitespace-normal break-words"><strong>Justin:</strong></p>
<ul class="[&amp;:not(:last-child)_ul]:pb-1 [&amp;:not(:last-child)_ol]:pb-1 list-disc space-y-2.5 pl-7">
<li class="whitespace-normal break-words">Big S3 vectors announcement and integration to Bedrock</li>
<li class="whitespace-normal break-words">FinOps service for Kubernetes</li>
<li class="whitespace-normal break-words">Amazon Q Developer with autonomous coding agents</li>
<li class="whitespace-normal break-words">New GPU architecture combining training/inference/Graviton capabilities</li>
<li class="whitespace-normal break-words">Amazon Bedrock model marketplace for revenue share on fine-tuned models</li>
</ul>

<h2 class="font-claude-response-heading text-text-100 mt-1 -mb-0.5">Quick Hits From the Episode</h2>
<ul class="[&amp;:not(:last-child)_ul]:pb-1 [&amp;:not(:last-child)_ol]:pb-1 list-disc space-y-2.5 pl-7">
<li class="whitespace-normal break-words"><strong>00:02</strong> – Is it really Re:Invent already? The existential crisis begins.</li>
<li class="whitespace-normal break-words"><strong>01:44</strong> – Jonathan reveals why Justin always wins: “Because you read the notes.”</li>
<li class="whitespace-normal break-words"><strong>02:54</strong> – Matt hasn’t been to a Re:Invent session since Image Builder launched… eight years ago.</li>
<li class="whitespace-normal break-words"><strong>05:03</strong> – Jonathan comes in hot with serverless GPU support prediction.</li>
<li class="whitespace-normal break-words"><strong>06:57</strong> – The inference vs. training cost debate – where’s the real ROI?</li>
<li class="whitespace-normal break-words"><strong>09:30</strong> – Matt’s picks get systematically destroyed by earlier drafters.</li>
<li class="whitespace-normal break-words"><strong>14:09</strong> – The OpenAI partnership prediction causes draft chaos.</li>
<li class="whitespace-normal break-words"><strong>16:24</strong> – Jonathan drops the Werner retirement bombshell.</li>
<li class="whitespace-normal break-words"><strong>19:12</strong> – Justin’s Security Hub prediction: “Please automate the SOC teams.”</li>
<li class="whitespace-normal break-words"><strong>19:46</strong> – Everyone hates Cognito. Matt’s prediction resonates with the universe.</li>
<li class="whitespace-normal break-words"><strong>21:47</strong> – Tiebreaker time: Jonathan goes with 1 out of pure spite.</li>
<li class="whitespace-normal break-words"><strong>24:08</strong> – Honorable mentions include mathematical AI verification and a marketplace for AI work.</li>
</ul>

<h2 class="font-claude-response-heading text-text-100 mt-1 -mb-0.5">Re:Invent Tips (From People Who Aren’t Going)</h2>
<p class="font-claude-response-body whitespace-normal break-words">Since none of us are attending this year, here’s what we remember from the good old days:</p>
<ul class="[&amp;:not(:last-child)_ul]:pb-1 [&amp;:not(:last-child)_ol]:pb-1 list-disc space-y-2.5 pl-7">
<li class="whitespace-normal break-words"><strong>Chalk Talks</strong> remain highly respected and valuable for deep technical content</li>
<li class="whitespace-normal break-words"><strong>Labs and hands-on sessions</strong> are worth your time more than keynotes you can watch online</li>
<li class="whitespace-normal break-words"><strong>Networking</strong> on the expo floor and in hallways is where the real value happens</li>
<li class="whitespace-normal break-words"><strong>Don’t try to see everything</strong> – focus on what matters to your work</li>
<li class="whitespace-normal break-words"><strong>Stay hydrated</strong> – Vegas is dry and conferences are exhausting</li>
</ul>

<h2 class="font-claude-response-heading text-text-100 mt-1 -mb-0.5">Closing</h2>
<p class="font-claude-response-body whitespace-normal break-words">And that is the week in the cloud! We’re taking Thanksgiving week off, so there won’t be an episode during Re:Invent. We’ll record late that week and have a dedicated Re:Invent recap episode the following week. If you’re heading to Las Vegas, have a great time and let us know how it goes!</p>
<p class="font-claude-response-body whitespace-normal break-words">Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback, or ask questions at <a class="underline" href="https://www.thecloudpod.net">theCloudPod.net</a> or tweet at us with the hashtag <strong>#theCloudPod</strong></p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2248333/c1e-rodobw3vd7f0wpw1-wwpr3jx7id5x-ua5nu7.mp3" length="14936834"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 332 of The Cloud Pod – where the forecast is always cloudy! It’s Thanksgiving week, which can only mean one thing: AWS Re:Invent predictions! In this special episode, Justin, Jonathan, Ryan, and Matt engage in the annual tradition of drafting their best guesses for what AWS will announce at the biggest cloud conference of the year. Justin is the reigning champion (probably because he actually reads the show notes), but with a reverse snake draft order determined by dice roll, anything could happen. Will Werner announce his retirement? Is Cognito finally getting a much-needed overhaul? And just how many times will “AI” be uttered on stage? Grab your turkey and let’s get predicting!
Titles we almost went with this week:

 Roll For Initiative: The Re:Invent Prediction Draft
 Justin’s Winning Streak: A Study in Actually Doing Your Homework
 Serverless GPUs and Broken Dreams: Our Re:Invent Wishlist
 Shooting in the Dark: AWS Predictions Edition
 We’re Never Good at This, But Here We Go Again
 Vegas Odds: What Happens at Re:Invent, Gets Predicted Wrong


AWS Re:Invent Predictions 2025
The annual prediction draft is here! Draft order was determined by dice roll: Jonathan first, followed by Ryan, Justin, and Matt in last position. As always, it’s a reverse order format, with points awarded for each correct prediction announced during the Tuesday, Wednesday, and Thursday keynotes.
Jonathan’s Predictions

Serverless GPU Support – An extension to Lambda or a different service that provides on-demand serverless GPU/inference capability. Likely with requirements for pre-warmed provisioned instances.
Agentic Platform for Continuous AI Agents – A service that allows agents to run continuously with goals or instructions, performing actions periodically or on-demand in the real world. Think: running agents on a schedule that can check conditions and take automated actions.
Werner Vogels Retirement Announcement – Werner will announce that this is his last Re:Invent keynote and that he is retiring.

Ryan’s Predictions

New Trainium 3 Chips, Inferentia, and Graviton Chips – New generation of AWS custom silicon across training, inference, and general compute.
Expanded Model Availability in Bedrock – AWS will significantly expand the number of models available in Bedrock, potentially via partnerships or integrations with additional providers.
Major Refresh to AWS Organizations – UI-based or functionality refresh providing better visibility into SCPs, OU mappings, and stack sets across organizations.

]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2248333/c1a-k5d5-gp976vjjid6r-r6ceuz.jpg"></itunes:image>
                                                                            <itunes:duration>00:31:08</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2248333/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[331: Claude Gets a $30 Billion Azure Wardrobe and Two New Best Friends]]>
                </title>
                <pubDate>Thu, 27 Nov 2025 19:06:35 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2248422</guid>
                                    <link>https://tcpfm.castos.com/episodes/331-claude-gets-a-30-billion-azure-wardrobe-and-two-new-best-friends</link>
                                <description>
                                            <![CDATA[<p>Welcome to episode 331 of The Cloud Pod, where the forecast is always cloudy! Jonathan, Ryan, Matt, and Justin (for a little bit, anyway) are in the studio today to bring you all the latest in cloud and AI news. This week, we’re looking at our Ignite predictions (that side gig as internet psychics isn’t looking too good) undersea cables (our fave!), plus datacenters and more. Plus Claude and Azure make a 30 billion dollar deal! Take a break from turkey and avoiding politics, and let’s take a trip into the clouds!   </p>
<h3>Titles we almost went with this week</h3>
<ul>
<li> GPT-5.1 Gets a Shell Tool Because Apparently We Haven’t Learned Anything From Sci-Fi Movies</li>
<li> The Great Ingress Egress: NGINX Controller Waves Goodbye After Years of Volunteer Burnout</li>
<li> Queue the Applause: Lambda SQS Mapping Gets a Serious Speed Boost</li>
<li> SELECT * FROM future WHERE SQL meets AI without the prompt drama</li>
<li> MFA or GTFO: Microsoft’s 99.6% Phishing-Resistant Authentication Achievement</li>
<li> JWT Another Thing ALB Can Do: OAuth Validation Moves to the Load Balancer</li>
<li> Google’s Emerging Threats Center: Because Manually Checking 12 Months of Logs Sounds Terrible</li>
<li> EventBridge Gets a Drag-and-Drop Makeover: No More Schema Drama</li>
<li> Permission Denied: How Granting Access Took Down the Internet</li>
</ul>
<p>
</p><h2>Follow Up </h2>
<p>00:51 Ignite Predictions – The Results </p>
<p>Matt (Who is in charge of sound effects, so be aware) </p>
<ol>
<li style="font-weight:400;">ACM Competitor – True SSL competitive product</li>
<li style="font-weight:400;">AI announcement in Security AI Agent (Copilot for Sentinel) – sort of (½) </li>
<li style="font-weight:400;">Azure DevOps Announcement</li>
</ol>
<p>Justin</p>
<ol>
<li style="font-weight:400;">New Cobalt and Mai Gen 2 or similar – Check</li>
<li style="font-weight:400;">Price Reduction on OpenAI &amp; Significant Prompt Caching </li>
<li style="font-weight:400;">Microsoft Foundational LLM to compete with OpenAI – </li>
</ol>
<p>Jonathan</p>
<ol>
<li style="font-weight:400;">The general availability of new, smaller, and more power-efficient Azure Local hardware form factors</li>
<li style="font-weight:400;">Declarative AI on Fabric: This represents a move towards a declarative model, where users state the desired outcome, and the AI agent system determines the steps needed to achieve it within the Fabric ecosystem.</li>
<li style="font-weight:400;">Advanced Cost Management: Granular dashboards to track the token and compute consumption per agent or per transaction, enabling businesses to forecast costs and set budgets for their agent workforce.</li>
</ol>
<p>How many times will they say Copilot:</p>
<p>The word “Copilot” is mentioned 46 to 71 times in the video.</p>
<p>Jonathan 45</p>
<p>Justin: 35</p>
<p>Matt: 40</p>
<h2>General News</h2>
<p>05:13 <a href="https://blog.cloudflare.com/18-november-2025-outage/?mkt_tok=NzEzLVhTQy05MTgAAAGeOJjevyQ0IlOLcPRqQDQW4uEMmHNfu4JzoArTvzIdZzMiJ9HX_9Hnem8EL2oietYsvRbz2F_SE_rUtfE7Bzbqmcmb6fAs5b4bG8_uxxYvysvRlAG8cmX6/">Cloudflare outage on November 18, 2025</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.cloudflare.com/">Cloudflar</a>e experienced its worst outage since 2019 on November 18, 2025, lasting approximately three hours and affecting core traffic routing across its entire network. </li>
<li style="font-weight:400;">The incident was triggered by a database permissions change that caused a <a href="https://www.cloudflare.com/application-services/products/bot-management/">Bot Management</a> feature file to double in size, exceeding hardcoded limits in their proxy software and causing system panics that resulted in 5xx errors for customers.</li>
<li style="font-weight:400;">The root cause reveals a cascading failure pattern, where a ClickHouse database query began returning duplicate column metadata after permission changes. </li>
<li style="font-weight:400;">This resulted in a significant i...</li></ul>
<h3>Chapters</h3>
<ul><li>(00:00:00) - The Cloud Pod</li><li>(00:01:04) - Matchbox: Microsoft's AI Announcement</li><li>(00:05:04) - Cloudflare's Worst Outage Since 2019</li><li>(00:07:32) - GPT 5.1 Release</li><li>(00:11:21) - ChatGPT Launches Group Chat</li><li>(00:14:53) - Microsoft Teams: Working in Teams with Copilot</li><li>(00:16:16) - Gemini 3.0 Pro Launch at Google AI Conference</li><li>(00:18:51) - Microsoft, Nvidia to Develop Cloud Models for Anthropic</li><li>(00:22:45) - Ingress NGINX Controller to Be Retired</li><li>(00:25:05) - Cloudflare Expands AI into the Edge with a Replicate</li><li>(00:29:31) - AWS Lambda: Provisioned Mode for SQS</li><li>(00:32:31) - Amazon EventBridge Expands Schema Aware with New Rule Builder</li><li>(00:34:37) - Application Load Balancers support JWT Token Verification</li><li>(00:37:51) - How Protective Reroute Improves Network Resilience</li><li>(00:40:26) - Google Security Operations Launches Emerging Threat Center</li><li>(00:46:48) - Google to Invest $7 Million in Subsea Cable Networks</li><li>(00:50:17) - Microsoft's Azure AI SuperFactory</li><li>(00:53:43) - Azure DB for Postgres Announces Private Preview</li><li>(00:57:04) - Microsoft Defender for Cloud Integrates with GitHub Advanced Security</li><li>(01:00:09) - Azure introduces Smart Tiering for Blob Storage</li><li>(01:06:29) - How to lay a fiber cable in your house</li><li>(01:10:02) - Microsoft's AI Agent Development Announcement</li><li>(01:16:21) - How to Manage Ideas in the AI World</li><li>(01:22:18) - The Project Narrative in the Machine Learning Code</li><li>(01:23:38) - Week in Cloud: The Cloud Pod</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 331 of The Cloud Pod, where the forecast is always cloudy! Jonathan, Ryan, Matt, and Justin (for a little bit, anyway) are in the studio today to bring you all the latest in cloud and AI news. This week, we’re looking at our Ignite predictions (that side gig as internet psychics isn’t looking too good) undersea cables (our fave!), plus datacenters and more. Plus Claude and Azure make a 30 billion dollar deal! Take a break from turkey and avoiding politics, and let’s take a trip into the clouds!   
Titles we almost went with this week

 GPT-5.1 Gets a Shell Tool Because Apparently We Haven’t Learned Anything From Sci-Fi Movies
 The Great Ingress Egress: NGINX Controller Waves Goodbye After Years of Volunteer Burnout
 Queue the Applause: Lambda SQS Mapping Gets a Serious Speed Boost
 SELECT * FROM future WHERE SQL meets AI without the prompt drama
 MFA or GTFO: Microsoft’s 99.6% Phishing-Resistant Authentication Achievement
 JWT Another Thing ALB Can Do: OAuth Validation Moves to the Load Balancer
 Google’s Emerging Threats Center: Because Manually Checking 12 Months of Logs Sounds Terrible
 EventBridge Gets a Drag-and-Drop Makeover: No More Schema Drama
 Permission Denied: How Granting Access Took Down the Internet


Follow Up 
00:51 Ignite Predictions – The Results 
Matt (Who is in charge of sound effects, so be aware) 

ACM Competitor – True SSL competitive product
AI announcement in Security AI Agent (Copilot for Sentinel) – sort of (½) 
Azure DevOps Announcement

Justin

New Cobalt and Mai Gen 2 or similar – Check
Price Reduction on OpenAI & Significant Prompt Caching 
Microsoft Foundational LLM to compete with OpenAI – 

Jonathan

The general availability of new, smaller, and more power-efficient Azure Local hardware form factors
Declarative AI on Fabric: This represents a move towards a declarative model, where users state the desired outcome, and the AI agent system determines the steps needed to achieve it within the Fabric ecosystem.
Advanced Cost Management: Granular dashboards to track the token and compute consumption per agent or per transaction, enabling businesses to forecast costs and set budgets for their agent workforce.

How many times will they say Copilot:
The word “Copilot” is mentioned 46 to 71 times in the video.
Jonathan 45
Justin: 35
Matt: 40
General News
05:13 Cloudflare outage on November 18, 2025

Cloudflare experienced its worst outage since 2019 on November 18, 2025, lasting approximately three hours and affecting core traffic routing across its entire network. 
The incident was triggered by a database permissions change that caused a Bot Management feature file to double in size, exceeding hardcoded limits in their proxy software and causing system panics that resulted in 5xx errors for customers.
The root cause reveals a cascading failure pattern, where a ClickHouse database query began returning duplicate column metadata after permission changes. 
This resulted in a significant i...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[331: Claude Gets a $30 Billion Azure Wardrobe and Two New Best Friends]]>
                </itunes:title>
                                    <itunes:episode>331</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p>Welcome to episode 331 of The Cloud Pod, where the forecast is always cloudy! Jonathan, Ryan, Matt, and Justin (for a little bit, anyway) are in the studio today to bring you all the latest in cloud and AI news. This week, we’re looking at our Ignite predictions (that side gig as internet psychics isn’t looking too good) undersea cables (our fave!), plus datacenters and more. Plus Claude and Azure make a 30 billion dollar deal! Take a break from turkey and avoiding politics, and let’s take a trip into the clouds!   </p>
<h3>Titles we almost went with this week</h3>
<ul>
<li> GPT-5.1 Gets a Shell Tool Because Apparently We Haven’t Learned Anything From Sci-Fi Movies</li>
<li> The Great Ingress Egress: NGINX Controller Waves Goodbye After Years of Volunteer Burnout</li>
<li> Queue the Applause: Lambda SQS Mapping Gets a Serious Speed Boost</li>
<li> SELECT * FROM future WHERE SQL meets AI without the prompt drama</li>
<li> MFA or GTFO: Microsoft’s 99.6% Phishing-Resistant Authentication Achievement</li>
<li> JWT Another Thing ALB Can Do: OAuth Validation Moves to the Load Balancer</li>
<li> Google’s Emerging Threats Center: Because Manually Checking 12 Months of Logs Sounds Terrible</li>
<li> EventBridge Gets a Drag-and-Drop Makeover: No More Schema Drama</li>
<li> Permission Denied: How Granting Access Took Down the Internet</li>
</ul>
<p>
</p><h2>Follow Up </h2>
<p>00:51 Ignite Predictions – The Results </p>
<p>Matt (Who is in charge of sound effects, so be aware) </p>
<ol>
<li style="font-weight:400;">ACM Competitor – True SSL competitive product</li>
<li style="font-weight:400;">AI announcement in Security AI Agent (Copilot for Sentinel) – sort of (½) </li>
<li style="font-weight:400;">Azure DevOps Announcement</li>
</ol>
<p>Justin</p>
<ol>
<li style="font-weight:400;">New Cobalt and Mai Gen 2 or similar – Check</li>
<li style="font-weight:400;">Price Reduction on OpenAI &amp; Significant Prompt Caching </li>
<li style="font-weight:400;">Microsoft Foundational LLM to compete with OpenAI – </li>
</ol>
<p>Jonathan</p>
<ol>
<li style="font-weight:400;">The general availability of new, smaller, and more power-efficient Azure Local hardware form factors</li>
<li style="font-weight:400;">Declarative AI on Fabric: This represents a move towards a declarative model, where users state the desired outcome, and the AI agent system determines the steps needed to achieve it within the Fabric ecosystem.</li>
<li style="font-weight:400;">Advanced Cost Management: Granular dashboards to track the token and compute consumption per agent or per transaction, enabling businesses to forecast costs and set budgets for their agent workforce.</li>
</ol>
<p>How many times will they say Copilot:</p>
<p>The word “Copilot” is mentioned 46 to 71 times in the video.</p>
<p>Jonathan 45</p>
<p>Justin: 35</p>
<p>Matt: 40</p>
<h2>General News</h2>
<p>05:13 <a href="https://blog.cloudflare.com/18-november-2025-outage/?mkt_tok=NzEzLVhTQy05MTgAAAGeOJjevyQ0IlOLcPRqQDQW4uEMmHNfu4JzoArTvzIdZzMiJ9HX_9Hnem8EL2oietYsvRbz2F_SE_rUtfE7Bzbqmcmb6fAs5b4bG8_uxxYvysvRlAG8cmX6/">Cloudflare outage on November 18, 2025</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.cloudflare.com/">Cloudflar</a>e experienced its worst outage since 2019 on November 18, 2025, lasting approximately three hours and affecting core traffic routing across its entire network. </li>
<li style="font-weight:400;">The incident was triggered by a database permissions change that caused a <a href="https://www.cloudflare.com/application-services/products/bot-management/">Bot Management</a> feature file to double in size, exceeding hardcoded limits in their proxy software and causing system panics that resulted in 5xx errors for customers.</li>
<li style="font-weight:400;">The root cause reveals a cascading failure pattern, where a ClickHouse database query began returning duplicate column metadata after permission changes. </li>
<li style="font-weight:400;">This resulted in a significant increase in the feature file, from approximately 60 features to over 200, which exceeded the preallocated memory limit of 200 features in their Rust-based <a href="https://blog.cloudflare.com/20-percent-internet-upgrade/">FL2</a> proxy code. </li>
<li style="font-weight:400;">The team initially suspected a DDoS attack due to fluctuating symptoms caused by the bad configuration file being generated every five minutes as the database cluster was gradually updated.</li>
<li style="font-weight:400;">The outage impacted multiple Cloudflare services, including their CDN, Workers KV, Access, and even their own dashboard login system through Turnstile dependencies. </li>
<li style="font-weight:400;">Customers on the older FL proxy engine did not see errors but received incorrect bot scores of zero, potentially causing false positives for those using bot blocking rules.</li>
<li style="font-weight:400;">Cloudflare’s remediation plan includes treating internal configuration files with the same validation rigor as user input, implementing more global kill switches for features, and preventing error reporting systems from consuming excessive resources during incidents. </li>
<li style="font-weight:400;">The company acknowledged this as unacceptable for their position in the Internet ecosystem and committed to architectural improvements to prevent similar failures.</li>
</ul>
<p>06:41  Justin – “Definitely a bad outage, but I appreciate that they owned it, and owned it hard… especially considering they were front page news.” </p>
<h2>AI Is Going Great, or How ML Makes Money </h2>
<p>07:27 <a href="https://openai.com/index/gpt-5-1-for-developers">Introducing GPT-5.1 for developers | OpenAI</a></p>
<ul>
<li style="font-weight:400;">OpenAI has released GPT-5.1 in their API platform with adaptive reasoning that dynamically adjusts thinking time based on task complexity, resulting in 2-3x faster performance on simple tasks while maintaining frontier intelligence. </li>
<li style="font-weight:400;">The model includes a new “no reasoning” mode (reasoning_effort set to ‘none’) that delivers 20% better low-latency tool calling performance compared to GPT-5 minimal reasoning, making it suitable for latency-sensitive applications while supporting web search and improved parallel tool calling.</li>
<li style="font-weight:400;">GPT-5.1 introduces extended prompt caching with 24-hour retention (up from minutes), maintaining the existing 90% cost reduction for cached tokens with no additional storage charges. </li>
<li style="font-weight:400;">Early adopters report the model uses approximately half the tokens of competitors at similar quality levels, with companies like <a href="https://www.bamfunds.com/">Balyasny Asset Management</a> seeing agents run 50% faster while exceeding GPT-5 accuracy.</li>
<li style="font-weight:400;">The release includes two new <a href="https://platform.openai.com/docs/guides/tools-apply-patch">developer tools</a> in the Responses API: apply_patch for structured code editing using diffs without JSON escaping, and a shell tool that allows the model to propose and execute command-line operations in a controlled plan-execute loop. GPT-5.1 achieves 76.3% on <a href="https://openai.com/index/introducing-swe-bench-verified/">SWE-bench Verified</a> and shows 7% improvement on diff editing benchmarks according to early testing partners like <a href="https://cline.bot/">Cline</a> and <a href="https://www.augmentcode.com/">Augment Code</a>.</li>
<li style="font-weight:400;">OpenAI is also releasing specialized gpt-5.1-codex and gpt-5.1-codex-mini models optimized specifically for long-running agentic coding tasks, while maintaining the same pricing and rate limits as GPT-5. 
<ul>
<li style="font-weight:400;">If you didn’t catch it in the podcast, Justin HATES this. Hates. It. All the hate. </li>
</ul>
</li>
<li style="font-weight:400;">The company has committed to not deprecating GPT-5 in the API and will provide advanced notice if deprecation plans change.</li>
<li style="font-weight:400;"><a href="https://platform.openai.com/docs/pricing">Pricing and rate limits</a> are the same at GPT-5. </li>
</ul>
<p>9:31  Ryan – “I didn’t really like GPT-5, so I don’t have high expectations, but as these things enhance, I’ve found  using different models for different use cases has some advantages, so maybe I’ll find the case for this one.” </p>
<p>11:31 <a href="https://openai.com/index/group-chats-in-chatgpt">Piloting group chats in ChatGPT | OpenAI</a></p>
<ul>
<li style="font-weight:400;"><a href="https://openai.com/">OpenAI</a> is piloting group chat functionality in ChatGPT, starting with users in Japan, New Zealand, South Korea, and Taiwan across all subscription tiers (<a href="https://chatgpt.com/pricing">Free, Go, Plus, and Pro</a>). </li>
<li style="font-weight:400;">The feature allows up to 20 people to collaborate in a shared conversation with ChatGPT, with responses powered by <a href="https://help.openai.com/en/articles/11909943-gpt-51-in-chatgpt">GPT-5.1 Auto</a> that selects the optimal model based on the prompt and the user’s subscription level.</li>
<li style="font-weight:400;">ChatGPT has been trained with new social behaviors for group contexts, including deciding when to respond or stay quiet based on conversation flow, reacting with emojis, and referencing profile photos for personalized image generation. </li>
<li style="font-weight:400;">Users can mention “ChatGPT” explicitly to trigger a response, and custom instructions can be set per group chat to control tone and personality.</li>
<li style="font-weight:400;">Privacy controls separate group chats from personal conversations, with personal ChatGPT memory not shared or used in group contexts. </li>
<li style="font-weight:400;">Users must accept invitations to join, can see all participants, and can leave at any time, with group creators having special removal privileges.</li>
<li style="font-weight:400;">The feature includes safeguards for users under 18, automatically reducing sensitive content exposure for all group members when a minor is present. 
<ul>
<li style="font-weight:400;">Parents can disable group chats entirely through <a href="https://help.openai.com/en/articles/12315553-parental-controls-on-chatgpt-faq">parental controls</a>, providing additional oversight for younger users.</li>
</ul>
</li>
<li style="font-weight:400;">Rate limits apply only to ChatGPT responses (not user-to-user messages) and count against the subscription tier of the person ChatGPT is responding to. </li>
<li style="font-weight:400;">The feature supports search, image and file uploads, image generation, and dictation, making it functional for both personal planning and workplace collaboration scenarios.</li>
</ul>
<p>12:41  Jonathan – “I’d rather actually have group chats enabled if kids are going to use it because at least you have witnesses to the conversation at that point.”</p>
<p>16:38 <a href="https://blog.google/products/gemini/gemini-3/">Gemini 3: Introducing the latest Gemini AI model from Google</a></p>
<ul>
<li style="font-weight:400;">Google launches <a href="https://deepmind.google/models/gemini/">Gemini 3</a> Pro in preview across its product suite, including the <a href="https://blog.google/products/gemini/gemini-3-gemini-app">Gemini app</a>, <a href="https://blog.google/technology/developers/gemini-3-developers">AI Studio</a>, <a href="https://cloud.google.com/blog/products/ai-machine-learning/gemini-3-is-available-for-enterprise">Vertex AI</a>, and a new <a href="http://blog.google/products/search/gemini-3-search-ai-mode">AI Mode in Search</a> with generative UI capabilities. </li>
<li style="font-weight:400;">The model achieves a 1501 Elo score on LMArena leaderboard and demonstrates 91.9% on GPQA Diamond, with a 1 million token context window for processing <a href="https://blog.google/technology/ai/google-gemini-ai/">multimodal inputs</a> including text, images, video, audio and code.</li>
<li style="font-weight:400;">Gemini 3 Deep Think mode offers enhanced reasoning performance, scoring 41.0% on Humanity’s Last Exam and 45.1% on ARC-AGI-2 with code execution. </li>
<li style="font-weight:400;">Google is providing early access to safety testers before rolling out to Google AI Ultra subscribers in the coming weeks, following comprehensive safety evaluations per their Frontier Safety Framework.</li>
<li style="font-weight:400;">Google introduces <a href="http://antigravity.google/">Antigravity</a>, a new agentic development platform that integrates Gemini 3 Pro with Gemini 2.5 Computer Use for browser control and Gemini 2.5 Image for editing. </li>
<li style="font-weight:400;">The platform enables autonomous agent workflows with direct access to editor, terminal, and browser, scoring 54.2% on Terminal-Bench 2.0 and 76.2% on SWE-bench Verified for coding agent capabilities.</li>
<li style="font-weight:400;">The model shows improved long-horizon planning by topping Vending-Bench 2 leaderboard and delivers enhanced <a href="https://blog.google/technology/google-deepmind/google-gemini-ai-update-december-2024/#ceo-message">agentic capabilities</a> through Gemini Agent for Google AI Ultra subscribers. </li>
<li style="font-weight:400;">Gemini 3 demonstrates 72.1% on SimpleQA Verified for factual accuracy and 1487 Elo on WebDev Arena for web development tasks, with availability in third-party platforms including Cursor, GitHub, JetBrains, and Replit.</li>
</ul>
<p>18:24  Ryan – “I look forward to trying this. My initial attempts with Gemini 2.5 did not go well, but I found a sort of sweet spot in using it for planning and documentation. It’s still much better at coding than any other model that I’ve used. So cool, I look forward to using this.”</p>
<p>19:14 <a href="https://blogs.microsoft.com/blog/2025/11/18/microsoft-nvidia-and-anthropic-announce-strategic-partnerships/">Microsoft, NVIDIA, and Anthropic announce strategic partnerships – The </a><a href="https://blogs.microsoft.com/blog/2025/11/18/microsoft-nvidia-and-anthropic-announce-strategic-partnerships/">Official Microsoft Blog</a></p>
<ul>
<li style="font-weight:400;">Continuing the messy breakups…</li>
<li style="font-weight:400;">Anthropic commits to $30 billion in <a href="https://blogs.microsoft.com/blog/tag/microsoft-azure/">Azure</a> compute capacity, and up to one gigawatt of additional capacity, making this one of the largest cloud infrastructure commitments in AI history. </li>
<li style="font-weight:400;">This positions Azure as Anthropic’s primary scaling platform for Claude models.</li>
<li style="font-weight:400;"><a href="https://blogs.microsoft.com/blog/tag/nvidia/">NVIDIA</a> and Anthropic are establishing their first deep technology partnership focused on co-design and engineering optimization. </li>
<li style="font-weight:400;">Anthropic will optimize Claude models for NVIDIA Grace Blackwell and Vera Rubin systems, while NVIDIA will tune future architectures specifically for Anthropic workloads to improve performance, efficiency, and total cost of ownership.</li>
<li style="font-weight:400;">Claude models, including <a href="https://www.anthropic.com/claude/sonnet">Sonnet 4.5</a>, <a href="https://www.anthropic.com/claude/opus">Opus 4.1</a>, and <a href="https://www.anthropic.com/claude/haiku">Haiku 4.5</a>, are now available through Microsoft <a href="https://azure.microsoft.com/en-us/blog/introducing-anthropics-claude-models-in-microsoft-foundry-bringing-frontier-intelligence-to-azure/">Foundry</a> on Azure, making Claude the only frontier model accessible across all three major cloud platforms (AWS, Azure, GCP). </li>
<li style="font-weight:400;">Azure enterprise customers gain expanded model choice beyond OpenAI offerings.</li>
<li style="font-weight:400;">Microsoft commits to maintaining Claude integration across its entire Copilot family, including <a href="https://github.com/features/copilot">GitHub Copilot</a>, <a href="https://www.microsoft.com/en-us/microsoft-365-copilot">Microsoft 365 Copilot</a>, and <a href="https://www.microsoft.com/en-us/microsoft-365-copilot/microsoft-copilot-studio/">Copilot Studio</a>.</li>
<li style="font-weight:400;">This ensures developers and enterprise users can leverage Claude capabilities within existing Microsoft productivity and development workflows.</li>
<li style="font-weight:400;">NVIDIA and Microsoft are investing up to $10 billion and $5 billion, respectively, in <a href="https://blogs.microsoft.com/blog/tag/anthropic/">Anthropic</a> as part of the partnership. So yes, that’s a lot of money going back and forth. </li>
<li style="font-weight:400;">The combined $15 billion investment represents substantial backing for Anthropic’s continued development and positions all three companies to benefit from Claude’s growth trajectory.</li>
</ul>
<p>21:57  Jonathan – “I’m wondering what Anthropic’s plan is – what they’re working on in the background – because they have just taken a huge amount of capacity from AWS and their new data center in Northern Indiana, and now another 30 billion in Azure Compute? I guess they’re still building models every day… that’s a lot of money flying around.” </p>
<h2>Cloud Tools  </h2>
<p>23:17  <a href="https://www.kubernetes.dev/blog/2025/11/12/ingress-nginx-retirement/">Ingress NGINX Retirement: What You Need to Know | Kubernetes </a><a href="https://www.kubernetes.dev/blog/2025/11/12/ingress-nginx-retirement/">Contributors</a></p>
<ul>
<li style="font-weight:400;"><a href="https://github.com/kubernetes/ingress-nginx/">Ingress NGINX</a>, one of the most popular Kubernetes ingress controllers that has powered billions of requests worldwide, is being <a href="https://github.com/kubernetes-retired/">retired</a> in March 2026 due to unsustainable maintenance burden and mounting technical debt. </li>
<li style="font-weight:400;">The project has struggled for years with only one or two volunteer maintainers working after hours, and despite its widespread use in hosted platforms and enterprise clusters, efforts to find additional support have failed.</li>
<li style="font-weight:400;">The retirement stems from security concerns around features that were once considered flexible but are now viewed as vulnerabilities, particularly the snippets annotations that allowed arbitrary NGINX configuration. </li>
<li style="font-weight:400;">The <a href="https://github.com/kubernetes/committee-security-response">Kubernetes Security Response Committee</a> and SIG Network exhausted all options to make the project sustainable before making this difficult decision to prioritize user safety over continuing an undermaintained critical infrastructure component.</li>
<li style="font-weight:400;">Users should immediately begin <a href="https://gateway-api.sigs.k8s.io/guides/">migrating to Gateway API</a>, the modern replacement for <a href="https://kubernetes.io/docs/concepts/services-networking/ingress/">Ingress</a> that addresses many of the architectural issues that plagued Ingress NGINX. Existing deployments will continue to function and installation artefacts will remain available, but after March 2026, there will be zero security patches, bug fixes, or updates of any kind.</li>
<li style="font-weight:400;">Alternative <a href="https://kubernetes.io/docs/concepts/services-networking/ingress-controllers/">ingress controllers</a> are plentiful and listed in <a href="https://kubernetes.io/docs/concepts/services-networking/ingress-controllers/">Kubernetes documentation</a>, including cloud-provider-specific options and vendor-supported solutions. </li>
<li style="font-weight:400;">Users can check if they are affected by running a simple kubectl command to look for pods with the ingress-nginx selector across all namespaces.</li>
<li style="font-weight:400;">This retirement highlights a critical open source sustainability problem where massively popular infrastructure projects can fail despite widespread adoption when companies benefit from the software but do not contribute maintainer resources back to the community.</li>
</ul>
<p>24:39  Justin – “I’m actually surprised NGINX didn’t want to pick this up; it seems like an obvious move for F5 to pick up and maintain the Ingress NGINX controller. But what do I know?” </p>
<p>25:46 <a href="https://blog.cloudflare.com/replicate-joins-cloudflare/">Replicate is joining Cloudflare</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.cloudflare.com/">Cloudflare</a> acquires <a href="https://replicate.com/">Replicate</a>, bringing its 50,000-plus model catalog and fine-tuning capabilities to Workers AI. </li>
<li style="font-weight:400;">This consolidates model discovery, deployment, and inference into a single platform backed by Cloudflare’s global network.</li>
<li style="font-weight:400;">The acquisition addresses the operational complexity of running AI models by combining Replicate’s <a href="https://github.com/replicate/cog">Cog</a> containerization tool with Cloudflare’s serverless infrastructure. </li>
<li style="font-weight:400;">Developers can now deploy custom models and fine-tune without managing GPU hardware or dependencies.</li>
<li style="font-weight:400;">Existing Replicate APIs will continue functioning without interruption while gaining Cloudflare’s network performance. </li>
<li style="font-weight:400;">Workers AI users get access to proprietary models like <a href="https://openai.com/gpt-5/">GPT-5</a> and <a href="https://www.anthropic.com/claude/sonnet">Claude Sonnet</a> through Replicate’s unified API alongside open-source options.</li>
<li style="font-weight:400;">The integration extends beyond inference to include AI Gateway for observability and cost analytics, plus native connections to Cloudflare’s data stack, including R2 storage and Vectorize database. </li>
<li style="font-weight:400;">This creates an end-to-end platform for building AI applications with state management and real-time capabilities.</li>
<li style="font-weight:400;">Replicate’s community features for sharing models, publishing fine-tunes, and experimentation will remain central to the platform. </li>
<li style="font-weight:400;">The acquisition positions Cloudflare to compete more directly with hyperscaler AI offerings by combining model variety with edge deployment.</li>
</ul>
<p>27:09  Ryan – “Cloudflare has been doing kind of amazing things at the edge, which is kind of neat. We’ve had serverless and functions for a while, and definitely options out there that provide much better performance. It’s kind of neat. They’re well-positioned to do that.”</p>
<p>28:02 <a href="https://www.harness.io/blog/kubecon-2025-recap">KubeCon NA 2025 Recap: The Dawn of the AI Native Era | Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://kccncna2025.sched.com/">KubeCon 2025</a> marked the industry shift from cloud native to AI native, with CNCF launching the <a href="https://www.cncf.io/announcements/2025/11/11/cncf-launches-certified-kubernetes-ai-conformance-program-to-standardize-ai-workloads-on-kubernetes/">Kubernetes AI Conformance Program</a> to standardize how AI and ML workloads run across clouds and hardware accelerators like GPUs and TPUs. </li>
<li style="font-weight:400;">The live demo showed Dynamic Resource Allocation making accelerators first-class citizens in Kubernetes, signaling that AI infrastructure standardization is now a community priority.</li>
<li style="font-weight:400;"><a href="https://www.harness.io/">Harness</a> showcased Agentic AI capabilities that transform traditional CI/CD pipelines into intelligent, adaptive systems that learn and optimize delivery automatically. 
<ul>
<li style="font-weight:400;">Their booth demonstrated 17 integrated products spanning CI, CD, IDP, IaCM, security, testing, and FinOps, with particular emphasis on AI-powered pipeline creation and visual workflow design that caught significant attendee interest.</li>
</ul>
</li>
<li style="font-weight:400;">Security emerged as a critical theme with demonstrations of zero-CVE malware attacks that bypass traditional vulnerability scanners by compromising the build chain itself. 
<ul>
<li style="font-weight:400;">The solution path involves supply chain attestation using SLSA, policy-as-code enforcement, and artifact signing with Sigstore, which Harness demonstrated as native capabilities in their platform.</li>
</ul>
</li>
<li style="font-weight:400;">Apple introduced <a href="https://opensource.apple.com/projects/containerization/">Apple Containerization</a>, a framework running Linux containers directly on macOS using lightweight microVMs that boot minimal Linux kernels in under a second. 
<ul>
<li style="font-weight:400;">This combines VM-level security with container speed, creating safer local development environments that could reshape how developers work on Mac hardware.</li>
</ul>
</li>
<li style="font-weight:400;">The conference emphasized that AI native infrastructure requires intelligent scheduling, deeper observability, and verified agent identity using SPIFFE/SPIRE, with multiple sessions showing practical implementations at scale from companies like Yahoo, managing 8,000 nodes, and Spotify handling a million infrastructure resources.</li>
</ul>
<p>29:51  Justin – “Everyone has moved on from Kubernetes as the hotness; now it’s all AI, so what are people working on in the AI space?”</p>
<h2>AWS </h2>
<p>30:27 <a href="https://aws.amazon.com/blogs/aws/aws-lambda-enhances-sqs-processing-with-new-provisioned-mode-3x-faster-scaling-16x-higher-capacity/">AWS Lambda enhances event processing with provisioned mode for SQS </a><a href="https://aws.amazon.com/blogs/aws/aws-lambda-enhances-sqs-processing-with-new-provisioned-mode-3x-faster-scaling-16x-higher-capacity/">event-source mapping</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/lambda">AWS Lambda</a> now offers provisioned mode for <a href="https://aws.amazon.com/sqs/">SQS</a> <a href="https://docs.aws.amazon.com/lambda/latest/dg/with-sqs.html">event source mapping</a>, providing 3x faster scaling and 16x higher concurrency (up to 20,000 concurrent executions) compared to the standard polling mode. </li>
<li style="font-weight:400;">This addresses customer demands for better control over event processing during traffic spikes, particularly for financial services and gaming companies requiring sub-second latency.</li>
<li style="font-weight:400;">The new provisioned mode uses dedicated event pollers that customers can configure with minimum and maximum values, where each poller handles up to 1 MB/sec throughput, 10 concurrent invokes, or 10 SQS API calls per second. </li>
<li style="font-weight:400;">Setting a minimum number of pollers maintains baseline capacity for immediate response to traffic surges, while the maximum prevents downstream system overload.</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/lambda/pricing/">Pricing</a> is based on Event Poller Units (EPUs) charged for the number of pollers provisioned and their duration, with a minimum of 2 event pollers required per event source mapping. </li>
<li style="font-weight:400;">Each EPU supports up to 1 MB per second throughput capacity, though AWS has not published specific per-EPU pricing on the announcement.</li>
<li style="font-weight:400;">The feature is available now in all commercial AWS Regions and can be configured through the <a href="https://aws.amazon.com/console/">AWS Console</a>, <a href="https://aws.amazon.com/cli/">CLI</a>, or <a href="https://aws.amazon.com/developer/tools/">SDKs</a>. </li>
<li style="font-weight:400;">Monitoring is handled through CloudWatch metrics, specifically the ProvisionedPollers metric that tracks active event pollers in one-minute windows.</li>
<li style="font-weight:400;">This capability enables applications to handle up to 2 GBps of aggregate traffic while automatically scaling down to the configured minimum during low-traffic periods for cost optimization. </li>
<li style="font-weight:400;">The enhanced scaling detects growing backlogs within seconds and adjusts poller count dynamically between configured limits.</li>
</ul>
<p>31:36 Ryan – “Where was this 5 years ago when we were maintaining a logging platform? This would have been very nice!” </p>
<p>33:30 <a href="https://aws.amazon.com/about-aws/whats-new/2025/11/eventbridge-enhanced-visual-rule-builder/">Amazon EventBridge introduces enhanced visual rule builder</a>  </p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/eventbridge/latest/userguide/eb-what-is.html">EventBridge</a> launches a new visual rule builder that integrates the Schema Registry with a drag-and-drop canvas, allowing developers to discover and subscribe to events from over 200 AWS services and custom applications without referencing individual service documentation. </li>
<li style="font-weight:400;">The schema-aware interface helps reduce syntax errors when creating event filter patterns and rules.</li>
<li style="font-weight:400;">The enhanced builder includes a comprehensive event catalog with readily available sample payloads and schemas, eliminating the need to hunt through documentation for event structures. 
<ul>
<li style="font-weight:400;">This addresses a common pain point: developers previously had to manually locate and understand event formats across different AWS services.</li>
</ul>
</li>
<li style="font-weight:400;">Available now in all regions where Schema Registry is launched at no additional cost beyond standard EventBridge usage charges. 
<ul>
<li style="font-weight:400;">The feature is accessible through the EventBridge console and aims to reduce development time for event-driven architectures.</li>
</ul>
</li>
<li style="font-weight:400;">The visual builder particularly benefits teams building complex event-driven applications that need to filter and route events from multiple sources. </li>
<li style="font-weight:400;">By providing schema validation upfront, it helps catch configuration errors before deployment rather than during runtime.</li>
</ul>
<p>34:46 Matt – “I definitely – back in the day – had lots of fun with EventBridge, and trying to make sure I got the schemas right for every frame when you’re trying to trigger one thing from another. So not having to deal with that mess is exponentially better. You know, at this point, though, I feel like I would just tell AI to tell me what the scheme was and solve the problem that way.”</p>
<p>35:43 <a href="https://aws.amazon.com/about-aws/whats-new/2025/11/application-load-balancer-jwt-verification">Application loadbalancer support client credential flow with JWT </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/11/application-load-balancer-jwt-verification">verification</a> </p>
<ul>
<li style="font-weight:400;">ALB now handles JWT token verification natively at the load balancer layer, eliminating the need for custom authentication code in backend applications. This offloads OAuth 2.0 token validation, including signature verification, expiration checks, and claims validation, directly to the load balancer, reducing complexity in microservices architectures.</li>
<li style="font-weight:400;">The feature supports Client Credentials Flow and other OAuth 2.0 flows, making it particularly useful for machine-to-machine and service-to-service authentication scenarios. Organizations can now centralize token validation at the edge rather than implementing it repeatedly across multiple backend services.</li>
<li style="font-weight:400;">This capability is available immediately in all AWS regions where ALB operates, with no additional ALB feature charges beyond standard load balancer pricing. Customers pay only for the existing ALB hourly rates and Load Balancer Capacity Units (LCUs) consumed.</li>
<li style="font-weight:400;">The implementation reads JWTs from request headers and validates against configured JSON Web Key Sets (JWKS) endpoints, supporting integration with identity providers like Auth0, Okta, and AWS Cognito. </li>
<li style="font-weight:400;">Failed validation results in configurable HTTP error responses before requests reach backend targets.</li>
<li style="font-weight:400;">This addresses a common pain point in API gateway and microservices deployments, where each service previously needed its own token validation logic. </li>
<li style="font-weight:400;">The centralized approach reduces code duplication and potential security inconsistencies across service boundaries.</li>
</ul>
<p>38:40 Jonathan – “Maybe this is kind of a sign that Cognito is not gaining the popularity they wanted. Because effectively, you could re-spin this announcement as Auth0 and Okta are now first-class citizens when it comes to authentication through API Gateway and ALB.”</p>
<h2>GCP</h2>
<p>39:10 <a href="https://cloud.google.com/blog/products/networking/how-protective-reroute-improves-network-resilience/">How Protective ReRoute improves network resilience | Google Cloud Blog</a> </p>
<ul>
<li style="font-weight:400;"><a href="https://research.google/pubs/improving-network-availability-with-protective-reroute/">Google Cloud’s Protective ReRoute</a> (PRR) shifts network failure recovery from centralized routers to distributed endpoints, allowing hosts to detect packet loss and immediately reroute traffic to alternate paths. </li>
<li style="font-weight:400;">This host-based approach has reduced inter-datacenter outages from slow network convergence by up to 84 percent since deployment five years ago, with recovery times measured in single-digit multiples of round-trip time rather than seconds or minutes.</li>
<li style="font-weight:400;">PRR works by having hosts continuously monitor path health using TCP retransmission timeouts, then modifying IPv6 flow-label headers to signal the network to use alternate paths when failures occur. Google contributed this IPv6 flow-label modification mechanism to the Linux kernel version 4.20 and later, making it available as open source technology for the broader community.</li>
<li style="font-weight:400;">The feature is particularly critical for AI and ML training workloads, where even brief network interruptions can cause expensive job failures and restarts costing millions in compute time. </li>
<li style="font-weight:400;">Large-scale distributed training across multiple GPUs and TPUs requires the ultra-reliable data distribution that PRR provides to prevent communication pattern disruptions.</li>
<li style="font-weight:400;">Google Cloud customers can use PRR in two modes: hypervisor mode, which automatically protects cross-datacenter traffic without guest OS changes, or guest mode for the fastest recovery, requiring Linux kernel 4.20 plus, TCP applications, and IPv6 traffic, or gVNIC driver for IPv4.</li>
<li style="font-weight:400;"><a href="https://cloud.google.com/compute/docs/networking/tcp-optimization-for-network-performance-in-gcp-and-hybrid#use-prr-for-network-resiliency">Documentation</a> is available at cloud.google.com/compute/docs/networking for enabling guest-mode PRR on critical workloads.</li>
<li style="font-weight:400;">The architecture treats the network as a highly parallel system where reliability increases exponentially with available paths rather than degrading serially through forwarding stages. </li>
<li style="font-weight:400;">This approach capitalizes on Google’s network path diversity to protect real-time applications, frequent short-lived connections, and data integrity scenarios where packet loss causes corruption beyond just throughput reduction.</li>
</ul>
<p>40:57 Ryan – “I was trying to think how I would even implement something like this in guest mode because it breaks my head. It seems pretty cool, and I’m sure that from an underlying technology at the infrastructure level, from the Google network, it sounds pretty neat. But it’s also the coordination of that failover seems very complex. And I would worry.”</p>
<p>41:54 <a href="https://cloud.google.com/blog/products/identity-security/introducing-the-emerging-threats-center-in-google-security-operations/">Introducing the Emerging Threats Center in Google Security Operations | </a><a href="https://cloud.google.com/blog/products/identity-security/introducing-the-emerging-threats-center-in-google-security-operations/">Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/security/products/security-operations">Google Security Operations</a> launches the Emerging Threats Center, a <a href="https://gemini.google.com/app">Gemini</a>-powered detection engineering system that automatically generates security rules when new threat campaigns emerge from <a href="https://cloud.google.com/blog/products/identity-security/introducing-google-threat-intelligence-actionable-threat-intelligence-at-google-scale-at-rsa/">Google Threat Intelligence</a>, <a href="https://mandiant.com/">Mandiant</a>, and <a href="https://www.virustotal.com/">VirusTotal</a>. </li>
<li style="font-weight:400;">The system addresses a key pain point where 59% of security leaders report difficulty deriving actionable intelligence from threat data, typically requiring days or weeks of manual work to assess organizational exposure.</li>
<li style="font-weight:400;">The platform provides two critical capabilities for security teams during major threat events: it automatically searches the previous 12 months of security telemetry for campaign-related indicators of compromise and detection rule matches, while also confirming active protection through campaign-specific detections. 
<ul>
<li style="font-weight:400;">This eliminates the manual cross-referencing process that traditionally occurs when zero-day vulnerabilities emerge.</li>
</ul>
</li>
<li style="font-weight:400;">Under the hood, the system uses an agentic workflow where Gemini ingests threat intelligence from Mandiant incident response and Google’s global visibility, generates synthetic event data mimicking adversary tactics, tests existing detection rules for coverage gaps, and automatically drafts new rules when gaps are found. Human security analysts maintain final approval before deployment, transforming detection engineering from a best-effort manual process into a systematic automated workflow.</li>
<li style="font-weight:400;">The Emerging Threats Center is available today for licensed Google Security Operations customers, though specific pricing details were not disclosed in the announcement. </li>
<li style="font-weight:400;">Organizations with high-volume security operations like <a href="https://www.fiserv.com/">Fiserv</a> are already using the behavioral detection capabilities to move beyond single indicators toward systematic adversary behavior detection.</li>
</ul>
<p>44:40 Jonathan – “I see this as very much a CrowdStrike-type AI solution for Google Cloud, in a way. Looking at the data, you’re identifying emerging threats, which is what CrowdStrike’s sales point really is, and then implementing controls to help quench that.” </p>
<p>47:56 <a href="https://cloud.google.com/blog/products/networking/introducing-dhivaru-new-subsea-cable/">Introducing Dhivaru and two new connectivity hubs | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google is investing in Dhivaru, a new Trans-Indian Ocean subsea cable connecting the Maldives, Christmas Island, and Oman, extending the <a href="https://cloud.google.com/blog/products/infrastructure/bosun-australia-connect-initiative-for-indo-pacific-connectivity?e=48754805">Australia Connect initiative</a> to improve regional connectivity. </li>
<li style="font-weight:400;">The cable system aims to support growing AI service demand like <a href="https://deepmind.google/models/gemini/flash/">Gemini 2.5 Flash</a> and <a href="https://cloud.google.com/vertex-ai">Vertex AI</a> by providing resilient infrastructure across the Indian Ocean region.</li>
<li style="font-weight:400;">The announcement includes two new connectivity hubs in the Maldives and Christmas Island that will provide three core capabilities: cable switching for automatic traffic rerouting during faults, content caching to reduce latency by storing popular content locally, and colocation services offering rack space to carriers and local companies. 
<ul>
<li style="font-weight:400;">These hubs are positioned to serve Africa, the Middle East, South Asia, and Oceania with improved reliability.</li>
</ul>
</li>
<li style="font-weight:400;">Google emphasizes the energy efficiency of subsea cables compared to traditional data centers, noting that connectivity hubs require significantly less power since they focus on networking and localized storage rather than compute-intensive AI and cloud workloads. </li>
<li style="font-weight:400;">The company is exploring ways to use power demand from these hubs to accelerate local investment in sustainable energy generation in smaller locations.</li>
<li style="font-weight:400;">The connectivity hubs will provide strategic benefits by minimizing the distance data travels before switching paths, which improves resilience and reduces downtime for services across the region. 
<ul>
<li style="font-weight:400;">This infrastructure investment aims to strengthen local economies while supporting Google’s objective of serving content from locations closer to users and customers.</li>
</ul>
</li>
<li style="font-weight:400;">The project represents Google’s continued infrastructure expansion to meet long-term demand driven by AI adoption rates that are outpacing predictions, with partnerships including Ooredoo Maldives and Dhiraagu supporting the Maldives hub deployment.</li>
</ul>
<p>49:38 Matthew – “I had to look up one connectivity hub, which is literally just a small little data center that just kind of handles basic networking and storage – and nothing fancy, which is interesting that they’re putting the two connectivity hubs. They’re dropping these hubs where all their cables terminate. So they are able to cache stuff at each location, which is always interesting.”</p>
<h2>Azure</h2>
<p>51:46  <a href="https://aka.ms/AAyjgcy">Infinite scale: The architecture behind the Azure AI superfactory – The </a><a href="https://aka.ms/AAyjgcy">Official Microsoft Blog</a> </p>
<ul>
<li style="font-weight:400;">Microsoft announces its second <a href="https://blogs.microsoft.com/blog/2025/09/18/inside-the-worlds-most-powerful-ai-datacenter/">Fairwater datacenter in Atlanta</a>, connecting it to the Wisconsin site and existing Azure infrastructure to create what they call a planet-scale AI superfactory. </li>
<li style="font-weight:400;">The facility uses a flat network architecture to integrate hundreds of thousands of NVIDIA GB200 and GB300 GPUs into a unified supercomputer for training frontier AI models.</li>
<li style="font-weight:400;">The datacenter achieves 140kW per rack power density through closed-loop liquid cooling that uses water equivalent to 20 homes annually and is designed to last 6-plus years without replacement. </li>
<li style="font-weight:400;">The two-story building design minimizes cable lengths between GPUs to reduce latency, while the site secures 4×9 availability power at 3×9 cost by relying on resilient grid power instead of traditional backup systems.</li>
<li style="font-weight:400;">Each rack houses up to 72 NVIDIA Blackwell GPUs connected via NVLink with 1.8TB GPU-to-GPU bandwidth and 14TB pooled memory per GPU. 
<ul>
<li style="font-weight:400;">The facility uses a two-tier Ethernet-based backend network with 800Gbps GPU-to-GPU connectivity running on SONiC to avoid vendor lock-in and reduce costs compared to proprietary solutions.</li>
</ul>
</li>
<li style="font-weight:400;">Microsoft deployed a dedicated AI WAN backbone with over 120,000 new fiber miles across the US last year to connect Fairwater sites and other Azure datacenters. 
<ul>
<li style="font-weight:400;">This allows workloads to span multiple geographic locations and enables dynamic allocation between training, fine-tuning, reinforcement learning, and synthetic data generation based on specific requirements.</li>
</ul>
</li>
<li style="font-weight:400;">The architecture addresses the challenge that large training jobs now exceed single-facility power and space constraints by creating fungibility across sites. </li>
<li style="font-weight:400;">Customers can segment traffic across scale-up networks within sites and scale-out networks between sites, maximizing GPU utilization across the combined system rather than being limited to a single datacenter.</li>
</ul>
<p>55:25 <a href="https://azure.microsoft.com/en-us/updates?id=529806">Private Preview: Azure HorizonDB </a></p>
<ul>
<li style="font-weight:400;"><a href="https://azure.microsoft.com/en-us/products/horizondb">Azure HorizonDB</a> for <a href="https://www.postgresql.org/">PostgreSQL</a> enters private preview as Microsoft’s performance-focused database offering, featuring autoscaling storage up to 128 TB and compute scaling to 3,072 vCores. </li>
<li style="font-weight:400;">The service claims up to 3 times faster performance compared to open-source PostgreSQL, positioning it as a competitor to <a href="https://docs.aws.amazon.com/AmazonRDS/latest/AuroraUserGuide/CHAP_AuroraOverview.html">AWS Aurora</a> and <a href="https://cloud.google.com/products/alloydb">Google Cloud AlloyDB</a> in the managed PostgreSQL space.</li>
<li style="font-weight:400;">The 128 TB storage ceiling represents a substantial increase over Azure’s existing PostgreSQL offerings, addressing enterprise workloads that previously required sharding or migration to other platforms. </li>
<li style="font-weight:400;">This storage capacity combined with the high vCore count targets large-scale OLTP and analytical workloads that need both horizontal and vertical scaling options.</li>
<li style="font-weight:400;">Microsoft appears to be building HorizonDB as a separate service line rather than an upgrade to existing Azure Database for PostgreSQL Flexible Server, suggesting different architecture and pricing models. 
<ul>
<li style="font-weight:400;">Organizations currently using Azure Database for PostgreSQL will need to evaluate migration paths and cost implications when the service reaches general availability.</li>
</ul>
</li>
<li style="font-weight:400;">The private preview status means limited customer access and no published pricing information yet. </li>
<li style="font-weight:400;">Enterprises interested in testing HorizonDB should expect typical private preview constraints, including potential feature changes, regional limitations, and SLA restrictions before general availability.</li>
</ul>
<p>57:35 Jonathan – “So it sounds like they’ve pretty much built what Amazon did with the Aurora, separating the storage from the compute to let them scale independently.” </p>
<p>59:10 <a href="https://azure.microsoft.com/en-us/updates?id=526876">Public Preview: Microsoft Defender for Cloud + GitHub Advanced Security</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/defender-for-cloud/defender-for-cloud-introduction">Microsoft Defender for Cloud</a> now integrates natively with <a href="https://docs.github.com/en/get-started/learning-about-github/about-github-advanced-security">GitHub Advanced Security</a> in public preview, creating a unified security workflow that spans from source code repositories through production cloud environments. </li>
<li style="font-weight:400;">This integration allows security teams and developers to work within a single platform rather than switching between separate tools for code scanning and cloud protection.</li>
<li style="font-weight:400;">The solution addresses the full application lifecycle security challenge by connecting GitHub’s code-level vulnerability detection with Defender for Cloud’s runtime protection capabilities. </li>
<li style="font-weight:400;">Organizations using both GitHub and Azure can now correlate security findings from development through deployment, reducing the gap between DevOps and SecOps teams.</li>
<li style="font-weight:400;">This preview targets cloud-native application teams who need consistent security policies across their CI/CD pipeline and production workloads. The integration is particularly relevant for organizations already invested in the Microsoft and GitHub ecosystem, as it leverages existing tooling rather than requiring additional third-party solutions.</li>
<li style="font-weight:400;">The announcement provides limited details on pricing structure, though organizations should expect costs to align with existing Defender for Cloud and GitHub Advanced Security licensing models. </li>
<li style="font-weight:400;">Specific regional availability and rollout timeline details were not included in the brief announcement.</li>
</ul>
<p>1:00:35 Matthew – “It seems like it has a lot of potential, but without the pricing and Windows for Defender as a CPM, I feel like – for me –  it lacks some features, when I’ve tried to use it. They’re going in the right direction; I don’t think they’re there at the end product yet.”</p>
<p>1:03:05 <a href="https://azure.microsoft.com/en-us/updates?id=526188">Public Preview: Smart Tier account level tiering (Azure Blob Storage and </a><a href="https://azure.microsoft.com/en-us/updates?id=526188">ADLS</a> </p>
<ul>
<li style="font-weight:400;">Azure introduces <a href="https://learn.microsoft.com/en-us/azure/storage/blobs/access-tiers-smart">Smart Tier for Blob Storage</a> and<a href="https://learn.microsoft.com/en-us/azure/storage/blobs/data-lake-storage-introduction"> ADLS Gen2</a>, which automatically moves data between hot, cool, and archive tiers based on access patterns without manual intervention. </li>
<li style="font-weight:400;">This eliminates the need for lifecycle management policies and reduces the operational overhead of managing storage costs across large data estates.</li>
<li style="font-weight:400;">The feature works at the account level rather than requiring per-container or per-blob configuration, making it simpler to deploy across entire storage accounts. Organizations with unpredictable access patterns or mixed workloads will benefit most, as the system continuously optimizes placement without predefined rules.</li>
<li style="font-weight:400;">Smart Tier monitors blob access patterns and automatically transitions objects to lower-cost tiers when appropriate, then moves them back to hot storage when access frequency increases. </li>
<li style="font-weight:400;">This differs from traditional lifecycle policies that rely on age-based rules and cannot respond dynamically to actual usage.</li>
<li style="font-weight:400;">The public preview allows customers to test the automated tiering without committing to production workloads, though specific pricing details for the Smart Tier feature itself were not disclosed in the announcement. Standard Azure Blob Storage tier pricing applies, with the hot tier being the most expensive and the archive tier offering the lowest storage costs but higher retrieval fees.</li>
<li style="font-weight:400;">This capability targets customers managing large volumes of data with variable access patterns, particularly those in analytics, backup, and archival scenarios where manual tier management becomes impractical at scale. </li>
<li style="font-weight:400;">The integration with ADLS Gen2 makes it relevant for big data and analytics workloads running on Azure.</li>
</ul>
<p>1:05:18 Jonathan – “So they’ve always had the tiering, but now they’re providing an easy button for you based on access patterns.” </p>
<p>1:13:04 <a href="https://blogs.microsoft.com/blog/2025/11/18/from-idea-to-deployment-the-complete-lifecycle-of-ai-on-display-at-ignite-2025/">From idea to deployment: The complete lifecycle of AI on display at Ignite </a></p>
<p>   <a href="https://blogs.microsoft.com/blog/2025/11/18/from-idea-to-deployment-the-complete-lifecycle-of-ai-on-display-at-ignite-2025/">2025 – The Official Microsoft Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://ignite.microsoft.com/en-US/home">Microsoft Ignite</a> 2025 introduces three intelligence layers for AI development: Work IQ connects Microsoft 365 data and user patterns, Fabric IQ unifies analytical and operational data under a shared business model, and Foundry IQ provides a managed knowledge system routing across multiple data sources. </li>
<li style="font-weight:400;">These layers work together to give AI agents business context rather than requiring custom integrations for each data source.</li>
<li style="font-weight:400;">Microsoft Agent Factory offers a single metered plan for building and deploying agents across<a href="https://www.microsoft.com/en-us/microsoft-365/blog/?p=280039"> Microsoft 365 Copilot</a> and Copilot Studio without upfront licensing requirements. </li>
<li style="font-weight:400;">The program includes access to AI Forward Deployed Engineers and role-based training, targeting organizations that want to build custom agents but lack internal AI expertise or want to avoid complex provisioning processes.</li>
<li style="font-weight:400;">Microsoft Agent 365 provides centralized observability, management, and security for AI agents regardless of whether they were built with Microsoft platforms, open-source frameworks, or third-party tools. With IDC projecting 1.3 billion AI agents by 2028, this addresses the governance gap where unmanaged agents become shadow IT, integrating Defender, Entra, Purview, and Microsoft 365 admin center for agent lifecycle management.</li>
<li style="font-weight:400;">Work IQ now exposes APIs for developers to build custom agents that leverage the intelligence layer’s understanding of user workflows, relationships, and content patterns. This allows organizations to extend Microsoft 365 Copilot capabilities into their own applications while maintaining the native integration advantages rather than relying on third-party connectors.</li>
<li style="font-weight:400;">The announcements position Microsoft as providing end-to-end AI infrastructure from the datacenter to the application layer, with particular emphasis on making agent development accessible to frontline workers rather than limiting it to specialized AI teams. No specific pricing details were provided for the new services beyond the mention of metered plans for Agent Factory.</li>
</ul>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback, or ask questions at theCloudPod.net or tweet at us with the hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2248422/c1e-1xdxb5z685hwo251-47m3jn7ot00o-p1mc2y.mp3" length="162508140"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 331 of The Cloud Pod, where the forecast is always cloudy! Jonathan, Ryan, Matt, and Justin (for a little bit, anyway) are in the studio today to bring you all the latest in cloud and AI news. This week, we’re looking at our Ignite predictions (that side gig as internet psychics isn’t looking too good) undersea cables (our fave!), plus datacenters and more. Plus Claude and Azure make a 30 billion dollar deal! Take a break from turkey and avoiding politics, and let’s take a trip into the clouds!   
Titles we almost went with this week

 GPT-5.1 Gets a Shell Tool Because Apparently We Haven’t Learned Anything From Sci-Fi Movies
 The Great Ingress Egress: NGINX Controller Waves Goodbye After Years of Volunteer Burnout
 Queue the Applause: Lambda SQS Mapping Gets a Serious Speed Boost
 SELECT * FROM future WHERE SQL meets AI without the prompt drama
 MFA or GTFO: Microsoft’s 99.6% Phishing-Resistant Authentication Achievement
 JWT Another Thing ALB Can Do: OAuth Validation Moves to the Load Balancer
 Google’s Emerging Threats Center: Because Manually Checking 12 Months of Logs Sounds Terrible
 EventBridge Gets a Drag-and-Drop Makeover: No More Schema Drama
 Permission Denied: How Granting Access Took Down the Internet


Follow Up 
00:51 Ignite Predictions – The Results 
Matt (Who is in charge of sound effects, so be aware) 

ACM Competitor – True SSL competitive product
AI announcement in Security AI Agent (Copilot for Sentinel) – sort of (½) 
Azure DevOps Announcement

Justin

New Cobalt and Mai Gen 2 or similar – Check
Price Reduction on OpenAI & Significant Prompt Caching 
Microsoft Foundational LLM to compete with OpenAI – 

Jonathan

The general availability of new, smaller, and more power-efficient Azure Local hardware form factors
Declarative AI on Fabric: This represents a move towards a declarative model, where users state the desired outcome, and the AI agent system determines the steps needed to achieve it within the Fabric ecosystem.
Advanced Cost Management: Granular dashboards to track the token and compute consumption per agent or per transaction, enabling businesses to forecast costs and set budgets for their agent workforce.

How many times will they say Copilot:
The word “Copilot” is mentioned 46 to 71 times in the video.
Jonathan 45
Justin: 35
Matt: 40
General News
05:13 Cloudflare outage on November 18, 2025

Cloudflare experienced its worst outage since 2019 on November 18, 2025, lasting approximately three hours and affecting core traffic routing across its entire network. 
The incident was triggered by a database permissions change that caused a Bot Management feature file to double in size, exceeding hardcoded limits in their proxy software and causing system panics that resulted in 5xx errors for customers.
The root cause reveals a cascading failure pattern, where a ClickHouse database query began returning duplicate column metadata after permission changes. 
This resulted in a significant i...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2248422/c1a-k5d5-9j3mx819smkp-lmanvy.jpg"></itunes:image>
                                                                            <itunes:duration>01:24:29</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2248422/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[330: AWS Proves the Internet Really Is a Series of Tubes Under the Ocean]]>
                </title>
                <pubDate>Fri, 21 Nov 2025 04:15:10 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2235457</guid>
                                    <link>https://tcpfm.castos.com/episodes/330-aws-proves-the-internet-really-is-a-series-of-tubes-under-the-ocean</link>
                                <description>
                                            <![CDATA[<h3>Welcome to episode 329 of The Cloud Pod, where the forecast is always cloudy (and if you’re in California, rainy too!) Justin and Matt have taken a break from Ark building activities to bring you this week’s episode, packed with all the latest in cloud and AI news, including undersea cables (our favorite!) FinOps, Ignite predictions, and so much more! Grab your umbrellas and let’s get started! </h3>
<h3>Titles we almost went with this week</h3>
<ul>
<li> Fastnet and Furious: AWS Lays 320 Terabits of Cable Across the Atlantic</li>
<li> No More kubectl apply –pray: AWS Backup Takes the Stress Out of EKS Recovery</li>
<li> AWS Gets Swift with Lambda: No Taylor Version Required</li>
<li> Breaking Up Is Hard to Do: Microsoft Splits Teams from Office</li>
<li> FinOps and Behold: Google Automates Your Cloud Budget Nightmares</li>
<li> AMD Turin Around GCP’s Price-Performance with N4D VMs</li>
<li> Azure Gets Territorial: Your Data Stays Put Whether It Likes It or Not</li>
<li> AWS Finally Answers “Is It Available in My Region?” Before You Build It </li>
<li> Getting to the Bare Metal of Things: Google’s Axion Goes Commando</li>
<li> Azure Ultra Disk Gets Ultra Serious About Latency</li>
<li> Container Size Matters: Azure Expands ACI to 240 GB Memory </li>
<li> Google Containerises Chaos: Agent Sandbox Keeps Your AI from Going Rogue</li>
<li> AWS Prints Money While Amazon Prints Pink Slips: Q3 Earnings Beat</li>
</ul>
<h2>Follow Up </h2>
<p>02:08 <a href="https://www.cnbc.com/2025/09/12/microsoft-avoids-big-fine-as-eu-accepts-deal-to-unbundle-teams.html">Microsoft sidesteps hefty EU fine with Teams unbundling deal</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.cnbc.com/quotes/MSFT/">Microsoft</a> avoids a potentially substantial EU antitrust fine by agreeing to unbundle Teams from the Office 365 and Microsoft 365 suites for a period of seven years. </li>
<li style="font-weight:400;">The settlement follows a 2023 complaint from <a href="https://www.cnbc.com/quotes/CRM/">Salesforce</a>-owned <a href="https://slack.com/signin">Slack</a> alleging anticompetitive bundling practices that harmed rival collaboration tools.</li>
<li style="font-weight:400;">The commitments require Microsoft to offer Office and <a href="https://duckduckgo.com/y.js?ad_domain=microsoft.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=kboceT-0pHWx4PDaHrOusVRLz4zMU2zRruTG9sH1FJnq-wLquMDhc68lD__u5nZ-4Sp4ku5pigBBLW3mDmXPldYdAnyw3V9QDuCMiaDRKfRXWu2ZMlIEVCeI-sQMsGIB.jeNNdhdnXC2JraaZ5AbV4w&amp;eddgt=WEy_w4Lbe8uALW5JMPcI5A%3D%3D&amp;rut=a0d8e68004f5210b309c654de88c5ba3cbe824619f458e4f603232f5c232e603&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8GuA3gopnGhKWHqo7ubRHfjVUCUxKi_VAgMZiJe1Za15R7mN_HRECCBhqDKzpNkN_HERf-a68o3uQzJWT1X1vFutsSXDtw0GlmUtYocn-sPuNp_aDGxWZAZ9rSwh49fKQSM2eifREds33_Zz8lOI93RoqjS3W2aFEsLNnjDQRj4msG6zl97iP5L_GXJrUEtfbG1JMIdf16FeD9tHlW3Uq-eiHLgZ50ere6BD6eV_vmVzJnnPgjbgmE5VVjNNYt9H8eX7BTQFy_Q-ip_nU7X_FGlQcHj4M26n1cd-EZGxnJ4qBie_SEJMzl1G7dMx9DyCF6nAGCptU-DazwdDnTxiE2pCK6LmAIe1pikTb8zNeQD1dGIYXOMGGzmBPCr6BbFxjGVy2oSiCOyGNbhfPuGXb1YlRa3KrLz_PtzWz5SOjF935eH6LtHyOFUd487_4vFOnrgS9hqjGgkYA4CwJzjf1L7uAB3xOFpMywE-Ng9hRhP_px3Sb0prNMzpdNkwThCnAVwvcUpINseBOUbnAPL9LKnhugFvcc1lw6FBqaI5UOM9ORGR0lmtaI4r0y7L6EtNSqXKp2z-FhzKWWOwunaanO3cE4sRlGHfR0ZRsTgxqp6li3mrIwQ-jddtKi-0fdQuy-W5kE0NJ1qAiScjolEFPvn7zsNQ2xiLhiHBMxjO1RQM37-wVmPEftGXucwvTD0HVEBBEWyWszKrP7DexpSwne5wyzdAUA_YnOnidozbHgePR-miyDlrRY6j5hSm-GIPKNgPf6HkK_i4p-mFCUa6FAqTVNck%26u%3DaHR0cHMlM2ElMmYlMmY1MzUwLnhnNGtlbi5jb20lMmZ0cmslMmZ2MSUzZnByb2YlM2Q0NDAlMjZjYW1wJTNkMTc0MTQyJTI2a2N0JTNkbXNuJTI2a2NoaWQlM2QxNTkwMDI1MjYlMjZjcml0ZXJpYWlkJTNka3dkLTc5MzcxNjEwMDc2NjI5JTNhbG9jLTcxMjI4JTI2Y2FtcGFpZ25pZCUzZDU5MDM2MTI0NyUyNmxvY3BoeSUzZDc5NzU4JTI2YWRncm91cGlkJTNkMTI2OTkzNzU5NTgwODY2NSUyNmNpZCUzZDc5MzcxMjA4MDIxMDQ1JTI2a2R2JTNkYyUyNmtleHQlM2QlMjZrcGclM2QlMjZrcGlkJTNkJTI2cXVlcnlTdHIlM2RNaWNyb3NvZnQlMjUyMDM2NSUyNnVybCUzZGh0dHBzJTNhJTJmJTJmd3d3Lm1pY3Jvc29mdC5jb20lMmZlbi11cyUyZm1pY3Jvc29mdC0zNjUlMm..."></a></li></ul>
<h3>Chapters</h3>
<ul><li>(00:00:00) - The Cloud Pod: When You Can't Even Sit Down</li><li>(00:01:37) - Nice Job Last Week With Jonathan and Elise</li><li>(00:02:03) - Microsoft Settles Competition Lawsuit Over Teams</li><li>(00:04:47) - Amazon, Google Cloud Deliver Record Earnings</li><li>(00:08:13) - Microsoft Q1 Fiscal 2026 Earnings</li><li>(00:09:06) - Azure Q4 Update, Microsoft</li><li>(00:09:45) - Azure Front Door Incident Follow Up</li><li>(00:13:53) - Azure Conference Prediction</li><li>(00:14:52) - Microsoft Ignite 2017: What Do You Want From SSL?</li><li>(00:16:28) - Microsoft's Next-Gen AI Accelerator</li><li>(00:17:32) - Top Tech News: Apple's AI Announcement</li><li>(00:19:12) - Microsoft's Azure DevOps Announcement, and More</li><li>(00:20:59) - How Many Times Will They Say Co-Pilot in This Present</li><li>(00:21:54) - Microsoft, Chat AI, and More</li><li>(00:26:12) - IBM Cloud Ability Governance and Kubecast 3.0</li><li>(00:28:06) - Amazon Rolls Out New Fastnet Cable</li><li>(00:29:32) -  AWS Cloud Planning Tool: Capabilities by Region</li><li>(00:34:04) - Kubernetes: Agent Sandbox for AI</li><li>(00:35:52) - Google's Ironwood TPU and Axion VM</li><li>(00:37:38) - Google Cloud: FinOps Tooling in the Future</li><li>(00:39:10) - Azure 3.8: Continuous Delivery & Cost Management</li><li>(00:42:29) - Will the MCP help with deployment?</li><li>(00:44:20) - Microsoft UltraDisk Gets Performance and Cost Update</li><li>(00:46:46) - Azure Container Instances now supports 31 VCPUs and 240</li><li>(00:48:04) - Azure 10.2: Geo Priority Replication</li><li>(00:49:22) - Cloud Podcast: Predicting the Keynote</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 329 of The Cloud Pod, where the forecast is always cloudy (and if you’re in California, rainy too!) Justin and Matt have taken a break from Ark building activities to bring you this week’s episode, packed with all the latest in cloud and AI news, including undersea cables (our favorite!) FinOps, Ignite predictions, and so much more! Grab your umbrellas and let’s get started! 
Titles we almost went with this week

 Fastnet and Furious: AWS Lays 320 Terabits of Cable Across the Atlantic
 No More kubectl apply –pray: AWS Backup Takes the Stress Out of EKS Recovery
 AWS Gets Swift with Lambda: No Taylor Version Required
 Breaking Up Is Hard to Do: Microsoft Splits Teams from Office
 FinOps and Behold: Google Automates Your Cloud Budget Nightmares
 AMD Turin Around GCP’s Price-Performance with N4D VMs
 Azure Gets Territorial: Your Data Stays Put Whether It Likes It or Not
 AWS Finally Answers “Is It Available in My Region?” Before You Build It 
 Getting to the Bare Metal of Things: Google’s Axion Goes Commando
 Azure Ultra Disk Gets Ultra Serious About Latency
 Container Size Matters: Azure Expands ACI to 240 GB Memory 
 Google Containerises Chaos: Agent Sandbox Keeps Your AI from Going Rogue
 AWS Prints Money While Amazon Prints Pink Slips: Q3 Earnings Beat

Follow Up 
02:08 Microsoft sidesteps hefty EU fine with Teams unbundling deal

Microsoft avoids a potentially substantial EU antitrust fine by agreeing to unbundle Teams from the Office 365 and Microsoft 365 suites for a period of seven years. 
The settlement follows a 2023 complaint from Salesforce-owned Slack alleging anticompetitive bundling practices that harmed rival collaboration tools.
The commitments require Microsoft to offer Office and ]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[330: AWS Proves the Internet Really Is a Series of Tubes Under the Ocean]]>
                </itunes:title>
                                    <itunes:episode>330</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<h3>Welcome to episode 329 of The Cloud Pod, where the forecast is always cloudy (and if you’re in California, rainy too!) Justin and Matt have taken a break from Ark building activities to bring you this week’s episode, packed with all the latest in cloud and AI news, including undersea cables (our favorite!) FinOps, Ignite predictions, and so much more! Grab your umbrellas and let’s get started! </h3>
<h3>Titles we almost went with this week</h3>
<ul>
<li> Fastnet and Furious: AWS Lays 320 Terabits of Cable Across the Atlantic</li>
<li> No More kubectl apply –pray: AWS Backup Takes the Stress Out of EKS Recovery</li>
<li> AWS Gets Swift with Lambda: No Taylor Version Required</li>
<li> Breaking Up Is Hard to Do: Microsoft Splits Teams from Office</li>
<li> FinOps and Behold: Google Automates Your Cloud Budget Nightmares</li>
<li> AMD Turin Around GCP’s Price-Performance with N4D VMs</li>
<li> Azure Gets Territorial: Your Data Stays Put Whether It Likes It or Not</li>
<li> AWS Finally Answers “Is It Available in My Region?” Before You Build It </li>
<li> Getting to the Bare Metal of Things: Google’s Axion Goes Commando</li>
<li> Azure Ultra Disk Gets Ultra Serious About Latency</li>
<li> Container Size Matters: Azure Expands ACI to 240 GB Memory </li>
<li> Google Containerises Chaos: Agent Sandbox Keeps Your AI from Going Rogue</li>
<li> AWS Prints Money While Amazon Prints Pink Slips: Q3 Earnings Beat</li>
</ul>
<h2>Follow Up </h2>
<p>02:08 <a href="https://www.cnbc.com/2025/09/12/microsoft-avoids-big-fine-as-eu-accepts-deal-to-unbundle-teams.html">Microsoft sidesteps hefty EU fine with Teams unbundling deal</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.cnbc.com/quotes/MSFT/">Microsoft</a> avoids a potentially substantial EU antitrust fine by agreeing to unbundle Teams from the Office 365 and Microsoft 365 suites for a period of seven years. </li>
<li style="font-weight:400;">The settlement follows a 2023 complaint from <a href="https://www.cnbc.com/quotes/CRM/">Salesforce</a>-owned <a href="https://slack.com/signin">Slack</a> alleging anticompetitive bundling practices that harmed rival collaboration tools.</li>
<li style="font-weight:400;">The commitments require Microsoft to offer Office and <a href="https://duckduckgo.com/y.js?ad_domain=microsoft.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=kboceT-0pHWx4PDaHrOusVRLz4zMU2zRruTG9sH1FJnq-wLquMDhc68lD__u5nZ-4Sp4ku5pigBBLW3mDmXPldYdAnyw3V9QDuCMiaDRKfRXWu2ZMlIEVCeI-sQMsGIB.jeNNdhdnXC2JraaZ5AbV4w&amp;eddgt=WEy_w4Lbe8uALW5JMPcI5A%3D%3D&amp;rut=a0d8e68004f5210b309c654de88c5ba3cbe824619f458e4f603232f5c232e603&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8GuA3gopnGhKWHqo7ubRHfjVUCUxKi_VAgMZiJe1Za15R7mN_HRECCBhqDKzpNkN_HERf-a68o3uQzJWT1X1vFutsSXDtw0GlmUtYocn-sPuNp_aDGxWZAZ9rSwh49fKQSM2eifREds33_Zz8lOI93RoqjS3W2aFEsLNnjDQRj4msG6zl97iP5L_GXJrUEtfbG1JMIdf16FeD9tHlW3Uq-eiHLgZ50ere6BD6eV_vmVzJnnPgjbgmE5VVjNNYt9H8eX7BTQFy_Q-ip_nU7X_FGlQcHj4M26n1cd-EZGxnJ4qBie_SEJMzl1G7dMx9DyCF6nAGCptU-DazwdDnTxiE2pCK6LmAIe1pikTb8zNeQD1dGIYXOMGGzmBPCr6BbFxjGVy2oSiCOyGNbhfPuGXb1YlRa3KrLz_PtzWz5SOjF935eH6LtHyOFUd487_4vFOnrgS9hqjGgkYA4CwJzjf1L7uAB3xOFpMywE-Ng9hRhP_px3Sb0prNMzpdNkwThCnAVwvcUpINseBOUbnAPL9LKnhugFvcc1lw6FBqaI5UOM9ORGR0lmtaI4r0y7L6EtNSqXKp2z-FhzKWWOwunaanO3cE4sRlGHfR0ZRsTgxqp6li3mrIwQ-jddtKi-0fdQuy-W5kE0NJ1qAiScjolEFPvn7zsNQ2xiLhiHBMxjO1RQM37-wVmPEftGXucwvTD0HVEBBEWyWszKrP7DexpSwne5wyzdAUA_YnOnidozbHgePR-miyDlrRY6j5hSm-GIPKNgPf6HkK_i4p-mFCUa6FAqTVNck%26u%3DaHR0cHMlM2ElMmYlMmY1MzUwLnhnNGtlbi5jb20lMmZ0cmslMmZ2MSUzZnByb2YlM2Q0NDAlMjZjYW1wJTNkMTc0MTQyJTI2a2N0JTNkbXNuJTI2a2NoaWQlM2QxNTkwMDI1MjYlMjZjcml0ZXJpYWlkJTNka3dkLTc5MzcxNjEwMDc2NjI5JTNhbG9jLTcxMjI4JTI2Y2FtcGFpZ25pZCUzZDU5MDM2MTI0NyUyNmxvY3BoeSUzZDc5NzU4JTI2YWRncm91cGlkJTNkMTI2OTkzNzU5NTgwODY2NSUyNmNpZCUzZDc5MzcxMjA4MDIxMDQ1JTI2a2R2JTNkYyUyNmtleHQlM2QlMjZrcGclM2QlMjZrcGlkJTNkJTI2cXVlcnlTdHIlM2RNaWNyb3NvZnQlMjUyMDM2NSUyNnVybCUzZGh0dHBzJTNhJTJmJTJmd3d3Lm1pY3Jvc29mdC5jb20lMmZlbi11cyUyZm1pY3Jvc29mdC0zNjUlMmZidXNpbmVzcyUyZmNvbXBhcmUtYWxsLW1pY3Jvc29mdC0zNjUtYnVzaW5lc3MtcHJvZHVjdHMtYiUzZmVmX2lkJTNkX2tfYzZlN2Y1ODQ2NmU4MTUwYTVjMjE5YzliMDI1YjFlOTdfa18lMjZPQ0lEJTNkQUlEY21tNDc0cXA4ZWxfU0VNX19rX2M2ZTdmNTg0NjZlODE1MGE1YzIxOWM5YjAyNWIxZTk3X2tfJTI2bXNjbGtpZCUzZGM2ZTdmNTg0NjZlODE1MGE1YzIxOWM5YjAyNWIxZTk3%26rlid%3Dc6e7f58466e8150a5c219c9b025b1e97&amp;vqd=4-130069304442190579270335636653026061298&amp;iurl=%7B1%7DIG%3DD6D1B3FDABAC4F28A8B59620D2436870%26CID%3D04829822549E640C319A8E8855D265F1%26ID%3DDevEx%2C5045.1">Microsoft 365</a> suites without Teams at reduced prices, with a 50 percent larger price difference between bundled and unbundled versions. </li>
<li style="font-weight:400;">Customers with long-term licenses can switch to Teams-free suites, addressing concerns about forced adoption of the collaboration platform.</li>
<li style="font-weight:400;">Microsoft must provide interoperability between competing collaboration tools and its products, plus enable data portability from Teams to rival services. </li>
<li style="font-weight:400;">These technical requirements aim to level the playing field for competitors like Slack and Zoom in the European enterprise collaboration market.</li>
<li style="font-weight:400;">The settlement applies specifically to the European Union market and stems from Microsoft’s dominant position in productivity software. </li>
<li style="font-weight:400;">Organizations using Microsoft 365 in the EU will now have a genuine choice in selecting collaboration tools without being locked into Teams through bundling.</li>
<li style="font-weight:400;">This decision sets a precedent for how cloud software vendors can package integrated services, particularly when holding dominant market positions. </li>
<li style="font-weight:400;">The seven-year commitment period and mandatory interoperability requirements could influence how Microsoft and competitors structure product offerings globally.</li>
</ul>
<h2>General News </h2>
<p>08:30 It’s Earnings Time! (Warning: turn down your volume) </p>
<p><a href="https://www.cnbc.com/2025/10/31/amazon-amzn-stock-earnings-revenue-ai-cloud.html">Amazon’s stock soars on earnings, revenue beat, spending guidance</a></p>
<ul>
<li style="font-weight:400;">Yes, we know there’s a little delay in our reporting here, but it’s still important! (To Justin, anyway.) </li>
<li style="font-weight:400;"><a href="https://www.cnbc.com/2025/10/30/aws-q3-2025-earnings-report-amazon-cloud.html">AWS</a> grew revenue 20% year-over-year to $33 billion in Q3, generating $11.4 billion in operating income, which represents two-thirds of Amazon’s total operating profit. </li>
<li style="font-weight:400;">While this growth trails <a href="https://cloud.google.com/">Google Cloud’s</a> 34% and <a href="https://azure.microsoft.com/en-us">Azure’s</a> 40%, AWS maintains its position as the leading cloud infrastructure provider.</li>
<li style="font-weight:400;"><a href="https://www.cnbc.com/quotes/AMZN/">Amazon</a> increased its 2025 capital expenditure forecast to $125 billion, up from $118 billion, with CFO Brian Olsavsky indicating further increases expected in 2026. </li>
<li style="font-weight:400;">This spending exceeds <a href="https://www.google.com/">Google</a>, <a href="https://www.meta.ai/">Meta</a>, and Microsoft’s capex guidance and signals Amazon’s commitment to AI infrastructure despite concerns about missing out on high-profile AI cloud deals.</li>
<li style="font-weight:400;">Amazon’s Q4 revenue guidance of $206-213 billion (midpoint $209.5 billion) exceeded analyst expectations of $208 billion, driven by strong performance in both AWS and the digital advertising business, which grew 24% to $17.7 billion. </li>
<li style="font-weight:400;">The company’s overall revenue reached $180.17 billion, beating estimates of $177.8 billion.</li>
<li style="font-weight:400;">The company announced 14,000 corporate layoffs this week, which CEO Andy Jassy attributed to organizational culture and reducing bureaucratic layers rather than financial pressures or AI automation. </li>
<li style="font-weight:400;">Amazon’s total workforce stands at 1.58 million employees, representing a 2% year-over-year increase despite the cuts.</li>
</ul>
<p>06:14  Justin – “There’s a lot of investors starting to question some of the dollars being spent on (AI). It’s feeling very .com boom-y. Let’s not do that again.”</p>
<p>06:46 <a href="https://www.cnbc.com/2025/10/30/alphabet-goog-stock-earnings-ai-spend.html">Alphabet stock jumps 4% after strong earnings results, boost in AI spend</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.cnbc.com/quotes/GOOGL/">Alphabet</a> increased AI infrastructure spending guidance to $91-93 billion for the year, up from $85 billion previously, driven by strong <a href="https://www.cnbc.com/2025/10/29/google-expects-significant-increase-in-capex-in-2026-execs-say.html">Google Cloud</a> demand. </li>
<li style="font-weight:400;">CEO Sundar Pichai reported a $155 billion backlog for Google Cloud at quarter’s end, with CFO signaling significant capex increases expected in 2026.</li>
<li style="font-weight:400;">Google Cloud contributed to Alphabet’s first-ever $100 billion revenue quarter, with total <a href="https://www.cnbc.com/2025/10/29/alphabet-google-q3-earnings.html">Q3 revenue</a> reaching $102.35 billion and beating analyst expectations by $2.5 billion. </li>
<li style="font-weight:400;">The company’s earnings of $3.10 per share significantly exceeded the $2.33 analyst consensus.</li>
<li style="font-weight:400;">Google Search revenue grew 15% year-over-year to $56.56 billion, indicating that AI integration in search is proving to be an opportunity rather than a threat to the core business. </li>
<li style="font-weight:400;">Analysts noted this addresses previous concerns about AI disrupting Google’s search dominance.</li>
<li style="font-weight:400;">Wall Street analysts raised price targets substantially following the results, with <a href="https://www.cnbc.com/2025/10/30/alphabet-is-soaring-after-its-latest-earnings-report-what-wall-street-analysts-are-saying.html">Goldman Sachs increasing from $288 to $330</a> and <a href="https://www.cnbc.com/quotes/JPM/">JPMorgan</a> raising from $300 to $340. </li>
<li style="font-weight:400;"><a href="https://www.cnbc.com/quotes/DB/">Deutsche Bank</a> characterized the earnings as having virtually no negative aspects across any business segment.</li>
</ul>
<p>08:03  Matt – “The 15 % of revenue for Google search year over year feels like a massive growth, but I still don’t really understand how they track that. It’s not like there’s 15 % more people using Google than before, but that’s the piece I don’t really understand still.”</p>
<p>08:27 <a href="https://www.cnbc.com/2025/10/29/microsoft-msft-q1-2026-earnings-report.html">Microsoft (MSFT) Q1 2026 earnings report</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.cnbc.com/quotes/MSFT/">Microsoft</a> Azure revenue grew 40% year-over-year in <a href="https://www.prnewswire.com/news-releases/microsoft-cloud-and-ai-strength-drives-first-quarter-results-302598900.html">Q1 fiscal 2026</a>, beating analyst expectations of 38.2% growth and driving the Intelligent Cloud segment to $30.9 billion in total revenue. </li>
<li style="font-weight:400;">The company’s <a href="https://www.cnbc.com/ai-artificial-intelligence/">AI</a> infrastructure investments continue to pay off as <a href="https://www.cnbc.com/2025/10/29/microsoft-hit-with-azure-365-outage-ahead-of-quarterly-earnings.html">Azure cloud services</a> reached over $75 billion in annual revenue for fiscal 2025.</li>
<li style="font-weight:400;">Microsoft took a $3.1 billion accounting hit to net income this quarter related to its OpenAI investment, equivalent to 41 cents per 41-cent-per-share impact on earnings. </li>
<li style="font-weight:400;">Despite this, the company still beat earnings expectations at $3.72 per share versus the expected $3.67, with overall revenue reaching $77.67 billion.</li>
<li style="font-weight:400;"><a href="https://www.cnbc.com/2025/10/27/openai-spending-spree-wall-street-focus-on-capex-in-big-tech-earnings-.html">Capital expenditure</a> spending came in at $34.9 billion for the quarter, and CFO Amy Hood indicated that capex growth will accelerate throughout fiscal 2026 rather than slow down as previously suggested. </li>
<li style="font-weight:400;">This aggressive infrastructure spending caused the stock to drop 4% in after-hours trading despite the strong revenue performance.</li>
<li style="font-weight:400;">Microsoft now holds a 27% stake in OpenAI’s for-profit entity worth approximately $135 billion, following the company’s <a href="https://www.cnbc.com/2025/10/28/open-ai-for-profit-microsoft.html">restructuring announcement</a>. </li>
<li style="font-weight:400;">This formalized partnership structure clarifies the relationship between the two companies as Azure continues to serve as the primary infrastructure platform for OpenAI’s services.</li>
<li style="font-weight:400;">The quarter’s results were overshadowed by a significant Azure and Microsoft 365 outage that occurred on the same day as earnings, affecting various websites and gaming services for several hours. Microsoft expects full recovery by evening, but the timing highlights ongoing reliability concerns as the company scales its cloud infrastructure.</li>
</ul>
<p>09:27 <a href="https://azure.status.microsoft/status/history/?trackingId=YKYN-BWZ">Azure Front Door RCA</a></p>
<ul>
<li style="font-weight:400;">What happened: Azure Front Door and CDN experienced an 8+ hour outage (Oct 29-30, 2025), causing connection timeouts and DNS failures across numerous Azure and Microsoft services, including Azure Portal, Microsoft 365, Entra ID, and many others.</li>
<li style="font-weight:400;">Root cause: A valid customer configuration change exposed a latent bug when processed across different control plane versions, creating incompatible metadata that crashed data plane services. </li>
<li style="font-weight:400;">The crash occurred asynchronously (~5 minutes delayed), allowing it to pass through safety checks undetected.</li>
<li style="font-weight:400;">Why it spread globally: The defective configuration propagated to all edge sites within 4 minutes (15:39 UTC) and was mistakenly saved as the “Last Known Good” snapshot before crashes began appearing at 15:41 UTC, making rollback impossible.</li>
<li style="font-weight:400;">Recovery approach: Rather than reverting to the corrupted LKG, Microsoft manually removed problematic configurations and performed a careful phased redeployment across all edge sites, completing full mitigation by 00:05 UTC (~8.5 hours total).</li>
<li style="font-weight:400;">Prevention measures: Microsoft has completed synchronous config processing, added pre-canary validation stages, reduced recovery time from 4.5 hours to 1 hour, and is working on traffic isolation and further improvements through mid-2026.</li>
<li style="font-weight:400;">Are you interested in the video version of this information? You can find that <a href="https://aka.ms/air/YKYN-BWZ">here</a>. </li>
</ul>
<p>14:23 PREDICTIONS FOR <a href="https://ignite.microsoft.com/en-US">IGNITE</a></p>
<p>Matt</p>
<ol>
<li style="font-weight:400;">ACM Competitor – True SSL competitive product</li>
<li style="font-weight:400;">AI announcement in Security AI Agent (Copilot for Sentinel)</li>
<li style="font-weight:400;">Azure DevOps Announcement</li>
</ol>
<p>Justin</p>
<ol>
<li style="font-weight:400;">New Cobalt and Mai Gen 2 or similar</li>
<li style="font-weight:400;">Price Reduction on OpenAI &amp; Significant Prompt Caching</li>
<li style="font-weight:400;">Microsoft Foundational LLM to compete with OpenAI</li>
</ol>
<p>Jonathan (who isn’t here)</p>
<ol>
<li style="font-weight:400;">The general availability of new, smaller, and more power-efficient Azure Local hardware form factors</li>
<li style="font-weight:400;">Declarative AI on Fabric: This represents a move towards a declarative model, where users state the desired outcome, and the AI agent system determines the steps needed to achieve it within the Fabric ecosystem.</li>
<li style="font-weight:400;">Advanced Cost Management: Granular dashboards to track the token and compute consumption per agent or per transaction, enabling businesses to forecast costs and set budgets for their agent workforce.</li>
</ol>
<p>How many times will they say Copilot: </p>
<ul>
<li>Jonathan</li>
<li>Justin: 35</li>
<li>Matt: 40</li>
</ul>
<p>Honorable Claude:</p>
<ul>
<li style="font-weight:400;">Claude for Azure AI</li>
<li style="font-weight:400;">Autonomous Agent Platform</li>
</ul>
<p>23:00  Matt – “</p>
<h2>Cloud Tools  </h2>
<p>26:47 <a href="https://siliconangle.com/2025/11/03/apptio-expands-finops-tools-cloud-cost-control/">Apptio expands its FinOps tools for cloud cost control – SiliconANGLE</a></p>
<ul>
<li style="font-weight:400;">IBM-owned <a href="https://www.apptio.com/">Apptio</a> launches <a href="https://www.apptio.com/company/news/press-releases/apptio-unveils-next-generation-finops-solutions-designed-to-redefine-how-cloud-leaders-manage-and-optimize-investments-in-the-ai-era/">Cloudability Governance with Terraform integration</a> to provide real-time cost estimation and policy compliance at deployment time. </li>
<li style="font-weight:400;">Platform engineers can now see cost impacts before deploying infrastructure through version control systems like GitHub, addressing the problem where 55% of business leaders lack adequate visibility into technology spending ROI.</li>
<li style="font-weight:400;"><a href="https://www.apptio.com/blog/ibm-kubecost-3-0-faster-smarter-and-built-for-scale/">Kubecost 3.0</a> adds GPU-specific monitoring capabilities through <a href="https://docs.nvidia.com/datacenter/dcgm/latest/gpu-telemetry/dcgm-exporter.html">Nvidia’s Data Center GPU Manager exporter</a>, providing utilization and memory metrics critical for AI workloads. </li>
<li style="font-weight:400;">The container-agnostic platform works across on-premises and cloud Kubernetes environments, with bidirectional integration into Cloudability’s FinOps suite for unified cost visibility.</li>
<li style="font-weight:400;">The platform addresses common tagging blind spots by automatically identifying resource initiators and applying ownership tags when teams forget. It also supports synthetic tags that map to business units, processing trillions of rows of cost data monthly to detect over-provisioning and committed instance discount opportunities.</li>
<li style="font-weight:400;">AI workload acceleration has increased the velocity of cloud spending rather than creating new blind spots, with GPU costs potentially reaching thousands of dollars per hour. </li>
<li style="font-weight:400;">Real-time visibility becomes essential when infrastructure costs can scale this rapidly, making proactive cost governance more important than reactive monitoring.</li>
<li style="font-weight:400;">The Terraform integration positions Apptio to intercept infrastructure deployments before they happen, shifting FinOps from reactive cost analysis to proactive cost prevention. </li>
<li style="font-weight:400;">This represents a meaningful evolution in cloud cost management by embedding financial controls directly into the infrastructure provisioning workflow.</li>
</ul>
<p>33:03  Matt – “I’ve set these up in my pipelines before… It’s always nice to see, and it’s good if you’re launching net new, but for general PR, it’s just more noise.  It kind of needed these tools.” </p>
<h2>AWS </h2>
<p>28:44 <a href="https://www.aboutamazon.com/news/aws/transatlantic-subsea-cable-us-ireland-fastnet-aws">AWS rolls out Fastnet subsea cable connecting the U.S. and Ireland</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.aboutamazon.com/what-we-do/amazon-web-services">AWS</a> announces Fastnet, a dedicated transatlantic subsea cable connecting Maryland to County Cork, Ireland, with 320+ terabits per second capacity when operational in 2028. </li>
<li style="font-weight:400;">The system uses unique landing points away from traditional cable corridors to provide route diversity and network resilience for AWS customers running cloud and AI workloads.</li>
<li style="font-weight:400;">The cable features advanced optical switching branching unit technology that allows future topology changes and can redirect data to new landing points as network demands evolve. This architecture specifically targets growing AI traffic loads and integrates directly with AWS services like CloudFront and Global Accelerator for rapid data rerouting.</li>
<li style="font-weight:400;">AWS’s centralized traffic monitoring system provides complete visibility across the global network and implements millions of daily optimizations to route customer traffic along the most performant paths. </li>
<li style="font-weight:400;">This differs from public internet routing, where individual devices make decisions with limited network visibility, helping avoid congestion before it impacts applications.</li>
<li style="font-weight:400;">The infrastructure investment includes Community Benefit Funds for both Maryland’s Eastern Shore and County Cork to support local initiatives, including STEM education, workforce development, and sustainability programs. </li>
<li style="font-weight:400;">AWS worked with local organizations and residents from project inception to align the deployment with community priorities.</li>
<li style="font-weight:400;">With this addition, AWS’s global fiber network now spans over 9 million kilometers of terrestrial and subsea cabling across 38 regions and 120 availability zones. The automated network management tools resolve 96 percent of network events without human intervention through services like Elastic Load Balancing and CloudWatch.</li>
</ul>
<p>29:24  Matt – “The speed of this is ridiculous. 320 plus terabytes per second – that is a lot of data to go at once!” </p>
<p>30:20 <a href="https://aws.amazon.com/blogs/aws/introducing-aws-capabilities-by-region-for-easier-regional-planning-and-faster-global-deployments/">Introducing AWS Capabilities by Region for easier Regional planning and f</a><a href="https://aws.amazon.com/blogs/aws/introducing-aws-capabilities-by-region-for-easier-regional-planning-and-faster-global-deployments/">aster global deployments | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;">AWS launched <a href="https://builder.aws.com/capabilities/?trk=769a1a2b-8c19-4976-9c45-b6b1226c7d20&amp;sc_channel=el">Capabilities by Region</a>, a new planning tool that lets you compare service availability, API operations, <a href="https://aws.amazon.com/cloudformation/">CloudFormation</a> resources, and EC2 instance types across multiple AWS <a href="https://docs.aws.amazon.com/glossary/latest/reference/glos-chap.html#region?trk=769a1a2b-8c19-4976-9c45-b6b1226c7d20&amp;sc_channel=el">Regions</a> simultaneously. </li>
<li style="font-weight:400;">The tool addresses a common customer pain point by providing visibility into which AWS features are available in different Regions and includes forward-looking roadmap information showing planned launch quarters.</li>
<li style="font-weight:400;">The tool helps solve practical deployment challenges like ensuring compliance with data residency requirements, planning disaster recovery architectures, and avoiding costly rework from discovering Regional limitations mid-project. You can filter results to show only common features available across all selected Regions, making it easier to design portable architectures.</li>
<li style="font-weight:400;">Beyond the web interface, AWS made the Regional capability data accessible through the AWS Knowledge MCP Server, enabling automation of Region expansion planning and integration into CI/CD pipelines. </li>
<li style="font-weight:400;">The MCP server is publicly accessible at no cost without requiring an AWS account, though it is subject to rate limits.</li>
<li style="font-weight:400;">The tool provides detailed visibility into infrastructure components, including specific EC2 instance types like Graviton-based and GPU-enabled variants, helping you verify whether specialized compute resources are available in target Regions before committing to an architecture. </li>
<li style="font-weight:400;">This level of granularity extends to CloudFormation resource types and individual API operations for services like<a href="https://docs.aws.amazon.com/amazondynamodb/latest/developerguide/Introduction.html"> DynamoDB</a> and <a href="https://docs.aws.amazon.com/apigateway/latest/developerguide/welcome.html">API Gateway</a>.</li>
</ul>
<p>30:36  Justin – “Thank you. I’ve wanted this for a long time. You put it in a really weird UI choice, but I do appreciate that it’s there.” </p>
<p>32:10 <a href="https://aws.amazon.com/blogs/aws/secure-eks-clusters-with-the-new-support-for-amazon-eks-in-aws-backup/">Secure EKS clusters with the new support for Amazon EKS in AWS Backup </a><a href="https://aws.amazon.com/blogs/aws/secure-eks-clusters-with-the-new-support-for-amazon-eks-in-aws-backup/">| AWS News Blog</a></p>
<ul>
<li style="font-weight:400;">AWS Backup <a href="https://docs.aws.amazon.com/aws-backup/latest/devguide/assigning-resources-console.html?trk=7c8639c6-87c6-47d6-9bd0-a5812eecb848&amp;sc_channel=el">now supports Amazon EKS clusters</a>, providing centralized backup and restore capabilities for both Kubernetes configurations and persistent data stored in <a href="https://aws.amazon.com/ebs/?trk=7c8639c6-87c6-47d6-9bd0-a5812eecb848&amp;sc_channel=el">EBS</a>, <a href="https://aws.amazon.com/efs/?trk=7c8639c6-87c6-47d6-9bd0-a5812eecb848&amp;sc_channel=el">EFS</a>, and <a href="https://aws.amazon.com/pm/serv-s3/?trk=7c8639c6-87c6-47d6-9bd0-a5812eecb848&amp;sc_channel=el">S3</a>. This eliminates the need for custom scripts or third-party tools that previously required complex maintenance across multiple clusters.</li>
<li style="font-weight:400;">The service includes policy-based automation for protecting single or multiple <a href="https://aws.amazon.com/pm/eks/?trk=7c8639c6-87c6-47d6-9bd0-a5812eecb848&amp;sc_channel=el">EKS</a> clusters with immutable backups to meet compliance requirements. During restore operations, <a href="https://aws.amazon.com/backup/?trk=7c8639c6-87c6-47d6-9bd0-a5812eecb848&amp;sc_channel=el">AWS Backup</a> can now provision a new EKS cluster automatically based on previous configuration settings, removing the requirement to pre-provision target infrastructure.</li>
<li style="font-weight:400;">Restore operations are non-destructive, meaning they apply only the delta between backup and source rather than overwriting existing data or Kubernetes versions. Customers can restore full clusters, individual namespaces to existing clusters, or specific persistent storage resources if partial backup failures occur.</li>
<li style="font-weight:400;">The feature is available in all AWS commercial regions except China and in AWS GovCloud US, where both AWS Backup and Amazon EKS are supported. </li>
<li style="font-weight:400;">Pricing follows standard AWS Backup rates based on backup storage consumed and data transfer, with costs varying by region and storage tier.</li>
<li style="font-weight:400;">Salesforce highlighted the business impact, noting that losing a Kubernetes control plane due to software bugs or accidental deletion can be catastrophic without proper backup capabilities. This native integration addresses a critical resiliency gap for organizations running production EKS workloads at scale.</li>
</ul>
<p>33:07  Matt – “It’s the namespace level that they can deploy or backup and restore to that, to me, is great. I could see this being a SaaS company that runs their application in Kubernetes, and they have a namespace per customer, and having that ability to have that single customer backed up and be able to restore that is fantastic. So while it sounds like a minor release, if you’re in the Kubernetes ecosystem, it will just make your life better.”</p>
<p>33:53 <a href="https://aws.amazon.com/blogs/opensource/jupyter-deploy-create-a-jupyterlab-application-with-real-time-collaboration-in-the-cloud-in-minutes/">Jupyter Deploy: Create a JupyterLab application with real-time </a><a href="https://aws.amazon.com/blogs/opensource/jupyter-deploy-create-a-jupyterlab-application-with-real-time-collaboration-in-the-cloud-in-minutes/">collaboration in the cloud in minutes | AWS Open Source Blog</a></p>
<ul>
<li style="font-weight:400;">Jupyter Deploy is an open source CLI tool from AWS that lets small teams and startups deploy a fully configured JupyterLab environment to the cloud in minutes, solving the problem of expensive enterprise deployment frameworks. </li>
<li style="font-weight:400;">The tool automatically sets up <a href="https://aws.amazon.com/ec2/">EC2</a> instances with HTTPS encryption, GitHub OAuth authentication, real-time collaboration features, and a custom domain without requiring manual console configuration.</li>
<li style="font-weight:400;">The CLI uses infrastructure-as-code templates with Terraform to provision AWS resources, making it simple to upgrade instance types for GPU workloads, add storage volumes, or manage team access through a single command. Users can easily scale from a basic t3.medium instance to GPU-accelerated instances when they need more compute power for deep learning tasks.</li>
<li style="font-weight:400;">Real-time collaboration is a key differentiator, allowing multiple team members to work simultaneously in the same JupyterLab environment after authenticating through GitHub, eliminating the security and access limitations of running Jupyter locally on laptops. The tool includes cost management features like the ability to stop instances when not in use while preserving state and file systems.</li>
<li style="font-weight:400;">The project is vendor-neutral and extensible, with AWS planning to add Kubernetes templates for Amazon EKS and welcoming community contributions for other cloud providers, OAuth providers, and deployment patterns. </li>
<li style="font-weight:400;">Templates are distributed as Python libraries that the CLI automatically discovers, making it easy for the community to create and share new deployment configurations. </li>
</ul>
<p>34:51  Justin – “A lot of people, especially in their AI workloads, they don’t want to use SageMaker for that necessarily; they want their own deployment of a cluster. And so there was just some undifferentiated heavy lifting that was happening, and so I think this helps address some of that.”</p>
<h2>GCP</h2>
<p>35:09 <a href="https://cloud.google.com/blog/products/containers-kubernetes/agentic-ai-on-kubernetes-and-gke/">Agentic AI on Kubernetes and GKE | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.cloud.google.com/kubernetes-engine/docs/concepts/sandbox-pods">Agent Sandbox</a> is a new Kubernetes primitive designed specifically for running <a href="https://cloud.google.com/blog/products/containers-kubernetes/google-bytedance-and-red-hat-improve-ai-on-kubernetes?e=48754805">AI agents that need to execute code or use computer interfaces</a>, providing kernel-level isolation through gVisor and Kata Containers. </li>
<li style="font-weight:400;">This addresses the security challenge of AI agents making autonomous decisions about tool usage, where traditional application security models fall short.</li>
<li style="font-weight:400;">On GKE, Agent Sandbox delivers sub-second latency for isolated agent workloads through pre-warmed sandbox pools, representing up to 90% improvement over cold starts. </li>
<li style="font-weight:400;">The managed implementation leverages GKE Sandbox and <a href="https://cloud.google.com/blog/products/containers-kubernetes/container-optimized-compute-delivers-autoscaling-for-autopilot?e=48754805">container-optimized compute</a> for horizontal scaling of thousands of ephemeral sandbox environments.</li>
<li style="font-weight:400;">Pod Snapshots is a GKE-exclusive feature in limited preview that enables checkpoint and restore of running pods, reducing startup times from minutes to seconds for both CPU and GPU workloads. </li>
<li style="font-weight:400;">This allows teams to snapshot idle sandboxes and suspend them to save compute costs while maintaining the ability to quickly restore them to a specific state.</li>
<li style="font-weight:400;">The project includes a Python SDK designed for AI engineers to manage sandbox lifecycles without requiring deep infrastructure expertise, while still providing Kubernetes administrators with operational control. Agent Sandbox is available as an open source CNCF project and can be deployed on GKE today, with documentation at agent-sandbox.sigs.k8s.io.</li>
<li style="font-weight:400;">Primary use cases include agentic AI systems that need to execute generated code safely, reinforcement learning environments requiring rapid provisioning of isolated compute, and computer use scenarios where agents interact with terminals or browsers. The isolation model prevents potential data exfiltration or damage to production systems from non-deterministic agent behavior.</li>
</ul>
<p>36:49  Matt – “Anything that can make these environments, especially if they are ephemeral, scale up and down better so you’re not burning time and capacity on your GPUs – that are not cheap – is definitely useful. So it’d be a nice little money saver along the way.”</p>
<p>37:09 <a href="https://cloud.google.com/blog/products/compute/ironwood-tpus-and-new-axion-based-vms-for-your-ai-workloads/">Ironwood TPUs and new Axion-based VMs for your AI workloads | Google </a><a href="https://cloud.google.com/blog/products/compute/ironwood-tpus-and-new-axion-based-vms-for-your-ai-workloads/">Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google announces Ironwood, its seventh-generation TPU, delivering 10X peak performance improvement over TPU v5p and 4X better performance per chip than TPU v6e for both training and inference workloads. </li>
<li style="font-weight:400;">The system scales up to 9,216 chips in a superpod with 9.6 Tb/s interconnect speeds and 1.77 petabytes of shared HBM, featuring Optical Circuit Switching for automatic failover. Anthropic plans to access up to <a href="https://www.googlecloudpresscorner.com/2025-10-23-Anthropic-to-Expand-Use-of-Google-Cloud-TPUs-and-Services">1 million TPUs</a> and reports that the performance gains will help scale Claude efficiently.</li>
<li style="font-weight:400;">New Axion-based N4A instances enter preview, offering up to 2X better price-performance than comparable x86 VMs for general-purpose workloads like microservices, databases, and data preparation. </li>
<li style="font-weight:400;">C4A metal, Google’s first Arm-based bare metal instance, will launch in preview soon for specialized workloads requiring dedicated physical servers. Early customers report 30% performance improvements for video transcoding at Vimeo and 60% better price-performance for data processing at ZoomInfo.</li>
<li style="font-weight:400;">Google positions Ironwood and Axion as complementary solutions for the age of inference, where agentic workflows require coordination between ML acceleration and general-purpose compute. </li>
<li style="font-weight:400;">The <a href="https://cloud.google.com/solutions/ai-hypercomputer">AI Hypercomputer</a> platform integrates both with enhanced software, including GKE Cluster Director for TPU fleet management, <a href="https://maxtext.readthedocs.io/en/latest/">MaxText</a> improvements for training optimization, and <a href="https://cloud.google.com/blog/products/compute/in-q3-2025-ai-hypercomputer-adds-vllm-tpu-and-more">vLLM</a> support for switching between GPUs and TPUs. According to IDC, AI Hypercomputer customers achieved 353% three-year ROI and 28% lower IT costs on average.</li>
<li style="font-weight:400;">The announcement emphasizes system-level co-design across hardware, networking, and software, building on Google’s custom silicon history, including TPUs that enabled the Transformer architecture eight years ago. Ironwood uses advanced liquid cooling deployed at a gigawatt scale with 99.999% fleet-wide uptime since 2020, while the Jupiter data center network connects multiple superpods into clusters of hundreds of thousands of TPUs. </li>
<li style="font-weight:400;">Customers can sign up for Ironwood, N4A, and C4A metal preview access through Google Cloud forms.</li>
</ul>
<p>38:57 <a href="https://cloud.google.com/blog/topics/cost-management/automate-financial-governance-policies-using-workload-manager/">Automate financial governance policies using Workload Manager | Google </a><a href="https://cloud.google.com/blog/topics/cost-management/automate-financial-governance-policies-using-workload-manager/">Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google has enhanced Workload Manager to automate FinOps cost governance policies across GCP organizations, allowing teams to codify financial rules using Open Policy Agent Rego and run continuous compliance scans. </li>
<li style="font-weight:400;">The tool includes <a href="https://cloud.google.com/workload-manager/docs/reference/best-practices-general">predefined rules</a> for common cost management scenarios like enforcing resource labels, lifecycle policies on Cloud Storage buckets, and data retention settings, with results exportable to BigQuery for analysis and visualization in Looker Studio.</li>
<li style="font-weight:400;">The pricing update is significant, with Google reducing Workload Manager costs by up to 95 percent for certain scenarios and introducing a small free tier for testing. 
<ul>
<li style="font-weight:400;">This makes large-scale automated policy scanning more economical compared to manual auditing processes that can take weeks or months while costs accumulate.</li>
</ul>
</li>
<li style="font-weight:400;">The automation addresses configuration drift where systems deviate from established cost policies, enabling teams to define rules once and scan entire organizations, specific folders, or individual projects on schedules ranging from hourly to monthly. Integration with notification channels, including email, Slack, and PagerDuty, ensures policy violations reach the appropriate teams for remediation.</li>
<li style="font-weight:400;">Organizations can use custom rules from the <a href="https://github.com/GoogleCloudPlatform/workload-manager/tree/main/rules">GitHub repository</a> or leverage hundreds of Google-authored best practice rules covering FinOps, security, reliability, and operations. </li>
<li style="font-weight:400;">The BigQuery export capability provides historical compliance tracking and supports showback reporting for cost allocation across teams and business units.</li>
</ul>
<p>40:06 Matt – “Having that very quick, rapid response to know that something changed and you need to go look at it before you get a 10 million dollar bill is critical.” </p>
<h2>Azure</h2>
<p>41:50 <a href="https://azure.microsoft.com/en-us/updates?id=526881">Generally Available: Azure MCP Server</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/developer/azure-mcp-server/get-started">Azure MCP Server</a> provides a standardized way for AI agents and developers to interact with Azure services through the Model Context Protocol. </li>
<li style="font-weight:400;">This creates a consistent interface layer across services like AKS, <a href="https://learn.microsoft.com/en-us/azure/container-apps/overview">Azure Container Apps</a>, App Service, <a href="https://azure.microsoft.com/en-us/products/cosmos-db/?ef_id=_k_421aea2a4b721b17f96c83358a597e7e_k_&amp;OCID=AIDcmm5edswduu_SEM__k_421aea2a4b721b17f96c83358a597e7e_k_&amp;msclkid=421aea2a4b721b17f96c83358a597e7e">Cosmos DB</a>, SQL Database, and <a href="https://ai.azure.com/">AI Foundry</a>, reducing the need to learn individual service APIs.</li>
<li style="font-weight:400;">The MCP implementation allows developers to build AI agents that can programmatically manage and query Azure resources using natural language or structured commands. 
<ul>
<li style="font-weight:400;">This bridges the gap between conversational AI interfaces and cloud infrastructure management, enabling scenarios like automated resource provisioning or intelligent troubleshooting assistants.</li>
</ul>
</li>
<li style="font-weight:400;">The server architecture provides secure, authenticated access to Azure services while maintaining standard Azure RBAC controls. 
<ul>
<li style="font-weight:400;">This means AI agents operate within existing security boundaries and permissions frameworks rather than requiring separate authentication mechanisms.</li>
</ul>
</li>
<li style="font-weight:400;">Primary use cases include DevOps automation, intelligent cloud management tools, and AI-powered development assistants that need direct Azure service integration. Organizations building copilots or agent-based workflows can now connect to Azure infrastructure without custom API integration work for each service.</li>
<li style="font-weight:400;">The feature is generally available across Azure regions where the underlying services operate. Pricing follows standard Azure service consumption models for the resources accessed through MCP, with no additional charge for the MCP Server interface itself.</li>
</ul>
<p>42:50 Matt – “So I like the idea of this, and I like it for troubleshooting and stuff like this, but the idea of using it to provision resources terrifies me. Maybe in development environments, ‘Hey, I’m setting up a three-tier web application, spin me up what I need.’ But if you’re doing this for a company, I really worry about speaking in natural language, and consistently getting the same result to spin up resources.”</p>
<p>45:50 <a href="https://azure.microsoft.com/en-us/blog/the-new-era-of-azure-ultra-disk-experience-the-next-generation-of-mission-critical-block-storage/">A new era and new features in Azure Ultra Disk</a></p>
<ul>
<li style="font-weight:400;"><a href="https://azure.microsoft.com/en-us/blog/azure-ultra-disk-storage-microsoft-s-service-for-your-most-i-o-demanding-workloads/">Azure Ultra Disk</a> receives substantial performance and cost optimization updates focused on mission-critical workloads. 
<ul>
<li style="font-weight:400;">The service now delivers an 80% reduction in P99.9 and outlier latency, plus a 30% improvement in average latency, making it suitable for transaction logs and I/O-intensive applications that previously required local SSDs or Write Accelerator.</li>
</ul>
</li>
<li style="font-weight:400;">New flexible provisioning model enables significant cost savings with workloads on small disks, saving up to 50% and large disks up to 25%. </li>
<li style="font-weight:400;">Customers can now independently adjust capacity, IOPS, and throughput with more granular control, allowing a financial database example to reduce Ultra Disk spending by 22% while maintaining required performance levels.</li>
<li style="font-weight:400;">Instant Access Snapshot feature enters public preview for Ultra Disk and Premium SSD v2, eliminating traditional wait times for snapshot readiness. New disks created from these snapshots hydrate up to 10x faster with minimal read latency impact during hydration, enabling rapid recovery and replication for business continuity scenarios.</li>
<li style="font-weight:400;">Ultra Disk now supports <a href="https://learn.microsoft.com/en-us/azure/virtual-machines/ebdsv5-ebsv5-series">Azure Boost VMs, including Ebdsv5 series</a> (GA with up to 400,000 IOPS and 10GB/s) and <a href="https://techcommunity.microsoft.com/blog/sapapplications/new-mbv3-size-standard-m416bs-v3-general-availability/4439103">Memory Optimized Mbv3 VM Standard_M416bs_v3</a> (GA with up to 550,000 IOPS and 10GB/s). </li>
<li style="font-weight:400;">Additional Azure Boost VM announcements are planned for 2025 Ignite with further performance improvements for remote block storage.</li>
<li style="font-weight:400;">Recent feature additions include live resize capability, encryption at host support, Azure Site Recovery and VM Backup integration, and shared disk capability for SCSI Persistent Reservations. </li>
<li style="font-weight:400;">Third-party backup and disaster recovery services now support Ultra Disk for customers with existing tooling preferences.</li>
</ul>
<p>47:38 Matt – “There wasn’t any encryption at the host level? Clearly I make bad life choices being in Azure, but not THAT bad of choices.” </p>
<p>48:21 <a href="https://techcommunity.microsoft.com/blog/appsonazureblog/announcing-general-availability-of-larger-container-sizes-on-azure-container-ins/4463863">Announcing General Availability of Larger Container Sizes on Azure </a><a href="https://techcommunity.microsoft.com/blog/appsonazureblog/announcing-general-availability-of-larger-container-sizes-on-azure-container-ins/4463863">Container Instances | Microsoft Community Hub</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/container-instances/container-instances-overview">Azure Container Instances</a> now supports container sizes up to 31 vCPUs and 240 GB of memory for standard containers, expanding from the previous 4 vCPUs and 16 GB limits. </li>
<li style="font-weight:400;">This applies across standard containers, confidential containers, virtual network-enabled containers, and AKS virtual nodes, though confidential containers max out at 180 GB memory.</li>
<li style="font-weight:400;">The larger sizes target data-intensive workloads like real-time fraud detection, predictive maintenance, collaborative analytics in healthcare, and high-performance computing tasks such as climate modeling and genomic research. Organizations can now run fewer, larger containers instead of managing multiple smaller instances, simplifying scaling operations.</li>
<li style="font-weight:400;">Customers must request quota approval through Azure Support before deploying containers exceeding 4 vCPUs and 16 GB, then can deploy via Azure Portal, CLI, PowerShell, ARM templates, or Bicep. The serverless nature maintains ACI’s pay-per-use pricing model, though specific costs for larger SKUs are not detailed in the announcement.</li>
<li style="font-weight:400;">This positions ACI as a more viable alternative to managed Kubernetes for workloads that need substantial compute resources but don’t require full orchestration complexity. The enhancement particularly benefits scenarios where confidential computing is required, as those containers can now scale to 31 vCPUs with 180 GB memory while maintaining security boundaries.</li>
</ul>
<p>49:40 <a href="https://azure.microsoft.com/en-us/updates?id=522059">Generally Available: Geo/Object Priority Replication for Azure Blob</a></p>
<ul>
<li style="font-weight:400;">Geo Priority Replication is now generally available for <a href="https://azure.microsoft.com/en-us/products/storage/blobs/?ef_id=_k_73adb87671131ba7de5cd169b2b6613b_k_&amp;OCID=AIDcmm5edswduu_SEM__k_73adb87671131ba7de5cd169b2b6613b_k_&amp;msclkid=73adb87671131ba7de5cd169b2b6613b">Azure Blob Storage</a>, providing accelerated data replication between primary and secondary regions for GRS and GZRS storage accounts with an SLA-backed guarantee. This addresses a longstanding customer request for predictable replication timing in geo-redundant storage scenarios.</li>
<li style="font-weight:400;">The feature specifically targets customers with compliance requirements or business continuity needs that demand faster recovery point objectives (RPO) for their geo-replicated data. Organizations in regulated industries like finance and healthcare can now better meet data availability requirements with measurable replication performance.</li>
<li style="font-weight:400;">This enhancement works within the existing GRS and GZRS storage account types, meaning customers can enable it on current deployments without migrating to new account types. The SLA backing represents a shift from best-effort replication to guaranteed performance metrics for secondary region data synchronization.</li>
<li style="font-weight:400;">The announcement appears truncated with incomplete SLA details, but the core value proposition centers on reducing the uncertainty around when data becomes available in secondary regions during normal operations. This matters for disaster recovery planning, where organizations need to calculate realistic RPO values rather than relying on variable replication times.</li>
<li style="font-weight:400;">Pricing details were not included in the announcement, though this feature likely carries additional costs beyond standard GRS or GZRS storage rates, given the performance guarantees involved. Customers should review Azure pricing documentation for specific cost implications before enabling geo priority replication.</li>
</ul>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback, or ask questions at theCloudPod.net or tweet at us with the hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2235457/c1e-v0z0c7446jfodw79-gp91qd0kix3w-7tv11m.mp3" length="97001008"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 329 of The Cloud Pod, where the forecast is always cloudy (and if you’re in California, rainy too!) Justin and Matt have taken a break from Ark building activities to bring you this week’s episode, packed with all the latest in cloud and AI news, including undersea cables (our favorite!) FinOps, Ignite predictions, and so much more! Grab your umbrellas and let’s get started! 
Titles we almost went with this week

 Fastnet and Furious: AWS Lays 320 Terabits of Cable Across the Atlantic
 No More kubectl apply –pray: AWS Backup Takes the Stress Out of EKS Recovery
 AWS Gets Swift with Lambda: No Taylor Version Required
 Breaking Up Is Hard to Do: Microsoft Splits Teams from Office
 FinOps and Behold: Google Automates Your Cloud Budget Nightmares
 AMD Turin Around GCP’s Price-Performance with N4D VMs
 Azure Gets Territorial: Your Data Stays Put Whether It Likes It or Not
 AWS Finally Answers “Is It Available in My Region?” Before You Build It 
 Getting to the Bare Metal of Things: Google’s Axion Goes Commando
 Azure Ultra Disk Gets Ultra Serious About Latency
 Container Size Matters: Azure Expands ACI to 240 GB Memory 
 Google Containerises Chaos: Agent Sandbox Keeps Your AI from Going Rogue
 AWS Prints Money While Amazon Prints Pink Slips: Q3 Earnings Beat

Follow Up 
02:08 Microsoft sidesteps hefty EU fine with Teams unbundling deal

Microsoft avoids a potentially substantial EU antitrust fine by agreeing to unbundle Teams from the Office 365 and Microsoft 365 suites for a period of seven years. 
The settlement follows a 2023 complaint from Salesforce-owned Slack alleging anticompetitive bundling practices that harmed rival collaboration tools.
The commitments require Microsoft to offer Office and ]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2235457/c1a-k5d5-7zxv5no3a7mk-jtl27u.jpg"></itunes:image>
                                                                            <itunes:duration>00:50:27</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2235457/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[329: Azure Front Door: Please Use the Side Entrance]]>
                </title>
                <pubDate>Wed, 12 Nov 2025 07:35:45 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2203823</guid>
                                    <link>https://tcpfm.castos.com/episodes/329-azure-front-door-please-use-the-side-entrance</link>
                                <description>
                                            <![CDATA[<h3>Welcome to episode 329 of The Cloud Pod, where the forecast is always cloudy! Justin, Jonathan, and special guest Elise are in the studio to bring you all the latest in AI and cloud news, including – you guessed it – more outages, and more OpenAI team-ups. We’ve also got GPUs, K8 news, and Cursor updates. Let’s get started! </h3>
<h3>Titles we almost went with this week</h3>
<ul>
<li>Azure Front Door: Please Use the Side Entrance – el -jb</li>
<li>Azure and NVIDIA: A Match Made in GPU Heaven – mk</li>
<li>Azure Goes Down Under the Weight of Its Own Configuration – el</li>
<li>GitHub Turns Your Copilot Subscription Into an All-You-Can-Eat Agent Buffet – mk, el</li>
<li>Microsoft Goes Full Blackwell: No Regrets, Just GPUs</li>
<li>Jules Verne Would Be Proud: Google’s CLI Goes 20,000 Bugs Under the Codebase</li>
<li>RAG to Riches: AWS Makes Retrieval Augmented Generation Turnkey</li>
<li>Kubectl Gets a Gemini Twin: Google Teaches AI to Speak Kubernetes</li>
<li>I’m Not a Robot: Azure WAF Finally Learns to Ask the Important Questions</li>
<li>OpenAI Puts 38 Billion Eggs in Amazon’s Basket: Multi-Cloud Gets Complicated</li>
<li>The Root Cause They’ll Never Root Out: Why Attrition Stays Off the RCA</li>
<li>Google’s New Extension Lets You Deploy Kubernetes by Just Asking Nicely</li>
<li>Cursor 2.0: Now With More Agents Than a Hollywood Talent Agency</li>
</ul>
<h2>Follow Up </h2>
<p>04:46 <a href="https://www.zdnet.com/article/massive-azure-outage-is-over-but-problems-linger-heres-what-happened/">Massive Azure outage is over, but problems linger – here’s what happened | </a><a href="https://www.zdnet.com/article/massive-azure-outage-is-over-but-problems-linger-heres-what-happened/">ZDNET</a> </p>
<ul>
<li style="font-weight:400;"><a href="https://azure.microsoft.com/en-us/pricing/purchase-options/azure-account/search?ef_id=_k_a70c9a4bd2dc19933a74f35fd7b642dd_k_&amp;OCID=AIDcmm5edswduu_SEM__k_a70c9a4bd2dc19933a74f35fd7b642dd_k_&amp;msclkid=a70c9a4bd2dc19933a74f35fd7b642dd">Azure</a> experienced a global outage on October 29, affecting all regions simultaneously, unlike the recent <a href="https://www.computerworld.com/article/4082890/the-aws-outage-post-mortem-is-more-revealing-in-what-it-doesnt-say.html">AWS outage</a> that was limited to a single region. </li>
<li style="font-weight:400;">The incident lasted approximately eight hours from noon to 8 PM ET, impacting major services including <a href="https://www.office.com/">Microsoft 365</a>, <a href="https://www.microsoft.com/en-us/microsoft-teams/group-chat-software">Teams</a>, <a href="https://www.xbox.com/en-US/live">Xbox Live</a>, and critical infrastructure for Alaska Airlines, Vodafone UK, and Heathrow Airport, among others.</li>
<li style="font-weight:400;">The root cause was an inadvertent tenant configuration change in <a href="https://learn.microsoft.com/en-us/azure/frontdoor/front-door-overview">Azure Front Door</a> that bypassed safety validations due to a software defect. Microsoft’s protection mechanisms failed to catch the erroneous deployment, allowing invalid configurations to propagate across the global fleet and cause HTTP timeouts, server errors, and elevated packet loss at network edges.</li>
<li style="font-weight:400;">Recovery required rolling back to the last known good configuration and gradually rebalancing traffic across nodes to prevent overload conditions. </li>
<li style="font-weight:400;">Some customers experienced lingering issues even after the official recovery time, with Microsoft temporarily blocking configuration changes to Azure Front Door while completing the restoration process.</li>
<li style="font-weight:400;">The incident highlights concentration risk in cloud infrastructure, as this marks the second major cloud provider outage in October 2025. </li>
<li style="font-weight:400;">Despite Azure revenue growing 40 percent in the latest quarterly report, Microsoft’s stock declined in after-hours trading as the company acknowledged capaci...</li></ul>
<h3>Chapters</h3>
<ul><li>(00:00:00) - Azure Front Door</li><li>(00:01:07) - Microsoft Azure's Front Door Outage: Update!</li><li>(00:04:09) - Amazon AWS and OpenAI Announce Multi-Year Strategic Partnership</li><li>(00:09:21) - OpenAI vs. Nvidia: Which One Will Win?</li><li>(00:12:09) - Google removes Gemini AI models from AI Studio</li><li>(00:20:40) - The New York Times' political model</li><li>(00:21:35) - GitHub's Agent HQ: Orchestrating Multiple Agents with</li><li>(00:25:53) - Cursor Launches Multi-Agent Interface with Composer</li><li>(00:33:49) - Conversations with an AI</li><li>(00:37:13) - Amazon.com Releases MCP Proxy for AWS</li><li>(00:40:35) - Cloud Cost Management Tool</li><li>(00:41:18) - ECS Now Supports Built-in Linear and Canary Deployments</li><li>(00:44:27) - Amazon Route 53 Resolver now supports AWS Private Link</li><li>(00:47:46) - Mount Points for S3</li><li>(00:52:08) - Google Cloud's New Log Analytics Query Builder</li><li>(00:54:40) - Google's Gemini CLI Adds Kubernetes to DevOps</li><li>(00:58:13) - Google Launches Joules Extension for Gnome CLI</li><li>(01:04:20) - Google Cloud: GA of Cost Anomaly Detection</li><li>(01:09:07) - Microsoft and Nvidia expand AI partnership with Azure</li><li>(01:11:23) - California data centers: How expensive is electricity?</li><li>(01:13:02) - Microsoft: Azure Cloud: 1.2 Million Tokens a Second,</li><li>(01:19:25) - Azure WAF: Capture Challenges for Bot Traffic</li><li>(01:22:10) - Azure: Instant Access to Snapshots for SSD & Ultra Disk</li><li>(01:27:47) - Week in Cloud: The Cloud Podcast</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 329 of The Cloud Pod, where the forecast is always cloudy! Justin, Jonathan, and special guest Elise are in the studio to bring you all the latest in AI and cloud news, including – you guessed it – more outages, and more OpenAI team-ups. We’ve also got GPUs, K8 news, and Cursor updates. Let’s get started! 
Titles we almost went with this week

Azure Front Door: Please Use the Side Entrance – el -jb
Azure and NVIDIA: A Match Made in GPU Heaven – mk
Azure Goes Down Under the Weight of Its Own Configuration – el
GitHub Turns Your Copilot Subscription Into an All-You-Can-Eat Agent Buffet – mk, el
Microsoft Goes Full Blackwell: No Regrets, Just GPUs
Jules Verne Would Be Proud: Google’s CLI Goes 20,000 Bugs Under the Codebase
RAG to Riches: AWS Makes Retrieval Augmented Generation Turnkey
Kubectl Gets a Gemini Twin: Google Teaches AI to Speak Kubernetes
I’m Not a Robot: Azure WAF Finally Learns to Ask the Important Questions
OpenAI Puts 38 Billion Eggs in Amazon’s Basket: Multi-Cloud Gets Complicated
The Root Cause They’ll Never Root Out: Why Attrition Stays Off the RCA
Google’s New Extension Lets You Deploy Kubernetes by Just Asking Nicely
Cursor 2.0: Now With More Agents Than a Hollywood Talent Agency

Follow Up 
04:46 Massive Azure outage is over, but problems linger – here’s what happened | ZDNET 

Azure experienced a global outage on October 29, affecting all regions simultaneously, unlike the recent AWS outage that was limited to a single region. 
The incident lasted approximately eight hours from noon to 8 PM ET, impacting major services including Microsoft 365, Teams, Xbox Live, and critical infrastructure for Alaska Airlines, Vodafone UK, and Heathrow Airport, among others.
The root cause was an inadvertent tenant configuration change in Azure Front Door that bypassed safety validations due to a software defect. Microsoft’s protection mechanisms failed to catch the erroneous deployment, allowing invalid configurations to propagate across the global fleet and cause HTTP timeouts, server errors, and elevated packet loss at network edges.
Recovery required rolling back to the last known good configuration and gradually rebalancing traffic across nodes to prevent overload conditions. 
Some customers experienced lingering issues even after the official recovery time, with Microsoft temporarily blocking configuration changes to Azure Front Door while completing the restoration process.
The incident highlights concentration risk in cloud infrastructure, as this marks the second major cloud provider outage in October 2025. 
Despite Azure revenue growing 40 percent in the latest quarterly report, Microsoft’s stock declined in after-hours trading as the company acknowledged capaci...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[329: Azure Front Door: Please Use the Side Entrance]]>
                </itunes:title>
                                    <itunes:episode>329</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<h3>Welcome to episode 329 of The Cloud Pod, where the forecast is always cloudy! Justin, Jonathan, and special guest Elise are in the studio to bring you all the latest in AI and cloud news, including – you guessed it – more outages, and more OpenAI team-ups. We’ve also got GPUs, K8 news, and Cursor updates. Let’s get started! </h3>
<h3>Titles we almost went with this week</h3>
<ul>
<li>Azure Front Door: Please Use the Side Entrance – el -jb</li>
<li>Azure and NVIDIA: A Match Made in GPU Heaven – mk</li>
<li>Azure Goes Down Under the Weight of Its Own Configuration – el</li>
<li>GitHub Turns Your Copilot Subscription Into an All-You-Can-Eat Agent Buffet – mk, el</li>
<li>Microsoft Goes Full Blackwell: No Regrets, Just GPUs</li>
<li>Jules Verne Would Be Proud: Google’s CLI Goes 20,000 Bugs Under the Codebase</li>
<li>RAG to Riches: AWS Makes Retrieval Augmented Generation Turnkey</li>
<li>Kubectl Gets a Gemini Twin: Google Teaches AI to Speak Kubernetes</li>
<li>I’m Not a Robot: Azure WAF Finally Learns to Ask the Important Questions</li>
<li>OpenAI Puts 38 Billion Eggs in Amazon’s Basket: Multi-Cloud Gets Complicated</li>
<li>The Root Cause They’ll Never Root Out: Why Attrition Stays Off the RCA</li>
<li>Google’s New Extension Lets You Deploy Kubernetes by Just Asking Nicely</li>
<li>Cursor 2.0: Now With More Agents Than a Hollywood Talent Agency</li>
</ul>
<h2>Follow Up </h2>
<p>04:46 <a href="https://www.zdnet.com/article/massive-azure-outage-is-over-but-problems-linger-heres-what-happened/">Massive Azure outage is over, but problems linger – here’s what happened | </a><a href="https://www.zdnet.com/article/massive-azure-outage-is-over-but-problems-linger-heres-what-happened/">ZDNET</a> </p>
<ul>
<li style="font-weight:400;"><a href="https://azure.microsoft.com/en-us/pricing/purchase-options/azure-account/search?ef_id=_k_a70c9a4bd2dc19933a74f35fd7b642dd_k_&amp;OCID=AIDcmm5edswduu_SEM__k_a70c9a4bd2dc19933a74f35fd7b642dd_k_&amp;msclkid=a70c9a4bd2dc19933a74f35fd7b642dd">Azure</a> experienced a global outage on October 29, affecting all regions simultaneously, unlike the recent <a href="https://www.computerworld.com/article/4082890/the-aws-outage-post-mortem-is-more-revealing-in-what-it-doesnt-say.html">AWS outage</a> that was limited to a single region. </li>
<li style="font-weight:400;">The incident lasted approximately eight hours from noon to 8 PM ET, impacting major services including <a href="https://www.office.com/">Microsoft 365</a>, <a href="https://www.microsoft.com/en-us/microsoft-teams/group-chat-software">Teams</a>, <a href="https://www.xbox.com/en-US/live">Xbox Live</a>, and critical infrastructure for Alaska Airlines, Vodafone UK, and Heathrow Airport, among others.</li>
<li style="font-weight:400;">The root cause was an inadvertent tenant configuration change in <a href="https://learn.microsoft.com/en-us/azure/frontdoor/front-door-overview">Azure Front Door</a> that bypassed safety validations due to a software defect. Microsoft’s protection mechanisms failed to catch the erroneous deployment, allowing invalid configurations to propagate across the global fleet and cause HTTP timeouts, server errors, and elevated packet loss at network edges.</li>
<li style="font-weight:400;">Recovery required rolling back to the last known good configuration and gradually rebalancing traffic across nodes to prevent overload conditions. </li>
<li style="font-weight:400;">Some customers experienced lingering issues even after the official recovery time, with Microsoft temporarily blocking configuration changes to Azure Front Door while completing the restoration process.</li>
<li style="font-weight:400;">The incident highlights concentration risk in cloud infrastructure, as this marks the second major cloud provider outage in October 2025. </li>
<li style="font-weight:400;">Despite Azure revenue growing 40 percent in the latest quarterly report, Microsoft’s stock declined in after-hours trading as the company acknowledged capacity constraints in meeting AI and cloud demands.</li>
<li style="font-weight:400;">Affected Azure services included App Service, Azure SQL Database, Microsoft Entra ID, Container Registry, Azure Databricks, and approximately 15 other core platform services. Microsoft has implemented additional validation and rollback controls to prevent similar configuration deployment failures, though the full post-incident report remains pending.</li>
</ul>
<p>07:06  Matt – “The fact that you’re plus one week and still can’t actually make changes or even do simple things like purge a cache makes me think this is a lot bigger on the backend than they let on at the beginning.”</p>
<h2>AI Is Going Great – Or How ML Makes Money</h2>
<p>08:30 <a href="https://openai.com/index/aws-and-openai-partnership">AWS and OpenAI announce multi-year strategic partnership | OpenAI</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/">AWS</a> and <a href="https://openai.com/">OpenAI</a> formalized a 38 billion dollar multi-year partnership providing OpenAI immediate access to hundreds of thousands of NVIDIA GPUs (GB200s and GB300s) clustered via <a href="https://aws.amazon.com/ec2/ultraservers/">Amazon EC2 UltraServers</a>, with capacity deployment targeted by the end of 2026. </li>
<li style="font-weight:400;">The infrastructure supports both <a href="https://www.google.com/aclk?sa=L&amp;ai=DChsSEwi7kvXn3uuQAxXPIkQIHcVrBpgYACICCAEQABoCZHo&amp;co=1&amp;ase=2&amp;gclid=CjwKCAiA2svIBhB-EiwARWDPjsFMRHxa6-ZKAVcUXbMFzVTNeajn1_hhTNNay1cfOafWb0A6_P2VQRoCSjcQAvD_BwE&amp;cid=CAASWeRoYivTEXIlACcnBrHYk-cgaWj51Ahj4vkxOfOVwL9d38DJUEhrvcVMFgc-0rwpCM0odbiHD8codtCCG_RZ-Ss3tUoTTcjWn0ZssU_naeCHZLUPLJkn_TT1&amp;cce=2&amp;category=acrcp_v1_32&amp;sig=AOD64_25ZH7Uvv72DbmzPfTtBVg9NtgrBQ&amp;q&amp;nis=4&amp;adurl&amp;ved=2ahUKEwjn5u_n3uuQAxVrHEQIHZuwJtgQ0Qx6BAgMEAE">ChatGPT</a> inference serving and next-generation model training with the ability to scale to tens of millions of CPUs for agentic workloads.</li>
<li style="font-weight:400;">The partnership builds on existing integration where OpenAI’s open weight foundation models became available on <a href="https://aws.amazon.com/bedrock/'">Amazon Bedrock</a> earlier this year, making OpenAI one of the most popular model providers on the platform. Thousands of customers, including Thomson Reuters, Peloton, and Verana Health, are already using these models for agentic workflows, coding, and scientific analysis.</li>
<li style="font-weight:400;">AWS positions this as validation of their large-scale AI infrastructure capabilities, noting they have experience running clusters exceeding 500,000 chips with the security, reliability, and scale required for frontier model development. </li>
<li style="font-weight:400;">The low-latency network architecture of <a href="https://aws.amazon.com/ec2/ultraservers/">EC2 UltraServers</a> enables optimal performance for interconnected GPU systems.</li>
<li style="font-weight:400;">This represents a significant shift in OpenAI’s infrastructure strategy, moving substantial compute workloads to AWS while maintaining its existing Microsoft Azure relationship. </li>
<li style="font-weight:400;">The seven-year commitment timeline with continued growth provisions indicates long-term capacity planning for increasingly compute-intensive AI model development.</li>
</ul>
<p>09:53  Elise – “It sort of feels like OpenAI has a strategic partnership with everyone right now, so I’m sure this will help them, just like everything else that they have done will help them. We’re banking a lot on OpenAI being very successful.” </p>
<p>17:11 <a href="https://arstechnica.com/google/2025/11/google-removes-gemma-models-from-ai-studio-after-gop-senators-complaint/">Google removes Gemma models from AI Studio after GOP senators </a><a href="https://arstechnica.com/google/2025/11/google-removes-gemma-models-from-ai-studio-after-gop-senators-complaint/">complaint – Ars Technica</a></p>
<ul>
<li style="font-weight:400;">Google removed its open <a href="https://deepmind.google/models/gemma/">Gemma AI models</a> from <a href="https://www.google.com/aclk?sa=L&amp;ai=DChsSEwikl5jn3-uQAxX9B7wBHR_LH6QYACICCAEQARoCZHo&amp;ae=2&amp;co=1&amp;ase=2&amp;gclid=CjwKCAiA2svIBhB-EiwARWDPjp3rdfjCYYuxe0lDICUQcp0H43bB4Fdiw8m0hS2xdB-LOljcO6Xd9hoC5jwQAvD_BwE&amp;cid=CAASWeRoaCMk5aBDAvOFCz6f5PjfntoEsx1i0BE_Z15iW4sP7hUp5bowUxRIHe5_P5V4nAmxB-KD2dxIhLOZIMI0oM7MNYNNs-o3sJvSfAu-jSwsX3RTzPey9UM-&amp;cce=2&amp;category=acrcp_v1_71&amp;sig=AOD64_3oQmVAGRjz1yLEorKGOpxu8nFoYw&amp;q&amp;nis=4&amp;adurl&amp;ved=2ahUKEwignpLn3-uQAxUOIEQIHSYpKTAQ0Qx6BAgOEAE">AI Studio</a> following a complaint from <a href="https://www.blackburn.senate.gov/2025/10/technology/blackburn-demands-answers-from-google-after-gemma-manufactured-fake-criminal-allegations-against-her">Senator Marsha Blackburn</a>, who reported the model hallucinated false sexual misconduct allegations against her when prompted with leading questions. </li>
<li style="font-weight:400;">The model allegedly fabricated detailed false claims and generated fake news article links, demonstrating the <a href="https://arstechnica.com/google/2025/10/unexpectedly-a-deer-briefly-entered-the-family-room-living-with-gemini-home/">persistent hallucination problem</a> across generative AI systems.</li>
<li style="font-weight:400;">The removal only affects non-developer access through AI Studio’s user interface, where model behavior tweaking tools could increase hallucination likelihood. </li>
<li style="font-weight:400;">Developers can still access Gemma through the API and download models for local development, suggesting Google is limiting casual experimentation rather than pulling the model entirely.</li>
<li style="font-weight:400;">This incident highlights the ongoing challenge of AI hallucinations in production systems, which no AI firm has successfully eliminated despite mitigation efforts. </li>
<li style="font-weight:400;">Google’s response indicates a shift toward restricting open model access when inflammatory outputs could result from user prompting, potentially setting a precedent for how cloud providers handle politically sensitive AI failures.</li>
<li style="font-weight:400;">The timing follows congressional hearings where Google defended its hallucination mitigation practices, with the company’s representative acknowledging these issues are widespread across the industry. </li>
<li style="font-weight:400;">This creates a tension between open model availability and liability concerns when models generate defamatory content, particularly affecting cloud-based AI platforms.</li>
</ul>
<p>23:00  Matt – “That’s everything on the internet, though. When Wikipedia first came out and you started using it, we were told you can’t reference Wikipedia, because who knows what was put on there…you can’t blindly trust.”  </p>
<h2>Cloud Tools  </h2>
<p>26:53 <a href="https://github.blog/news-insights/company-news/welcome-home-agents/?utm_source=MSFT-day1-blog&amp;utm_medium=social&amp;utm_campaign=universe25">Introducing Agent HQ: Any agent, any way you work – The GitHub Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://github.com/">GitHub</a> launches <a href="https://github.blog/news-insights/company-news/welcome-home-agents/">Agent HQ</a> as a unified platform to orchestrate multiple AI coding agents from <a href="https://www.anthropic.com/">Anthropic</a>, <a href="https://openai.com/">OpenAI</a>, Google, <a href="https://cognition.ai/">Cognition</a>, and <a href="https://x.ai/">xAI</a> directly within GitHub and <a href="https://code.visualstudio.com/">VS Code</a>, all included in paid Copilot subscriptions. </li>
<li style="font-weight:400;">This eliminates the fragmented experience of juggling different AI tools across separate interfaces and subscriptions.</li>
<li style="font-weight:400;"><a href="https://github.blog/changelog/2025-10-28-a-mission-control-to-assign-steer-and-track-copilot-coding-agent-tasks/?utm_source=blog-day1-recap-mission-control-cta&amp;utm_medium=blog&amp;utm_campaign=universe25">Mission Control</a> provides a single command center across GitHub, VS Code, mobile, and CLI to assign work to different agents in parallel, track their progress, and manage agent identities and permissions just like human team members. </li>
<li style="font-weight:400;">The system maintains familiar Git primitives like pull requests and issues while adding granular controls over when CI runs on agent-generated code.</li>
<li style="font-weight:400;">VS Code gets Plan Mode for building step-by-step task approaches with clarifying questions before code generation, plus <a href="http://agents.md">AGENTS.md</a> files for creating custom agents with specific rules like preferred logging frameworks or testing patterns. </li>
<li style="font-weight:400;">It’s the only editor supporting the full Model Context Protocol specification with one-click access to the <a href="https://code.visualstudio.com/docs/copilot/customization/mcp-servers?utm_source=blog-day1-recap-mcp-registry-in-vs-code-cta&amp;utm_medium=blog&amp;utm_campaign=universe25">GitHub MCP Registry</a> for integrating tools like Stripe, Figma, and Sentry.</li>
<li style="font-weight:400;"><a href="https://github.blog/changelog/2025-10-28-github-code-quality-in-public-preview/?utm_source=blog-day1-recap-code-quality-cta&amp;utm_medium=blog&amp;utm_campaign=universe25">GitHub Code Quality</a> in public preview now provides org-wide visibility into code maintainability and reliability, with Copilot automatically reviewing its own generated code before developers see it to catch technical debt early. </li>
<li style="font-weight:400;">Enterprise admins get a new control plane for governing AI access, setting security policies, and viewing <a href="https://github.blog/changelog/2025-10-28-copilot-usage-metrics-dashboard-and-api-in-public-preview/?utm_source=blog-day1-recap-copilot-metrics-dashboard-cta&amp;utm_medium=blog&amp;utm_campaign=universe25">Copilot usage metrics</a> across the organization.</li>
<li style="font-weight:400;">The platform keeps developers on GitHub’s existing compute infrastructure, whether using GitHub Actions or self-hosted runners, avoiding vendor lock-in while OpenAI Codex becomes available this week in VS Code Insiders for Copilot Pro+ users as the first partner agent.</li>
</ul>
<p>27:20  Jonathan- “I’m like the different interfaces; they all bring something a little different.” </p>
<p>31:55 <a href="https://arstechnica.com/ai/2025/10/cursor-introduces-its-coding-model-alongside-multi-agent-interface/">Cursor introduces its coding model alongside multi-agent interface – Ars </a>:<a href="https://arstechnica.com/ai/2025/10/cursor-introduces-its-coding-model-alongside-multi-agent-interface/">Technica</a></p>
<ul>
<li style="font-weight:400;"><a href="https://cursor.com/">Cursor</a> launches <a href="https://cursor.com/blog/2-0">version 2.0</a> of its IDE with <a href="https://cursor.com/blog/2-0">Composer</a>, its first competitive in-house coding model built using reinforcement learning and mixture-of-experts architecture. </li>
<li style="font-weight:400;">The company claims Composer is 4x faster than similarly intelligent models while maintaining competitive intelligence levels with frontier models from OpenAI, Google, and Anthropic.</li>
<li style="font-weight:400;">The new multi-agent interface in <a href="https://cursor.com/blog/2-0">Cursor 2.0</a> allows developers to run multiple AI agents in parallel for coding tasks, expanding beyond the single-agent workflow that has been standard in AI-assisted development environments. 
<ul>
<li style="font-weight:400;">This represents a shift toward more complex, distributed AI assistance within the IDE.</li>
</ul>
</li>
<li style="font-weight:400;">Cursor’s internal benchmarking shows Composer prioritizes speed over raw intelligence, outperforming competitors significantly in tokens per second while slightly underperforming the best frontier models in intelligence metrics. 
<ul>
<li style="font-weight:400;">This positions it as a practical option for developers who need faster code generation and iteration cycles.</li>
</ul>
</li>
<li style="font-weight:400;">The IDE maintains its<a href="https://code.visualstudio.com/"> Visual Studio Code</a> foundation while deepening LLM integration for what Cursor calls vibe coding, where AI assistance is more directly embedded in the development workflow. </li>
<li style="font-weight:400;">Previously, Cursor relied entirely on third-party models, making this its first attempt at vertical integration in the AI coding assistant space.</li>
</ul>
<p>33:03  Elise- “Cursor had an agent built, and I thought it was ok, but it was wrong a lot. The 2.0 agent seems fabulous, comparatively, and a lot faster.” </p>
<h2>AWS </h2>
<p>43:25 <a href="https://aws.amazon.com/about-aws/whats-new/2025/10/model-context-protocol-proxy-available/">The Model Context Protocol (MCP) Proxy for AWS is now generally </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/10/model-context-protocol-proxy-available/">available</a></p>
<ul>
<li style="font-weight:400;">AWS has released the Model Context Protocol (MCP) Proxy for AWS, a client-side proxy that enables MCP clients to connect to remote AWS-hosted MCP servers using AWS SigV4 authentication. </li>
<li style="font-weight:400;">The proxy works with popular AI development tools like <a href="https://aws.amazon.com/q/developer/">Amazon Q Developer</a> CLI, <a href="https://cursor.com/">Cursor</a>, and <a href="https://kiro.dev/">Kiro</a>, allowing developers to integrate AWS service interactions directly into their agentic AI workflows.</li>
<li style="font-weight:400;">The proxy enables developers to access AWS resources like S3 buckets and RDS tables through MCP servers while maintaining AWS security standards through SigV4 authentication. </li>
<li style="font-weight:400;">It includes built-in safety controls such as read-only mode to prevent accidental changes, configurable retry logic for reliability, and logging capabilities for troubleshooting issues.</li>
<li style="font-weight:400;">The MCP Proxy bridges the gap between local AI development tools and AWS-hosted MCP servers, particularly those built on Amazon Bedrock AgentCore Gateway or Runtime. </li>
<li style="font-weight:400;">This allows AI agents and developers to extend their workflows to include AWS service interactions without manually handling authentication and protocol communications.</li>
<li style="font-weight:400;">Installation options are flexible, supporting deployment from source, Python package managers, or containers, making it straightforward to integrate with existing MCP-supported development environments. </li>
<li style="font-weight:400;">The proxy is open-source and available now through the AWS GitHub repository at https://github.com/aws/mcp-proxy-for-aws with no additional cost beyond standard AWS service usage.</li>
</ul>
<p>44:10  Matt – “This is a nice little tool to help with production…and easier stepping stone than having to build all this stuff yourself.” </p>
<p>47:07 <a href="https://aws.amazon.com/about-aws/whats-new/2025/10/amazon-ecs-built-in-linear-canary-deployments">Amazon ECS now supports built-in Linear and Canary deployments</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/ecs/">Amazon ECS</a> now includes <a href="https://docs.aws.amazon.com/AmazonECS/latest/developerguide/deployment-type-linear.html">native linear</a> and <a href="https://docs.aws.amazon.com/AmazonECS/latest/developerguide/canary-deployment.html">canary deployment</a> strategies alongside existing blue/green deployments, eliminating the need for external tools like <a href="http://v">AWS CodeDeploy</a> for gradual traffic shifting. </li>
<li style="font-weight:400;">Linear deployments shift traffic in equal percentage increments with configurable step sizes and bake times, while canary deployments route a small percentage to the new version before completing the shift.</li>
<li style="font-weight:400;">The feature integrates with <a href="https://aws.amazon.com/cloudwatch/">CloudWatch</a> alarms for automatic rollback detection and supports deployment lifecycle hooks for custom validation steps. </li>
<li style="font-weight:400;">Both strategies include a post-deployment bake time that keeps the old revision running after full traffic shift, enabling quick rollback without downtime if issues emerge.</li>
<li style="font-weight:400;">Available now in all commercial AWS regions where ECS operates, the deployment strategies work with <a href="https://docs.aws.amazon.com/elasticloadbalancing/latest/application/introduction.html">Application Load Balancer</a> and ECS Service Connect configurations. </li>
<li style="font-weight:400;">Customers can implement these strategies through Console, SDK, CLI, CloudFormation, CDK, and Terraform for both new and existing ECS services without additional cost beyond standard ECS pricing.</li>
<li style="font-weight:400;">This brings ECS deployment capabilities closer to parity with Kubernetes native deployment options and reduces dependency on CodeDeploy for teams running containerized workloads. </li>
<li style="font-weight:400;">The built-in approach simplifies deployment pipelines for organizations that previously needed separate deployment orchestration tools.</li>
</ul>
<p>48:45  Jonathan – “I always wonder why they haven’t built these things previously, and I guess it was possible through CodeDeploy, but if it was possible through CodeDeploy, then why add it to ECS now? I feel like we kind of get this weird sprawl.” </p>
<p>50:35<strong> <a href="https://aws.amazon.com/about-aws/whats-new/2026/10/amazon-route53-resolver-supports-aws-privatelink">Amazon Route 53 Resolver now supports AWS PrivateLink</a></strong></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/Route53/latest/DeveloperGuide/resolver.html">Route 53 Resolver</a> now supports <a href="https://aws.amazon.com/privatelink/">AWS PrivateLink</a>, allowing customers to manage DNS resolution features entirely over Amazon’s private network without traversing the public internet. </li>
<li style="font-weight:400;">This includes all Resolver capabilities like endpoints, DNS Firewall, Query Logging, and Outposts integration.</li>
<li style="font-weight:400;">The integration addresses security and compliance requirements for organizations that need to keep all AWS API calls within private networks. Operations like creating, deleting, and editing Resolver configurations can now be performed through VPC endpoints instead of public endpoints.</li>
<li style="font-weight:400;">Available immediately in all regions where Route 53 Resolver operates, including <a href="https://aws.amazon.com/govcloud-us/">AWS GovCloud</a> (US) regions. </li>
<li style="font-weight:400;">No additional feature announcements for pricing were mentioned, so standard Route 53 Resolver pricing applies, plus <a href="http://v">PrivateLink</a> endpoint costs (typically $0.01 per hour per AZ plus data processing charges).</li>
<li style="font-weight:400;">Primary use case targets enterprises with strict network isolation policies, particularly in regulated industries like finance and healthcare, where DNS management traffic must remain on private networks. </li>
<li style="font-weight:400;">This complements existing hybrid DNS architectures using Resolver endpoints for on-premises connectivity.</li>
</ul>
<p>51:04  Jonathan – “Good for anyone who wanted this!” </p>
<p>54:05 <a href="https://aws.amazon.com/about-aws/whats-new/2025/11/mountpoint-amazon-s3-csi-driver-monitoring-capability">Mountpoint for Amazon S3 and Mountpoint for Amazon S3 CSI driver add </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/11/mountpoint-amazon-s3-csi-driver-monitoring-capability">monitoring capability</a></p>
<ul>
<li style="font-weight:400;"><a href="https://github.com/awslabs/mountpoint-s3">Mountpoint for Amazon S3</a> now emits near real-time metrics using the OpenTelemetry Protocol, allowing customers to monitor operations through <a href="https://aws.amazon.com/cloudwatch/">CloudWatch</a>, <a href="https://prometheus.io/">Prometheus</a>, and <a href="https://grafana.com/">Grafana</a> instead of parsing log files manually. </li>
<li style="font-weight:400;">This addresses a significant operational gap for teams running data-intensive workloads that mount S3 buckets as file systems on EC2 instances or Kubernetes clusters.</li>
<li style="font-weight:400;">The new monitoring capability provides granular metrics, including request counts, latency, and error types at the EC2 instance level, enabling proactive troubleshooting of issues like permission errors or performance bottlenecks. Customers can now set up alerts and dashboards using standard observability tools rather than building custom log parsing solutions.</li>
<li style="font-weight:400;">Integration works through CloudWatch agent or OpenTelemetry collector, making it compatible with existing monitoring infrastructure that many organizations already have deployed. The feature is available immediately for both the standalone <a href="https://aws.amazon.com/s3/features/mountpoint/">Mountpoint</a> client and the Mountpoint for Amazon S3 CSI driver used in Kubernetes environments.</li>
<li style="font-weight:400;">This update is particularly relevant for machine learning workloads, data analytics pipelines, and containerized applications that treat S3 as a file system and need visibility into storage layer performance. Setup instructions are available in the Mountpoint GitHub repository with configuration examples for common observability platforms.</li>
</ul>
<h2>GCP</h2>
<p>58:31 <a href="https://cloud.google.com/blog/products/management-tools/new-log-analytics-query-builder-simplifies-writing-sql-code/">New Log Analytics query builder simplifies writing SQL code | Google </a><a href="https://cloud.google.com/blog/products/management-tools/new-log-analytics-query-builder-simplifies-writing-sql-code/">Cloud Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/">Google Cloud</a> has released the <a href="https://cloud.google.com/logging/docs/log-analytics">Log Analytics</a> query builder to general availability, providing a UI-based interface that generates SQL queries automatically for users who need to analyze logs without deep SQL expertise. </li>
<li style="font-weight:400;">The tool addresses the common challenge of extracting insights from nested JSON payloads in log data, which typically requires complex SQL functions like JSON_VALUE and JSON_EXTRACT that many DevOps engineers and SREs find time-consuming to write.</li>
<li style="font-weight:400;">The query builder includes intelligent schema discovery that automatically detects and suggests JSON fields and values from your datasets, along with a real-time SQL preview so users can see the generated code and switch to manual editing when needed. 
<ul>
<li style="font-weight:400;">Key capabilities include search across all fields, automatic aggregations and grouping, and one-click visualization to dashboards, making it practical for incident troubleshooting and root cause analysis workflows.</li>
</ul>
</li>
<li style="font-weight:400;">Google plans to expand the feature with cross-project log scopes, trace data integration for joining logs and traces, query saving and history, and natural language to SQL conversion using Gemini AI. </li>
<li style="font-weight:400;">The query builder works with existing Log Analytics pricing, which is based on the amount of data scanned during queries, similar to BigQuery’s on-demand pricing model.</li>
<li style="font-weight:400;">The tool integrates directly with Google Cloud’s observability stack, allowing users to query logs alongside BigQuery datasets and other telemetry types in a single interface. </li>
<li style="font-weight:400;">This consolidation reduces context switching for teams managing complex distributed systems across multiple GCP services and projects.</li>
</ul>
<p>1:00:01 Jonathan- “I think this is where everything is going. Why spend half an hour crafting a perfect SQL query…when you can have it figure it all out for you.” </p>
<p>1:01:12 <a href="https://cloud.google.com/blog/products/containers-kubernetes/gke-and-gemini-cli-work-better-together/">GKE and Gemini CLI work better together | Google Cloud Blog</a>  </p>
<ul>
<li style="font-weight:400;">Google has open-sourced a GKE extension for <a href="https://github.com/google-gemini/gemini-cli">Gemini CLI</a> that integrates Kubernetes Engine operations directly into the command-line AI agent. The extension works as both a <a href="https://github.com/GoogleCloudPlatform/gke-mcp">Gemini CLI extension</a> and a Model Context Protocol server compatible with any MCP client, allowing developers to manage GKE clusters using natural language commands instead of verbose kubectl syntax.</li>
<li style="font-weight:400;">The integration provides three main capabilities: GKE-specific context resources for more natural prompting, pre-built slash command prompts for complex workflows, and direct access to GKE tools, including Cloud Observability integration. Installation requires a single command for Gemini CLI users, with separate instructions available for other MCP clients.</li>
<li style="font-weight:400;">The primary use case targets ML engineers deploying inference models on GKE who need help selecting appropriate models and accelerators based on business requirements like latency targets. </li>
<li style="font-weight:400;">Gemini CLI can automatically discover compatible models, recommend accelerators, and generate deployable Kubernetes manifests through conversational interaction rather than manual configuration.</li>
<li style="font-weight:400;">This builds on Gemini CLI’s extension architecture that bundles MCP servers, context files, and custom commands into packages that teach the AI agent how to use specific tools. </li>
<li style="font-weight:400;">The GKE extension represents Google’s effort to make Kubernetes operations more accessible through AI assistance, particularly for teams managing AI workload deployments.</li>
<li style="font-weight:400;">The announcement includes no pricing details as both Gemini CLI and the GKE extension are open source projects, though standard GKE cluster costs and any Gemini API usage charges would still apply during operation.</li>
</ul>
<p>1:02:10 Matt – “Anything to make Kubernetes easier to manage, I’m on board for it.” </p>
<p>1:05:06 <a href="https://cloud.google.com/blog/topics/developers-practitioners/master-multi-tasking-with-the-jules-extension-for-gemini-cli/">Master multi-tasking with the Jules extension for Gemini CLI | Google </a><a href="https://cloud.google.com/blog/topics/developers-practitioners/master-multi-tasking-with-the-jules-extension-for-gemini-cli/">Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google has launched the <a href="https://github.com/gemini-cli-extensions/jules">Jules extension for Gemini CLI</a>, which acts as an autonomous coding assistant that handles background tasks like bug fixes, security patches, and dependency updates while developers focus on primary work. </li>
<li style="font-weight:400;"><a href="https://jules.google/">Jules</a> operates asynchronously using the /jules command, working in isolated environments to address multiple issues in parallel and creating branches for review.</li>
<li style="font-weight:400;">The extension integrates with other Gemini CLI extensions to create automated workflows, including the <a href="https://github.com/gemini-cli-extensions/security">Security extension</a> for vulnerability analysis and remediation, and the <a href="https://github.com/gemini-cli-extensions/observability">Observability extension</a> for crash investigation and automated unit test generation. 
<ul>
<li style="font-weight:400;">This modular approach allows developers to chain together different capabilities for comprehensive task automation.</li>
</ul>
</li>
<li style="font-weight:400;">Jules addresses common developer productivity drains by handling routine maintenance tasks that typically interrupt deep work sessions. The tool can process multiple GitHub issues simultaneously, each in its own environment, and prepares fixes for human review rather than automatically committing changes.</li>
<li style="font-weight:400;">The extension is available now as an open source project on GitHub at github.com/gemini-cli-extensions/jules, with no pricing information provided, as it appears to be a free developer tool. </li>
<li style="font-weight:400;">Google is building an ecosystem<a href="https://github.com/gemini-cli-extensions/jules"> of Gemini CLI extensions</a> that can be combined with Jules for various development workflows.</li>
</ul>
<p>1:06:16 Jonathan – “Google obviously listens to their customers because it was only half an hour ago when I said something like this would be pretty useful.”</p>
<p>1:11:36 <a href="https://cloud.google.com/blog/topics/cost-management/announcing-ga-of-cost-anomaly-detection/">Announcing GA of Cost Anomaly Detection | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google’s <a href="https://cloud.google.com/blog/topics/cost-management/introducing-cost-anomaly-detection?e=48754805">Cost Anomaly Detection</a> has reached general availability with AI-powered alerts now enabled by default for all GCP customers across all projects, including new ones. </li>
<li style="font-weight:400;">The service automatically monitors spending patterns and sends alerts to Billing Administrators when unusual cost spikes are detected, with no configuration required.</li>
<li style="font-weight:400;">The GA release introduces AI-generated anomaly thresholds that adapt to each customer’s historical spending patterns, reducing alert noise by flagging only significant, unexpected deviations. </li>
<li style="font-weight:400;">Customers can override these intelligent baselines with custom values if needed, and the system now supports both absolute-dollar thresholds and percentage-based deviation filters to accommodate projects of different sizes and sensitivities.</li>
<li style="font-weight:400;">The improved algorithm solves the cold start problem that previously required six months of spending history, now providing immediate anomaly protection for brand new accounts and projects from day one. 
<ul>
<li style="font-weight:400;">This addresses a key limitation from the public preview phase and ensures comprehensive cost monitoring regardless of account age.</li>
</ul>
</li>
<li style="font-weight:400;">Cost Anomaly Detection remains free as part of GCP’s cost management toolkit and integrates with Cloud Budgets to create a layered approach for preventing, detecting, and containing runaway cloud spending. </li>
<li style="font-weight:400;">The anomaly dashboard provides root cause analysis to help teams quickly understand and address cost spikes when they occur.</li>
<li style="font-weight:400;">Interested in pricing details? Check out the billing console <a href="https://pantheon.corp.google.com/billing">here</a>. </li>
</ul>
<p>1:14:01 Elise – “I just wonder, there’s so many third-party companies that specialize in this kind of thing. So I wonder if they realized that they could just do a little bit better.”</p>
<h2>Azure</h2>
<p>1:16:37 <a href="https://azure.microsoft.com/en-us/blog/building-the-future-together-microsoft-and-nvidia-announce-ai-advancements-at-gtc-dc/">Building the future together: Microsoft and NVIDIA announce AI </a><a href="https://azure.microsoft.com/en-us/blog/building-the-future-together-microsoft-and-nvidia-announce-ai-advancements-at-gtc-dc/">advancements at GTC DC | Microsoft Azure Blog</a></p>
<ul>
<li style="font-weight:400;">Microsoft and NVIDIA are expanding their AI partnership with several infrastructure and model updates. </li>
<li style="font-weight:400;">Azure Local now supports <a href="https://www.nvidia.com/en-us/data-center/rtx-pro-6000-blackwell-server-edition/">NVIDIA RTX PRO 6000 Blackwell Server Edition GPU</a>s, enabling organizations to run AI workloads at the edge with cloud-like management through Azure Arc, targeting healthcare, retail, manufacturing, and government sectors requiring data residency and low-latency processing.</li>
<li style="font-weight:400;"><a href="https://ai.azure.com/">Azure AI Foundry</a> adds <a href="https://www.nvidia.com/en-us/ai-data-science/foundation-models/nemotron/">NVIDIA Nemotron</a> models for agentic AI and enterprise reasoning, plus <a href="http://v">NVIDIA Cosmos models</a> for physical AI applications like robotics and autonomous vehicles. </li>
<li style="font-weight:400;">Microsoft also introduced <a href="https://trellis3d.net/">TRELLIS for 3D asset generation</a>, all deployable as NVIDIA NIM microservices with enterprise-grade security and scalability.</li>
<li style="font-weight:400;">Microsoft deployed the first production-scale cluster of<a href="https://www.google.com/aclk?sa=L&amp;ai=DChsSEwjk6cKX7-uQAxXsJkQIHWQ_FbMYACICCAEQABoCZHo&amp;co=1&amp;ase=2&amp;gclid=CjwKCAiA2svIBhB-EiwARWDPjuVNIV09qJoz7W15_4A2pJNPq5hRnWWfyHDb-5v3MBJ8s8SO5kVleBoCj34QAvD_BwE&amp;cid=CAASWeRovAmO3AW68YQjmrGeAxo235DdWPAPNR5GldM3jdLQOmnVYxxfQWaO4qtOLoV4FYi-VKch039Xv8iK9A29S4uwUQ8HW5w7x5mvJi-K1dvYZ-UNgaHZrrV6&amp;cce=2&amp;category=acrcp_v1_32&amp;sig=AOD64_27AlXMxzv8665XQBoHRyxOB1gstw&amp;q&amp;nis=4&amp;adurl&amp;ved=2ahUKEwj477iX7-uQAxVQIEQIHTrcAscQ0Qx6BAgNEAQ"> NVIDIA GB300 NVL72</a> systems with over 4,600 Blackwell Ultra GPUs in the new NDv6 GB300 VM series. </li>
<li style="font-weight:400;">Each rack delivers 130 TB/s of NVLink bandwidth and up to 136 kW of compute power, designed for training and deploying frontier models with integrated liquid cooling and Azure Boost for accelerated I/O.</li>
<li style="font-weight:400;">Also, NVIDIA Run:ai is now available on Azure Marketplace, providing GPU orchestration and workload management across Azure NC and ND series instances. The platform integrates with AKS, Azure Machine Learning, and Azure AI Foundry to help enterprises dynamically allocate GPU resources, reduce costs, and improve utilization across teams.</li>
<li style="font-weight:400;"><a href="https://azure.microsoft.com/en-us/products/kubernetes-service">Azure Kubernetes Service</a> now supports <a href="https://developer.nvidia.com/dynamo">NVIDIA Dynamo framework</a> on ND GB200-v6 VMs, demonstrating 1.2 million tokens per second with the gpt-oss 120b model. </li>
<li style="font-weight:400;">Microsoft reports up to 15x throughput improvement over Hopper generation for reasoning models, with deployment guides available for production implementations.</li>
</ul>
<p>1:21:53 Jonathan – “That’s a really good salesy number to quote, though, 1.2 million tokens a second – that’s great, but that’s not an individual user. One individual user will not get 1.2 million tokens a second out of any model. That is, at full capacity with as many users running inference as possible on that cluster. The total generation output might be 1.2 million tokens a second, which is still phenomenal, but as far as the actual user experience, you know, if you were a business that wanted really fast inference, you’re not going to get 1.2 million tokens a second.”</p>
<p>1:23:26 <a href="https://azure.microsoft.com/en-us/updates?id=520822">Public Preview: Azure Functions zero-downtime deployments with rolling </a>Updates<a href="https://azure.microsoft.com/en-us/updates?id=520822"> in Flex Consumption</a> </p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/azure-functions/flex-consumption-plan">Azure Functions in the Flex Consumption</a> plan now supports rolling updates for zero-downtime deployments through a simple configuration change. </li>
<li style="font-weight:400;">This eliminates the need for forceful instance restarts during code or configuration updates, allowing the platform to gracefully transition workloads across instances.</li>
<li style="font-weight:400;">Rolling updates work by gradually replacing old instances with new ones while maintaining active request handling, similar to deployment strategies used in container orchestration platforms. </li>
<li style="font-weight:400;">This brings enterprise-grade deployment capabilities to serverless functions without requiring additional infrastructure management.</li>
<li style="font-weight:400;">The capability is currently in public preview for the <a href="https://www.reddit.com/r/AZURE/comments/1ggmp5f/azure_functions_flexconsumption_plan/">Flex Consumption plan</a> specifically, which is Azure’s newer consumption-based pricing model that offers more flexibility than the traditional Consumption plan. </li>
<li style="font-weight:400;"><a href="https://www.google.com/aclk?sa=L&amp;ai=DChsSEwjq7Je78OuQAxXuJUQIHSV2ECsYACICCAEQABoCZHo&amp;ae=2&amp;aspm=1&amp;co=1&amp;ase=2&amp;gclid=CjwKCAiA2svIBhB-EiwARWDPjrer1CmiXO1RNt6xgiv0JQyZdtjs9qCb9VTPT0nNI3MxQhrWaMEhmBoC1MoQAvD_BwE&amp;cid=CAASWeRo_b52yyOEqkYrK8Le97h0U3i-Yn5XDAuz55X8bxKFKDf82cn71Ok55PdlR8a0aeN32K1lRmKeeh1CXEO0e9zbRHGSQ2RsmpCHAsa6Fo3rwq3r8VmurwHz&amp;cce=2&amp;category=acrcp_v1_35&amp;sig=AOD64_0RVTN7fyV14OU9e0TM7xRs4kW82A&amp;q&amp;nis=4&amp;adurl&amp;ved=2ahUKEwiSh5G78OuQAxVjKkQIHUXwOlcQ0Qx6BAgMEAE">Pricing</a> follows the standard Flex Consumption model based on execution time and memory usage, with no additional cost for the rolling update feature itself.</li>
</ul>
<p>1:24:42 Matt – “It’s a nice quality of life feature that they’re adding to everything. It’s in preview, though, so don’t deploy production workloads leveraging this.” </p>
<p>1:25:06 <a href="https://www.shankuehn.io/post/the-azure-payg-api-shift-what-s-actually-changing-and-why-it-matters">The Azure PAYG API Shift: What’s Actually Changing (and Why It Matters)</a> </p>
<ul>
<li style="font-weight:400;">Microsoft is <a href="https://learn.microsoft.com/en-us/azure/cost-management-billing/automate/get-usage-details-legacy-customer">deprecating the legacy Consumption API for Azure Pay-As-You-Go</a> cost data retrieval and replacing it with two modern approaches: the <a href="https://learn.microsoft.com/en-us/rest/api/cost-management/exports">Cost Details API</a> for Enterprise and Microsoft Customer Agreement subscriptions, and the <a href="https://learn.microsoft.com/en-us/rest/api/cost-management/exports">Exports API for PAYG</a> and Visual Studio subscriptions. </li>
<li style="font-weight:400;">This shifts from a pull model, where teams constantly query APIs, to a subscribe model where Azure delivers cost data directly to Azure Storage Accounts as CSV files.</li>
<li style="font-weight:400;">The change addresses significant scalability and consistency issues with the old API that struggled with throttling, inconsistent schemas across different subscription types, and handling large enterprise-scale datasets. </li>
<li style="font-weight:400;">The new APIs support FOCUS-compliant schemas, include reservations and savings plans data in single exports, and integrate better with Power BI and <a href="https://azure.microsoft.com/en-us/products/data-factory">Azure Data Factory</a> for <a href="https://www.finops.org/">FinOps</a> automation.</li>
<li style="font-weight:400;">FinOps teams need to audit existing scripts that call the Microsoft.Commerce/UsageAggregates endpoint and migrate to storage-based data ingestion instead of direct API calls. </li>
<li style="font-weight:400;">While the legacy endpoint remains live but unsupported, Microsoft strongly recommends immediate migration, though the deprecation timeline may extend based on customer adoption rates.</li>
<li style="font-weight:400;">The practical impact for cloud teams is more reliable cost data pipelines with fewer failed jobs, predictable scheduled exports eliminating API throttling issues, and consistent field mappings across all subscription types. </li>
<li style="font-weight:400;">Teams should review Microsoft’s field mapping reference documentation, as column names have changed between the old and new APIs.</li>
<li style="font-weight:400;">PAYG customers currently must use the Exports API with storage-based retrieval, though Microsoft plans to eventually extend Cost Details API support to PAYG subscriptions. </li>
<li style="font-weight:400;">The transition requires updating data flow architecture but provides an opportunity to standardize FinOps processes across different Azure billing models.</li>
</ul>
<p>1:27:12 Matt – “A year or two ago, we did an analysis at my day job, and we were trying to figure out the savings plan’s amount if we buy X amount, how much do we need to buy everything along those lines. And we definitely ran into like throttling issues, and it was just bombing out on us at a few points, and a lot of weird loops we had to do because the format just didn’t make sense with moderate stuff. It’s a great way. I would suggest you move not because they’re trying to get rid of it, but because it will make your life better.”</p>
<p>1:28:05 <a href="https://azure.microsoft.com/en-us/updates?id=512751">Generally Available: Azure WAF CAPTCHA Challenge for Azure Front </a><a href="https://azure.microsoft.com/en-us/updates?id=512751">Door</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.google.com/aclk?sa=L&amp;ai=DChsSEwir6P3o8euQAxVoIUQIHTJQEK8YACICCAEQAxoCZHo&amp;ae=2&amp;aspm=1&amp;co=1&amp;ase=2&amp;gclid=CjwKCAiA2svIBhB-EiwARWDPjkQF1dTjZ4wtxZjIksL0OQwn3pbuzqnIZizuZVPcbR6gSp-Smt-JuxoCkRcQAvD_BwE&amp;cid=CAASWeRoPhcueGzwONHck8B0aZBw9Ecs0CC0bpSPCzcaqX4wUXEpElrwLU0cYaCVxvz1YkHU5tUjaT7miZ9TniwBev3IvyCDSWSLN21YXstRXibbZQXCEyWL2KCb&amp;cce=2&amp;category=acrcp_v1_35&amp;sig=AOD64_38lKCNCEc-M_RonU47OA2_QQeX5g&amp;q&amp;nis=4&amp;adurl&amp;ved=2ahUKEwiN5_bo8euQAxXgN0QIHWTuFy4Q0Qx6BAgMEAE">Azure WAF</a> now includes CAPTCHA challenge capabilities for Front Door deployments, allowing organizations to distinguish between legitimate users and automated bot traffic. </li>
<li style="font-weight:400;">This addresses common threats like credential stuffing, web scraping, and DDoS attacks that traditional WAF rules may miss.</li>
<li style="font-weight:400;">The CAPTCHA feature integrates directly into <a href="https://learn.microsoft.com/en-us/azure/frontdoor/front-door-overview">Azure Front Door</a>‘s WAF policy engine, enabling administrators to trigger challenges based on custom rules, rate limits, or anomaly detection patterns. </li>
<li style="font-weight:400;">Organizations can configure CAPTCHA thresholds and exemptions without requiring changes to backend application code.</li>
<li style="font-weight:400;">This capability targets e-commerce sites, financial services, and any web application experiencing bot-driven abuse or account takeover attempts. </li>
<li style="font-weight:400;">The CAPTCHA challenge adds a human verification layer that complements existing WAF protections like <a href="https://owasp.org/www-project-modsecurity-core-rule-set/">OWASP rule sets</a> and custom security policies.</li>
<li style="font-weight:400;">Pricing follows the standard Azure Front Door WAF model with per-policy charges plus request-based fees, though specific CAPTCHA-related costs were not detailed in the announcement. </li>
<li style="font-weight:400;">Organizations already using Front Door Premium can enable this feature through policy configuration updates.</li>
<li style="font-weight:400;">The general availability means this protection is now production-ready across all Azure regions where Front Door operates, removing the need for third-party CAPTCHA services or custom bot mitigation solutions for many Azure customers.</li>
<li style="font-weight:400;">We just wonder what we’re going to replace re: Captcha with when AI can click the button like a human can. </li>
</ul>
<p>1:31:04 <a href="https://azure.microsoft.com/en-us/updates?id=520805">Public Preview: Instant Access Snapshots for Azure Premium SSD v2 and </a> <a href="https://azure.microsoft.com/en-us/updates?id=520805">Ultra Disk Storage</a></p>
<ul>
<li style="font-weight:400;">Azure now offers <a href="https://learn.microsoft.com/en-us/azure/virtual-machines/disks-instant-access-snapshots">Instant Access Snapshots</a> in public preview for <a href="https://learn.microsoft.com/en-us/azure/virtual-machines/disks-deploy-premium-v2">Premium SSD v2</a> and <a href="https://learn.microsoft.com/en-us/azure/virtual-machines/disks-enable-ultra-ssd">Ultra Disks</a>, eliminating the traditional wait time for snapshot restoration. Previously, customers had to wait for snapshots to fully hydrate before using restored disks, but this feature allows immediate disk restoration with high performance right after snapshot creation.</li>
<li style="font-weight:400;">This capability addresses a critical operational need for enterprises running high-performance workloads on Azure’s fastest storage tiers. </li>
<li style="font-weight:400;">Premium SSD v2 and Ultra Disks are typically used for mission-critical databases, <a href="https://www.google.com/aclk?sa=L&amp;ai=DChsSEwjV5Iu18uuQAxUAIEQIHZ4uOKUYACICCAEQAhoCZHo&amp;ae=2&amp;co=1&amp;ase=2&amp;gclid=CjwKCAiA2svIBhB-EiwARWDPjtuJqQU6zM5tWOFheuugNuUkTNkkyaiQa7Lz-gnlRWfkad1lODOC4xoCqzIQAvD_BwE&amp;cid=CAASWeRocEYcuHVEVvrhJjNQxxPXrCZSOrunEDYYFiF04seZVbWzCSqjR_LG2OagcsK1RY9op6CzeZeHN4u2GZLNkHGtgS95XdWZWpLAP1twfviBElne2cSmEJ0H&amp;cce=2&amp;category=acrcp_v1_71&amp;sig=AOD64_08_dRbCx0geFrBelK364O9UVRV-A&amp;q&amp;nis=4&amp;adurl&amp;ved=2ahUKEwjbh4a18uuQAxXcEUQIHVAHCh0Q0Qx6BAgNEAE">SAP HANA</a>, and other latency-sensitive applications where downtime during recovery operations directly impacts business operations.</li>
<li style="font-weight:400;">The feature reduces recovery time objectives for disaster recovery and backup scenarios, particularly valuable for customers who need rapid failover capabilities. Organizations can now create point-in-time copies and immediately spin up test environments or recover from failures without the performance penalty of background hydration processes.</li>
<li style="font-weight:400;">This positions Azure’s premium storage offerings more competitively against <a href="https://docs.aws.amazon.com/ebs/latest/userguide/ebs-snapshots.html">AWS’s EBS snapshots</a> with fast snapshot restore and Google Cloud’s instant snapshots. </li>
<li style="font-weight:400;">The preview status means customers should test thoroughly before production use, and Microsoft has not yet announced general availability timing or any pricing changes specific to this snapshot capability.</li>
</ul>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback, or ask questions at theCloudPod.net or tweet at us with the hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2203823/c1e-2okobm679os8op5j-6zqzxkwpf9xg-ucqxd6.mp3" length="171079338"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 329 of The Cloud Pod, where the forecast is always cloudy! Justin, Jonathan, and special guest Elise are in the studio to bring you all the latest in AI and cloud news, including – you guessed it – more outages, and more OpenAI team-ups. We’ve also got GPUs, K8 news, and Cursor updates. Let’s get started! 
Titles we almost went with this week

Azure Front Door: Please Use the Side Entrance – el -jb
Azure and NVIDIA: A Match Made in GPU Heaven – mk
Azure Goes Down Under the Weight of Its Own Configuration – el
GitHub Turns Your Copilot Subscription Into an All-You-Can-Eat Agent Buffet – mk, el
Microsoft Goes Full Blackwell: No Regrets, Just GPUs
Jules Verne Would Be Proud: Google’s CLI Goes 20,000 Bugs Under the Codebase
RAG to Riches: AWS Makes Retrieval Augmented Generation Turnkey
Kubectl Gets a Gemini Twin: Google Teaches AI to Speak Kubernetes
I’m Not a Robot: Azure WAF Finally Learns to Ask the Important Questions
OpenAI Puts 38 Billion Eggs in Amazon’s Basket: Multi-Cloud Gets Complicated
The Root Cause They’ll Never Root Out: Why Attrition Stays Off the RCA
Google’s New Extension Lets You Deploy Kubernetes by Just Asking Nicely
Cursor 2.0: Now With More Agents Than a Hollywood Talent Agency

Follow Up 
04:46 Massive Azure outage is over, but problems linger – here’s what happened | ZDNET 

Azure experienced a global outage on October 29, affecting all regions simultaneously, unlike the recent AWS outage that was limited to a single region. 
The incident lasted approximately eight hours from noon to 8 PM ET, impacting major services including Microsoft 365, Teams, Xbox Live, and critical infrastructure for Alaska Airlines, Vodafone UK, and Heathrow Airport, among others.
The root cause was an inadvertent tenant configuration change in Azure Front Door that bypassed safety validations due to a software defect. Microsoft’s protection mechanisms failed to catch the erroneous deployment, allowing invalid configurations to propagate across the global fleet and cause HTTP timeouts, server errors, and elevated packet loss at network edges.
Recovery required rolling back to the last known good configuration and gradually rebalancing traffic across nodes to prevent overload conditions. 
Some customers experienced lingering issues even after the official recovery time, with Microsoft temporarily blocking configuration changes to Azure Front Door while completing the restoration process.
The incident highlights concentration risk in cloud infrastructure, as this marks the second major cloud provider outage in October 2025. 
Despite Azure revenue growing 40 percent in the latest quarterly report, Microsoft’s stock declined in after-hours trading as the company acknowledged capaci...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2203823/c1a-k5d5-6zqzxk7xt2d2-dpvsaw.jpg"></itunes:image>
                                                                            <itunes:duration>01:28:56</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2203823/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[328: Shhh… It’s a Secret Region!]]>
                </title>
                <pubDate>Wed, 05 Nov 2025 23:48:49 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2193961</guid>
                                    <link>https://tcpfm.castos.com/episodes/328-shhh-its-a-secret-region</link>
                                <description>
                                            <![CDATA[<h3>Welcome to episode 328 of The Cloud Pod, where the forecast is always cloudy! Justin, Ryan, and Matt are on board today to bring you all the latest news in cloud and AI, including secret regions (this one has the aliens), ongoing discussions between Microsoft and OpenAI, and updates to Nova, SQL, and OneLake -and even the latest installment of Cloud Journeys.  Let’s get started! </h3>
<h3>Titles we almost went with this week</h3>
<ul>
<li> CloudWatch’s New Feature: Because Nobody Likes Writing Incident Reports at 3 AM</li>
<li> DNS: Did Not Survive – The Great US-EAST-1 Outage of 2025</li>
<li> 404 DevOps Not Found: The AWS Automation Adventure mk</li>
<li> When Your DevOps Team Gets Replaced by AI and Then Everything Crashes</li>
<li> Database Migrations Get the ChatGPT Treatment: Just Vibe Your Schema Changes</li>
<li> AWS DevOps Team Gets the AI Treatment: 40% Fewer Humans, 100% More Questions</li>
<li> Breaking Up is Hard to Compute: Microsoft and OpenAI Redefine Their Relationship</li>
<li> AWS Goes Full Scope: Now Tracking Your Cloud’s Carbon from Cradle to Gate</li>
<li> Platform Engineering: When Your Golden Path Leads to a Dead End</li>
<li> DynamoDB’s DNS Disaster: How a Race Condition Raced Through AWS</li>
<li> AI Takes Over AWS DevOps Jobs, Servers Take Unscheduled Vacation</li>
<li> PostgreSQL Scaling Gets a 30-Second Makeover While AWS Takes a Coffee Break</li>
<li> The Domino Effect: When DynamoDB Drops, Everything Drops</li>
<li> RAG to Riches: Amazon Nova Learns to Cite Its Sources</li>
<li> AWS Finally Tells You When Your EC2 Instance Can’t Keep Up With Your Storage Ambitions</li>
<li> AWS Nova Gets Grounded: No More Hallucinating About Reality</li>
<li> One API to Rule Them All: OneLake’s Storage Compatibility Play</li>
<li> OpenAI gets to pay Alimony</li>
<li> Database schema deployments are totally a vibe</li>
<li> AWS will tell you how not green you are today, now in 3 scopes</li>
</ul>
<h2>General News </h2>
<p>02:00 <a href="https://www.fastly.com/blog/ddos-in-september">DDoS in September | Fastly</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.fastly.com/">Fastly</a>‘s September DDoS report reveals a notable 15.5 million requests per second attack that lasted over an hour, demonstrating how modern application-layer attacks can sustain extreme throughput with real HTTP requests rather than simple pings or amplification techniques.</li>
<li style="font-weight:400;">Attack volume in September dropped to 61% of August levels, with data suggesting a correlation between school schedules and attack frequency: lower volumes coincide with school breaks, while higher volumes occur when schools are in session.</li>
<li style="font-weight:400;">Media &amp; Entertainment companies faced the highest median attack sizes, followed by Education and High Technology sectors, with 71% of September’s peak attack day attributed to a single enterprise media company.</li>
<li style="font-weight:400;">The sustained 15 million RPS attack originated from a single cloud-provider ASN, using sophisticated daemons that mimicked browser behavior, making detection more challenging than typical DDoS patterns.</li>
<li style="font-weight:400;">Organizations should evaluate whether their incident response runbooks can handle hour-long attacks at 15+ million RPS, as these sustained high-throughput attacks require automated mitigation rather than manual intervention.</li>
<li style="font-weight:400;">Listen, we’re not inviting a DDoS attack, but also…we’ll just turn off the website, so there’s that. </li>
</ul>
<h2>AI Is Going Great – Or How ML Makes Money</h2>
<p>04:41 <a href="https://blog.google/technology/developers/introducing-vibe-coding-in-google-ai-studio/">Google AI Studio updates: More control, less friction</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aistudio.google.com/">Google AI Studio</a> introduces “<a href="https://aistudio.google.com/vibe-code">vibe coding</a>” – a new AI-powered develo...</li></ul>
<h3>Chapters</h3>
<ul><li>(00:00:00) - AWS vs. Azure: When Will Both Companies Have Outages</li><li>(00:02:07) - DDoS Attacks Rise in September</li><li>(00:04:43) - Google AI Studio Introduces Vibe Coding</li><li>(00:09:20) - OpenAI's Company Knowledge for Chat GPT</li><li>(00:13:59) - Microsoft and OpenAI Strike a New Deal</li><li>(00:17:19) - Amazon Nova: General Availability of WebGrounding</li><li>(00:18:58) - Athena Health Reporting's AI-Powered Database Migration Author</li><li>(00:20:56) - Amazon Reportedly Replaces 40% of DevOps Staff With AI</li><li>(00:23:58) - Amazon's DynamoDB Outage</li><li>(00:28:11) - CloudWatch: Automated Incident Reporting with Scope 3</li><li>(00:33:24) - Amazon's Secret West Region</li><li>(00:39:31) -  EC2: EBS IOPS exceeded and Volume level</li><li>(00:42:52) - Google Cloud Parameter Manager</li><li>(00:46:37) - Azure Key Vault vs AWS SSM: Feature Flag Management</li><li>(00:48:32) - Citadel Cross-Site Interconnect with Google Cloud Platform</li><li>(00:51:52) - BigTable Storage: Limited-Access Storage in Preview</li><li>(00:54:38) - Google Cloud: 4x Max Nvidia NVL70 Instance</li><li>(00:56:58) - Nvidia GB300 Envel 72 Instances</li><li>(00:58:35) - Azure databases for PostgreSQL now with High Availability ( HA)</li><li>(01:00:11) - OneLake + Fabric: What Could Go Wrong?</li><li>(01:01:40) - 8 Platform Engineering Anti-Patterns</li><li>(01:05:01) - The Second Anti-Pattern: Lack of Product Mindset</li><li>(01:08:02) - 2. Give the team some ownership of the platform</li><li>(01:11:56) - Building a Successful Platform: Tracking the Wrong Metrics</li><li>(01:13:34) - Don't Copy the Kubernetes Platform</li><li>(01:16:08) - 7 Pitfalls of Over Engineering on Day 1</li><li>(01:19:14) - Platform Engineering: The Product Management Process</li><li>(01:20:59) - This Week in the Cloud: Platform Engineering</li><li>(01:21:41) - Next Week In The Cloud: Trip to the Bay</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 328 of The Cloud Pod, where the forecast is always cloudy! Justin, Ryan, and Matt are on board today to bring you all the latest news in cloud and AI, including secret regions (this one has the aliens), ongoing discussions between Microsoft and OpenAI, and updates to Nova, SQL, and OneLake -and even the latest installment of Cloud Journeys.  Let’s get started! 
Titles we almost went with this week

 CloudWatch’s New Feature: Because Nobody Likes Writing Incident Reports at 3 AM
 DNS: Did Not Survive – The Great US-EAST-1 Outage of 2025
 404 DevOps Not Found: The AWS Automation Adventure mk
 When Your DevOps Team Gets Replaced by AI and Then Everything Crashes
 Database Migrations Get the ChatGPT Treatment: Just Vibe Your Schema Changes
 AWS DevOps Team Gets the AI Treatment: 40% Fewer Humans, 100% More Questions
 Breaking Up is Hard to Compute: Microsoft and OpenAI Redefine Their Relationship
 AWS Goes Full Scope: Now Tracking Your Cloud’s Carbon from Cradle to Gate
 Platform Engineering: When Your Golden Path Leads to a Dead End
 DynamoDB’s DNS Disaster: How a Race Condition Raced Through AWS
 AI Takes Over AWS DevOps Jobs, Servers Take Unscheduled Vacation
 PostgreSQL Scaling Gets a 30-Second Makeover While AWS Takes a Coffee Break
 The Domino Effect: When DynamoDB Drops, Everything Drops
 RAG to Riches: Amazon Nova Learns to Cite Its Sources
 AWS Finally Tells You When Your EC2 Instance Can’t Keep Up With Your Storage Ambitions
 AWS Nova Gets Grounded: No More Hallucinating About Reality
 One API to Rule Them All: OneLake’s Storage Compatibility Play
 OpenAI gets to pay Alimony
 Database schema deployments are totally a vibe
 AWS will tell you how not green you are today, now in 3 scopes

General News 
02:00 DDoS in September | Fastly

Fastly‘s September DDoS report reveals a notable 15.5 million requests per second attack that lasted over an hour, demonstrating how modern application-layer attacks can sustain extreme throughput with real HTTP requests rather than simple pings or amplification techniques.
Attack volume in September dropped to 61% of August levels, with data suggesting a correlation between school schedules and attack frequency: lower volumes coincide with school breaks, while higher volumes occur when schools are in session.
Media & Entertainment companies faced the highest median attack sizes, followed by Education and High Technology sectors, with 71% of September’s peak attack day attributed to a single enterprise media company.
The sustained 15 million RPS attack originated from a single cloud-provider ASN, using sophisticated daemons that mimicked browser behavior, making detection more challenging than typical DDoS patterns.
Organizations should evaluate whether their incident response runbooks can handle hour-long attacks at 15+ million RPS, as these sustained high-throughput attacks require automated mitigation rather than manual intervention.
Listen, we’re not inviting a DDoS attack, but also…we’ll just turn off the website, so there’s that. 

AI Is Going Great – Or How ML Makes Money
04:41 Google AI Studio updates: More control, less friction

Google AI Studio introduces “vibe coding” – a new AI-powered develo...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[328: Shhh… It’s a Secret Region!]]>
                </itunes:title>
                                    <itunes:episode>328</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<h3>Welcome to episode 328 of The Cloud Pod, where the forecast is always cloudy! Justin, Ryan, and Matt are on board today to bring you all the latest news in cloud and AI, including secret regions (this one has the aliens), ongoing discussions between Microsoft and OpenAI, and updates to Nova, SQL, and OneLake -and even the latest installment of Cloud Journeys.  Let’s get started! </h3>
<h3>Titles we almost went with this week</h3>
<ul>
<li> CloudWatch’s New Feature: Because Nobody Likes Writing Incident Reports at 3 AM</li>
<li> DNS: Did Not Survive – The Great US-EAST-1 Outage of 2025</li>
<li> 404 DevOps Not Found: The AWS Automation Adventure mk</li>
<li> When Your DevOps Team Gets Replaced by AI and Then Everything Crashes</li>
<li> Database Migrations Get the ChatGPT Treatment: Just Vibe Your Schema Changes</li>
<li> AWS DevOps Team Gets the AI Treatment: 40% Fewer Humans, 100% More Questions</li>
<li> Breaking Up is Hard to Compute: Microsoft and OpenAI Redefine Their Relationship</li>
<li> AWS Goes Full Scope: Now Tracking Your Cloud’s Carbon from Cradle to Gate</li>
<li> Platform Engineering: When Your Golden Path Leads to a Dead End</li>
<li> DynamoDB’s DNS Disaster: How a Race Condition Raced Through AWS</li>
<li> AI Takes Over AWS DevOps Jobs, Servers Take Unscheduled Vacation</li>
<li> PostgreSQL Scaling Gets a 30-Second Makeover While AWS Takes a Coffee Break</li>
<li> The Domino Effect: When DynamoDB Drops, Everything Drops</li>
<li> RAG to Riches: Amazon Nova Learns to Cite Its Sources</li>
<li> AWS Finally Tells You When Your EC2 Instance Can’t Keep Up With Your Storage Ambitions</li>
<li> AWS Nova Gets Grounded: No More Hallucinating About Reality</li>
<li> One API to Rule Them All: OneLake’s Storage Compatibility Play</li>
<li> OpenAI gets to pay Alimony</li>
<li> Database schema deployments are totally a vibe</li>
<li> AWS will tell you how not green you are today, now in 3 scopes</li>
</ul>
<h2>General News </h2>
<p>02:00 <a href="https://www.fastly.com/blog/ddos-in-september">DDoS in September | Fastly</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.fastly.com/">Fastly</a>‘s September DDoS report reveals a notable 15.5 million requests per second attack that lasted over an hour, demonstrating how modern application-layer attacks can sustain extreme throughput with real HTTP requests rather than simple pings or amplification techniques.</li>
<li style="font-weight:400;">Attack volume in September dropped to 61% of August levels, with data suggesting a correlation between school schedules and attack frequency: lower volumes coincide with school breaks, while higher volumes occur when schools are in session.</li>
<li style="font-weight:400;">Media &amp; Entertainment companies faced the highest median attack sizes, followed by Education and High Technology sectors, with 71% of September’s peak attack day attributed to a single enterprise media company.</li>
<li style="font-weight:400;">The sustained 15 million RPS attack originated from a single cloud-provider ASN, using sophisticated daemons that mimicked browser behavior, making detection more challenging than typical DDoS patterns.</li>
<li style="font-weight:400;">Organizations should evaluate whether their incident response runbooks can handle hour-long attacks at 15+ million RPS, as these sustained high-throughput attacks require automated mitigation rather than manual intervention.</li>
<li style="font-weight:400;">Listen, we’re not inviting a DDoS attack, but also…we’ll just turn off the website, so there’s that. </li>
</ul>
<h2>AI Is Going Great – Or How ML Makes Money</h2>
<p>04:41 <a href="https://blog.google/technology/developers/introducing-vibe-coding-in-google-ai-studio/">Google AI Studio updates: More control, less friction</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aistudio.google.com/">Google AI Studio</a> introduces “<a href="https://aistudio.google.com/vibe-code">vibe coding</a>” – a new AI-powered development experience that generates working multi-modal apps from natural language prompts without requiring API key management or manual service integration.</li>
<li style="font-weight:400;">The platform now automatically connects appropriate models and APIs based on app descriptions, supporting capabilities like Veo for video generation, Nano Banana for image editing, and Google Search for source verification.</li>
<li style="font-weight:400;">New <a href="https://www.youtube.com/watch?v=FyTB1vmgM00">Annotation Mode</a> enables visual app modifications by highlighting UI elements and describing changes in plain language rather than editing code directly</li>
<li style="font-weight:400;">The updated <a href="https://aistudio.google.com/apps?source=showcase">App Gallery</a> provides visual examples of <a href="https://gemini.google.com/">Gemini</a>-powered applications with instant preview, starter code access, and remix capabilities for rapid prototyping</li>
<li style="font-weight:400;">Users can add personal API keys to continue development when free-tier quotas are exhausted, with automatic switching back to the free tier upon renewal.</li>
<li style="font-weight:400;">Are you a visual learner? You can check out their YouTube tutorial playlist <a href="https://www.youtube.com/playlist?list=PLOU2XLYxmsIKkEa_-KTPF9DZ0IyHJ7V1H">here</a>. </li>
</ul>
<p>05:39  Justin – “So, there are still API keys – they made it sound like there wasn’t, but there is. You just don’t have to manage them until you’ve consumed your free tier.”  </p>
<p>09:35  <a href="https://go.theregister.com/feed/www.theregister.com/2025/10/24/openai_chatgpt_company_knowledge/">OpenAI takes aim at Microsoft 365 Copilot • The Register</a></p>
<ul>
<li style="font-weight:400;"><a href="https://openai.com/">OpenAI</a> launched “<a href="https://openai.com/index/introducing-company-knowledge/">company knowledge</a>” for <a href="https://chatgpt.com/business/business-plan">ChatGPT Business</a>, <a href="https://chatgpt.com/business/enterprise">Enterprise</a>, and <a href="https://openai.com/index/introducing-chatgpt-edu/">Edu</a> plans, enabling direct integration with corporate data sources, including <a href="https://slack.com/signin">Slack</a>, <a href="https://www.microsoft.com/microsoft-365/sharepoint/collaboration">SharePoint</a>, <a href="https://www.google.com/drive/">Google Drive</a>, <a href="https://www.microsoft.com/en-us/microsoft-teams/log-in">Teams</a>, and <a href="https://www.outlook.com/">Outlook</a>; notably excluding <a href="https://onedrive.live.com/login/en-us/">OneDrive</a>, which could impact Microsoft-heavy organizations.</li>
<li style="font-weight:400;">The feature requires manual activation for each conversation and lacks capabilities like web search, image generation, or graph creation when enabled, unlike <a href="https://m365.cloud.microsoft/?omkt=en-001&amp;source=post_page---------------------------">Microsoft 365 Copilot</a>‘s deeper integration across Office applications.</li>
<li style="font-weight:400;">ChatGPT Business pricing at $25/user/month undercuts Microsoft 365 Copilot’s <a href="https://www.microsoft.com/en-us/microsoft-365-copilot/pricing">$30/month fee</a>, potentially offering a more cost-effective enterprise AI assistant option with stronger brand recognition. (5 bucks is 5 bucks, right?) </li>
<li style="font-weight:400;">Security implementation includes individual authentication per connector, encryption of all data, no training on corporate data, and an <a href="https://help.openai.com/en/articles/9261474-compliance-api-for-enterprise-customers">Enterprise Compliance API</a> for conversation log review and regulatory reporting.</li>
<li style="font-weight:400;">Data residency and processing locations vary by connector, with no clear documentation from OpenAI, requiring organizations to verify compliance requirements before deployment.</li>
<li style="font-weight:400;">We kind of think we’ve heard of this before…</li>
</ul>
<p>11:05  Ryan – “And it’s a huge problem. It’s been a huge problem that people have been trying to solve for a long time.”  </p>
<p>14:23 <a href="https://blogs.microsoft.com/blog/2025/10/28/the-next-chapter-of-the-microsoft-openai-partnership/">The next chapter of the Microsoft–OpenAI partnership – The Official </a><a href="https://blogs.microsoft.com/blog/2025/10/28/the-next-chapter-of-the-microsoft-openai-partnership/">Microsoft Blog</a></p>
<ul>
<li style="font-weight:400;">Welp, the divorce has reached a (sort of) amicable alimony agreement. </li>
<li style="font-weight:400;">Microsoft and OpenAI have restructured their partnership with Microsoft, now holding approximately 27% stake in OpenAI’s new public benefit corporation, which is now valued at $135 billion, while maintaining exclusive <a href="https://learn.microsoft.com/en-us/azure/api-management/api-management-key-concepts">Azure API</a> access and IP rights until <a href="https://www.zdnet.com/article/what-is-artificial-general-intelligence/">AG</a>I is achieved.</li>
<li style="font-weight:400;">The agreement introduces an independent expert panel to verify AGI declarations and extends Microsoft’s IP rights for models and products through 2032, including post-AGI models with safety guardrails, though research IP expires by 2030 or AGI verification.</li>
<li style="font-weight:400;">OpenAI gains significant operational flexibility, including the ability to develop non-API products with third parties on any cloud provider, release open weight models meeting capability criteria, and serve US government national security customers on any cloud infrastructure.</li>
<li style="font-weight:400;">Microsoft can now independently pursue AGI development alone or with partners, and if using <a href="https://blogs.microsoft.com/blog/2025/10/28/the-next-chapter-of-the-microsoft-openai-partnership/">OpenAI’s IP pre-AGI</a>, must adhere to compute thresholds significantly larger than current leading model training systems.</li>
<li style="font-weight:400;">OpenAI has committed to purchasing $250 billion in Azure services while Microsoft loses its right of first refusal as OpenAI’s compute provider, signaling a shift toward more independent operations for both companies.</li>
</ul>
<p>Con’t <a href="https://openai.com/index/next-chapter-of-microsoft-openai-partnership/">The next chapter of the Microsoft–OpenAI partnership | OpenAI</a></p>
<ul>
<li style="font-weight:400;">Microsoft’s investment in OpenAI is now valued at approximately $135 billion, representing roughly 27% ownership on a diluted basis, while OpenAI transitions to a public benefit corporation structure.</li>
<li style="font-weight:400;">The partnership introduces an independent expert panel to verify when OpenAI achieves AGI, with Microsoft’s IP rights for models and products extended through 2032, including post-AGI models with safety guardrails.</li>
<li style="font-weight:400;">OpenAI gains significant flexibility, including the ability to develop non-API products with third parties on any cloud provider, release open weight models meeting capability criteria, and provide API access to US government national security customers on any cloud.</li>
<li style="font-weight:400;">Microsoft can now independently pursue AGI development alone or with partners, while OpenAI has committed to purchasing an additional $250 billion in Azure services, but Microsoft no longer has the right of first refusal as a compute provider.</li>
<li style="font-weight:400;">The revenue-sharing agreement continues until AGI verification, but payments will be distributed over a longer timeframe, while Microsoft retains exclusive rights to <a href="https://platform.openai.com/docs/models">OpenAI’s frontier models</a> and Azure API exclusivity until AGI is achieved.</li>
</ul>
<p>15:59  Justin – “Once AGI is achieved is an interesting choice… I wonder how Microsoft believes that’s gonna happen very soon, and OpenAI doesn’t, that’s why they’re willing to agree on that term; it’s interesting. Again, it has to be independently verified by a partner, so OpenAI can’t just come out and say, ‘we’ve created AGI,’ then, into a legal dispute – it has to be agreed upon by others. So that’s all very interesting.”</p>
<p>17:45 <a href="https://aws.amazon.com/blogs/aws/build-more-accurate-ai-applications-with-amazon-nova-web-grounding/">Build more accurate AI applications with Amazon Nova Web Grounding | </a><a href="https://aws.amazon.com/blogs/aws/build-more-accurate-ai-applications-with-amazon-nova-web-grounding/">AWS News Blog</a></p>
<ul>
<li style="font-weight:400;">AWS announces general availability of Web Grounding for <a href="https://aws.amazon.com/blogs/aws/amazon-nova-premier-our-most-capable-model-for-complex-tasks-and-teacher-for-model-distillation/">Amazon Nova Premier</a>, a built-in <a href="https://aws.amazon.com/what-is/retrieval-augmented-generation/?trk=ac97e39c-d115-4d4a-b3fe-c695e0c9a7ee&amp;sc_channel=el">RAG</a> tool that automatically retrieves and cites current web information during inference. </li>
<li style="font-weight:400;">The feature eliminates the need to build custom RAG pipelines while reducing hallucinations through automatic source attribution and verification.</li>
<li style="font-weight:400;">Web Grounding operates as a system tool within the <a href="https://docs.aws.amazon.com/bedrock/latest/userguide/conversation-inference-examples.html">Bedrock Converse API</a>, allowing Nova models to intelligently determine when to query external sources based on prompt context. </li>
<li style="font-weight:400;">Developers simply add nova_grounding to the toolConfig parameter, and the model handles retrieval, integration, and citation of public web sources automatically.</li>
<li style="font-weight:400;">The feature is currently available only in US East N. Virginia for <a href="https://aws.amazon.com/blogs/aws/amazon-nova-premier-our-most-capable-model-for-complex-tasks-and-teacher-for-model-distillation/">Nova Premier</a>, with Ohio and Oregon regions coming soon, and support for other Nova models planned. </li>
<li style="font-weight:400;">Additional costs apply beyond standard model inference pricing, detailed on the <a href="https://aws.amazon.com/bedrock/pricing/?trk=ac97e39c-d115-4d4a-b3fe-c695e0c9a7ee&amp;sc_channel=el">Amazon Bedrock pricing page</a>.</li>
<li style="font-weight:400;">Primary use cases include knowledge-based chat assistants requiring current information, content generation tools needing fact-checking, research applications synthesizing multiple sources, and customer support where accuracy and verifiable citations are essential. </li>
<li style="font-weight:400;">The reasoning traces in responses allow developers to follow the model’s decision-making process.</li>
<li style="font-weight:400;">The implementation provides a turnkey alternative to custom RAG architectures, particularly valuable for developers who want to focus on application logic rather than managing complex information retrieval systems while maintaining transparency through automatic source attribution.</li>
</ul>
<p>18:36  Justin – “This is the first time I’ve heard anything about Nova in months, so, good to know?” </p>
<h2>Cloud Tools  </h2>
<p>19:34 I<a href="https://www.harness.io/blog/introducing-ai-powered-database-migration-authoring">ntroducing-ai-powered-database-migration-authoring</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.harness.io/">Harness</a> introduces AI-powered database migration authoring that lets developers describe schema changes in plain English, like “create a table named animals with columns for genus_species,” and automatically generates production-ready SQL migrations with rollback scripts and Git integration.</li>
<li style="font-weight:400;">The tool addresses the “<a href="https://www.harness.io/blog/the-ai-velocity-paradox">AI Velocity Paradox</a>” where 63% of organizations ship code faster with AI, but 72% have suffered production incidents from AI-generated code – by extending AI automation to database changes, which remain a manual bottleneck in most CI/CD pipelines.</li>
<li style="font-weight:400;">Built on Harness’s Software Delivery Knowledge Graph and MCP Server, it analyzes current schemas, generates backward-compatible migrations, validates for compliance, and integrates with existing policy-as-code governance – making it more than just a generic SQL generator.</li>
<li style="font-weight:400;"><a href="https://www.harness.io/products/database-devops">Database DevOps</a> is one of Harness’s fastest-growing modules, with customers like Athenahealth reporting they saved months of engineering effort compared to Liquibase Pro or homegrown solutions while getting better governance and visibility.</li>
<li style="font-weight:400;">This positions databases as first-class citizens in CI/CD pipelines rather than the traditional midnight deployment bottleneck, allowing DBAs to maintain oversight through automated approvals while developers can finally move database changes at DevOps speed.</li>
</ul>
<p>20:44  Ryan – “Given how hard this is for humans to do, I look forward to AI doing this better.” </p>
<h2>AWS </h2>
<p>21:38 <a href="https://80.lv/articles/amazon-allegedly-replaced-40-of-aws-devops-workers-with-ai-days-before-crash">Amazon Allegedly Replaced 40% of AWS DevOps With AI Days Before </a><a href="https://80.lv/articles/amazon-allegedly-replaced-40-of-aws-devops-workers-with-ai-days-before-crash">Crash</a></p>
<ul>
<li style="font-weight:400;">An unverified <a href="https://blog.stackademic.com/aws-just-fired-40-of-its-devops-team-then-let-ai-take-their-jobs-d9db9d298bfa">report</a> claims Amazon replaced 40% of AWS DevOps staff with AI systems capable of automatically fixing IAM permissions, rebuilding VPC configurations, and rolling back failed <a href="https://duckduckgo.com/y.js?ad_domain=amazon.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=F-QhhGQltokwW7hRZ8xaPod72ZGJ2kGZxoO2FhUD2NVVTsE0PmgukmM5_bnlTQ2BYpJVmIH8H7q3_MWJrYnOsyHIROKS7FDMbesJXQg83_cx3D9X3xQ2JzcE1PiJk_og.A9VWsocZAD1h_407Y28Dsg&amp;eddgt=_fRUtMyQW3CJhW7ZRwqfPQ%3D%3D&amp;rut=4ff80f5e08b2447afa20a844d1bcc2a25682dfbe959ec99d7a9ad4fbf6a22a6f&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8DC81zmhDPF0K7bk1CLMBGjVUCUxN6VKaZO4yJQoz7T9lGAXDD4cW-nZPqtqDBhiCNTgIJSVjh4o6koDuHi9Jife5b_Puu72Xthqm_vZgi-PpvVs6JJ49QF2xOPOU1c7-L2dLzuByauDtoRfPV8V-80IP0fYA7w8aC05lFbrg-YSDQf70HfMAbAGIUgUEGjcdI1QmfSL6UcZjJsXfAT8R5-J6uq7xEtuQ0vOfmuDtBVvfvl-jEZK8Pr69JF7G8nuRyVGd9Nyhlgx0Dw1ruAUrrJmAOVbAmkM2w45hBdnfQcbuIcakIdXrE2DXp3WT_kgvkHLVGAcz0VCY0eeBnJQvRgo8OLQl5DD0whdyiBDkIl4juANtNi1JO-afZ3iyeED2ApzSKkAqCV66N14_ZYdW8R9fopm2CBmjaVacCuLBnKN3VK1wCNY2Mgfy3A9qCyGE0OtU5MHAIICrEilon5EFnhzpIZwblcyJUabu-68Cihw7ETbfeCLkFioVg1xeSFfx75XZAbUMJq5qwAfDt8TS7Z42nH36mFW7ebJ6xsna2rvbOfbFM5ALqvrPUvRpu5GvbioydVNLOtgQSnn8YXKdiYNYgzd8fq__GGHG2l44NZip-nR6Gb4HeoRjKKXgnlFZRBLEDWbaEZe2sSjIAp12R5FMz-Sf2A-ohtsqcsLkmXFZ2h1SikzeL91w7JN6DDMaZQUaR8yPrvwr-Ya0BdVatIdwAwht_8admJe8HnPpt2g-YdIToHiFZmDbe_dex0Mup_kQ_F7mvP9EGb_P1ucaXFby2do%26u%3DaHR0cHMlM2ElMmYlMmZwaXhlbC5ldmVyZXN0dGVjaC5uZXQlMmY0NDIyJTJmY3ElM2Zldl9zaWQlM2QxMCUyNmV2X2xuJTNkbGFtYmRhJTI2ZXZfbHR4JTNkJTI2ZXZfbHglM2Rrd2QtNzE2NzUxODAwODk5MjYlM2Fsb2MtMTkwJTI2ZXZfY3J4JTNkNzE2NzQ2NTEzMDk2MzIlMjZldl9tdCUzZGUlMjZldl9kdmMlM2RjJTI2ZXZfcGh5JTNkNzkzODclMjZldl9sb2MlM2QlMjZldl9jeCUzZDQ4MjUwMzExNSUyNmV2X2F4JTNkMTE0Njc5MTYzNDI5MDY2NSUyNmV2X2V4JTNkJTI2ZXZfZWZpZCUzZDZjZjE5YzE2ZDA2NzFiNDVmZWU1ZjUxYzEyNDNlZWRmJTNhRyUzYXMlMjZ1cmwlM2RodHRwcyUyNTNBJTI1MkYlMjUyRmF3cy5hbWF6b24uY29tJTI1MkZwbSUyNTJGbGFtYmRhJTI1MkYlMjUzRnRyayUyNTNENjFmMWFiMzAtOWNhZi00NGQwLWE1NmQtNWFhYjYzM2Y0YWQ3JTI1MjZzY19jaGFubmVsJTI1M0RwcyUyNTI2c19rd2NpZCUyNTNEQUwhNDQyMiExMCE3MTY3NDY1MTMwOTYzMiEhISE3MTY3NTE4MDA4OTkyNiEhNDgyNTAzMTE1ITExNDY3OTE2MzQyOTA2NjUlMjUyNmVmX2lkJTI1M0Q2Y2YxOWMxNmQwNjcxYjQ1ZmVlNWY1MWMxMjQzZWVkZiUyNTNBRyUyNTNBcyUyNm1zY2xraWQlM2Q2Y2YxOWMxNmQwNjcxYjQ1ZmVlNWY1MWMxMjQzZWVkZg%26rlid%3D6cf19c16d0671b45fee5f51c1243eedf&amp;vqd=4-266450751659073941330548974144759362426&amp;iurl=%7B1%7DIG%3DC0927D5D94874C5D872D52DD34E6CA8E%26CID%3D3C4FC36568F9687B0B42D5FE692669B6%26ID%3DDevEx%2C5045.1">Lambda</a> deployments, just days before their widely reported on crash. </li>
<li style="font-weight:400;">AWS has not confirmed this, and skepticism remains high, however.</li>
<li style="font-weight:400;">The timing coincides with a recent AWS outage that impacted major services, including <a href="https://www.snapchat.com/">Snapchat</a>, McDonald’s app, Roblox, and Fortnite, raising questions about automation’s role in system reliability and incident response.</li>
<li style="font-weight:400;"><a href="https://www.reuters.com/business/retail-consumer/amazons-aws-cloud-computing-unit-cuts-least-hundreds-jobs-sources-say-2025-07-17/">AWS officially laid off hundreds of employees in July 2025</a> (and more just recently), but the alleged 40% DevOps reduction would represent a significant shift toward AI-driven infrastructure management if true.</li>
<li style="font-weight:400;">The incident highlights growing concerns about cloud service concentration risk, as both this AWS outage and the 2023 CrowdStrike incident demonstrate how single points of failure can impact thousands of businesses globally.</li>
<li style="font-weight:400;">For AWS customers, this raises practical questions about the balance between automation efficiency and human oversight in critical infrastructure operations, particularly for disaster recovery and complex troubleshooting scenarios.</li>
</ul>
<p>22:19  Justin – “In general, Amazon has been doing a lot of layoffs. There’s been a lot of brain drain. I don’t know that they’ve automated 40% of the DevOps staff with AI systems…so this one seems a little rumor-y and speculative, but I did find it fun that people were trying to blame AI for Amazon’s woes last week.” </p>
<p>24:41 <a href="https://aws.amazon.com/message/101925/">Summary of the Amazon DynamoDB Service Disruption in Northern </a><a href="https://aws.amazon.com/message/101925/">Virginia (US-EAST-1) Region</a></p>
<ul>
<li style="font-weight:400;"><a href="https://duckduckgo.com/y.js?ad_domain=amazon.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=r-ou5kDNQkymTEepm4ztXcIPR6BeK1CT9_BV_F1J5F-pbRSWSkRJkGyazgSjd6XecDXaFwTEM9Ux25NbHac988Y5oOKPsrcahV0Wi8MlMzj5WF-jys-ggnJXMftBOipH.jSUi98APKOvUmhQ8rR4ODQ&amp;eddgt=2REkXq20md4G4A5tjGUahw%3D%3D&amp;rut=5fe16590abc86b9c9c3f3ba081ab82ba75322bc68c87aee4fb6d2ce4a18a3bfc&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8fh8l_bpOTaFh7A7X-DwkpDVUCUzQklp_dM5DkOPfyBfishuLqDUbU-8p8Ti7rhfkn80aSOCtgHUN8LcOsJRH4t3PtIi-m0eZzwut0WjzmyUC7mC7sbpkXss-iMk54aKHP71rz6HAbzCWjBTWLIsn6LrvkKc4Pogk87kS6kIdBfIX8eh1MuhNqHZISHZbQvxktAGbmu5LBfrDBZtmdi__IAegSH6h_zu6cuoQBiuRC-62Zwvg_9x1KfS0eOaMALX4Pm3Q5fbBD_iZ9hehM_9q9K5jFExciy2KbEuqV90o-RORDHvuPG60SSie1ukRFgdObJizlg421VO9WHfCXI-neQFir7K2DmitPiB_fPn8O2o5oE-e5btA3IrpVtDztR4D5TbJuGNe-li5a7tL9FiuKDINlh9J5mr3IXWTfGs7Y_I3yCuVltFojfl6cmiWezlCuZpvSCc42H_4nahbE_acveK2t_3DeW5F09UYbKmwG4a_i2wZaQ6uh68Src4Ua7GJkIuZVgWGPXHufin-GJjFm7gc1wwl6IgAngdLXC1v3ETRS8vJaP_tGsUSNxYem0Uw_mKlJCrChahpWJ8nZxjpzuuLUAEWQg_hTM2uG2QgTDonBcGBhRIcmV87ZpixEpMR2WXqIAZaFm57LqrUSeUPL5uJ9XMtj3-vr7u3S8jRCpn4dcX2cMGxrSd6PekZGc8KR_h7z97D1L4Sj8nCwpgxgOTko0o6vWHaYFaky4XZ0JfxvOOK8CiC_5M5MT1DURmqql3Aq2a8agqVMz5mowwp0-TMMfI%26u%3DaHR0cHMlM2ElMmYlMmZwaXhlbC5ldmVyZXN0dGVjaC5uZXQlMmY0NDIyJTJmY3ElM2Zldl9zaWQlM2QxMCUyNmV2X2xuJTNkZHluYW1vZGIlMjZldl9sdHglM2QlMjZldl9seCUzZGt3ZC03MTQ2OTAwNTk3Njk0NSUzYWxvYy0xOTAlMjZldl9jcnglM2Q3MTQ2ODQ4MzE0NDk3OSUyNmV2X210JTNkZSUyNmV2X2R2YyUzZGMlMjZldl9waHklM2Q4MDQxOCUyNmV2X2xvYyUzZCUyNmV2X2N4JTNkNDgyMjEwMzkxJTI2ZXZfYXglM2QxMTQzNDkzMDc0MjIzNjcwJTI2ZXZfZXglM2QlMjZldl9lZmlkJTNkYjQ4YzkxNGM3YTkwMTJmMjM1NzlmODQ2Yjg3MmYxNjklM2FHJTNhcyUyNnVybCUzZGh0dHBzJTI1M0ElMjUyRiUyNTJGYXdzLmFtYXpvbi5jb20lMjUyRnBtJTI1MkZkeW5hbW9kYiUyNTJGJTI1M0Z0cmslMjUzRGYyY2Y0NDNjLTZkY2MtNGQzYy1iODA5LWM4MDQ3NzhiNzMzMyUyNTI2c2NfY2hhbm5lbCUyNTNEcHMlMjUyNnNfa3djaWQlMjUzREFMITQ0MjIhMTAhNzE0Njg0ODMxNDQ5NzkhISEhNzE0NjkwMDU5NzY5NDUhITQ4MjIxMDM5MSExMTQzNDkzMDc0MjIzNjcwJTI1MjZlZl9pZCUyNTNEYjQ4YzkxNGM3YTkwMTJmMjM1NzlmODQ2Yjg3MmYxNjklMjUzQUclMjUzQXMlMjZtc2Nsa2lkJTNkYjQ4YzkxNGM3YTkwMTJmMjM1NzlmODQ2Yjg3MmYxNjk%26rlid%3Db48c914c7a9012f23579f846b872f169&amp;vqd=4-168157558398445922298540892694062669655&amp;iurl=%7B1%7DIG%3DF4DFA0D1AA22413E988D7523CDAA2B0B%26CID%3D26969A4910CB66891BF08CD2114367CE%26ID%3DDevEx%2C5046.1">DynamoDB</a> experienced a 2.5-hour outage in US-EAST-1 due to a race condition in its DNS management system that resulted in empty DNS records, affecting all services dependent on DynamoDB, including <a href="https://duckduckgo.com/y.js?ad_domain=amazon.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=8auIy8zqouKytoLe4L9_Pg1AQT8nVmQF1yh_Ed9JnwCMjXxh_JctKmU3kp6xC9nS-TaFcySlerg_pvbkDK4g3R0PFKWGgW41OwwVCjmiZHQ_SKOW4KVQgGy-LR4Tmmt9.uCPz30b-k0kiWfzkgtpsFg&amp;eddgt=Xsc_icYZv8PQPm7aeqGXqw%3D%3D&amp;rut=e6c61238d030c3b43ad9cefdd78f84e401103909a11b7d9bcc48018fa45c3d5e&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8rjLmfwmKnOuAG4iV9PctsDVUCUzFrU03m5xo4__nPUAQx7Z-967petqChwMY7OZB8nVsW0sGNo7Y0_37vYRqDWYaye4E4xCmEHRNK85-4jV3np3OGeC3uSmlibP0e58RWb8kux9SN7I229YYltG6YvmBboLKvCAqJxnBovwlhNaG2tvnjqnhvHwwsFzgkKGpUhDR5qnSA0LFt8kbTd9O06TF3auXb29XlwHrPJ-V4AEHyPR0KrhLkDF1thtiGB5R_4VIdMhvHDZofEcxWUli9x1CW-Yt9o9KTqF7SlJ1BQPbTD0c0iUnW4wNw6XMpkEeXvBUPm5qXlplxWw4wrxG85x0Iqu_G-SmcuRVa0QilvvWj_c9TKA3BnutuD5FX2P9qMmWYmsazyF5UTka_OKZoO8n3KODg5wE8NQzwX8FTv4MzLkH3B1TVcTBKQ2r2aMJ_Ob4P1JE2b4ON5B0wfG9DopheEtCfwAf28TvGXB1k5VMst1-lr786H1U4ynks62oqgKMdstXt2prInn0ASi6qqGMKeIQyhcsMPmqy5iDaDLP-23xovubCgYXUahtMmYuLwD38ZToQzd-ENi7f2ZQ-XsFAozA7AtZR7VXK_3o-OE-I3S9d7QM8cfBn2dkZdQdOO7-PzFxOfB4-DR0w1Z2YbB3_lYTY9_g-zfCUJKb5JTCZ-0RFJtGlIlsA_FvEBjaUL74ywtApTeXZXmP0Au1FAKR6Z-inX76WQh6-AvmQnujeFE0kyegr2m41v3UMiK_vDRjmyxF7u0FtSFLEqPyEHTzXdQ%26u%3DaHR0cHMlM2ElMmYlMmZwaXhlbC5ldmVyZXN0dGVjaC5uZXQlMmY0NDIyJTJmY3ElM2Zldl9zaWQlM2QxMCUyNmV2X2xuJTNkYW1hem9uJTI1MjB3ZWIlMjUyMHNlcnZpY2VzJTI1MjBlYzIlMjUyMHdlYmhvc3RpbmclMjZldl9sdHglM2QlMjZldl9seCUzZGt3ZC03MTMzMTU2NDA4Njk3MSUzYWxvYy0xOTAlMjZldl9jcnglM2Q3MTMzMTA0Mzk5MDA1NSUyNmV2X210JTNkZSUyNmV2X2R2YyUzZGMlMjZldl9waHklM2Q4MDU2MCUyNmV2X2xvYyUzZCUyNmV2X2N4JTNkNDgyMjEwMzkzJTI2ZXZfYXglM2QxMTQxMjk0MDUwNDg1MDc2JTI2ZXZfZXglM2QlMjZldl9lZmlkJTNkOGVmN2ZlYjU5YmUxMWExYjJkODQ5N2JmNThiNDZjMWIlM2FHJTNhcyUyNnVybCUzZGh0dHBzJTI1M0ElMjUyRiUyNTJGYXdzLmFtYXpvbi5jb20lMjUyRnBtJTI1MkZlYzIlMjUyRiUyNTNGdHJrJTI1M0QyMTVjNDU0OS1mYWFjLTRjNTItOTZhNy02OWQ5ZTE0OWY4YmIlMjUyNnNjX2NoYW5uZWwlMjUzRHBzJTI1MjZzX2t3Y2lkJTI1M0RBTCE0NDIyITEwITcxMzMxMDQzOTkwMDU1ISEhITcxMzMxNTY0MDg2OTcxISE0ODIyMTAzOTMhMTE0MTI5NDA1MDQ4NTA3NiUyNTI2ZWZfaWQlMjUzRDhlZjdmZWI1OWJlMTFhMWIyZDg0OTdiZjU4YjQ2YzFiJTI1M0FHJTI1M0FzJTI2bXNjbGtpZCUzZDhlZjdmZWI1OWJlMTFhMWIyZDg0OTdiZjU4YjQ2YzFi%26rlid%3D8ef7feb59be11a1b2d8497bf58b46c1b&amp;vqd=4-300947642358475397040362265414124639131&amp;iurl=%7B1%7DIG%3D9B42EA8802D04047A81C6E8145E53D8B%26CID%3D17EBB2BBFCEA6245240FA420FD5F633E%26ID%3DDevEx%2C5046.1">EC2</a>, <a href="http://v">Lambda</a>, and <a href="https://aws.amazon.com/redshift/">Redshift</a>.</li>
<li style="font-weight:400;">The cascading failure pattern showed how tightly coupled AWS services are – EC2 instance launches failed for 14 hours because DynamoDB’s outage prevented lease renewals between EC2’s DropletWorkflow Manager and physical servers.</li>
<li style="font-weight:400;">Network Load Balancers experienced connection errors from 5:30 AM to 2:09 PM due to health check failures caused by EC2’s network state propagation delays, demonstrating how infrastructure dependencies can create extended recovery times.</li>
<li style="font-weight:400;">AWS has disabled the automated DNS management system globally and will implement velocity controls and improved throttling mechanisms before re-enabling, highlighting the challenge of balancing automation with resilience.</li>
<li style="font-weight:400;">The incident reveals architectural vulnerabilities in multi-service dependencies – services like Redshift in all regions failed IAM authentication due to hardcoded dependencies on US-EAST-1, suggesting the need for better regional isolation.</li>
</ul>
<p>26:31  Matt – “It’s a good write-up to show that look, even these large cloud providers that have these massive systems and have redundancy upon redundancy upon redundancy – it’s all software under the hood. Software will eventually have a bug in it. And this just happens to be a really bad bug that took down half the internet.”</p>
<p>28:30 <a href="https://aws.amazon.com/about-aws/whats-new/2025/10/amazon-cloudwatch-incident-report/">Amazon CloudWatch introduces interactive incident reporting</a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/AmazonCloudWatch/latest/monitoring/WhatIsCloudWatch.html">CloudWatch</a> now automatically generates post-incident analysis reports by correlating telemetry data, investigation inputs, and actions taken during an investigation, reducing report creation time from hours to minutes.</li>
<li style="font-weight:400;">Reports include executive summaries, event timelines, impact assessments, and actionable recommendations, helping teams identify patterns and implement preventive measures for better operational resilience.</li>
<li style="font-weight:400;">The feature integrates directly with <a href="https://docs.aws.amazon.com/AmazonCloudWatch/latest/monitoring/Investigations.html">CloudWatch investigations</a>, capturing operational telemetry and service configurations automatically without manual data collection or correlation.</li>
<li style="font-weight:400;">Currently available in 12 AWS regions, including US East, Europe, and Asia Pacific, with no specific pricing mentioned – likely included in existing CloudWatch investigation costs.</li>
<li style="font-weight:400;">This addresses a common pain point where teams spend significant time manually creating incident reports instead of focusing on root cause analysis and prevention strategies.</li>
</ul>
<p>31:00 <a href="https://aws.amazon.com/blogs/aws/aws-customer-carbon-footprint-tool-now-includes-scope-3-emissions/">Customer Carbon Footprint Tool Expands: Additional emissions categories </a><a href="https://aws.amazon.com/blogs/aws/aws-customer-carbon-footprint-tool-now-includes-scope-3-emissions/">including Scope 3 are now available | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;">AWS <a href="https://aws.amazon.com/aws-cost-management/aws-customer-carbon-footprint-tool/?trk=769a1a2b-8c19-4976-9c45-b6b1226c7d20&amp;sc_channel=el">Customer Carbon Footprint Tool</a> now includes Scope 3 emissions data covering fuel/energy-related activities, IT hardware lifecycle emissions, and building/equipment impacts, giving customers a complete view of their carbon footprint beyond just direct operational emissions.</li>
<li style="font-weight:400;">The tool provides both location-based and market-based emission calculations with 38 months of historical data recalculated using the new methodology, accessible through the <a href="https://aws.amazon.com/about-aws/whats-new/2025/01/customer-carbon-footprint-tool-dedicated-page/?trk=769a1a2b-8c19-4976-9c45-b6b1226c7d20&amp;sc_channel=el">AWS Billing console</a> with CSV export and integration options for QuickSight visualization.</li>
<li style="font-weight:400;">Scope 3 emissions are amortized over asset lifecycles (6 years for IT hardware, 50 years for buildings) to fairly distribute embodied carbon across operational lifetime, with all calculations independently verified following <a href="https://ghgprotocol.org/">GHG Protocol standards</a>.</li>
<li style="font-weight:400;">Early access customers like Salesforce, SAP, and Pinterest report that the granular regional data and Scope 3 visibility help them move beyond industry averages to make targeted carbon reduction decisions based on actual infrastructure emissions.</li>
<li style="font-weight:400;">The tool remains free to use within the AWS Billing and Cost Management console, providing emissions data in metric tons of CO2 equivalent (MTCO2e) to help organizations track progress toward sustainability goals and compliance reporting requirements.</li>
</ul>
<p>32:45  Matt – “This is a difficult problem to solve. Once you have scope three, it’s all your indirect costs. So, I think if I remember correctly, scope one is your actual server, scope two is power, and then scope three is all the things that have to get included to generate your power and your servers, which includes shipping, et cetera. So getting all that, it’s not an easy task to do. Even when I look at the numbers, I don’t know what these mean half the time when I have to look at them. I’m like, we’re going down. That seems positive.”</p>
<p>33:59 <a href="https://aws.amazon.com/about-aws/whats-new/2025/10/aws-secret-west-region-is-now-available">AWS Secret-West Region is now available</a></p>
<ul>
<li style="font-weight:400;">AWS launches <a href="http://v">Secret-West</a>, its second region capable of handling Secret-level U.S. classified workloads, expanding beyond the existing Secret-East region to provide geographic redundancy for intelligence and defense agencies operating in the western United States.</li>
<li style="font-weight:400;">The region meets stringent <a href="https://www.dni.gov/index.php/what-we-do/ic-related-menus/ic-related-links/intelligence-community-directives">Intelligence Community Directive (ICD) 503</a> and DoD Security Requirements Guide Impact Level 6 requirements, enabling government agencies to process and analyze classified data with multiple Availability Zones for high availability and disaster recovery.</li>
<li style="font-weight:400;">This expansion allows agencies to deploy latency-sensitive classified workloads closer to western U.S. operations while maintaining multi-region resiliency, addressing a critical gap in classified cloud infrastructure outside the eastern United States.</li>
<li style="font-weight:400;">AWS continues to operate in a specialized market segment with limited competition, as few cloud providers can meet the security clearance and infrastructure requirements necessary for Secret-level classification hosting.</li>
<li style="font-weight:400;">Pricing information is not publicly available due to the classified nature of the service; interested government agencies must contact AWS directly through their secure channels to discuss access and costs.</li>
</ul>
<p> Agent Coulson – “Welcome to level 7.”</p>
<p>38:24 <a href="https://aws.amazon.com/about-aws/whats-new/2025/10/aws-transfer-family-changing-idp-type">AWS Transfer Family now supports changing identity provider type on a </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/10/aws-transfer-family-changing-idp-type">server</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/aws-transfer-family/">AWS Transfer Family</a> now allows changing identity provider types (service managed, Active Directory, or custom IdP) on existing SFTP, FTPS, and FTP servers without service interruption, eliminating the need to recreate servers during authentication migrations.</li>
<li style="font-weight:400;">This feature enables zero-downtime authentication migrations for organizations transitioning between identity providers or consolidating authentication systems, particularly useful for companies undergoing mergers or updating compliance requirements.</li>
<li style="font-weight:400;">The capability is available across all AWS regions where Transfer Family operates, with no additional pricing beyond standard Transfer Family costs, which start at $0.30 per protocol per hour.</li>
<li style="font-weight:400;">Organizations can now adapt their file transfer authentication methods dynamically as business needs evolve, such as switching from basic service-managed users to enterprise Active Directory integration without disrupting ongoing file transfers.</li>
<li style="font-weight:400;">Implementation details and migration procedures are documented in the Transfer Family User Guide <a href="https://docs.aws.amazon.com/transfer/latest/userguide/what-is-aws-transfer-family.html">here</a>.</li>
</ul>
<p>39:26  Ryan – “Any kind of configuration change that requires you to destroy and recreate isn’t fun. I do believe that we should architect for such things and be able to redirect things with DNS traffic (which never goes wrong), never causes anyone any problems. But, it is terrible when that happens, because even when it works, you’re sort of nervously doing it the entire time.”</p>
<p>40:24 <a href="https://aws.amazon.com/about-aws/whats-new/2025/10/amazon-cloudwatch-metrics-monitor-ec2-instances-i-o-performance">New Amazon CloudWatch metrics to monitor EC2 instances exceeding I/O </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/10/amazon-cloudwatch-metrics-monitor-ec2-instances-i-o-performance">performance</a></p>
<ul>
<li style="font-weight:400;">AWS introduces <a href="https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/ebs-optimized.html">Instance EBS IOPS</a> Exceeded Check and Instance EBS Throughput <a href="https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/viewing_metrics_with_cloudwatch.html#ebs-metrics-nitro">Exceeded Check metrics</a> that return binary values (0 or 1) to indicate when EC2 instances exceed their EBS-optimized performance limits, helping identify bottlenecks without manual calculation.</li>
<li style="font-weight:400;">These metrics enable automated responses through <a href="https://docs.aws.amazon.com/AmazonCloudWatch/latest/monitoring/AlarmThatSendsEmail.html">CloudWatch alarms</a>, such as triggering instance resizing or type changes when I/O limits are exceeded, reducing manual intervention for performance optimization.</li>
<li style="font-weight:400;">Available at no additional cost with 1-minute granularity for all <a href="https://docs.aws.amazon.com/ec2/latest/instancetypes/ec2-nitro-instances.html">Nitro-based EC2 instances</a> with attached EBS volumes across all commercial AWS regions, including <a href="https://aws.amazon.com/govcloud-us/">GovCloud</a> and China.</li>
<li style="font-weight:400;">Addresses a common blind spot where applications experience degraded performance due to exceeding instance-level I/O limits rather than volume-level limits, which many users overlook when troubleshooting. (Yes, we’re all guilty of this.) </li>
<li style="font-weight:400;">Particularly useful for database workloads and high-throughput applications where understanding whether the bottleneck is at the instance or volume level is critical for right-sizing decisions.</li>
</ul>
<p>41:20  Matt – “This would have solved a lot of headaches when GP3 came out…”</p>
<p> GCP</p>
<p>43:53 <a href="https://cloud.google.com/blog/products/identity-security/a-practical-guide-to-google-clouds-parameter-manager/">A practical guide to Google Cloud’s Parameter Manager | Google Cloud </a><a href="https://cloud.google.com/blog/products/identity-security/a-practical-guide-to-google-clouds-parameter-manager/">Blog</a></p>
<ul>
<li style="font-weight:400;">Google Cloud <a href="https://cloud.google.com/secret-manager/parameter-manager/docs/overview">Parameter Manager</a> provides centralized configuration management that separates application settings from code, supporting JSON, YAML, and unformatted data with built-in format validation for JSON and YAML types</li>
<li style="font-weight:400;">The service integrates with <a href="https://cloud.google.com/security/products/secret-manager">Secret Manager</a> through a __REF__ syntax that allows parameters to securely reference secrets like API keys and passwords, with regional compliance enforcement ensuring secrets can only be referenced by parameters in the same region</li>
<li style="font-weight:400;">Parameter Manager uses versioning for configuration snapshots, enabling safe rollbacks and preventing unintended breaking changes to deployed applications while supporting use cases like A/B testing, feature flags, and regional configurations</li>
<li style="font-weight:400;">Both Parameter Manager and Secret Manager offer monthly free tiers, though specific pricing details aren’t provided in the announcement; the service requires granting IAM permissions for parameters to access referenced secrets</li>
<li style="font-weight:400;">Key benefits include eliminating hard-coded configurations, supporting multi-region deployments with region-specific settings, and enabling dynamic configuration updates without code changes for applications across various industries</li>
</ul>
<p>44:22 Justin – “ I’m a very heavy user of parameter store on AWS. I love it, and you should all use it for any of your dynamic configuration, especially if you’re moving containers between environments. This is the bee’s knees in my opinion.”</p>
<p>49:39 <a href="https://cloud.google.com/blog/products/networking/cross-site-interconnect-now-ga-simplifies-l2-connectivity/">Cross-Site Interconnect, now GA, simplifies L2 connectivity | Google Cloud </a><a href="https://cloud.google.com/blog/products/networking/cross-site-interconnect-now-ga-simplifies-l2-connectivity/">Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.cloud.google.com/network-connectivity/docs/interconnect/concepts/cross-site-overview">Cross-Site Interconnect</a> is now GA, providing managed Layer 2 connectivity between data centers using Google’s global network infrastructure, eliminating the need for complex multi-vendor setups and reducing capital expenditures for WAN connectivity.</li>
<li style="font-weight:400;">The service offers consumption-based pricing with no setup fees or long-term commitments, allowing customers to scale bandwidth dynamically and pay only for what they use, though specific pricing details weren’t provided in the announcement.</li>
<li style="font-weight:400;">Built on Google’s 3.2 million kilometers of fiber and 34 subsea cables (and you know how much we love a good undersea cable). </li>
<li style="font-weight:400;">Cross-Site Interconnect provides a 99.95% SLA that includes protection against cable cuts and maintenance windows, with automatic failover and proactive monitoring across 100s of Cloud Interconnect PoPs.</li>
<li style="font-weight:400;">Financial services and telecommunications providers are early adopters, with Citadel reporting stable performance during their pilot program, highlighting use cases for low-latency trading, disaster recovery, and dynamic bandwidth augmentation for AI/ML workloads.</li>
<li style="font-weight:400;">As a transparent Layer 2 service, it enables MACsec encryption between remote routers with customer-controlled keys, while providing programmable APIs for infrastructure-as-code workflows and real-time monitoring of latency, packet loss, and bandwidth utilization.</li>
</ul>
<p>50:57 Ryan – “I mean, I like this just because of the heavy use of infrastructure as code availability. Some of these deep-down network services across the clouds don’t really provide that; it’s all just sort of click ops or a support case. So this is kind of neat. And I do like that you can dynamically configure this and stand it up / turn it down pretty quickly.”</p>
<p>53:12 <a href="https://cloud.google.com/blog/products/databases/introducing-bigtable-tiered-storage/">Introducing Bigtable tiered storage | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/bigtable/">Bigtable</a> introduces <a href="https://cloud.google.com/bigtable/docs/tiered-storage">tiered storage</a> that automatically moves data older than a configurable threshold from SSD to infrequent access storage, reducing storage costs by up to 85% while maintaining API compatibility and data accessibility through the same interface.</li>
<li style="font-weight:400;">The infrequent access tier provides<a href="https://cloud.google.com/bigtable/docs/choosing-ssd-hdd"> 540% more storage capacity</a> per node compared to <a href="https://cloud.google.com/bigtable/pricing">SSD-only nodes</a>, enabling customers to retain historical data for compliance and analytics without manual archiving or separate systems.</li>
<li style="font-weight:400;">Time-series workloads from manufacturing, automotive, and IoT benefit most – sensor data, EV battery telemetry, and factory equipment logs can keep recent data on SSD for real-time operations while moving older data to cheaper storage automatically based on age policies.</li>
<li style="font-weight:400;">Integration with Bigtable SQL allows querying across both tiers, and logical views enable controlled access to historical data for reporting without full table permissions, simplifying data governance for large datasets.</li>
<li style="font-weight:400;">Currently in preview with pricing at approximately $0.026/GB/month for infrequent access storage compared to $0.17/GB/month for SSD storage, representing significant savings for organizations storing hundreds of terabytes of historical operational data.</li>
</ul>
<p>54:31 Ryan – “To illustrate that I’m still a cloud guy at heart, whenever I’m in an application and I’m loading data and I go back – like I want to see a year’s data –  and it takes that extra 30 seconds to load, I actually get happy, because I know what they’re doing on the backend.” </p>
<p>56:05 <a href="https://cloud.google.com/blog/products/compute/now-shipping-a4x-max-vertex-ai-training-and-more/">Now Shipping A4X Max, Vertex AI Training and more | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google launches <a href="https://cloud.google.com/blog/products/compute/now-shipping-a4x-max-vertex-ai-training-and-more">A4X Max instances</a> powered by <a href="https://www.nvidia.com/en-us/data-center/gb300-nvl72/">NVIDIA GB300 NVL72</a> with 72 <a href="https://developer.nvidia.com/blog/inside-nvidia-blackwell-ultra-the-chip-powering-the-ai-factory-era/">Blackwell Ultra GPUs</a> and <a href="https://www.nvidia.com/en-us/data-center/grace-cpu/">36 Grace CPUs</a>, delivering 2x network bandwidth compared to A4X and 4x better LLM training performance versus A3 H100-based VMs. The system features 1.4 exaflops per NVL72 system and can scale to clusters twice as large as A4X deployments.</li>
<li style="font-weight:400;">GKE now supports <a href="https://github.com/google/dranet">DRANET</a> (Dynamic Resource Allocation Kubernetes Network Driver) in production, starting with A4X Max, providing topology-aware scheduling of GPUs and RDMA network cards to boost bus bandwidth for distributed AI workloads. </li>
<li style="font-weight:400;">This improves cost efficiency through better VM utilization by optimizing connectivity between RDMA devices and GPUs.</li>
<li style="font-weight:400;"><a href="https://cloud.google.com/kubernetes-engine/docs/concepts/about-gke-inference-gateway">GKE Inference Gateway</a> integrates with <a href="https://developer.nvidia.com/nemo-guardrails">NVIDIA NeMo Guardrails</a> to add safety controls for production AI deployments, preventing models from engaging in undesirable topics or responding to malicious prompts. </li>
<li style="font-weight:400;">The integration combines model-aware routing and autoscaling with enterprise-grade security features.</li>
<li style="font-weight:400;"><a href="https://cloud.google.com/model-garden">Vertex AI Model Garden</a> will support <a href="https://developer.nvidia.com/nemotron">NVIDIA Nemotron models</a> as NIM microservices, starting with Llama Nemotron Super v1.5, allowing developers to deploy open-weight models with granular control over machine types, regions, and VPC security policies. </li>
<li style="font-weight:400;"><a href="https://cloud.google.com/vertex-ai/docs/training/overview">Vertex AI Training</a> now includes curated recipes built on <a href="https://docs.nvidia.com/nemo-framework/index.html">NVIDIA NeMo Framework</a> and <a href="https://github.com/NVIDIA-NeMo/RL">NeMo-RL</a> with managed <a href="https://slurm.schedmd.com/overview.html">Slurm</a> environments and automated resiliency features for large-scale model development.</li>
<li style="font-weight:400;">A4X Max is available in preview through Google Cloud sales representatives and leverages Cluster Director for lifecycle management, topology-aware placement, and integration with Managed Lustre storage. </li>
<li style="font-weight:400;">Pricing details were not disclosed in the announcement.</li>
</ul>
<p>57:41 Justin – “That’s a lot of cool hardware stuff that I do not understand.” </p>
<h2>Azure</h2>
<p>58:38 <a href="https://azure.microsoft.com/en-us/blog/microsoft-azure-delivers-the-first-large-scale-cluster-with-nvidia-gb300-nvl72-for-openai-workloads/">NVIDIA GB300 NVL72: Next-generation AI infrastructure at scale | Microsoft </a><a href="https://azure.microsoft.com/en-us/blog/microsoft-azure-delivers-the-first-large-scale-cluster-with-nvidia-gb300-nvl72-for-openai-workloads/">Azure Blog</a></p>
<ul>
<li style="font-weight:400;">Microsoft deployed the first production cluster with over 4,600 <a href="https://www.nvidia.com/en-us/data-center/gb300-nvl72/">NVIDIA GB300 NVL72 systems</a> featuring Blackwell Ultra GPUs, enabling AI model training in weeks instead of months and supporting models with hundreds of trillions of parameters</li>
<li style="font-weight:400;">The ND GB300 v6 VMs deliver 1,440 petaflops of FP4 performance per rack with 72 GPUs, 37TB of fast memory, and 130TB/second NVLink bandwidth, specifically optimized for reasoning models, agentic AI, and multimodal generative AI workloads</li>
<li style="font-weight:400;">Azure implemented <a href="https://www.nvidia.com/en-us/networking/products/infiniband/quantum-x800/">800 Gbps NVIDIA Quantum-X800 InfiniBand</a> networking with full fat-tree architecture and SHARP acceleration, doubling effective bandwidth by performing computations in-switch for improved large-scale training efficiency</li>
<li style="font-weight:400;">The infrastructure uses standalone heat exchanger units and new power distribution models to handle high-density GPU clusters, with Microsoft planning to scale to hundreds of thousands of Blackwell Ultra GPUs across global datacenters</li>
<li style="font-weight:400;">OpenAI and Microsoft are already using these clusters for frontier model development, with the platform becoming the standard for organizations requiring supercomputing-scale AI infrastructure (pricing is not specified in the announcement).</li>
</ul>
<p>59:55 Ryan – “Companies looking for scale – companies with a boatload of money.” </p>
<p>1:00:23 <a href="https://azure.microsoft.com/en-us/updates?id=502004">Generally Available: Near-zero downtime scaling for HA-enabled Azure </a><a href="https://azure.microsoft.com/en-us/updates?id=502004">Database for PostgreSQL servers </a></p>
<ul>
<li style="font-weight:400;"><a href="https://azure.microsoft.com/en-us/products/postgresql/">Azure Database for PostgreSQL</a> servers with high availability can now scale in under 30 seconds compared to the previous 2-10 minute window, reducing downtime by over 90% for database scaling operations.</li>
<li style="font-weight:400;">This feature targets production workloads that require continuous availability during infrastructure changes, particularly benefiting e-commerce platforms, financial services, and SaaS applications that cannot afford extended maintenance windows.</li>
<li style="font-weight:400;">The near-zero downtime scaling works specifically with <a href="https://learn.microsoft.com/en-us/azure/postgresql/flexible-server/how-to-configure-high-availability">HA-enabled PostgreSQL</a> instances, leveraging Azure’s high availability architecture to perform seamless compute and storage scaling without disrupting active connections.</li>
<li style="font-weight:400;">While pricing remains unchanged from standard PostgreSQL rates, the reduced downtime translates to lower operational costs by minimizing revenue loss during scaling events and reducing the need for complex maintenance scheduling.</li>
<li style="font-weight:400;">This enhancement positions <a href="https://duckduckgo.com/y.js?ad_domain=microsoft.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=GSWCcql5-pTHt8J0hvbwH3V3dBpEVYof_s7FXMdG5HTZQGJXPBuRVZCjgibM7oPmOllqbvoCqRLHuXzw4xaz3xoMX-s3RyvB4yoytV0Cu31QYhNp7GLRKCtfPoTfSalJ.iuwAdv5NSIUMGBpSq7lZAw&amp;eddgt=9HGyz-2CICqIm_yfOf8Qfw%3D%3D&amp;rut=82443545837ddb76772d56ddb401ab932bbdbb315e9a733d3179c2f36bfb0b22&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De88u5ZIeCW0KX3iLxVs_6cVzVUCUww0b-aRscdVABmcSDu_sXj1FvNWBeQRa5Tf_p759wkazkeFzo4AOTrEQ9FnsMtODWujyYvIhyBpspewU2evhqmqTD6pNTrq4qOMqz2rSCiLRYXUXNQpWyvQbWv1nSfzEQMB0OA2IhnlPorn7DP1jJ1YogA-vZ5zyrFokIqzvVLF02gSYTxQ57GukovJ2ipcyNE-5d_eoJJF0uuqhdGQ3A_ClvT-Ao-bmDnlrYMtDfE8A6x7n62NhCSw1fWZ7KVOQNQx53iVkLQpBl1960Q7SzP3w-5ycbDDHpFkyw13cfkZwLjsr9EbTQUutDloSl2BS8USqeFr_iFYpEecWoKuDn3i1w9RnzhQTwa376n912eFMkmLYMgc5H-xEQLRnofBN29eTaJVJXFV6pcPWUOqxMpwEJhW--2bNpIorw2H3dg-tFIf6k-HhcsngBdvaTzb_2_KlbilbHcQcRxHVz3xBqjXUCgUL6jxHhLajjVTUwwafn0HoUEjqZ3UvOq8tQ11bpydEcYyiAg0m-HeZklufJaMqYVs6m9gsLmCPNn3CGs2PhWG-yXcrwdIvVhE_Zw1aIW4-L4mI0if2dXUkkdCZMfc0tNYEPe5Ly4q65E1nah6ndhsiolho8-eoQlv4DGWbGSzmcjrdNVq5SW0pHKEiWCqI5pfWyP0fL6oOqNVUyODUCOtbdTSshzhTc7v53JKcbKByiolnjrVFlL5H4m9GCAA4bzVV5TVf_gxMZg7VaJJ4RBfNtUe6ikMemVYXLInsc%26u%3DaHR0cHMlM2ElMmYlMmY1MzUwLnhnNGtlbi5jb20lMmZ0cmslMmZ2MSUzZnByb2YlM2Q0MzklMjZjYW1wJTNkMTY4ODA3JTI2a2N0JTNkbXNuJTI2a2NoaWQlM2QxNTkwMDE5MjElMjZjcml0ZXJpYWlkJTNka3dkLTc4ODIxODQ2NDc2OTQ2JTNhbG9jLTE5MCUyNmNhbXBhaWduaWQlM2Q1OTAyMzczNTglMjZsb2NwaHklM2Q4MDU0OCUyNmFkZ3JvdXBpZCUzZDEyNjExNDE0ODk3MDI3ODUlMjZjaWQlM2Q3ODgyMTQ0ODU3MjY5NiUyNmtkdiUzZGMlMjZrZXh0JTNkJTI2a3BnJTNkJTI2a3BpZCUzZCUyNnF1ZXJ5U3RyJTNkQXp1cmUlMjUyMFBvc3RncmVTUUwlMjZ1cmwlM2RodHRwcyUzYSUyZiUyZmF6dXJlLm1pY3Jvc29mdC5jb20lMmZlbi11cyUyZnByb2R1Y3RzJTJmcG9zdGdyZXNxbCUyZiUzZmVmX2lkJTNkX2tfZTE3NzAxZWY1MDc4MTU0NDZkZWU2OGU1YmM1MDk5ZTVfa18lMjZPQ0lEJTNkQUlEY21tNWVkc3dkdXVfU0VNX19rX2UxNzcwMWVmNTA3ODE1NDQ2ZGVlNjhlNWJjNTA5OWU1X2tfJTI2bXNjbGtpZCUzZGUxNzcwMWVmNTA3ODE1NDQ2ZGVlNjhlNWJjNTA5OWU1%26rlid%3De17701ef507815446dee68e5bc5099e5&amp;vqd=4-325821295192677621425969527632279731022&amp;iurl=%7B1%7DIG%3D876A78F95500421B812CB0FC57E63DED%26CID%3D0C02B6A4B1546B572DA4A03FB0EE6A34%26ID%3DDevEx%2C5046.1">Azure PostgreSQL</a> competitively against <a href="https://aws.amazon.com/rds/">AWS RDS</a> and <a href="https://cloud.google.com/sql">Google Cloud SQL</a>, which still require longer downtime windows for similar scaling operations on their managed PostgreSQL offerings.</li>
</ul>
<p>1:01:16 Matt – “They’ve had this for forever on Azure SQL, which is their Microsoft SQL platform, so it doesn’t surprise me. It surprised me more that this was already a two-to-10-minute window to scale. Seems crazy for a production HA service.”</p>
<p>1:02:10 <a href="https://blog.fabric.microsoft.com/en-GB/blog/onelake-apis-bring-your-apps-and-build-new-ones-with-familiar-blob-and-adls-apis/">OneLake APIs: Bring your apps and build new ones with familiar Blob and </a><a href="https://blog.fabric.microsoft.com/en-GB/blog/onelake-apis-bring-your-apps-and-build-new-ones-with-familiar-blob-and-adls-apis/">ADLS APIs | Microsoft Fabric Blog | Microsoft Fabric</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/fabric/onelake/onelake-overview">OneLake</a> now supports <a href="https://duckduckgo.com/y.js?ad_domain=microsoft.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=lmjcuEp4DOWN0XlzUgJheRw9RUv9vLnq273uGOoAziYV8JEn9kiFBLb-Hthy1O6Sjvv2ZoGqdjOgP2cNihsOH5SxFz_RnuPH3p945ztFyXhpE-ZyYWx8mqlxfy3-5NOh.0BrHw0NpcnlbVPqUHxQaZA&amp;eddgt=AZUqFtpJoI-OFglmFMfTSQ%3D%3D&amp;rut=daa2c84fbe774d47c8cd6a3079c7484b612a79851a56c500cdf362a0830960e5&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8biP4xGzqXEnBvJGPV0JhujVUCUzajqIqxTZZzLxB-980YB5ynkUalWCAeeYKFw1WgBqsKgIyLBe5mKSk-LwxLT5MvZmnskUeci2FGcnLqGqJ5UynavmTW5Lay5cVxaJVHXJtbZQ9-kxDGEdZCzivWMYbhVvBMKJS0C3QqJMSyLzHUGOZAUy7qE5CIQ5podAkCM7bL6XEn-iLxwMfVjmGsUmqg19qnDwbXNF4G4a4DWdBB-Z-DoclGKmvPsVa9Kag2LF7DAy5i3OlA0pMlIhKBoOfFP8Otmx-RfLAKaj3S9rt3432kQptnsK_4mLiZkJzhE6FwLjXJK-t7ZfejLc7cB_W1UCOxCEIgiWPRnCAwB3oDTWGgL1obxsNd9fX_NwB57e_uhMJ29iD6-qd0oAB2JOyb9uQaP4F5stAP9N3YirH69e2XMdbMYlgkbsFbQN6mWqawtnwnvk-70TTdkF2oOyMR11nXBj5uERNb_BeDOeDwAaweQiId6_FqbJmpsx9qSfrY7d_vHPgt3yWqijlLAe76tL18zCVi2Rlzer0W06DHxqVbROYJpdw_pBZg3E2H1bUojK3pfM2Lww5zofCHycSj13SV0tLJh38k0mGZJILER5J8EXbQ_95MfLfFZ2QlKVNSrn-XNZwqvfyRZJ-G2SLR-gULO8h_OTEKKV60atZJulehJQybtRiLHxonmNhTMf-tnQvO0c1CTimLEQ6OJCr1pqCm8HyAUt5d-cODoCT0Ut0-x_LDTOz7KlibTDWWhpeRAwFuIVboau9j9zR6wgPaJ4%26u%3DaHR0cHMlM2ElMmYlMmY1MzUwLnhnNGtlbi5jb20lMmZ0cmslMmZ2MSUzZnByb2YlM2Q0MzklMjZjYW1wJTNkMTY4ODA3JTI2a2N0JTNkbXNuJTI2a2NoaWQlM2QxNTkwMDE5MjElMjZjcml0ZXJpYWlkJTNka3dkLTc5NzgzOTE4OTkxODg4JTNhbG9jLTE5MCUyNmNhbXBhaWduaWQlM2Q1OTAyMzczNTglMjZsb2NwaHklM2Q4MDU2MCUyNmFkZ3JvdXBpZCUzZDEyNzY1MzQ2NTI2ODc4NTUlMjZjaWQlM2Q3OTc4MzUyMDQyNTU4OSUyNmtkdiUzZGMlMjZrZXh0JTNkJTI2a3BnJTNkJTI2a3BpZCUzZCUyNnF1ZXJ5U3RyJTNkQXp1cmUlMjUyMEJsb2IlMjUyMFN0b3JhZ2UlMjZ1cmwlM2RodHRwcyUzYSUyZiUyZmF6dXJlLm1pY3Jvc29mdC5jb20lMmZlbi11cyUyZnByb2R1Y3RzJTJmc3RvcmFnZSUyZmJsb2JzJTJmJTNmZWZfaWQlM2Rfa19jYWUzZDIyY2E3ZDUxZTA2NTFlYWJiMjY0YTU1ZGM5N19rXyUyNk9DSUQlM2RBSURjbW01ZWRzd2R1dV9TRU1fX2tfY2FlM2QyMmNhN2Q1MWUwNjUxZWFiYjI2NGE1NWRjOTdfa18lMjZtc2Nsa2lkJTNkY2FlM2QyMmNhN2Q1MWUwNjUxZWFiYjI2NGE1NWRjOTc%26rlid%3Dcae3d22ca7d51e0651eabb264a55dc97&amp;vqd=4-134709429749097944832056379083226239425&amp;iurl=%7B1%7DIG%3D7C9779127ECA4D788821D3B578291F86%26CID%3D2817625679046D2C17E974CD78986CB5%26ID%3DDevEx%2C5046.1">Azure Blob Storage</a> and <a href="https://learn.microsoft.com/en-us/rest/api/storageservices/data-lake-storage-gen2">ADLS APIs</a>, allowing existing applications to connect to Microsoft Fabric’s unified data lake without code changes – just swap endpoints to onelake.dfs.fabric.microsoft.com or <a href="http://onelake.blob.fabric.microsoft.com">onelake.blob.fabric.microsoft.com</a>. What could go wrong? </li>
<li style="font-weight:400;">This API compatibility eliminates migration barriers for organizations with existing Azure Storage investments, enabling immediate use of tools like <a href="https://azure.microsoft.com/products/storage/storage-explorer/">Azure Storage Explorer</a> with OneLake while preserving existing scripts and workflows</li>
<li style="font-weight:400;">The feature targets enterprises looking to consolidate data lakes without rewriting applications, particularly those using C# SDKs or requiring DFS operations for hierarchical data management</li>
<li style="font-weight:400;">Microsoft provides an end-to-end guide demonstrating open mirroring to replicate on-premises data to <a href="https://learn.microsoft.com/en-us/fabric/onelake/table-apis/delta-table-apis-overview">OneLake Delta tables</a>, positioning this as a bridge between traditional storage and Fabric’s analytics ecosystem</li>
<li style="font-weight:400;">No specific pricing mentioned for <a href="https://learn.microsoft.com/fabric/onelake/onelake-apis-in-action">OneLake API access</a> – costs likely follow standard Fabric capacity pricing model based on compute and storage consumption</li>
</ul>
<h2>Cloud Journey </h2>
<p>1:03:47 <a href="https://www.infoworld.com/article/4064273/8-platform-engineering-anti-patterns.html">8 platform engineering anti-patterns | InfoWorld</a></p>
<ul>
<li style="font-weight:400;">Platform engineering initiatives are failing at an alarming rate because teams treat the visual portal as the entire platform rather than building solid backend APIs and orchestration first. The 2024 DORA Report found that dedicated platform engineering teams actually decreased throughput by 8% and change stability by 14%, showing that implementation mistakes have serious downstream consequences.</li>
<li style="font-weight:400;">The biggest mistake organizations make is copying approaches from large companies like Spotify without considering ROI for their scale. Mid-size companies invest the same effort as enterprises with thousands of developers but see minimal returns, making reference architectures often impractical for solving real infrastructure abstraction challenges.</li>
<li style="font-weight:400;">Successful platform adoption requires shared ownership where developers can contribute plugins and customizations rather than top-down mandates. Spotify achieves 100% employee adoption of their internal Backstage by allowing engineers to build their own plugins like Soundcheck, proving that developer autonomy drives platform usage.</li>
<li style="font-weight:400;">Organizations must survey specific user subsets because Java developers, QA testers, and SREs have completely different requirements from an internal developer platform. Tracking surface metrics like onboarded users misses the point when platforms should measurably improve time to market, reduce costs, and increase innovation rather than just showing DORA metrics.</li>
<li style="font-weight:400;">Simply rebranding operations teams as platform engineering without a cultural shift and product mindset creates more toil than it reduces. Platforms need to be treated as products requiring continuous improvement, user research, internal marketing, and incremental development, starting with basic CI/CD touchpoints rather than attempting to solve every problem on day one.</li>
</ul>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback, or ask questions at theCloudPod.net or tweet at us with the hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2193961/c1e-2okobmzno9sm26j0-pkvndrpwfo07-cjctc5.mp3" length="100786816"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 328 of The Cloud Pod, where the forecast is always cloudy! Justin, Ryan, and Matt are on board today to bring you all the latest news in cloud and AI, including secret regions (this one has the aliens), ongoing discussions between Microsoft and OpenAI, and updates to Nova, SQL, and OneLake -and even the latest installment of Cloud Journeys.  Let’s get started! 
Titles we almost went with this week

 CloudWatch’s New Feature: Because Nobody Likes Writing Incident Reports at 3 AM
 DNS: Did Not Survive – The Great US-EAST-1 Outage of 2025
 404 DevOps Not Found: The AWS Automation Adventure mk
 When Your DevOps Team Gets Replaced by AI and Then Everything Crashes
 Database Migrations Get the ChatGPT Treatment: Just Vibe Your Schema Changes
 AWS DevOps Team Gets the AI Treatment: 40% Fewer Humans, 100% More Questions
 Breaking Up is Hard to Compute: Microsoft and OpenAI Redefine Their Relationship
 AWS Goes Full Scope: Now Tracking Your Cloud’s Carbon from Cradle to Gate
 Platform Engineering: When Your Golden Path Leads to a Dead End
 DynamoDB’s DNS Disaster: How a Race Condition Raced Through AWS
 AI Takes Over AWS DevOps Jobs, Servers Take Unscheduled Vacation
 PostgreSQL Scaling Gets a 30-Second Makeover While AWS Takes a Coffee Break
 The Domino Effect: When DynamoDB Drops, Everything Drops
 RAG to Riches: Amazon Nova Learns to Cite Its Sources
 AWS Finally Tells You When Your EC2 Instance Can’t Keep Up With Your Storage Ambitions
 AWS Nova Gets Grounded: No More Hallucinating About Reality
 One API to Rule Them All: OneLake’s Storage Compatibility Play
 OpenAI gets to pay Alimony
 Database schema deployments are totally a vibe
 AWS will tell you how not green you are today, now in 3 scopes

General News 
02:00 DDoS in September | Fastly

Fastly‘s September DDoS report reveals a notable 15.5 million requests per second attack that lasted over an hour, demonstrating how modern application-layer attacks can sustain extreme throughput with real HTTP requests rather than simple pings or amplification techniques.
Attack volume in September dropped to 61% of August levels, with data suggesting a correlation between school schedules and attack frequency: lower volumes coincide with school breaks, while higher volumes occur when schools are in session.
Media & Entertainment companies faced the highest median attack sizes, followed by Education and High Technology sectors, with 71% of September’s peak attack day attributed to a single enterprise media company.
The sustained 15 million RPS attack originated from a single cloud-provider ASN, using sophisticated daemons that mimicked browser behavior, making detection more challenging than typical DDoS patterns.
Organizations should evaluate whether their incident response runbooks can handle hour-long attacks at 15+ million RPS, as these sustained high-throughput attacks require automated mitigation rather than manual intervention.
Listen, we’re not inviting a DDoS attack, but also…we’ll just turn off the website, so there’s that. 

AI Is Going Great – Or How ML Makes Money
04:41 Google AI Studio updates: More control, less friction

Google AI Studio introduces “vibe coding” – a new AI-powered develo...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2193961/c1a-k5d5-okj07910uvjk-mda6zn.jpg"></itunes:image>
                                                                            <itunes:duration>01:24:00</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2193961/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[327: AWS Finally Admits Kubernetes is Hard, Makes Robots Do It Instead]]>
                </title>
                <pubDate>Thu, 30 Oct 2025 00:26:49 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2175583</guid>
                                    <link>https://tcpfm.castos.com/episodes/327-aws-finally-admits-kubernetes-is-hard-makes-robots-do-it-instead</link>
                                <description>
                                            <![CDATA[<h3>Welcome to episode 327 of The Cloud Pod, where the forecast is always cloudy! Justin, Matt, and Ryan are here to bring you all the latest news (and a few rants) in the worlds of Cloud and AI. I’m sure all our readers are aware of the AWS outage last week, as it was in all the news everywhere. But we’ve also got some new AI models (including Sora in case you’re low on really crappy videos the youths might like), plus EKS, Kubernetes, Vertex AI, and more. Let’s get started! </h3>
<h3>Titles we almost went with this week</h3>
<ul>
<li> Oracle and Azure Walk Into a Cloud Bar: Nobody Gets ETL’d</li>
<li> When DNS Goes Down, So Does Your Monday: AWS Takes Half the Internet on a Coffee Break</li>
<li> 404 Cloud Not Found: AWS Proves Even the Internet’s Phone Book Can Get Lost</li>
<li> DNS: Definitely Not Staffed – How AWS Lost Its Way When It Lost Its People</li>
<li> When Larry Met Satya: A Cloud Love Story</li>
<li> Azure Finally Answers ‘Dude, Where’s My Data?’ with Storage Discovery</li>
<li> Breaking: Microsoft Discovers AI Training Uses More Power Than a Small Country</li>
<li> 404 Engineers Not Found – AWS Learns the Hard Way That People Are Its Most Critical Infrastructure</li>
<li> Azure Storage Discovery: Finding Your Data Needles in the Cloud Haystack</li>
<li> EKS Auto Mode: Because Even Your Clusters Deserve Cruise Control</li>
<li>Azure Gets Reel: Microsoft Adds Video Generation to AI Foundry</li>
<li> The Great Token Heist: Vertex AI Steals 90% Off Your Gemini Bills</li>
<li> Cache Me If You Can: Vertex AI’s Token-Saving Feature</li>
<li> IaC Just Got a Manager – And It’s Not Your Boss </li>
<li> From Musk to Microsoft: Grok 4 Makes the Great Cloud Migration</li>
<li> No Harness.. You are not going to make IACM happen</li>
<li> Microsoft Drafts a Solution to Container Creation Chaos</li>
<li> PowerShell to the People: Azure Simplifies the Great Gateway Migration</li>
<li> IP There Yet? Azure’s Scripts Keep Your Address While You Upgrade</li>
</ul>
<h2>Follow Up</h2>
<p>00:53 Glacier Deprecation Email</p>
<ul>
<li style="font-weight:400;">Standalone <a href="https://www.google.com/aclk?sa=L&amp;ai=DChsSEwiJpNXOqciQAxXEMUQIHWq7CcYYACICCAEQABoCZHo&amp;ae=2&amp;co=1&amp;gclid=CjwKCAjw04HIBhB8EiwA8jGNbZpjiDljWhLvdfpOoUheFZ93MUVIFkYwFJo76XdV-rOeNR0yaP-G_hoCMV0QAvD_BwE&amp;cid=CAASlwHkaCDOZWpDhOU_nIDXIuIlztOmD1FPPwdt-WGRCPxRjE77owu-sBW219fUMurgwl1laRG91A5qarS9Ms8w0HvTcf9ZsFgs2mJHPrPxgN1njD5VD0n-VGYsmveGHLFzW8L169tqux_f1sYUYaNNV3YRBQycwR60fZPbUMz1gJpJjxEO2hjVm2MjBAiF1rlHbgq61qqG0pk0&amp;cce=2&amp;sig=AOD64_06wnqZPVtdTTB9Zk51k3qRWbc2zA&amp;q&amp;adurl&amp;ved=2ahUKEwjWrtDOqciQAxVSJEQIHesNF6kQ0Qx6BAgZEAE">Amazon Glacier</a> service (vault-based with separate APIs) will stop accepting new customers as of December 15, 2025. </li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/s3/storage-classes/glacier/">S3 Glacier storage classes</a> (Instant Retrieval, Flexible Retrieval, Deep Archive) are completely unaffected and continue normally</li>
<li style="font-weight:400;">Existing Glacier customers can keep using it forever – no forced migration required. </li>
<li style="font-weight:400;">AWS is essentially consolidating around S3 as the unified storage platform, rather than maintaining two separate archival services.</li>
<li style="font-weight:400;">The standalone service will enter maintenance mode, meaning there will be no new features, but the service will remain operational.</li>
<li style="font-weight:400;">Migration to S3 Glacier is optional but recommended for better integration, lower costs, and more features. (Justin assures us it is actually slightly cheaper, so there’s that.) </li>
</ul>
<h2>General News </h2>
<p>02:24 <a href="https://www.geekwire.com/2025/f5-discloses-major-security-breach-linked-to-nation-state-hackers/?utm_source=GeekWire+Newsletters&amp;utm_campaign=2615c13f30-daily-digest-email&amp;utm_medium=email&amp;utm_term=0_4e93fc7dfd-2615c13f30-233353605&amp;mc_cid=2615c13f30&amp;mc_eid=0..."></a></p>
<h3>Chapters</h3>
<ul><li>(00:00:00) - Azure vs. GCP</li><li>(00:00:59) - Amazon's Glacier Storage Deprecation, and More</li><li>(00:02:33) - Big IP Software Breach: Worrisome</li><li>(00:04:56) - Claude Code Gets a Web Version</li><li>(00:11:45) - Infrastructure as Code Management: Annoying Sales Pitch</li><li>(00:14:26) - AWS: US East 1 Outage Causes Chaos</li><li>(00:23:17) - EC2 Capacity Manager</li><li>(00:25:39) - EC2 Auto-Mode for Kubernetes 1.29</li><li>(00:28:44) - Amazon. EC2: CPU Optimization for License Included Instances</li><li>(00:30:55) - AWS Systems Manager Patch Manager: Improved Security Protection</li><li>(00:35:14) - Amazon ECS CLI Agent Orchestrator</li><li>(00:40:37) - Google Cloud: BigQuery Update, New GPUs</li><li>(00:46:11) - Google Cloud: Management of Suences in Vertex & AI SDK</li><li>(00:47:58) - Gemini Code Assist on GitHub Enterprise</li><li>(00:52:09) - Vertex AI Context Caching</li><li>(00:54:25) - Cloud Armor Announces New Features</li><li>(00:57:05) - Microsoft Firewall: New Capacity Metric</li><li>(00:59:55) - Microsoft's Azure API Management introduces carbon aware features</li><li>(01:04:14) - Azure Storage Discovery</li><li>(01:07:45) - Two new AI models available in Azure AI Foundry</li><li>(01:08:54) - Azure: Application Gateway V1 to V2 Migration Scripts</li><li>(01:12:43) - Oracle's AI Agent Studio Expands</li><li>(01:14:05) - Week in the Cloud</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 327 of The Cloud Pod, where the forecast is always cloudy! Justin, Matt, and Ryan are here to bring you all the latest news (and a few rants) in the worlds of Cloud and AI. I’m sure all our readers are aware of the AWS outage last week, as it was in all the news everywhere. But we’ve also got some new AI models (including Sora in case you’re low on really crappy videos the youths might like), plus EKS, Kubernetes, Vertex AI, and more. Let’s get started! 
Titles we almost went with this week

 Oracle and Azure Walk Into a Cloud Bar: Nobody Gets ETL’d
 When DNS Goes Down, So Does Your Monday: AWS Takes Half the Internet on a Coffee Break
 404 Cloud Not Found: AWS Proves Even the Internet’s Phone Book Can Get Lost
 DNS: Definitely Not Staffed – How AWS Lost Its Way When It Lost Its People
 When Larry Met Satya: A Cloud Love Story
 Azure Finally Answers ‘Dude, Where’s My Data?’ with Storage Discovery
 Breaking: Microsoft Discovers AI Training Uses More Power Than a Small Country
 404 Engineers Not Found – AWS Learns the Hard Way That People Are Its Most Critical Infrastructure
 Azure Storage Discovery: Finding Your Data Needles in the Cloud Haystack
 EKS Auto Mode: Because Even Your Clusters Deserve Cruise Control
Azure Gets Reel: Microsoft Adds Video Generation to AI Foundry
 The Great Token Heist: Vertex AI Steals 90% Off Your Gemini Bills
 Cache Me If You Can: Vertex AI’s Token-Saving Feature
 IaC Just Got a Manager – And It’s Not Your Boss 
 From Musk to Microsoft: Grok 4 Makes the Great Cloud Migration
 No Harness.. You are not going to make IACM happen
 Microsoft Drafts a Solution to Container Creation Chaos
 PowerShell to the People: Azure Simplifies the Great Gateway Migration
 IP There Yet? Azure’s Scripts Keep Your Address While You Upgrade

Follow Up
00:53 Glacier Deprecation Email

Standalone Amazon Glacier service (vault-based with separate APIs) will stop accepting new customers as of December 15, 2025. 
S3 Glacier storage classes (Instant Retrieval, Flexible Retrieval, Deep Archive) are completely unaffected and continue normally
Existing Glacier customers can keep using it forever – no forced migration required. 
AWS is essentially consolidating around S3 as the unified storage platform, rather than maintaining two separate archival services.
The standalone service will enter maintenance mode, meaning there will be no new features, but the service will remain operational.
Migration to S3 Glacier is optional but recommended for better integration, lower costs, and more features. (Justin assures us it is actually slightly cheaper, so there’s that.) 

General News 
02:24 ]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[327: AWS Finally Admits Kubernetes is Hard, Makes Robots Do It Instead]]>
                </itunes:title>
                                    <itunes:episode>327</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<h3>Welcome to episode 327 of The Cloud Pod, where the forecast is always cloudy! Justin, Matt, and Ryan are here to bring you all the latest news (and a few rants) in the worlds of Cloud and AI. I’m sure all our readers are aware of the AWS outage last week, as it was in all the news everywhere. But we’ve also got some new AI models (including Sora in case you’re low on really crappy videos the youths might like), plus EKS, Kubernetes, Vertex AI, and more. Let’s get started! </h3>
<h3>Titles we almost went with this week</h3>
<ul>
<li> Oracle and Azure Walk Into a Cloud Bar: Nobody Gets ETL’d</li>
<li> When DNS Goes Down, So Does Your Monday: AWS Takes Half the Internet on a Coffee Break</li>
<li> 404 Cloud Not Found: AWS Proves Even the Internet’s Phone Book Can Get Lost</li>
<li> DNS: Definitely Not Staffed – How AWS Lost Its Way When It Lost Its People</li>
<li> When Larry Met Satya: A Cloud Love Story</li>
<li> Azure Finally Answers ‘Dude, Where’s My Data?’ with Storage Discovery</li>
<li> Breaking: Microsoft Discovers AI Training Uses More Power Than a Small Country</li>
<li> 404 Engineers Not Found – AWS Learns the Hard Way That People Are Its Most Critical Infrastructure</li>
<li> Azure Storage Discovery: Finding Your Data Needles in the Cloud Haystack</li>
<li> EKS Auto Mode: Because Even Your Clusters Deserve Cruise Control</li>
<li>Azure Gets Reel: Microsoft Adds Video Generation to AI Foundry</li>
<li> The Great Token Heist: Vertex AI Steals 90% Off Your Gemini Bills</li>
<li> Cache Me If You Can: Vertex AI’s Token-Saving Feature</li>
<li> IaC Just Got a Manager – And It’s Not Your Boss </li>
<li> From Musk to Microsoft: Grok 4 Makes the Great Cloud Migration</li>
<li> No Harness.. You are not going to make IACM happen</li>
<li> Microsoft Drafts a Solution to Container Creation Chaos</li>
<li> PowerShell to the People: Azure Simplifies the Great Gateway Migration</li>
<li> IP There Yet? Azure’s Scripts Keep Your Address While You Upgrade</li>
</ul>
<h2>Follow Up</h2>
<p>00:53 Glacier Deprecation Email</p>
<ul>
<li style="font-weight:400;">Standalone <a href="https://www.google.com/aclk?sa=L&amp;ai=DChsSEwiJpNXOqciQAxXEMUQIHWq7CcYYACICCAEQABoCZHo&amp;ae=2&amp;co=1&amp;gclid=CjwKCAjw04HIBhB8EiwA8jGNbZpjiDljWhLvdfpOoUheFZ93MUVIFkYwFJo76XdV-rOeNR0yaP-G_hoCMV0QAvD_BwE&amp;cid=CAASlwHkaCDOZWpDhOU_nIDXIuIlztOmD1FPPwdt-WGRCPxRjE77owu-sBW219fUMurgwl1laRG91A5qarS9Ms8w0HvTcf9ZsFgs2mJHPrPxgN1njD5VD0n-VGYsmveGHLFzW8L169tqux_f1sYUYaNNV3YRBQycwR60fZPbUMz1gJpJjxEO2hjVm2MjBAiF1rlHbgq61qqG0pk0&amp;cce=2&amp;sig=AOD64_06wnqZPVtdTTB9Zk51k3qRWbc2zA&amp;q&amp;adurl&amp;ved=2ahUKEwjWrtDOqciQAxVSJEQIHesNF6kQ0Qx6BAgZEAE">Amazon Glacier</a> service (vault-based with separate APIs) will stop accepting new customers as of December 15, 2025. </li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/s3/storage-classes/glacier/">S3 Glacier storage classes</a> (Instant Retrieval, Flexible Retrieval, Deep Archive) are completely unaffected and continue normally</li>
<li style="font-weight:400;">Existing Glacier customers can keep using it forever – no forced migration required. </li>
<li style="font-weight:400;">AWS is essentially consolidating around S3 as the unified storage platform, rather than maintaining two separate archival services.</li>
<li style="font-weight:400;">The standalone service will enter maintenance mode, meaning there will be no new features, but the service will remain operational.</li>
<li style="font-weight:400;">Migration to S3 Glacier is optional but recommended for better integration, lower costs, and more features. (Justin assures us it is actually slightly cheaper, so there’s that.) </li>
</ul>
<h2>General News </h2>
<p>02:24 <a href="https://www.geekwire.com/2025/f5-discloses-major-security-breach-linked-to-nation-state-hackers/?utm_source=GeekWire+Newsletters&amp;utm_campaign=2615c13f30-daily-digest-email&amp;utm_medium=email&amp;utm_term=0_4e93fc7dfd-2615c13f30-233353605&amp;mc_cid=2615c13f30&amp;mc_eid=04fad859c0">F5 discloses major security breach linked to nation-state hackers – </a><a href="https://www.geekwire.com/2025/f5-discloses-major-security-breach-linked-to-nation-state-hackers/?utm_source=GeekWire+Newsletters&amp;utm_campaign=2615c13f30-daily-digest-email&amp;utm_medium=email&amp;utm_term=0_4e93fc7dfd-2615c13f30-233353605&amp;mc_cid=2615c13f30&amp;mc_eid=04fad859c0">GeekWire</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.f5.com/">F5</a> disclosed that <a href="https://www.securityweek.com/chinese-hackers-leveraged-legacy-f5-big-ip-appliance-for-persistence/">nation-state hackers maintained persistent access to their internal systems over the summer of 2024</a>, stealing portions of BIG-IP source code and vulnerability details before containment in August.</li>
<li style="font-weight:400;">The breach compromised product development and engineering systems, but did not affect customer CRM data, financial systems, or F5’s software supply chain, according to independent security audits.</li>
<li style="font-weight:400;">F5 has released security patches for BIG-IP, F5OS, and BIG-IP Next products and is providing threat-hunting guides to help customers monitor for suspicious activity.</li>
<li style="font-weight:400;">This represents the first publicly disclosed breach of F5’s internal systems, notable given that F5 handles traffic for 80% of Fortune Global 500 companies through its load-balancing and security services.</li>
<li style="font-weight:400;">The incident highlights supply chain security concerns, as attackers targeted source code and vulnerability information, rather than customer data, potentially seeking ways to exploit F5 products deployed across enterprise networks.</li>
</ul>
<p>03:12  Justin – “A little concerning on this one, mostly because F5 is EVERYWHERE.” </p>
<h2>AI is Going Great – Or How ML Makes Money </h2>
<p>04:55 <a href="https://arstechnica.com/ai/2025/10/claude-code-gets-a-web-version-but-its-the-new-sandboxing-that-really-matters/">Claude Code gets a web version—but it’s the new sandboxing that really </a><a href="https://arstechnica.com/ai/2025/10/claude-code-gets-a-web-version-but-its-the-new-sandboxing-that-really-matters/">matters – Ars Technica</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.anthropic.com/">Anthropic</a> launched<a href="https://www.anthropic.com/news/claude-code-on-the-web"> web and mobile interfaces</a> for <a href="https://www.claude.com/product/claude-code">Claude Code</a>, their CLI-based AI coding assistant, with the web version supporting direct access to GitHub repositories and the ability to process general instructions, such as “add real-time inventory tracking to the dashboard.”</li>
<li style="font-weight:400;">The web interface introduces multi-session support, allowing developers to run and switch between multiple coding sessions simultaneously through a left-side panel, plus the ability to provide mid-task corrections without canceling and restarting</li>
<li style="font-weight:400;">A new <a href="https://docs.claude.com/en/docs/claude-code/sandboxing'">sandboxing runtime</a> has been implemented to improve security and reduce friction, moving away from the previous approach where Claude Code required permission for most changes and steps during execution</li>
<li style="font-weight:400;">The mobile version is currently limited to iOS and is in an earlier development stage compared to the web interface, indicating a phased rollout approach</li>
<li style="font-weight:400;">This positions Claude Code as a more accessible alternative to traditional CLI-only AI coding tools, potentially expanding its reach to developers who prefer web-based interfaces over command-line environments</li>
</ul>
<p>05:51  Ryan – “I haven’t had a chance to play with the web version, but I am interested in it just because I found the terminal interface limiting, but I also feel like a lot of the value is in that local sort of execution and not in the sandbox. A lot of the tasks I do are internal and require access to either company resources or private networks, or the kind of thing where you’re not going to get that from a publicly hosted sandbox environment.”</p>
<p>08:36 <a href="https://azure.microsoft.com/en-us/updates?id=503268">Open Source: Containerization Assist MCP Server </a></p>
<ul>
<li style="font-weight:400;"><a href="https://github.com/Azure/containerization-assist">Containerization Assist</a> automates the tedious process of creating Dockerfiles and Kubernetes manifests, eliminating manual errors that plague developers during the containerization process</li>
<li style="font-weight:400;">Built on <a href="https://learn.microsoft.com/en-us/azure/aks/draft">AKS Draft’s</a> proven foundation, this open-source tool goes beyond basic AI coding assistants by providing a complete containerization platform rather than just code suggestions.</li>
<li style="font-weight:400;">The tool addresses a critical pain point where developers waste hours writing boilerplate container configurations and debugging deployment issues caused by manual mistakes. (Listener beware, Justin mini rant here.) </li>
<li style="font-weight:400;">As an open-source MCP (Model Context Protocol) server, it integrates seamlessly with existing development workflows while leveraging Microsoft’s containerization expertise from Azure Kubernetes Service. (Expertise is a stretch.) </li>
<li style="font-weight:400;">This launch signals Microsoft’s commitment to simplifying Kubernetes adoption by removing the steep learning curve associated with container orchestration and manifest creation – or you could just use a pass. </li>
</ul>
<p>09:47  Matt – “The piece I did like about this is that it integrated in as an optional feature, kind of the trivia and the security thing. So it’s not just setting it up, but they integrated the next steps of security code scanning. It’s not Microsoft saying, you know, hey, it’s standard … they are building security in, hopefully.”</p>
<h2>Cloud Tools </h2>
<p>33:09 <a href="https://www.harness.io/blog/meet-iacm">IaC is Great, But Have You Met IaCM?</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.harness.io/tag/infrastructure-as-code-management/?tag=true">IaCM (Infrastructure as Code Management)</a> extends traditional IaC by adding lifecycle management capabilities, including state management, policy enforcement, and drift detection to handle the complexity of infrastructure at scale.</li>
<li style="font-weight:400;">Key features include centralized state file management with version control, module and provider registries for reusable components, and automated policy enforcement to ensure compliance without slowing down teams.</li>
<li style="font-weight:400;">The platform integrates directly into <a href="https://www.harness.io/tag/continuous-integration/?tag=true">CI</a>/<a href="https://www.harness.io/tag/continuous-delivery/?tag=true">CD</a> workflows with visual PR insights showing cost estimates and infrastructure changes before deployment, solving the problem of unexpected costs and configuration conflicts.</li>
<li style="font-weight:400;">IaCM addresses critical pain points like configuration drift, secret exposure in state files, and resource conflicts when multiple teams work on the same infrastructure simultaneously.</li>
<li style="font-weight:400;">Harness IaCM specifically supports <a href="https://opentofu.org/">OpenTofu</a> and <a href="https://developer.hashicorp.com/terraform">Terraform</a> with features like Variable Sets, Workspace Templates, and Default Pipelines to standardize infrastructure delivery across organizations.</li>
</ul>
<p>13:04  Justin – “So let me boil this down for you. We created our own Terraform Enterprise or Terraform Cloud, but we can’t use that name because it’s copyrighted. So we’re going to try to create a new thing and pretend we invented this – and then try to sell it to you as our new Terraform or OpenTofu replacement for your management tier.”</p>
<h2>HugOps Corner – Previously Known as AWS </h2>
<p>41:08 <a href="https://www.geekwire.com/2025/aws-outage-hits-major-apps-and-services-resurfacing-old-questions-about-cloud-redundancy">AWS outage hits major apps and services, resurfacing old questions about </a></p>
<p><a href="https://www.geekwire.com/2025/aws-outage-hits-major-apps-and-services-resurfacing-old-questions-about-cloud-redundancy">cloud redundancy – GeekWire</a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/global-infrastructure/latest/regions/aws-regions.html">AWS US-EAST-1</a> experienced a major outage starting after midnight Pacific on Monday, caused by DNS resolution issues with <a href="https://aws.amazon.com/dynamodb/">DynamoDB</a> that prevented proper address lookup for database services, impacting thousands of applications, including <a href="https://www.facebook.com/">Facebook</a>, <a href="https://www.snapchat.com/">Snapchat</a>, <a href="https://www.coinbase.com/">Coinbase</a>, <a href="https://chatgpt.com/">ChatGPT</a>, and Amazon’s own services.</li>
<li style="font-weight:400;">The outage highlighted ongoing redundancy concerns as many organizations failed to implement proper failover to other regions or cloud providers, despite similar incidents in US-EAST-1 in 2017, 2021, and 2023, raising questions about single-region dependency for critical infrastructure.</li>
<li style="font-weight:400;">AWS identified the root cause as an internal subsystem responsible for monitoring network load balancer health, with core DNS issues resolved by 3:35 AM Pacific, though <a href="https://lambda.ai/">Lambda</a> backlog processing and EC2 instance launch errors persisted through the morning recovery period.</li>
<li style="font-weight:400;">Real-world impacts included<a href="https://www.laguardiaairport.com/"> LaGuardia Airport</a> check-in kiosk failures, causing passenger lines, widespread disruption to financial services (<a href="https://venmo.com/">Venmo</a>, <a href="https://robinhood.com/us/en/">Robinhood</a>), gaming platforms (<a href="https://www.roblox.com/">Roblox</a>, <a href="https://www.fortnite.com/?lang=en-US">Fortnite</a>), and productivity tools (<a href="https://slack.com/">Slack</a>, <a href="https://www.canva.com/">Canva</a>), demonstrating the cascading effects of cloud provider outages.</li>
<li style="font-weight:400;">The incident underscores the importance of multi-region deployment strategies and proper disaster recovery planning for AWS customers, particularly those using US-EAST-1 as their primary region due to its status as AWS’s oldest and largest data center location.</li>
<li style="font-weight:400;">We have a couple of observations: this one took a LONG time to resolve, including hours before the DNS was restored. Maybe they’re out of practice? Maybe it’s a people problem? Hopefully, this isn’t the new norm as some of the talent have been let go/moved on. </li>
</ul>
<p>17:53  Ryan – “If it’s a DNS resolution issue that’s causing a global outage, that’s not exactly straightforward. It’s not just a bug, you know, or a function returning the wrong value, or that you’re looking at global propagation, you’re looking at clients in different places, resolving different things, at the base parts of the internet for functionality. And so it does take a pretty experienced engineer to sort of have that in their heads conceptually in to order to troubleshoot. I wonder if that’s really the cause, where they’re not able to recover as fast. But I also feel like cloud computing has come a long way, and the impact was very widely felt because a lot more people are using AWS as their hosting provider than I think have been in the past. A little bit of everything, I think.”</p>
<p><a href="https://www.geekwire.com/2025/aws-outage-was-not-due-to-a-cyberattack-but-shows-potential-for-far-worse-damage/">AWS outage was not due to a cyberattack — but shows potential for ‘far worse’ </a><a href="https://www.geekwire.com/2025/aws-outage-was-not-due-to-a-cyberattack-but-shows-potential-for-far-worse-damage/">damage – GeekWire</a></p>
<ul>
<li style="font-weight:400;">AWS’s US-EAST-1 region experienced an <a href="https://www.geekwire.com/2025/aws-outage-hits-major-apps-and-services-resurfacing-old-questions-about-cloud-redundancy/">outage</a> due to an internal monitoring subsystem failure affecting network load balancers, impacting major services including Facebook, Coinbase, and LaGuardia Airport check-in systems. </li>
<li style="font-weight:400;">The issue was related to DNS resolution problems with DynamoDB, not a cyberattack.</li>
<li style="font-weight:400;">The incident highlights ongoing single-region dependency issues, as US-EAST-1 remains AWS’s largest region and has caused similar widespread disruptions in 2017, 2021, and 2023. Many organizations still lack proper multi-region failover despite repeated outages from this location.</li>
<li style="font-weight:400;">Industry experts warn that the outage demonstrates vulnerability to potential targeted attacks on cloud infrastructure monoculture. The concentration of services on single providers creates systemic risk similar to agricultural monoculture, where one failure can cascade widely.</li>
<li style="font-weight:400;">The failure occurred at the control-plane level, suggesting AWS should implement more aggressive isolation of critical networking components. This may accelerate enterprise adoption of multi-cloud and multi-region architectures as baseline resilience requirements.</li>
<li style="font-weight:400;">AWS resolved the issue within hours but the incident reinforces that even major cloud providers remain vulnerable to cascading failures when core monitoring and health check systems malfunction, affecting downstream services across their infrastructure.</li>
</ul>
<p><a href="https://www.theregister.com/2025/10/20/aws_outage_amazon_brain_drain_corey_quinn/">Today is when Amazon’s brain drain finally caught up with AWS • The </a><a href="https://www.theregister.com/2025/10/20/aws_outage_amazon_brain_drain_corey_quinn/">Register</a></p>
<ul>
<li style="font-weight:400;">AWS experienced a major outage on October 20, 2025 in US-EAST-1 region caused by DNS resolution failures for DynamoDB endpoints, taking 75 minutes just to identify the root cause and impacting banking, gaming, social media, and government services across much of the internet.</li>
<li style="font-weight:400;">The incident highlights concerns about AWS’s talent retention, with 27,000+ Amazon layoffs between 2022-2025 and internal documents showing 69-81% regretted attrition, suggesting loss of senior engineers who understood complex failure modes and had institutional knowledge of AWS systems.</li>
<li style="font-weight:400;">DynamoDB’s role as a foundational service meant the DNS failure created cascading impacts across multiple AWS services, demonstrating the risk of centralized dependencies in cloud architectures and the importance of regional redundancy for critical workloads.</li>
<li style="font-weight:400;">AWS’s status page showed “all is well” for the first 75 minutes of the outage, continuing a pattern of slow incident communication that AWS has acknowledged as needing improvement in multiple previous post-mortems from 2011, 2012, and 2015.</li>
<li style="font-weight:400;">The article suggests this may be a tipping point where the loss of experienced staff who built these systems is beginning to impact AWS’s legendary operational excellence, with predictions that similar incidents may become more frequent as institutional knowledge continues to leave.</li>
</ul>
<p>-And that’s an end to Hugops. Moving on to the rest of AWS-</p>
<p>23:58 <a href="https://aws.amazon.com/blogs/aws/monitor-analyze-and-manage-capacity-usage-from-a-single-interface-with-amazon-ec2-capacity-manager/">Monitor, analyze, and manage capacity usage from a single interface with \</a><a href="https://aws.amazon.com/blogs/aws/monitor-analyze-and-manage-capacity-usage-from-a-single-interface-with-amazon-ec2-capacity-manager/">Amazon EC2 Capacity Manager | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/ec2/">EC2</a> <a href="https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/capacity-manager.html">Capacity Manager</a> provides a single dashboard to monitor and manage EC2 capacity across all accounts and regions, eliminating the need to collect data from multiple AWS services like <a href="https://aws.amazon.com/aws-cost-management/aws-cost-and-usage-reporting/">Cost and Usage Reports</a>, <a href="https://aws.amazon.com/blogs/aws/category/management-tools/amazon-cloudwatch/">CloudWatch</a>, and EC2 APIs. </li>
<li style="font-weight:400;">Available at no additional cost in all commercial AWS regions.</li>
<li style="font-weight:400;">The service aggregates capacity data with hourly refresh rates for On-Demand Instances, Spot Instances, and Capacity Reservations, displaying utilization metrics by vCPUs, instance counts, or estimated costs based on published On-Demand rates.</li>
<li style="font-weight:400;">Key features include automated identification of underutilized Capacity Reservations with specific utilization percentages by instance type and AZ, plus direct modification capabilities for ODCRs within the same account.</li>
<li style="font-weight:400;">Data exports to S3 extend analytics beyond the 90-day console retention period, enabling long-term capacity trend analysis and integration with existing BI tools or custom reporting systems.</li>
<li style="font-weight:400;">Organizations can enable cross-account visibility through AWS Organizations integration, helping identify optimization opportunities like redistributing reservations between development accounts showing 30% utilization and production accounts exceeding 95%.</li>
</ul>
<p>25:45  Ryan – “This is kind of nice to have it built in and just have it be plug and play – especially when it’s at no cost.” </p>
<p>26:21 <a href="https://aws.amazon.com/blogs/containers/new-amazon-eks-auto-mode-features-for-enhanced-security-network-control-and-performance/">New Amazon EKS Auto Mode features for enhanced security, network </a><a href="https://aws.amazon.com/blogs/containers/new-amazon-eks-auto-mode-features-for-enhanced-security-network-control-and-performance/">control, and performance | Containers</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/eks/auto-mode/">EKS Auto Mod</a>e now supports <a href="https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/capacity-reservation-overview.html">EC2 On-Demand Capacity Reservations</a> and <a href="https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/ec2-capacity-blocks.html">Capacity Blocks for ML</a>, allowing customers to target pre-purchased capacity for AI/ML workloads requiring guaranteed access to specialized instances like P5s. This addresses the challenge of GPU availability for training jobs without over-provisioning.</li>
<li style="font-weight:400;">New networking capabilities include separate pod subnets for isolating infrastructure and application traffic, explicit public IP control for enterprise security compliance, and forward proxy support with custom certificate bundles. These features enable integration with existing enterprise network architectures without complex CNI customizations.</li>
<li style="font-weight:400;">Complete <a href="https://docs.aws.amazon.com/kms/latest/developerguide/overview.html">AWS KMS encryption</a> now covers both ephemeral storage and root volumes using customer-managed keys, addressing security audit findings that previously flagged unencrypted storage. </li>
<li style="font-weight:400;">This eliminates the need for custom AMIs or manual certificate distribution.</li>
<li style="font-weight:400;">Performance improvements include multi-threaded node filtering and intelligent capacity management that can automatically relax instance diversity constraints during capacity shortages. </li>
<li style="font-weight:400;">These optimizations particularly benefit time-sensitive applications and AI/ML workloads requiring rapid scaling.</li>
<li style="font-weight:400;">EKS Auto Mode is available for new clusters or can be enabled on existing <a href="https://aws.amazon.com/blogs/containers/category/compute/amazon-kubernetes-service/">EKS</a> clusters running Kubernetes 1.29+, with migration guides available for teams moving from Managed node groups, <a href="https://karpenter.sh/">Karpenter</a>, or <a href="https://aws.amazon.com/fargate/">Fargate</a>. </li>
<li style="font-weight:400;">Pricing follows standard EKS pricing at $0.10 per cluster per hour plus EC2 instance costs.</li>
</ul>
<p>27:33 Ryan – “This just highlights how terrible it was before.” </p>
<p>29:33 <a href="https://aws.amazon.com/about-aws/whats-new/2025/10/amazon-ec2-optimize-cpu-license-included-instances">Amazon EC2 now supports Optimize CPUs for license-included instances</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/ec2/">EC2</a> now lets customers reduce vCPU counts and disable hyperthreading on Windows Server and SQL Server license-included instances, enabling up to 50% savings on vCPU-based licensing costs while maintaining full memory and IOPS performance.</li>
<li style="font-weight:400;">This feature targets database workloads that need high memory and IOPS but fewer vCPUs – for example, an r7i.8xlarge instance can be reduced from 32 to 16 vCPUs while keeping its 256 GiB memory and 40,000 IOPS.</li>
<li style="font-weight:400;">The CPU optimization extends EC2’s existing Optimize CPUs feature to license-included instances, addressing a common pain point where customers overpay for Microsoft licensing due to fixed vCPU counts.</li>
<li style="font-weight:400;">Available now in all commercial AWS regions and GovCloud regions, with no additional charges beyond the adjusted licensing costs based on the modified vCPU count.</li>
<li style="font-weight:400;">This positions AWS competitively against Azure for SQL Server workloads by offering more granular control over licensing costs, particularly important as organizations migrate legacy database workloads to the cloud.</li>
<li style="font-weight:400;">Interested in CPU options? Check those out <a href="https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/instance-optimize-cpu.html">here</a>. </li>
</ul>
<p>30:20 Justin – “This is a little weird to me, because I thought this already existed.” </p>
<p>31:46 <a href="https://aws.amazon.com/about-aws/whats-new/2025/10/aws-systems-manager-patch-manager-windows/">AWS Systems Manager Patch Manager launches security updates </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/10/aws-systems-manager-patch-manager-windows/">notification for Windows</a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/systems-manager/latest/userguide/patch-manager.html">AWS Systems Manager Patch Manager</a> now includes an “AvailableSecurityUpdate” state that identifies Windows security patches available but not yet approved by patch baseline rules, helping prevent accidental exposure from delayed patch approvals.</li>
<li style="font-weight:400;">The feature addresses a specific operational risk where administrators using ApprovalDelay with extended timeframes could unknowingly leave systems vulnerable, with instances marked as Non-Compliant by default when security updates are pending.</li>
<li style="font-weight:400;">Available across all <a href="https://docs.aws.amazon.com/general/latest/gr/ssm.html">AWS Systems Manager regions</a> with no additional charges beyond standard pricing, the feature integrates directly into existing patch baseline configurations through the console at <a href="https://console.aws.amazon.com/systems-manager/patch-manager">https://console.aws.amazon.com/systems-manager/patch-manager.</a></li>
<li style="font-weight:400;">Organizations can customize compliance reporting behavior to maintain existing workflows while gaining visibility into security patch availability across their Windows fleet, particularly useful for enterprises with complex patch approval processes.</li>
<li style="font-weight:400;">The update provides a practical solution for balancing security requirements with operational stability, allowing teams to maintain patch deployment schedules while staying informed about critical security updates awaiting approval.</li>
</ul>
<p>30:20 Ryan – “It sounds like just a quality of life improvement, but it’s something that should be so basic, but isn’t there, right? Which is like Windows patch management is cobbled together and not really managed well, and so you could have a patch available, but the only way to find out that it was available previously to this was to actually go ahead and patch it and then see if it did something. And so now, at least you have a signal on that; you can apply your patches in a way that’s not going to take down your entire service if a patch goes wrong. So this is very nice. I think for people using the Systems Manager patch management, they’re going to be very happy with this.”</p>
<p>35:26 <a href="https://aws.amazon.com/blogs/opensource/introducing-cli-agent-orchestrator-transforming-developer-cli-tools-into-a-multi-agent-powerhouse/">Introducing CLI Agent Orchestrator: Transforming Developer CLI Tools into </a><a href="https://aws.amazon.com/blogs/opensource/introducing-cli-agent-orchestrator-transforming-developer-cli-tools-into-a-multi-agent-powerhouse/">a Multi-Agent Powerhouse | AWS Open Source Blog</a></p>
<ul>
<li style="font-weight:400;">AWS introduces <a href="https://github.com/awslabs/cli-agent-orchestrator">CLI Agent Orchestrator (CAO)</a>, an open source framework that enables multiple AI-powered CLI tools like <a href="https://aws.amazon.com/developer/learning/q-developer-cli/">Amazon Q CLI</a> and <a href="https://www.google.com/search?newwindow=1&amp;client=firefox-b-d&amp;channel=entpr&amp;cs=0&amp;sca_esv=c6427501e1c4b628&amp;sxsrf=AE3TifPP62As7P7T1ecobEjZB1WUO47L2w%3A1759150599212&amp;q=Claude+Code&amp;sa=X&amp;ved=2ahUKEwiDqqrUgv6PAxUfxzgGHfwWObEQxccNegQIAxAB&amp;mstk=AUtExfDKiJPSHH0nxe8XQsV1HwsMjUije8CosiDmNCn511fXf_60yFI-tonoUAUUHw4CAwHdDkqGU7M65IDiaWsW7cKWTYOPX60mx0us1YB-7xfHpJxtUf0ao9Zn8h63-vJW2pWUvU7tnUsmXOQ1X_KksTehxh7rNhPqjyL85YbXKvT2Ldw&amp;csui=3">Claude Code</a> to work together as specialized agents under a supervisor agent, addressing limitations of single-agent approaches for complex enterprise development projects.</li>
<li style="font-weight:400;">CAO uses hierarchical orchestration with tmux session isolation and Model Context Protocol servers to coordinate specialized agents – for example, orchestrating Architecture, Security, Performance, and Test agents simultaneously during mainframe modernization projects.</li>
<li style="font-weight:400;">The framework supports three orchestration patterns (Handoff for synchronous transfers, Assign for parallel execution, Send Message for direct communication) plus scheduled runs using cron-like automation, with all processing occurring locally for security and privacy.</li>
<li style="font-weight:400;">Currently supports <a href="https://aws.amazon.com/blogs/opensource/category/amazon-q/">Amazon Q</a> Developer CLI and Claude Code with planned expansion to <a href="https://developers.openai.com/codex/cli/">OpenAI Codex CLI</a>, <a href="https://www.google.com/aclk?sa=L&amp;ai=DChsSEwjn9O7syMiQAxVPEUQIHQPHKxEYACICCAEQABoCZHo&amp;ae=2&amp;co=1&amp;gclid=CjwKCAjw04HIBhB8EiwA8jGNbaEyl_dr9_xNCEClJBP4MPxcwz2XEY8TjVXW7sgRT3TtB8tHT3UHmxoCmTEQAvD_BwE&amp;cid=CAASlwHkaEI3XmoJ5v97Nt-YqmGhZmLJRRUlhZn6U0gs-jTbkw7LoS1rmiogavtv1LNLy11c4_qGTDxvoyUELxFW-zOTaTZ3CqRxE6VJUh6zTlsR8IE64eT_MAEVS80sOlKj8cBIwmC1y1juvfbtPHCUAO9kkcUXDwVPksGUY73PVuY6_JipmNzp9mlETmrgR8fCrl6o4NZNL2qD&amp;cce=2&amp;sig=AOD64_0hDMVq0-f4KHXda4sEOGLCJseTww&amp;q&amp;adurl&amp;ved=2ahUKEwickOrsyMiQAxUfD0QIHTWvCiMQ0Qx6BAgMEAE">Gemini CLI</a>, <a href="https://qwen.ai/">Qwen CLI</a>, and <a href="https://www.meetaiden.com/">Aiden</a> – no pricing mentioned as it’s open source, available at <a href="http://github.com/awslabs/cli-agent-orchestrator">github.com/awslabs/cli-agent-orchestrator</a>.</li>
<li style="font-weight:400;">Key use cases include multi-service architecture development, enterprise migrations requiring parallel implementation, comprehensive research workflows, and multi-stage quality assurance processes that benefit from coordinated specialist agents.</li>
<li style="font-weight:400;">We definitely appreciate another tool in the Agent Orchestration world. </li>
</ul>
<p>37:46 <a href="https://aws.amazon.com/about-aws/whats-new/2025/10/amazon-ecs-aws-cloudtrail-data-events-insight-api-activities">Amazon ECS now publishes AWS CloudTrail data events for insight into </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/10/amazon-ecs-aws-cloudtrail-data-events-insight-api-activities">API activities</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/ecs/">Amazon ECS</a> now publishes CloudTrail data events for ECS Agent API activities, enabling detailed monitoring of container instance operations, including polling (ecs: Poll), telemetry sessions (ecs: StartTelemetrySession), and managed instance logging (ecs: PutSystemLogEvents).</li>
<li style="font-weight:400;">Security and operations teams gain comprehensive audit trails to detect unusual access patterns, troubleshoot agent communication issues, and understand how container instance roles are utilized for compliance requirements.</li>
<li style="font-weight:400;">The feature uses the new data event resource type AWS::ECS::ContainerInstance and is available for ECS on EC2 in all AWS regions, with <a href="https://aws.amazon.com/ecs/managed-instances/">ECS Managed Instances</a> supported in select regions.</li>
<li style="font-weight:400;">Standard CloudTrail data event charges apply – typically $0.10 per 100,000 events recorded, making this a cost-effective solution for organizations needing detailed container instance monitoring.</li>
<li style="font-weight:400;">This addresses a previous visibility gap in ECS operations, as teams can now track agent-level activities that were previously opaque, improving debugging capabilities and security posture for containerized workloads.</li>
</ul>
<p>39:33  Ryan – “This is definitely something I would use sparingly because the UCS API is agent API chatting. So this seems like it would be very expensive, very fast.”</p>
<h2>GCP</h2>
<p>41:22 <a href="https://cloud.google.com/blog/products/compute/g4-vms-powered-by-nvidia-rtx-6000-blackwell-gpus-are-ga/">G4 VMs powered by NVIDIA RTX 6000 Blackwell GPUs are GA | Google </a><a href="https://cloud.google.com/blog/products/compute/g4-vms-powered-by-nvidia-rtx-6000-blackwell-gpus-are-ga/">Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google Cloud launches <a href="https://cloud.google.com/compute/docs/accelerator-optimized-machines#g4-series">G4 VMs</a> with <a href="https://www.nvidia.com/en-us/products/workstations/professional-desktop-gpus/rtx-pro-6000/">NVIDIA RTX 6000 Blackwell</a> GPUs, offering up to 9x throughput improvement over G2 instances and supporting workloads from AI inference to digital twin simulations with configurations of 1, 2, 4, or 8 GPUs.</li>
<li style="font-weight:400;">The G4 VMs feature enhanced PCIe-based peer-to-peer data paths that deliver up to 168% throughput gains and 41% lower latency for multi-GPU workloads, addressing the bottleneck issues common in serving large generative AI models that exceed single GPU memory limits.</li>
<li style="font-weight:400;">Each GPU provides 96GB of GDDR7 memory (up to 768GB total), native FP4 precision support, and Multi-Instance GPU capability that allows partitioning into 4 isolated instances, enabling efficient serving of models from under 30B to over 100B parameters.</li>
<li style="font-weight:400;"><a href="https://www.nvidia.com/en-us/omniverse/">NVIDIA Omniverse</a> and <a href="https://console.cloud.google.com/marketplace/product/nvidia/nvidia-isaac-sim-development-workstation-windows?project=nvidia-vgpu-public">Isaac Sim</a> are now available on <a href="https://console.cloud.google.com/marketplace/browse?q=Nvidia%20omniverse">Google Cloud Marketplace</a> as turnkey solutions for G4 VMs, enabling immediate deployment of industrial digital twin and robotics simulation applications with full integration across GKE, Vertex AI, Dataproc, and Cloud Run.</li>
<li style="font-weight:400;">G4 VMs are available immediately with broader regional availability than previous GPU offerings, though specific pricing details were not provided in the announcement – customers should contact Google Cloud sales for cost information. (AKA $$$$.)  </li>
</ul>
<p>43:03 <a href="https://cloud.google.com/blog/products/data-analytics/dataproc-23-on-google-compute-engine/">Dataproc 2.3 on Google Compute Engine | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Dataproc 2.3 introduces a lightweight, <a href="https://www.fedramp.gov/">FedRamp</a> High-compliant image that contains only essential Spark and Hadoop components, reducing CVE exposure and meeting <a href="https://cloud.google.com/dataproc/docs/concepts/fedramp-compliance">strict security requirements</a> for organizations handling sensitive data.</li>
<li style="font-weight:400;">Optional components like <a href="https://flink.apache.org/">Flink</a>, <a href="https://hive.apache.org/docs/latest/webhcat/webhcat-base/">Hive WebHCat</a>, and <a href="https://www.sparkcodehub.com/hive/security/hive-ranger-integration">Ranger</a> are now deployed on-demand during cluster creation rather than pre-packaged, keeping clusters lean by default while maintaining full functionality when needed.</li>
<li style="font-weight:400;">Custom images allow pre-installation of required components to reduce cluster provisioning time while maintaining the security benefits of the lightweight base image.</li>
<li style="font-weight:400;">The image supports multiple operating systems, including <a href="https://www.debian.org/download">Debian 12</a>, <a href="https://releases.ubuntu.com/jammy/">Ubuntu 22</a>, and <a href="https://rockylinux.org/">Rocky 9</a>, with deployment as simple as specifying version 2.3 when creating clusters via gcloud CLI.</li>
<li style="font-weight:400;">Google employs automated CVE scanning and patching combined with manual intervention for complex vulnerabilities to maintain compliance standards and security posture.</li>
</ul>
<p>44:14  Ryan – “But on the contrary, like FedRAMP has such tight SLAs for vulnerability management that you don’t have to carry this risk or request an exception because of Google not patching Flink as fast as you would like them to. At least this puts the control at the end user, where they can say, well, I’m not going to use it.”</p>
<p>44:45 <a href="https://cloud.google.com/blog/products/data-analytics/bigquery-studio-gets-improved-console-interface/">BigQuery Studio gets improved console interface | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="http://console.cloud.google.com/bigquery">BigQuery Studio’s</a> new interface introduces an expanded Explorer view that allows users to filter resources by project and type, with a dedicated search function that spans across all BigQuery resources within an organization – addressing the common pain point of navigating through large-scale data projects.</li>
<li style="font-weight:400;">The Reference panel provides context-aware information about tables and schemas directly within the code editor, eliminating the need to switch between tabs or run exploratory queries just to check column names or data types – particularly useful for data analysts writing complex SQL queries.</li>
<li style="font-weight:400;">Google has streamlined the workspace by moving job history to a dedicated tab accessible from the Explorer pane and removing the bottom panel clutter, while also allowing users to control tab behavior with double-click functionality to prevent unwanted tab replacements.</li>
<li style="font-weight:400;">The update includes code generation capabilities where clicking on table elements in the Reference panel automatically inserts query snippets or field names into the editor, reducing manual typing errors and speeding up query development workflows.</li>
<li style="font-weight:400;">This interface refresh targets data analysts, data engineers, and data scientists who need efficient navigation across multiple BigQuery projects and datasets – no pricing changes mentioned as this appears to be a UI update to the existing BigQuery Studio service.</li>
</ul>
<p>46:00  Ryan – “Although I’m a little nervous about having all the BigQuery resources across an organization available on a single console, just because it sounds like a permissions nightmare.” </p>
<p>47:10  <a href="https://cloud.google.com/blog/products/ai-machine-learning/manage-your-prompts-using-vertex-sdk/">Manage your prompts using Vertex SDK | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google launches GA of Prompt Management in <a href="https://cloud.google.com/vertex-ai/docs/python-sdk/use-vertex-ai-python-sdk">Vertex AI SDK</a>, enabling developers to create, version, and manage prompts programmatically through <a href="https://www.python.org/">Python</a> code rather than tracking them in spreadsheets or text files.</li>
<li style="font-weight:400;">The feature provides seamless integration between <a href="https://cloud.google.com/generative-ai-studio">Vertex AI Studio’s</a> visual interface for prompt design and the <a href="https://docs.aws.amazon.com/rekognition/latest/dg/sdk-programmatic-access.html">SDK for programmatic management</a>, with prompts stored as centralized resources within Google Cloud projects for team collaboration.</li>
<li style="font-weight:400;">Enterprise security features include Customer-Managed Encryption Keys (CMEK) and VPC Service Controls (VPCSC) support, addressing compliance requirements for organizations handling sensitive data in their AI applications.</li>
<li style="font-weight:400;">Key use cases include teams building production generative AI applications that need version control, consistent prompt deployment across environments, and the ability to programmatically update prompts without manual code changes.</li>
<li style="font-weight:400;">Pricing follows standard Vertex AI model usage rates with no additional charges for prompt management itself; documentation available at <a href="http://cloud.google.com/vertex-ai/generative-ai/docs/model-reference/prompt-classes">cloud.google.com/vertex-ai/generative-ai/docs/model-reference/prompt-classes</a>.</li>
</ul>
<p>47:43  Justin – “If your prompt has sensitive data in it, I have questions already.” </p>
<p>49:05 <a href="https://cloud.google.com/blog/products/ai-machine-learning/gemini-code-assist-in-github-for-enterprises/">Gemini Code Assist in GitHub for Enterprises | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google launches <a href="https://developers.google.com/gemini-code-assist/docs/overview">Gemini Code Assist</a> for <a href="https://github.com/enterprise/trial">GitHub Enterprise</a>, bringing AI-powered code reviews to enterprise customers using GitHub Enterprise Cloud and on-premises GitHub Enterprise Server. </li>
<li style="font-weight:400;">This addresses the bottleneck where 60.2% of organizations take over a day for code changes to reach production due to manual review processes.</li>
<li style="font-weight:400;">The service provides organization-level controls, including centralized custom style guides and org-wide configuration settings, allowing platform teams to enforce coding standards automatically across all repositories. </li>
<li style="font-weight:400;">Individual teams can still customize repo-level settings while maintaining organizational baselines.</li>
<li style="font-weight:400;">Built under Google Cloud Terms of Service, the enterprise version ensures code prompts and model responses are stateless and not stored, with Google committing not to use customer data for model training without permission. This addresses enterprise security and compliance requirements for AI-assisted development.</li>
<li style="font-weight:400;">Currently in public preview with access through the <a href="https://cloud.google.com/cloud-console">Google Cloud Console</a>, the service includes a higher pull request quota than the individual developer tier. Google is developing additional features, including agentic loop capabilities for automated issue resolution and bug fixing.</li>
<li style="font-weight:400;">This release complements the recently launched <a href="https://github.com/gemini-cli-extensions/code-review">Code Review Gemini CLI Extension</a> for terminal-based AI assistance and represents part of Google’s broader strategy to provide AI assistance across the entire software development lifecycle. </li>
<li style="font-weight:400;">Pricing details are not specified in the announcement.</li>
</ul>
<p>51:08  Ryan – “It’s just sort of the ability to sort of do organizational-wide things is super powerful for these tools, and I’m just sort of surprised that GitHub allows that. It seems like they would have to develop API hooks and externalize that.”</p>
<p>53:19 <a href="https://cloud.google.com/blog/products/ai-machine-learning/vertex-ai-context-caching/">Vertex AI context caching | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/context-cache/context-cache-overview">Vertex AI context caching</a> reduces costs by 90% for repeated content in <a href="https://gemini.google.com/">Gemini</a> models by storing precomputed tokens – implicit caching happens automatically, while explicit caching gives developers control over what content to cache for predictable savings</li>
<li style="font-weight:400;">The feature supports caching from 2,048 tokens up to <a href="https://deepmind.google/models/gemini/pro/">Gemini 2.5 Pro’s</a> 1 million token context window across all modalities (text, PDF, image, audio, video) with both global and regional endpoint support</li>
<li style="font-weight:400;">Key use cases include document processing for financial analysis, customer support chatbots with detailed system instructions, codebase Q&amp;A for development teams, and enterprise knowledge base queries</li>
<li style="font-weight:400;">Implicit caching is enabled by default with no code changes required and clears within 24 hours, while explicit caching charges standard input token rates for initial caching, then a 90% discount on reuse, plus hourly storage fees based on TTL. </li>
<li style="font-weight:400;">Integration with Provisioned Throughput ensures production workloads benefit from caching, and explicit caches support <a href="https://cloud.google.com/storage/docs/encryption/customer-managed-keys">Customer Managed Encryption Keys</a> (CMEK) for additional security compliance</li>
</ul>
<p>54:18  Ryan – “This is awesome. If you have a workload where you’re gonna have very similar queries or prompts and have it return similar data, this is definitely nicer than having to regenerate that every time. They’ve been moving more and more towards this. And I like to see it sort of more at a platform level now, whereas you could sort of implement this – in a weird way – directly in a model, like in a notebook or something. This is more of a ‘turn it on and it works’.”</p>
<p>55:30 <a href="https://cloud.google.com/blog/products/identity-security/cloud-armor-named-strong-performer-in-forrester-wave-new-features-launched/">Cloud Armor named Strong Performer in Forrester WAVE, new features </a><a href="https://cloud.google.com/blog/products/identity-security/cloud-armor-named-strong-performer-in-forrester-wave-new-features-launched/">launched</a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/security/products/armor">Cloud Armor</a> introduces <a href="https://cloud.google.com/armor/docs/hierarchical-policies-overview">hierarchical security policies</a> (GA) that enable WAF and DDoS protection at the organization, folder, and project levels, allowing centralized security management across large GCP deployments with consistent policy enforcement.</li>
<li style="font-weight:400;"><a href="https://cloud.google.com/armor/docs/configure-waf#update-waf-inspection-limit">Enhanced WAF inspection capability</a> (preview) expands request body inspection from 8KB to 64KB for all preconfigured rules, improving detection of malicious content hidden in larger payloads while maintaining performance.</li>
<li style="font-weight:400;"><a href="https://cloud.google.com/armor/docs/security-policy-overview">JA4 network fingerprinting support</a> (GA) provides advanced SSL/TLS client identification beyond JA3, offering deeper behavioral insights for threat hunting and distinguishing legitimate traffic from malicious actors.</li>
<li style="font-weight:400;"><a href="https://cloud.google.com/armor/docs/address-groups-overview#organization-scoped-address-group">Organization-scoped address groups</a> (GA) enable IP range list management across multiple security policies and products like <a href="https://cloud.google.com/firewall/docs/about-firewalls">Cloud Next Generation Firewall</a>, reducing configuration complexity and duplicate rules.</li>
<li style="font-weight:400;">Cloud Armor now protects Media CDN with <a href="https://cloud.google.com/media-cdn/docs/overview#armor-support">Network Threat Intelligence and ASN blocking capabilities</a> (GA), defending media assets at the network edge against known malicious IPs and traffic patterns.</li>
</ul>
<p>56:59 Ryan – “These are some pretty advanced features for a cloud platform provided WAF. It’s pretty cool.” </p>
<h2>Azure</h2>
<p>58:44 <a href="https://azure.microsoft.com/en-us/updates?id=516002">Generally Available: Observed capacity metric in Azure Firewall </a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.azure.cn/en-us/firewall/features">Azure Firewall’s</a> new <a href="https://learn.microsoft.com/en-us/azure/firewall/monitor-firewall-reference">observed capacity metric</a> provides real-time visibility into capacity unit utilization, helping administrators track actual scaling behavior versus provisioned capacity for better resource optimization and cost management.</li>
<li style="font-weight:400;">This observability enhancement addresses a common blind spot where teams over-provision firewall capacity due to uncertainty about actual usage patterns, potentially reducing unnecessary Azure spending on unused capacity units.</li>
<li style="font-weight:400;">The metric integrates with <a href="https://duckduckgo.com/y.js?ad_domain=microsoft.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=ofZ8VJzl_zMPtcremnMdsxLt_RRgozg6NM_t9P08WG8o8pSwZglNRhhDlTbHZAoAwBbWUAYofZQ0NK8pIFCPeUuNh2oVKQmg909EhPhgms8_btuTy1SRMRGPjSuhQ5T3.r_sE_nL6HnWTtPaXqDA9NQ&amp;eddgt=9NNSFeNYrmHHutHkqr27gQ%3D%3D&amp;rut=a00a4808352a9ba9ab36d9e0c3773a3b4b44a28e08bf533e6ad18109bcc8f5cc&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8cg0D_nd6j9jGDDKxEY06_jVUCUzVkl1aDWDnswkmAiviOnsTXwFB28BjPbG6k9CSDW4PMww-0p--VToF3Aw-SQ5Hcn-socvzSbnQcC4dLLGvPBggUkjtH1Ruj3uk7TTUoeH0HDnlp78HFkNYP-Is49TPdv-JltRchxVKSaFGcDiRwfbkXLu2yOFBNQCZp02w92BJrvYdeiOxHSKQFsc6HdoOx45avlK-mxHDnC2XjcKCl1yZW-2xOHA0khxvm-T6pRH7TNhok4NFzXIPd7aePRhT8SQaYOKEdDVkY2Slly_aP1MEWIWffK48sihQF0pHKWDLMG3Q6Mb9vMkjlxLFp6eoOWIa3EqoaBQjVtNy4FP8KjetnsGq2C6GAtsFO0PN4RaK_rOvvqccmN7zsGlfIeLDDNZeqJSfjLWqmI-5XhUoL-7-0XyPW6F-X0UPdzp-Pv7puXzkKNNHn-PRMgTIo5jU2OJ6VCgxtfjDnmZGo1aPyBL4ZWGjA76sqn5jey-Lq2x8P5WBLae7s8QP9HWivZiI-6cq9TydXvWRIoBIMaQqgZHbrmMtiM9OTYEcXTmES_O9a7OHgNdSfyh98MPLQIi2-oHjsh7Ao6TYMxa_bT3o4fwHhPGPC61k0SlAV4kmQoPqAw4LlAPDUdmOTYgxECEWqD5Sd2NUpPRUFg33abMhsQ4hpvLmJ2vWL00GciRsZSuczTJbQqbPr5i6Dk8PUI01xUvc7nEWJjOhB4cNtLQsqVrwSDmOTNF6MsAK2t3-HTXFHA%26u%3DaHR0cHMlM2ElMmYlMmY1MzUwLnhnNGtlbi5jb20lMmZ0cmslMmZ2MSUzZnByb2YlM2Q0MzklMjZjYW1wJTNkMTY4ODA3JTI2a2N0JTNkbXNuJTI2a2NoaWQlM2QxNTkwMDE5MjElMjZjcml0ZXJpYWlkJTNka3dkLTc5MDk2NzI1NzM0NjUzJTNhbG9jLTE5MCUyNmNhbXBhaWduaWQlM2Q1OTAyMzczNTglMjZsb2NwaHklM2Q4MDU2MCUyNmFkZ3JvdXBpZCUzZDEyNjU1Mzk1MzU5Njk4MTMlMjZjaWQlM2Q3OTA5NjMyNjY0OTI4NCUyNmtkdiUzZGMlMjZrZXh0JTNkJTI2a3BnJTNkJTI2a3BpZCUzZCUyNnF1ZXJ5U3RyJTNkQXp1cmUlMjUyME1vbml0b3IlMjZ1cmwlM2RodHRwcyUzYSUyZiUyZmF6dXJlLm1pY3Jvc29mdC5jb20lMmZlbi11cyUyZnByb2R1Y3RzJTJmbW9uaXRvciUyZiUzZm1zY2xraWQlM2Q5MDA5YWU3NzBkODAxOGZmMmQ4NGM3NzZhMzM4YWJkMCUyM292ZXJ2aWV3JTJmJTI2ZWZfaWQlM2Rfa185MDA5YWU3NzBkODAxOGZmMmQ4NGM3NzZhMzM4YWJkMF9rXyUyNk9DSUQlM2RBSURjbW01ZWRzd2R1dV9TRU1fX2tfOTAwOWFlNzcwZDgwMThmZjJkODRjNzc2YTMzOGFiZDBfa18%26rlid%3D9009ae770d8018ff2d84c776a338abd0&amp;vqd=4-303116738650167520374123325213183185711&amp;iurl=%7B1%7DIG%3DB75E27AED59E416B9FB218558942BE50%26CID%3D1612565B1A6968803DE840CF1B8F697B%26ID%3DDevEx%2C5046.1">Azure Monitor</a> and existing alerting systems, enabling proactive capacity planning and automated scaling decisions based on historical utilization trends rather than guesswork.</li>
<li style="font-weight:400;">Target customers include enterprises with variable traffic patterns and managed service providers who need granular visibility into firewall performance across multiple client deployments to optimize resource allocation.</li>
<li style="font-weight:400;">While pricing remains unchanged for Azure Firewall itself (starting at $1.25/hour plus $0.016/GB processed), the metric helps justify right-sizing decisions that could significantly impact monthly costs for organizations running multiple firewall instances.</li>
</ul>
<p><a href="https://azure.microsoft.com/en-us/updates?id=515452">Generally Available: Prescaling in Azure Firewall </a></p>
<ul>
<li style="font-weight:400;"><a href="https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Faka.ms%2Fazfwprescaling&amp;data=05%7C02%7Cv-sanyaibe%40microsoft.com%7C0c4a0ad4827945d1214908de0cda8e68%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C638962329999971929%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&amp;sdata=UftnsFZNhRxr6RWEcr4J688kII%2FfOOTawS2IjGeJvBg%3D&amp;reserved=0">Azure Firewall prescaling</a> allows administrators to reserve capacity units in advance for predictable traffic spikes like holiday shopping seasons or product launches, eliminating the lag time typically associated with auto-scaling firewall resources.</li>
<li style="font-weight:400;">This feature addresses a common pain point where Azure Firewall’s auto-scaling couldn’t respond quickly enough to sudden traffic surges, potentially causing performance degradation during critical business events.</li>
<li style="font-weight:400;">Prescaling integrates with Azure’s existing capacity planning tools and can be configured through Azure Portal, PowerShell, or ARM templates, making it accessible for both manual and automated deployment scenarios.</li>
<li style="font-weight:400;">Target customers include e-commerce platforms, streaming services, and any organization with predictable traffic patterns that require guaranteed firewall throughput during peak periods.</li>
<li style="font-weight:400;">While specific pricing wasn’t detailed in the announcement, prescaling will likely follow Azure Firewall’s existing pricing model where customers pay for provisioned capacity units, with costs varying by region and SKU tier.</li>
<li style="font-weight:400;">When you combine these two announcements, they’re pretty good! </li>
</ul>
<p>1:01:35 <a href="https://azure.microsoft.com/en-us/updates?id=513074">Public Preview: Environmental sustainability features in Azure API </a><a href="https://azure.microsoft.com/en-us/updates?id=513074">Management</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/api-management/api-management-key-concepts">Azure API Management</a> introduces carbon-aware capabilities that allow organizations to route API traffic and adjust policy behavior based on carbon intensity data, helping reduce the environmental impact of API infrastructure operations.</li>
<li style="font-weight:400;">The feature enables developers to implement sustainability-focused policies such as throttling non-critical API calls during high carbon intensity periods or routing traffic to regions with cleaner energy grids.</li>
<li style="font-weight:400;">This aligns with Microsoft’s broader carbon negative commitment by 2030 and provides enterprises with tools to measure and reduce the carbon footprint of their digital services at the API layer.</li>
<li style="font-weight:400;">Target customers include organizations with ESG commitments and sustainability reporting requirements who need granular control over their cloud infrastructure’s environmental impact.</li>
<li style="font-weight:400;">Pricing details are not yet available for the preview, but the feature integrates with existing API Management tiers and will likely follow consumption-based pricing models when generally available.</li>
</ul>
<p>1:02:44 Matt – “So APIMs are one, stupidly expensive. If you have to be on the premier tier, it’s like $2,700 a month. And then if you want HA, you have to have two of them. So whatever they’re doing to the hood is stupidly expensive. If you ever had to deal with the SharePoint, they definitely use them because I’ve hit the same error codes as we provide to customers. On the second side, when you do scale them, you can scale them to be multi-region APIMs in the paired region concept, so in theory, what you can do based on this is route a cheaper or more environmentally efficient one, you could route to your paired region and then have the traffic coming that way.”</p>
<p>1:06:09 <a href="https://azure.microsoft.com/en-us/blog/from-queries-to-conversations-unlock-insights-about-your-data-using-azure-storage-discovery-now-generally-available/">Unlock insights about your data using Azure Storage Discovery</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aka.ms/StorageDiscovery">Azure Storage Discovery</a> is now generally available as a fully managed service that provides enterprise-wide visibility into data estates across <a href="https://azure.microsoft.com/en-us/products/storage/blobs/?ef_id=_k_003c4a49065118a887ff22a77e78d45b_k_&amp;OCID=AIDcmm5edswduu_SEM__k_003c4a49065118a887ff22a77e78d45b_k_&amp;msclkid=003c4a49065118a887ff22a77e78d45b">Azure Blob Storage</a> and <a href="https://learn.microsoft.com/en-us/azure/storage/blobs/data-lake-storage-introduction">Data Lake Storage</a>, helping organizations optimize costs, ensure security compliance, and improve operational efficiency across multiple subscriptions and regions.</li>
<li style="font-weight:400;">The service integrates <a href="https://learn.microsoft.com/en-us/azure/copilot/discover-storage-estate-insights">Microsoft Copilot in Azure</a> to enable natural language queries for storage insights, allowing non-technical users to ask questions like “Show me storage accounts with default access tier as Hot above 1TiB with least transactions” and receive actionable visualizations without coding skills. Because a non-technical person is asking this question. In the ever-wise words of Marcia Brady, “Sure, Jan.” </li>
<li style="font-weight:400;">Key capabilities include 18-month data retention for trend analysis, insights across capacity, activity, security configurations, and errors, with deployment taking less than 24 hours to generate initial insights from 15 days of historical data.</li>
<li style="font-weight:400;">Pricing includes a free tier with basic capacity and configuration insights retained for 15 days, while the standard plan adds advanced activity, error, and security insights with 18-month retention – specific pricing varies by region at <a href="http://azure.microsoft.com/pricing/details/azure-storage-discovery">azure.microsoft.com/pricing/details/azure-storage-discovery</a>.</li>
<li style="font-weight:400;">Target use cases include identifying cost optimization opportunities through access tier analysis, ensuring security best practices by highlighting accounts still using shared access keys, and managing data redundancy requirements across global storage estates.</li>
</ul>
<p>1:08:35 Ryan – “Well, I’ll tell you when I was looking for this report, I had a lot of natural language – and I was shouting it at my computer.” </p>
<p>1:09:52 <a href="https://azure.microsoft.com/en-us/blog/sora-2-now-available-in-azure-ai-foundry/">Sora 2 in Azure AI Foundry: Create videos with responsible AI | Microsoft </a><a href="https://azure.microsoft.com/en-us/blog/sora-2-now-available-in-azure-ai-foundry/">Azure Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://azure.microsoft.com/products/ai-foundry">Azure AI Foundry</a> now offers <a href="https://duckduckgo.com/y.js?ad_domain=vidfly.ai&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=VtAYWJm8O5CUOXzZbBwCzU3f0j7CtyMkb77CmfmbgjB6vCDg9mJU1DhlgS2ZvQ87X5ZLVf8Y7Chvy_pOytaVl54YDaa8KQ88wZ-kboGUVfqLJApZUhklnWtKXkJIJj5y.pza5nuL2NDur5C3-AYZISw&amp;eddgt=s8DW81DM6fpjMYnJj-YL9A%3D%3D&amp;rut=06c547abdf2282c1fab11887d76d46fffc9d75fc2a8dd9d791118ac8fd41c472&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8zULEQBbyimHDS3bSbtC49zVUCUw34NhyCRW3wuhaFSFluiSOJZTLHyPqTdW0ubHd2YLNDqse4osxpO10KiyTru2aZN8dwRTgigmP2ALccem7KPFloDvHIi4BUdcIOQyNzpuj-DAVdSc8eOqZ6qz_X3dbSF1nb8IwDELEPhQPuql8xm-DI2692e55SAiAmP5PpOcCZVPWnFULFKMTs36k2Oaby1GtNKMDT4Oq6lJT14qmCQUT82YGayZEQMYx3F_Fqhh0JayBgJ0ti3muJAFMFUQr45gon7XG6Iiz8oDzKwQDOvq1xBY8JT_UZTbTDYb26If9-yLnsf-vKb0QXRcldQMF39zYEedGViteJ8dRZW466KXrsRYz0mtLBekFwFvMwHKk8eCum8o4L8CzQHLn_wXzFR4fwDKoz-TOSaZKh2LDpVNfqllJOxAanDDVJ58YkxTAdh9OoRq9BYrOVN9q-y5E5lakdecXm47Sez2og8uf1SrMLprAYE-HLW65dD8aGPOFaqnhMdPQK9S7W_ItV-4HquogoM-HOYNvsaJ5CLDQv9elM-m0DU_sOpImRPe8UDnObsQfXQywc4jWAToZDsF33aSD761gLpnH775wlz4YV1tx42sBdlLIENZ5A3y4XRotyDwz6OEUXXbVW7Tymjwgf2lfbAVyJ0rHtiNEl9fNXzjCXaLEqKicOaYh19LFuyxZCT9Q7Z0lUiQXc4sg4Veozvt3mCq7O7_xnS8xr23SbleO0591MIK85xYJckYml26-bw%26u%3DaHR0cHMlM2ElMmYlMmZ2aWRmbHkuYWklMmZzb3JhLTIlMmYlM2Z1dG1fY2FtcGFpZ24lM2RzMjAxJTI2dXRtX3NvdXJjZSUzZGJpbmclMjZ1dG1fdGVybSUzZHNvcmElMjUyMG9wZW4lMjUyMGFpJTI2bXNjbGtpZCUzZDNmYjAyMTQxY2IzZjE1ZWQ5YTk2NDg0MDdhYzc5NjUz%26rlid%3D3fb02141cb3f15ed9a9648407ac79653&amp;vqd=4-58002290821084271246164676224938707476&amp;iurl=%7B1%7DIG%3D35BF926B9628496E8E98EFBD21B74B67%26CID%3D058BF6D89EF66ECD10B2E04C9F436F11%26ID%3DDevEx%2C5046.1">OpenAI’s Sora 2 video generation model</a> in public preview, enabling developers to create videos from text, images, and existing video inputs with synchronized audio in multiple languages.</li>
<li style="font-weight:400;">The platform provides a unified environment combining <a href="https://duckduckgo.com/y.js?ad_domain=vidfly.ai&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=H3Ro7S29tZ9QjstuJcmZ7QbQEiOZlWMrDaYKsoesmYndbK4iEukeCoQJUmLIx_4VUMC461uKlJRmTC66fBmpJ8YYsMshMIXPz1sJnajImoBdO3BaIwOwkBfdWBsi482l.kLXBOijsAVZUtoObezZZNw&amp;eddgt=McOmTFn1-WttmURSgcMDRA%3D%3D&amp;rut=fb538cbdc90e26d9d49e3a16ee76d4ae01b1a107fc392d565b39a9aa98c1bb9d&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8kldc57YDvK5NYofw81JK2DVUCUz2SnlGgRCRxAyFYZlAIVPNI_veNFziAeUiIJ6f8C-Ui2qUWWg69h0VXoDx-1XU5RzV0Q2cDQqjbnOPFYKPuQhAZkcyGn_5-PewE2CvA9HGUPfeqXPxg2g0sc-0dMnSun29AJzukRo5G3pUkzeBHZZmau5fsQi8efepgEuFjSbZmE5GjGWdgdOZwZBgJY9NVAXRY-pVzbBcUWsrE6RnHfpuOwnXG7aCvRf31xLyqPXTWT4Slwn7uUwc472cgqJF0mqsQPjVmoDgFrRDGSo1T_xW3QPVe7AZkLqukeVthEiiOVTw4--NKqtaC95wvqpjPU83WdlCcKPvfZFd4ptW6VK9pMMy4ZSK1o69L3y8gOfdQlmrE35j28dbQ-x1Sk7ZW_KOUzufJ5iz4hKzWiTsFgkA6COoLjdTU-YKdff8J_-9ymLvbylIm4XNCpqodI4zhqYK5ts1KPhfMGulHoVjVizSoCjDsGJWunhAtITUjqSwCfgGzAsmLjljDdy1c4xccpAMjPAq0kQjws2-5cfHo6ikqfjnsVCurw8cTbHxg0x0fwHT3GTT_pa6xbdrFnkS4wlKRQlL2a_ExMpl6BeJG0_wdUH3IwHApW47cKcyRHb43kRzLuc6J28-oCkb49W3vpxLw_UrJ09jr7ITkGfGFTPEfqTbPOjKwjayfhh4JBXQML1KEX2-82-ezj6Xxt0c1uuWowDpuqDddnlV4j1DoC2XO7EFzc4Qd1VdaRqfQ9xTXw%26u%3DaHR0cHMlM2ElMmYlMmZ2aWRmbHkuYWklMmZzb3JhLTIlMmYlM2Z1dG1fY2FtcGFpZ24lM2RzMjAxJTI2dXRtX3NvdXJjZSUzZGJpbmclMjZ1dG1fdGVybSUzZHNvcmElMjUyMDIlMjZtc2Nsa2lkJTNkOWYxN2YxMjRhMzBkMTdjNDM5MWI5ODI1MTEwZTIwOWE%26rlid%3D9f17f124a30d17c4391b9825110e209a&amp;vqd=4-200013414939670704916566999758397719634&amp;iurl=%7B1%7DIG%3D0FC4E682995A4D569F946C42EF604337%26CID%3D2BF79B5F15C86ED31D7C8DCB147D6F8F%26ID%3DDevEx%2C5046.1">Sora 2</a> with other generative models like <a href="https://platform.openai.com/docs/models/gpt-image-1">GPT-image-1</a> and <a href="https://bfl.ai/announcing-flux-1-1-pro-and-the-bfl-api">Black Forest Lab’s Flux 1.1</a>, all backed by Azure’s enterprise security and content filtering for both inputs and outputs.</li>
<li style="font-weight:400;">Key capabilities include realistic physics simulation, detailed camera control, and creative features for marketers, retailers, educators, and creative directors to rapidly prototype and produce video content within existing business workflows.</li>
<li style="font-weight:400;">Sora 2 is currently available via API through Standard Global deployment in <a href="https://duckduckgo.com/y.js?ad_domain=microsoft.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=vDRfQV-KnP38W-V28acKnXy-KiZLrRRmgqdRD0XAtidMVDY0erBR79o-tebT_Lut_TE2oIcs2p65GkhVhT8tR7GG6puOWUOiHa-o5VFyimKYX40mxZ19oFyMmihmLBFy.UXsV3mHjwpfBXV7nKCIiww&amp;eddgt=HNfdZwANcLtGzCXmgdLjmg%3D%3D&amp;rut=8c69f5c0522cc692936c8e1d7a635dd4b958353df8d543abb7f0c475ea19a267&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De82MWG4HmXxPRMJ9a1PRYr2DVUCUwJGiGKl_y8a8UgZ_4_69rnmw_dEbmRBq_qXBNgwa-3aD1gU4WTvnrl53vWmoPiX9tERTrUTj5JliZBa1BqDWO0onS68ieM-1C-tqC_1OPYPKwlEjy388-ScsYaO0uBUPHn2O5wbVX1x87SxLP9ZA1q5P1h1K-shk_iWlHJN0imBfeNljyXAbz1IbEdR2uS9mjnWcf8lhB0oGMkFyauv8n7Pdcf0uy3V9xyAn7CEMW3pY_MMKhdpQzZRpMQtBiM8s8LH1p8CBH9ls6GIXEWexCjzlrHbSf6dN4ryySwl45ErMlTybuSyQWbyOd9gTCi7sMavJdk5VRakgZjrUUNE6vu4cJa9tRqblT2QG5xXIGpfS-5nJHcqsz-nZYqaSOGWyW3H-mIHggm_8F_H5gcffKC_4ZbCAQAZQtwVgQkVUIFYt-YVzr0wX95W6o2befUE2VJxwUoTYEHMhqz5KQg_D6rlEfN_I39-n2zj6Ef25ZRQe7ME-y2jvPx4TvBfgaOxeALD75_HuZsnZ4GTGIvQHSi3eqrq8uRmn6n8KS4UgmRTr2mEkzgsHPWDOYQX9qFOykDIDPGEzCbZHg7wA9jYO14PzqWErFfcoxhyy386RoJj1ex8QZAi_K5GqvEimT8FldR5AWhl-ZmxngIf9C96yofB8DvMCYdRkcw-mQYt1IY9tApHU8nYWCA7c04yLpdeAfp4VIBbpIAkSAR_qVcvyWVpz0vpGbmJOQHKr_D-U6INg%26u%3DaHR0cHMlM2ElMmYlMmY1MzUwLnhnNGtlbi5jb20lMmZ0cmslMmZ2MSUzZnByb2YlM2Q0MzklMjZjYW1wJTNkMTY4ODA3JTI2a2N0JTNkbXNuJTI2a2NoaWQlM2QxNTkwMDE5MjElMjZjcml0ZXJpYWlkJTNka3dkLTc5MzAyODgzNzYzNDU3JTNhbG9jLTE5MCUyNmNhbXBhaWduaWQlM2Q1OTAyMzczNTglMjZsb2NwaHklM2Q4MDI2NiUyNmFkZ3JvdXBpZCUzZDEyNjg4MzgwNzEwODY2MDAlMjZjaWQlM2Q3OTMwMjQ4NTk5MDQ3NyUyNmtkdiUzZGMlMjZrZXh0JTNkJTI2a3BnJTNkJTI2a3BpZCUzZCUyNnF1ZXJ5U3RyJTNkQXp1cmUlMjUyMEFJJTI1MjBGb3VuZHJ5JTI2dXJsJTNkaHR0cHMlM2ElMmYlMmZhenVyZS5taWNyb3NvZnQuY29tJTJmZW4tdXMlMmZzb2x1dGlvbnMlMmZhaSUyZiUzZm1zY2xraWQlM2QxYjVjYzBiYzA3MWQxM2YzOTkyZmE5YmRiNTRjY2NhZCUyM292ZXJ2aWV3JTJmJTI2ZWZfaWQlM2Rfa18xYjVjYzBiYzA3MWQxM2YzOTkyZmE5YmRiNTRjY2NhZF9rXyUyNk9DSUQlM2RBSURjbW01ZWRzd2R1dV9TRU1fX2tfMWI1Y2MwYmMwNzFkMTNmMzk5MmZhOWJkYjU0Y2NjYWRfa18%26rlid%3D1b5cc0bc071d13f3992fa9bdb54cccad&amp;vqd=4-289791032530740762233233808276956305669&amp;iurl=%7B1%7DIG%3DA2D7FF0B936E4FFF993B2C6A2EA368E6%26CID%3D0F38D3B240A76E5120DEC526411D6FBC%26ID%3DDevEx%2C5046.1">Azure AI Foundry</a>, with pricing details available on the Azure AI Foundry Models page.</li>
<li style="font-weight:400;">Microsoft positions this as part of their responsible AI approach, embedding safety controls and compliance frameworks to help organizations innovate while maintaining governance over generated content.</li>
<li style="font-weight:400;">We’re not big fans of this one. </li>
</ul>
<p>1:10:12 <a href="https://azure.microsoft.com/en-us/blog/grok-4-is-now-available-in-azure-ai-foundry-unlock-frontier-intelligence-and-business-ready-capabilities/">Grok 4 is now available in Microsoft Azure AI Foundry | Microsoft Azure </a><a href="https://azure.microsoft.com/en-us/blog/grok-4-is-now-available-in-azure-ai-foundry-unlock-frontier-intelligence-and-business-ready-capabilities/">Blog</a></p>
<ul>
<li style="font-weight:400;">Microsoft brings <a href="https://x.ai/news/grok-4">xAI’s Grok 4</a> model to <a href="https://azure.microsoft.com/products/ai-foundry">Azure AI Foundry</a>, featuring a 128K-token context window, native tool use, and integrated web search capabilities. The model emphasizes first-principles reasoning with a “think mode” that breaks down complex problems step-by-step, particularly excelling at math, science, and logic puzzles.</li>
<li style="font-weight:400;">Grok 4’s extended context window allows processing of entire code repositories, lengthy research papers, or hundreds of pages of documents in a single query. This eliminates the need to manually chunk large inputs and enables comprehensive analysis across massive datasets without losing context.</li>
<li style="font-weight:400;"><a href="https://duckduckgo.com/y.js?ad_domain=microsoft.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=ADTqT-DTmU1m6kCiPQuWF2SX6xiGkoKbRof7OLGfRe1-tOYXoo0w5b3quTN5li4PvcXwRyqVtJyrZx5V5WNJGdWTgfM1RnYSLzc1CoSIobOfwRB94HV5tYYNvWrWOEjg.42sfONMXHP_nny7QKq2Xrg&amp;eddgt=gWsGv1pwiNiAaiYszbg5Hg%3D%3D&amp;rut=9fd4a7c4b8d09110fcd4c3b8bb2218f3b339c430edc4b68ad0449030c626b8b1&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8g8JtH--whL5ckBZADzeJFzVUCUwAcpwJMO6VIfwiRs6Nyp40_jHV7VDjkOqwJvkvR33XYZhotwDRP1mtmez_D6JkPF56Oz5bNtdYrxARg5yhKOw4Ju4-oben57HcOaB3IOQRDp1bmhEksCh25C3xcOduDRDBIFJAPyqCG8WnFN7pMAqvW_cSO0ycl4pqsDcDua-BlTN-Q0ppQIgiszyITK3K3nOZRREfPPGBD3s9AgwAlN9MM6VbfnrEYhfcu_yGlPMOL5GoQ77zcEOtk3uvc-0GHM5d-GBrurI0gh9xJTPOjdzLMsGRvH0nDa0uaeUi8WJ90qGTGmZUph1QrJyTV6QjgK44G4fWZZGYpQc2fj7kd-sX1sN5s1uMfJ5HY7DYlSiTtSp4YeB_jlDnEWjhvijhbOrAy8wFaRPAM-X7DIbe27RAjExZ2I0L-4Tgt8lGyuDs2_S8153JMjvAraaO8LvcjDrNSR1LylGCCIWyapYUNnaF1lgl8wgQlCM6MR5esMlw0ECsUWYyv7wegLptTOb47cZ7BUv9igUr-cltQ7PMKuSWa4dnFs-B8RrCk7If0tYiyi-aGzlIddz-a_d9c45Ro2tF95swN5hO9WrF0Q90OF5m7Hpoc1l4MX-UY5Nu7xVJL_KHaFzLWX4c0I6FufLhfr3u6grbL-zKoQOYB-9WCQAJt0K6skUD91gF86jlVjmksKhV0rkH_BD5dO1MP830AUzfEnBN3PfMwfrIxvHpc3d2xECDkhiG0bxeRG2XsYVW0w%26u%3DaHR0cHMlM2ElMmYlMmY1MzUwLnhnNGtlbi5jb20lMmZ0cmslMmZ2MSUzZnByb2YlM2Q0MzklMjZjYW1wJTNkMTY4ODA3JTI2a2N0JTNkbXNuJTI2a2NoaWQlM2QxNTkwMDE5MjElMjZjcml0ZXJpYWlkJTNka3dkLTc5NjQ2NDgwNzIyNTU1JTNhbG9jLTE5MCUyNmNhbXBhaWduaWQlM2Q1OTAyMzczNTglMjZsb2NwaHklM2Q3OTcyNyUyNmFkZ3JvdXBpZCUzZDEyNzQzMzU2Mjk1MjQ0OTMlMjZjaWQlM2Q3OTY0NjA4Mjc5NzU4MiUyNmtkdiUzZGMlMjZrZXh0JTNkJTI2a3BnJTNkJTI2a3BpZCUzZCUyNnF1ZXJ5U3RyJTNkQXp1cmUlMjUyMEFJJTI1MjBDb250ZW50JTI1MjBTYWZldHklMjZ1cmwlM2RodHRwcyUzYSUyZiUyZmF6dXJlLm1pY3Jvc29mdC5jb20lMmZlbi11cyUyZnByb2R1Y3RzJTJmYWktc2VydmljZXMlMmZhaS1jb250ZW50LXNhZmV0eSUyZiUzZmVmX2lkJTNkX2tfMThiYmY5N2RiNDM3MTBmMmI2NjY5MDdjMDY5NWY1OTZfa18lMjZPQ0lEJTNkQUlEY21tNWVkc3dkdXVfU0VNX19rXzE4YmJmOTdkYjQzNzEwZjJiNjY2OTA3YzA2OTVmNTk2X2tfJTI2bXNjbGtpZCUzZDE4YmJmOTdkYjQzNzEwZjJiNjY2OTA3YzA2OTVmNTk2%26rlid%3D18bbf97db43710f2b666907c0695f596&amp;vqd=4-180506487162074869462430080103316469598&amp;iurl=%7B1%7DIG%3DC31998333949429CA327927DD7C8E2CB%26CID%3D0728DD010DA66F413023CB950C136E52%26ID%3DDevEx%2C5046.1">Azure AI Content Safety</a> is enabled by default for Grok 4, addressing enterprise concerns about responsible AI deployment. Microsoft and xAI conducted extensive safety testing and compliance checks over the past month to ensure business-ready protection layers.</li>
<li style="font-weight:400;">Pricing starts at $2 per million input tokens and $10 per million output tokens for Grok 4, with faster variants available at lower costs. </li>
<li style="font-weight:400;">The family includes <a href="https://x.ai/news/grok-4-fast">Grok 4 Fast Reasoning</a> for analytical tasks, <a href="https://naga.ac/models/grok-4-fast-non-reasoning/examples">Fast Non-Reasoning for lightweight operations</a>, and <a href="https://x.ai/news/grok-code-fast-1">Grok Code Fast 1</a> specifically for programming workflows.</li>
<li style="font-weight:400;">The model’s real-time data integration allows it to retrieve and incorporate external information beyond its training data, functioning as an autonomous research assistant. This capability is particularly valuable for tasks requiring current information like market analysis or regulatory updates.</li>
</ul>
<p>1:11:04 <a href="https://azure.microsoft.com/en-us/updates?id=517027">Generally Available: Enhanced cloning and Public IP retention scripts for </a><a href="https://azure.microsoft.com/en-us/updates?id=517027">Azure Application Gateway migration</a></p>
<ul>
<li style="font-weight:400;">Azure releases <a href="https://learn.microsoft.com/en-us/powershell/scripting/install/installing-powershell-on-windows?view=powershell-7.5">PowerShell</a> scripts to help customers migrate from <a href="https://learn.microsoft.com/en-us/azure/application-gateway/migrate-v1-v2#1-enhanced-cloning-script">Application Gateway V1 to V2</a> before the April 2026 retirement deadline, addressing a critical infrastructure transition need.</li>
<li style="font-weight:400;">The enhanced cloning script preserves configurations during migration while the Public IP retention script ensures customers can maintain their existing IP addresses, minimizing disruption to production workloads.</li>
<li style="font-weight:400;">This migration tooling targets enterprises running legacy Application Gateway Standard or WAF SKUs who need to upgrade to Standard_V2 or WAF_V2 for continued support and access to newer features.</li>
<li style="font-weight:400;">The scripts automate what would otherwise be a complex manual migration process, reducing the risk of configuration errors and downtime during the transition.</li>
<li style="font-weight:400;">Customers should begin planning migrations now as the 2026 deadline approaches, with these scripts providing a standardized path forward for maintaining application delivery infrastructure.</li>
<li style="font-weight:400;">You know would be even easier than PowerShell? How about just doing it for them? Too easy? </li>
<li style="font-weight:400;">(Listener alert: This time it’s a Matt rant.) </li>
</ul>
<h2>Oracle </h2>
<p>1:14:59 <a href="https://www.oracle.com/news/announcement/ai-world-oracle-expands-ai-agent-studio-for-fusion-applications-with-new-marketplace-llms-and-vast-partner-network-2025-10-15/">Oracle Expands AI Agent Studio for Fusion Applications with New </a><a href="https://www.oracle.com/news/announcement/ai-world-oracle-expands-ai-agent-studio-for-fusion-applications-with-new-marketplace-llms-and-vast-partner-network-2025-10-15/">Marketplace, LLMs, and Vast Partner Network</a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.oracle.com/en/cloud/saas/fusion-ai/aiaas/overview.html">Oracle AI Agent Studio</a> expands with new marketplace LLMs and partner integrations for <a href="https://www.oracle.com/applications/">Fusion Applications</a>, allowing customers to build AI agents using models from <a href="https://www.anthropic.com/">Anthropic</a>, <a href="https://cohere.com/">Cohere</a>, <a href="https://www.meta.com/about/">Meta</a>, and others alongside Oracle’s own models.</li>
<li style="font-weight:400;">The platform enables the creation of AI agents that can automate tasks across <a href="https://docs.oracle.com/en/cloud/saas/index.html">Oracle Fusion Cloud Applications</a>, including ERP, HCM, and CX, with pre-built templates and low-code development tools for business users.</li>
<li style="font-weight:400;">Oracle is partnering with major consulting firms like <a href="https://www.accenture.com/us-en">Accenture</a>, <a href="https://www.deloitte.com/us/en.html">Deloitte</a>, and <a href="https://www.infosys.com/">Infosys</a> to help customers implement AI agents, though this likely means significant professional services costs for most deployments.</li>
<li style="font-weight:400;">The AI agents can handle tasks like expense report processing, supplier onboarding, and customer service inquiries, with Oracle claiming reduced manual work by up to 50% in some use cases.</li>
<li style="font-weight:400;">Pricing details remain unclear, but the service requires Oracle Fusion Applications subscriptions and likely additional fees for LLM usage and agent deployment based on Oracle’s typical pricing model.</li>
</ul>
<p>1:15:45 Ryan – “They’re partnering with these giant firms that will come in with armies of engineers who will build you a thing – and hopefully document it before running away.” </p>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback, or ask questions at theCloudPod.net or tweet at us with the hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2175583/c1e-7nkns9xmgvu3gvmr-mkwg4mvmijox-lit8gy.mp3" length="144103080"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 327 of The Cloud Pod, where the forecast is always cloudy! Justin, Matt, and Ryan are here to bring you all the latest news (and a few rants) in the worlds of Cloud and AI. I’m sure all our readers are aware of the AWS outage last week, as it was in all the news everywhere. But we’ve also got some new AI models (including Sora in case you’re low on really crappy videos the youths might like), plus EKS, Kubernetes, Vertex AI, and more. Let’s get started! 
Titles we almost went with this week

 Oracle and Azure Walk Into a Cloud Bar: Nobody Gets ETL’d
 When DNS Goes Down, So Does Your Monday: AWS Takes Half the Internet on a Coffee Break
 404 Cloud Not Found: AWS Proves Even the Internet’s Phone Book Can Get Lost
 DNS: Definitely Not Staffed – How AWS Lost Its Way When It Lost Its People
 When Larry Met Satya: A Cloud Love Story
 Azure Finally Answers ‘Dude, Where’s My Data?’ with Storage Discovery
 Breaking: Microsoft Discovers AI Training Uses More Power Than a Small Country
 404 Engineers Not Found – AWS Learns the Hard Way That People Are Its Most Critical Infrastructure
 Azure Storage Discovery: Finding Your Data Needles in the Cloud Haystack
 EKS Auto Mode: Because Even Your Clusters Deserve Cruise Control
Azure Gets Reel: Microsoft Adds Video Generation to AI Foundry
 The Great Token Heist: Vertex AI Steals 90% Off Your Gemini Bills
 Cache Me If You Can: Vertex AI’s Token-Saving Feature
 IaC Just Got a Manager – And It’s Not Your Boss 
 From Musk to Microsoft: Grok 4 Makes the Great Cloud Migration
 No Harness.. You are not going to make IACM happen
 Microsoft Drafts a Solution to Container Creation Chaos
 PowerShell to the People: Azure Simplifies the Great Gateway Migration
 IP There Yet? Azure’s Scripts Keep Your Address While You Upgrade

Follow Up
00:53 Glacier Deprecation Email

Standalone Amazon Glacier service (vault-based with separate APIs) will stop accepting new customers as of December 15, 2025. 
S3 Glacier storage classes (Instant Retrieval, Flexible Retrieval, Deep Archive) are completely unaffected and continue normally
Existing Glacier customers can keep using it forever – no forced migration required. 
AWS is essentially consolidating around S3 as the unified storage platform, rather than maintaining two separate archival services.
The standalone service will enter maintenance mode, meaning there will be no new features, but the service will remain operational.
Migration to S3 Glacier is optional but recommended for better integration, lower costs, and more features. (Justin assures us it is actually slightly cheaper, so there’s that.) 

General News 
02:24 ]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2175583/c1a-k5d5-dmx1zwoxir4q-1upv8q.jpg"></itunes:image>
                                                                            <itunes:duration>01:14:55</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2175583/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[326: Oracle Discovers the Dark Side (And Finally Has Cookies)]]>
                </title>
                <pubDate>Thu, 23 Oct 2025 05:07:07 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2170375</guid>
                                    <link>https://tcpfm.castos.com/episodes/326-oracle-discovers-the-dark-side-and-finally-has-cookies</link>
                                <description>
                                            <![CDATA[<h3>Welcome to episode 326 of The Cloud Pod, where the forecast is always cloudy! Justin and Ryan are your guides to all things cloud and AI this week! We’ve got news from SonicWall (and it’s not great), a host of goodbyes to say over at AWS, Oracle (finally) joins the dark side, and even Slurm – and you don’t even need to ride on a creepy river to experience it. Let’s get started! </h3>
<h3>Titles we almost went with this week</h3>
<ul>
<li> SonicWall’s Cloud Backup Service: From 5% to Oh No, That’s Everyone</li>
<li> AWS Spring Cleaning: 19 Services Get the Boot</li>
<li> The Great AWS Service Purge of 2025</li>
<li> Maintenance Mode: Where Good Services Go to Die</li>
<li> GitHub Gets Assimilated: Resistance to Azure Migration is Futile</li>
<li> Salesforce to Ransomware Gang: You Can’t Always Get What You Want</li>
<li> Kansas City Gets the Need for Speed with 100G Direct Connect. Peter, what are you up too</li>
<li> Gemini Takes the Wheel: Google’s AI Learns to Click and Type </li>
<li> Oracle Discovers the Dark Side (Finally Has Cookies)</li>
<li> Azure Goes Full Blackwell: 4,600 Reasons to Upgrade Your GPU Game</li>
<li> DataStax to the Future: AWS Hires Database CEO for Security Role</li>
<li> The Clone Wars: EBS Strikes Back with Instant Volume Copies</li>
<li> Slurm Dunk: AWS Brings HPC Scheduling to Kubernetes</li>
<li> The Great Cluster Convergence: When Slurm Met EKS</li>
<li> Codex sent me a DM that I’ll ignore too on Slack</li>
</ul>
<h2>General News </h2>
<p>01:24 <a href="https://share.google/9FyWqhZCrsMWscoF8">SonicWall: Firewall configs stolen for all cloud backup customers</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.sonicwall.com/products/remote-access/vpn-clients">SonicWall</a> confirmed that all customers using their cloud backup service had firewall configuration files exposed in a breach, expanding from their initial estimate of 5% to 100% of cloud backup users. That’s a big difference…</li>
<li style="font-weight:400;">The exposed backup files contain AES-256-encrypted credentials and configuration data, which could include MFA seeds for TOTP authentication, potentially explaining recent Akira ransomware attacks that bypassed MFA.</li>
<li style="font-weight:400;">SonicWall requires affected customers to reset all credentials, including local user passwords, TOTP codes, VPN shared secrets, API keys, and authentication tokens across their entire infrastructure.</li>
<li style="font-weight:400;">This incident highlights a fundamental security risk of cloud-based configuration backups where sensitive credentials are stored centrally, making them attractive targets for attackers.</li>
<li style="font-weight:400;">The breach demonstrates why WebAuthn/passkeys offer superior security architecture since they don’t rely on shared secrets that can be stolen from backups or servers.</li>
<li style="font-weight:400;">Interested in checking out their detailed remediation guidance? Find that <a href="https://www.sonicwall.com/support/knowledge-base/remediation-playbook/250916130050523">here</a>. </li>
</ul>
<p>02:36  Justin – “You know, providing your own encryption keys is also good; not allowing your SaaS vendor to have the encryption key is a positive thing to do. There’s all kinds of ways to protect your data in the cloud when you’re leveraging a SaaS service.”</p>
<p>04:43 <a href="https://www.theregister.com/2025/10/08/salesforce_refuses_to_pay_ransomware/">Take this rob and shove it! Salesforce issues stern retort to ransomware </a><a href="https://www.theregister.com/2025/10/08/salesforce_refuses_to_pay_ransomware/">extort</a></p>
<ul>
<li style="font-weight:400;"><a href="https://login.salesforce.com/">Salesforce</a> is refusing to pay <a href="https://www.theregister.com/2025/10/03/scattered_lapsus_hunters_latest_leak/">ransomware demands</a> from criminals claiming to have stolen nearly 1 billion customer records, stating they will not engage, negotiate with, or pay any extortion dema...</li></ul>
<h3>Chapters</h3>
<ul><li>(00:00:00) - Cloud Pod: Oracle Explains The Dark Side</li><li>(00:01:31) - Cloud Security: Sonicwall Hacking</li><li>(00:04:44) - Salesforce Rejects Ransomware Demand</li><li>(00:07:04) - OpenAI's AI Agent Kit and More</li><li>(00:10:10) - Google's Gemini 2.5 for UIs</li><li>(00:12:20) - Amazon Is Moving 19 AWS Services to Maintenance Mode</li><li>(00:16:30) - AWS Direct Connect now offers 100 Gigabytes dedicated connections with Mac</li><li>(00:17:37) -  AWS Identity Center now supports customer-managed KMS Keys</li><li>(00:18:56) - Amazon QuickSuite M8A New Instance Launch</li><li>(00:22:31) - Amazon Hires Former Data Stack CEO as VP of Security Services and</li><li>(00:26:43) - Amazon Bedrock Agent Core</li><li>(00:28:35) - AWS Transports AI Inference to Custom Chips</li><li>(00:30:07) - Amazon EBS Volume Clones</li><li>(00:31:45) - Amazon EKS Adds Slurm to Kubernetes</li><li>(00:32:48) - GCP Introduces Gemini Enterprise as a Unified AI Platform</li><li>(00:35:44) - Google's LLM Eval Kit for Prompt Engineering</li><li>(00:37:57) - Google Cloud : NetApp Files for Enterprise Storage</li><li>(00:40:43) - GitHub to Move All Its Software to Azure</li><li>(00:45:17) - Microsoft Deploys First Production Cluster with Nvidia GB300 GPUs</li><li>(00:48:31) - Oracle's Dark Mode in Oci</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 326 of The Cloud Pod, where the forecast is always cloudy! Justin and Ryan are your guides to all things cloud and AI this week! We’ve got news from SonicWall (and it’s not great), a host of goodbyes to say over at AWS, Oracle (finally) joins the dark side, and even Slurm – and you don’t even need to ride on a creepy river to experience it. Let’s get started! 
Titles we almost went with this week

 SonicWall’s Cloud Backup Service: From 5% to Oh No, That’s Everyone
 AWS Spring Cleaning: 19 Services Get the Boot
 The Great AWS Service Purge of 2025
 Maintenance Mode: Where Good Services Go to Die
 GitHub Gets Assimilated: Resistance to Azure Migration is Futile
 Salesforce to Ransomware Gang: You Can’t Always Get What You Want
 Kansas City Gets the Need for Speed with 100G Direct Connect. Peter, what are you up too
 Gemini Takes the Wheel: Google’s AI Learns to Click and Type 
 Oracle Discovers the Dark Side (Finally Has Cookies)
 Azure Goes Full Blackwell: 4,600 Reasons to Upgrade Your GPU Game
 DataStax to the Future: AWS Hires Database CEO for Security Role
 The Clone Wars: EBS Strikes Back with Instant Volume Copies
 Slurm Dunk: AWS Brings HPC Scheduling to Kubernetes
 The Great Cluster Convergence: When Slurm Met EKS
 Codex sent me a DM that I’ll ignore too on Slack

General News 
01:24 SonicWall: Firewall configs stolen for all cloud backup customers

SonicWall confirmed that all customers using their cloud backup service had firewall configuration files exposed in a breach, expanding from their initial estimate of 5% to 100% of cloud backup users. That’s a big difference…
The exposed backup files contain AES-256-encrypted credentials and configuration data, which could include MFA seeds for TOTP authentication, potentially explaining recent Akira ransomware attacks that bypassed MFA.
SonicWall requires affected customers to reset all credentials, including local user passwords, TOTP codes, VPN shared secrets, API keys, and authentication tokens across their entire infrastructure.
This incident highlights a fundamental security risk of cloud-based configuration backups where sensitive credentials are stored centrally, making them attractive targets for attackers.
The breach demonstrates why WebAuthn/passkeys offer superior security architecture since they don’t rely on shared secrets that can be stolen from backups or servers.
Interested in checking out their detailed remediation guidance? Find that here. 

02:36  Justin – “You know, providing your own encryption keys is also good; not allowing your SaaS vendor to have the encryption key is a positive thing to do. There’s all kinds of ways to protect your data in the cloud when you’re leveraging a SaaS service.”
04:43 Take this rob and shove it! Salesforce issues stern retort to ransomware extort

Salesforce is refusing to pay ransomware demands from criminals claiming to have stolen nearly 1 billion customer records, stating they will not engage, negotiate with, or pay any extortion dema...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[326: Oracle Discovers the Dark Side (And Finally Has Cookies)]]>
                </itunes:title>
                                    <itunes:episode>326</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<h3>Welcome to episode 326 of The Cloud Pod, where the forecast is always cloudy! Justin and Ryan are your guides to all things cloud and AI this week! We’ve got news from SonicWall (and it’s not great), a host of goodbyes to say over at AWS, Oracle (finally) joins the dark side, and even Slurm – and you don’t even need to ride on a creepy river to experience it. Let’s get started! </h3>
<h3>Titles we almost went with this week</h3>
<ul>
<li> SonicWall’s Cloud Backup Service: From 5% to Oh No, That’s Everyone</li>
<li> AWS Spring Cleaning: 19 Services Get the Boot</li>
<li> The Great AWS Service Purge of 2025</li>
<li> Maintenance Mode: Where Good Services Go to Die</li>
<li> GitHub Gets Assimilated: Resistance to Azure Migration is Futile</li>
<li> Salesforce to Ransomware Gang: You Can’t Always Get What You Want</li>
<li> Kansas City Gets the Need for Speed with 100G Direct Connect. Peter, what are you up too</li>
<li> Gemini Takes the Wheel: Google’s AI Learns to Click and Type </li>
<li> Oracle Discovers the Dark Side (Finally Has Cookies)</li>
<li> Azure Goes Full Blackwell: 4,600 Reasons to Upgrade Your GPU Game</li>
<li> DataStax to the Future: AWS Hires Database CEO for Security Role</li>
<li> The Clone Wars: EBS Strikes Back with Instant Volume Copies</li>
<li> Slurm Dunk: AWS Brings HPC Scheduling to Kubernetes</li>
<li> The Great Cluster Convergence: When Slurm Met EKS</li>
<li> Codex sent me a DM that I’ll ignore too on Slack</li>
</ul>
<h2>General News </h2>
<p>01:24 <a href="https://share.google/9FyWqhZCrsMWscoF8">SonicWall: Firewall configs stolen for all cloud backup customers</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.sonicwall.com/products/remote-access/vpn-clients">SonicWall</a> confirmed that all customers using their cloud backup service had firewall configuration files exposed in a breach, expanding from their initial estimate of 5% to 100% of cloud backup users. That’s a big difference…</li>
<li style="font-weight:400;">The exposed backup files contain AES-256-encrypted credentials and configuration data, which could include MFA seeds for TOTP authentication, potentially explaining recent Akira ransomware attacks that bypassed MFA.</li>
<li style="font-weight:400;">SonicWall requires affected customers to reset all credentials, including local user passwords, TOTP codes, VPN shared secrets, API keys, and authentication tokens across their entire infrastructure.</li>
<li style="font-weight:400;">This incident highlights a fundamental security risk of cloud-based configuration backups where sensitive credentials are stored centrally, making them attractive targets for attackers.</li>
<li style="font-weight:400;">The breach demonstrates why WebAuthn/passkeys offer superior security architecture since they don’t rely on shared secrets that can be stolen from backups or servers.</li>
<li style="font-weight:400;">Interested in checking out their detailed remediation guidance? Find that <a href="https://www.sonicwall.com/support/knowledge-base/remediation-playbook/250916130050523">here</a>. </li>
</ul>
<p>02:36  Justin – “You know, providing your own encryption keys is also good; not allowing your SaaS vendor to have the encryption key is a positive thing to do. There’s all kinds of ways to protect your data in the cloud when you’re leveraging a SaaS service.”</p>
<p>04:43 <a href="https://www.theregister.com/2025/10/08/salesforce_refuses_to_pay_ransomware/">Take this rob and shove it! Salesforce issues stern retort to ransomware </a><a href="https://www.theregister.com/2025/10/08/salesforce_refuses_to_pay_ransomware/">extort</a></p>
<ul>
<li style="font-weight:400;"><a href="https://login.salesforce.com/">Salesforce</a> is refusing to pay <a href="https://www.theregister.com/2025/10/03/scattered_lapsus_hunters_latest_leak/">ransomware demands</a> from criminals claiming to have stolen nearly 1 billion customer records, stating they will not engage, negotiate with, or pay any extortion demand. </li>
<li style="font-weight:400;">This firm stance sets a precedent for how major cloud providers handle ransomware attacks.</li>
<li style="font-weight:400;">The stolen data appears to be from previous breaches rather than new intrusions, specifically from when <a href="https://krebsonsecurity.com/2025/10/shinyhunters-wage-broad-corporate-extortion-spree/">ShinyHunters</a> compromised <a href="https://www.salesloft.com/">Salesloft’s</a> Drift application earlier this year. </li>
<li style="font-weight:400;">The attackers used stolen OAuth tokens to access multiple companies’ Salesforce instances.</li>
<li style="font-weight:400;">The incident highlights the security risks of third-party integrations in cloud environments, as the breach originated through a compromised integration app rather than Salesforce’s core platform. </li>
<li style="font-weight:400;">This demonstrates how supply chain vulnerabilities can expose customer data across multiple organizations.</li>
<li style="font-weight:400;">Scattered LAPSUS$ Hunters set an October 10 deadline for payment and offered $10 in Bitcoin to anyone willing to harass executives of affected companies. This unusual tactic shows evolving extortion methods beyond traditional ransomware encryption.</li>
<li style="font-weight:400;">Salesforce maintains there’s no indication their platform has been compromised, and no known vulnerabilities in their technology were exploited. The company is working with external experts and authorities while supporting affected customers through the incident.</li>
</ul>
<p>06:31  Ryan – “I do also really like Salesforce’s response, just because I feel like the ransomware has gotten a little out of hand, and I think a lot of companies are quiet quietly sort of paying these ransoms, which has only made the attacks just skyrocket. So making a big public show of saying we’re not going to pay for this is, is a good idea.”</p>
<h2>AI is Going Great – Or How ML Makes Money </h2>
<p>07:06 <a href="https://openai.com/index/introducing-agentkit/">Introducing AgentKit</a></p>
<ul>
<li style="font-weight:400;"><a href="https://openai.com/">OpenAI’s</a> <a href="https://openai.com/index/introducing-agentkit/">AgentKit</a> provides a framework for building and managing AI agents with simplified deployment and customization options, addressing the growing need for autonomous AI systems in cloud environments.</li>
<li style="font-weight:400;">The tool integrates with existing OpenAI technologies and supports multiple programming languages, enabling developers to create agents that can interact with various cloud services and APIs without extensive infrastructure setup.</li>
<li style="font-weight:400;">AgentKit’s architecture allows for efficient agent lifecycle management, including deployment, monitoring, and behavior customization, which could reduce operational overhead for businesses running AI workloads at scale.</li>
<li style="font-weight:400;">Key use cases include automated customer service agents, data processing pipelines, and intelligent workflow automation that can adapt to changing conditions in cloud-native applications.</li>
<li style="font-weight:400;">This development matters for cloud practitioners as it potentially lowers the barrier to entry for implementing sophisticated AI agents while providing the scalability and reliability expected in enterprise cloud deployments</li>
</ul>
<p>09:03 <a href="https://openai.com/index/codex-now-generally-available/">Codex Now Generally Available</a></p>
<ul>
<li style="font-weight:400;">OpenAI’s Codex is now generally available, offering GPT-3-based AI that’s fine-tuned specifically for code generation and understanding across multiple programming languages. This represents a significant advancement in AI-assisted development tools becoming mainstream.</li>
<li style="font-weight:400;">Several new features, A new Slack integration: Delegate tasks or ask questions to Codex directly from a team channel or thread, just like a coworker</li>
<li style="font-weight:400;">Codex SDK to embed the same agent that powers Codex CLI to your own workflows, tools, and apps for state-of-the-art performance on <a href="https://openai.com/index/introducing-upgrades-to-codex/">GPT-5-Codex</a> without more tuning</li>
<li style="font-weight:400;">New Admin tools with environment controls, monitoring, and analytics dashboards. ChatGPT workspace admins now have more control</li>
</ul>
<p>09:48  Ryan – “I don’t know why, but something about having it available in Slack to boss it around sort of rubs me the wrong way. I feel like it’s the poor new college grad joining the team  – it’s just delegated all the crap jobs.” </p>
<p>10:14 <a href="https://blog.google/technology/google-deepmind/gemini-computer-use-model/">Introducing the Gemini 2.5 Computer Use model</a></p>
<ul>
<li style="font-weight:400;">Google released <a href="http://ai.google.dev/gemini-api/docs/computer-use">Gemini 2.5 Computer Use mode</a>l via Gemini API, enabling AI agents to interact with graphical user interfaces through clicking, typing, and scrolling actions – available in <a href="http://ai.google.dev/gemini-api/docs/computer-use">Google AI Studio</a> and <a href="https://cloud.google.com/vertex-ai/generative-ai/docs/computer-use">Vertex AI</a> for developers to build automation agents.</li>
<li style="font-weight:400;">The model operates in a loop using screenshots and action history to navigate web pages and applications, outperforming competitors on web and mobile control benchmarks while maintaining the lowest latency among tested solutions.</li>
<li style="font-weight:400;">Built-in safety features include per-step safety service validation and system instructions to prevent high-risk actions like bypassing CAPTCHA or compromising security, with developers able to require user confirmation for sensitive operations.</li>
<li style="font-weight:400;">Early adopters, including Google teams, use it for UI testing and workflow automation, with the model already powering <a href="https://deepmind.google/models/project-mariner/">Project Mariner</a>, <a href="https://firebase.blog/posts/2025/04/app-testing-agent/">Firebase Testing Agent</a>, and <a href="https://blog.google/products/search/ai-mode-agentic-personalized/">AI Mode in Search</a> – demonstrating practical enterprise applications.</li>
<li style="font-weight:400;">This represents a shift from API-only interactions to visual UI control, enabling automation of tasks that previously required human interaction like form filling, dropdown navigation, and operating behind login screens.</li>
</ul>
<p>11:48  Ryan – “I think this is the type of thing that really is going to get AI to be as big as the Agentic model in general; having it be able to understand click and UIs and operate on people’s behalf. It’s going to open up just a ton of use cases for it.”    </p>
<h2>AWS</h2>
<p>12:35 <a href="https://aws.amazon.com/products/lifecycle/announcement/">AWS Service Availability Change Announcement</a></p>
<ul>
<li style="font-weight:400;">AWS is moving 19 services to maintenance mode starting November 7, 2025, including <a href="https://docs.aws.amazon.com/amazonglacier/latest/dev/introduction.html">Amazon Glacier</a>, <a href="https://docs.aws.amazon.com/codecatalyst/latest/userguide/migration.html">AWS CodeCatalyst</a>, and <a href="https://docs.aws.amazon.com/frauddetector/latest/ug/what-is-frauddetector.html">Amazon Fraud Detector</a> – existing customers can continue using these services but new customers will be blocked from adoption.</li>
<li style="font-weight:400;">Several migration-focused services are being deprecated, including <a href="https://docs.aws.amazon.com/migrationhub/latest/ug/migrationhub-availability-change.html">AWS Migration Hub</a>, <a href="https://docs.aws.amazon.com/application-discovery/latest/userguide/application-discovery-service-availability-change.html">AWS Application Discovery Service</a>, and <a href="https://docs.aws.amazon.com/m2/latest/userguide/mainframe-modernization-availability-change.html">AWS Mainframe Modernization Service</a>, signaling AWS may be consolidating or rethinking its migration tooling strategy.</li>
<li style="font-weight:400;">The deprecation of <a href="https://aws.amazon.com/s3/features/object-lambda/">Amazon S3 Object Lambda</a> and <a href="https://aws.amazon.com/cloud-directory/">Amazon Cloud Directory</a> suggests AWS is streamlining overlapping functionality – customers will need to evaluate alternatives like Lambda@Edge or AWS Directory Service for similar capabilities.</li>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/snowball/latest/developer-guide/device-differences.html">AWS Snowball Edge Compute Optimized</a> and Storage Optimized entering maintenance indicates AWS is likely pushing customers toward newer edge computing solutions like AWS Outposts or Local Zones for hybrid deployments.</li>
<li style="font-weight:400;">The sunset of specialized services like AWS HealthOmics Variant Store and AWS IoT SiteWise Monitor shows AWS pruning niche offerings that may have had limited adoption or overlapping functionality with other services.</li>
</ul>
<p>13:53  Ryan – “It’s interesting, because I was a heavy user of CodeGuru and CodeCatalyst for a while, so the announcement I got as a customer was a lot less friendly than maintenance mode. It was like, your stuff’s going to end. So I don’t know if it’s true across all these services, but I know with at least those two. I did not get one for Glacier – because I also have a ton of stuff in Glacier, because I’m cheap.” </p>
<p>17:01 <a href="https://aws.amazon.com/about-aws/whats-new/2025/10/aws-direct-connect-100g-expansion-kansas-city/">AWS Direct Connect announces 100G expansion in Kansas City, MO</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/directconnect">AWS Direct Connect</a> now offers 100 Gbps dedicated connections with MACsec encryption at the Netrality KC1 data center in Kansas City, expanding high-bandwidth private connectivity options in the central US region.</li>
<li style="font-weight:400;">The Kansas City location provides direct network access to all public AWS Regions (except China), <a href="https://docs.aws.amazon.com/govcloud-us/latest/UserGuide/using-govcloud.html">AWS GovCloud Regions</a>, and <a href="https://aws.amazon.com/about-aws/global-infrastructure/localzones/">AWS Local Zones</a>, making it a strategic connectivity hub for enterprises in the Midwest.</li>
<li style="font-weight:400;">With 100G connections and MACsec encryption, organizations can achieve lower latency and enhanced security for workloads requiring high throughput, such as data analytics, media processing, or hybrid cloud architectures.</li>
<li style="font-weight:400;">This expansion brings AWS Direct Connect to over 146 locations worldwide, reinforcing AWS’s commitment to providing enterprises with reliable alternatives to internet-based connectivity for mission-critical applications.</li>
<li style="font-weight:400;">For businesses evaluating Direct Connect, the 100G option typically suits large-scale data transfers and enterprises with substantial bandwidth requirements, while the 10G option remains available for more moderate connectivity needs.</li>
</ul>
<p>18:07 <a href="https://aws.amazon.com/blogs/aws/aws-iam-identity-center-now-supports-customer-managed-kms-keys-for-encryption-at-rest/">AWS IAM Identity Center now supports customer-managed KMS keys for </a><a href="https://aws.amazon.com/blogs/aws/aws-iam-identity-center-now-supports-customer-managed-kms-keys-for-encryption-at-rest/">encryption at rest | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/iam/identity-center/">AWS IAM Identity Center</a> now supports <a href="https://docs.aws.amazon.com/kms/latest/developerguide/concepts.html">customer-managed KMS keys</a> for encrypting identity data at rest, giving organizations in regulated industries full control over encryption key lifecycle, including creation, rotation, and deletion. This addresses compliance requirements for customers who previously could only use <a href="https://aws.amazon.com/kms/">AWS-owned keys</a>.</li>
<li style="font-weight:400;">The feature requires symmetric KMS keys in the same AWS account and region as the Identity Center instance, with multi-region keys recommended for future flexibility. Implementation involves creating the key, configuring detailed permissions for Identity Center services and administrators, and updating IAM policies for cross-account access.</li>
<li style="font-weight:400;">Not all AWS managed applications currently support Identity Center with customer-managed keys – administrators <a href="https://docs.aws.amazon.com/singlesignon/latest/userguide/awsapps-that-work-with-identity-center.html">must verify compatibility</a> before enabling to avoid service disruptions. The documentation provides specific policy templates for common use cases, including delegated administrators and application administrators.</li>
<li style="font-weight:400;">Standard AWS KMS pricing applies for key storage and API usage while Identity Center remains free. The feature is available in all AWS commercial regions, GovCloud, and China regions.</li>
<li style="font-weight:400;">Key considerations include the critical nature of proper permission configuration – incorrect setup can disrupt Identity Center operations and access to AWS accounts. Organizations should implement encryption context conditions to restrict key usage to specific Identity Center instances for enhanced security.</li>
</ul>
<p>18:52  Justin – “Encrypt setup can disrupt Identity Center operations, like revoking your encryption key, might be bad for your access to your cloud. So be careful with this one.” </p>
<p>19:28 <a href="https://aws.amazon.com/blogs/aws/new-general-purpose-amazon-ec2-m8a-instances-are-now-available/">New general-purpose Amazon EC2 M8a instances are now available | AWS </a><a href="https://aws.amazon.com/blogs/aws/new-general-purpose-amazon-ec2-m8a-instances-are-now-available/">News Blog</a></p>
<ul>
<li style="font-weight:400;">AWS launches M8a instances powered by <a href="https://www.amd.com/en/products/processors/server/epyc/9005-series.html">5th Gen AMD EPYC Turin processors</a>, delivering up to 30% better performance and 19% better price-performance than M7a instances for general-purpose workloads.</li>
<li style="font-weight:400;">The new instances feature 45% more memory bandwidth and 50% improvements in networking (75 Gbps) and EBS bandwidth (60 Gbps), making them suitable for financial applications, gaming, databases, and SAP-certified enterprise workloads.</li>
<li style="font-weight:400;">M8a introduces instance bandwidth configuration (IBC), allowing customers to flexibly allocate resources between networking and EBS bandwidth by up to 25%, optimizing for specific workload requirements.</li>
<li style="font-weight:400;">Each vCPU maps to a physical CPU core without SMT, resulting in up to 60% faster <a href="https://groovy-lang.org/">GroovyJVM</a> performance and 39% faster <a href="https://cassandra.apache.org/_/index.html">Cassandra</a> performance compared to M7a instances.</li>
<li style="font-weight:400;">Available in 12 sizes from small to metal-48xl (192 vCPU, 768GiB RAM) across three regions initially, with standard pricing options including <a href="https://aws.amazon.com/ec2/pricing/on-demand/">On-Demand</a>, <a href="https://aws.amazon.com/savingsplans/">Savings Plans</a>, and <a href="https://aws.amazon.com/ec2/spot/pricing/">Spot instances</a>.</li>
</ul>
<p>20:01  Ryan – “That’s a big one! I still don’t have a use case for it.” </p>
<p> </p>
<p>20:09 <a href="https://aws.amazon.com/blogs/aws/reimagine-the-way-you-work-with-ai-agents-in-amazon-quick-suite/">Announcing Amazon Quick Suite: your agentic teammate for answering </a><a href="https://aws.amazon.com/blogs/aws/reimagine-the-way-you-work-with-ai-agents-in-amazon-quick-suite/">questions and taking action | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/quicksuite/">Amazon Quick Suite</a> combines AI-powered research, business intelligence, and automation into a single workspace, eliminating the need to switch between multiple applications for data gathering and analysis. </li>
<li style="font-weight:400;">The service includes Quick Research for comprehensive analysis across enterprise and external sources, Quick Sight for natural language BI queries, and Quick Flows/Automate for process automation.</li>
<li style="font-weight:400;">Quick Index serves as the foundational knowledge layer, creating a unified searchable repository across databases, documents, and applications that powers AI responses throughout the suite. This addresses the common enterprise challenge of fragmented data sources by consolidating everything from S3, <a href="https://www.snowflake.com/en/">Snowflake</a>, <a href="https://www.google.com/drive/">Google Drive</a>, and <a href="https://support.microsoft.com/en-us/office/sign-in-to-sharepoint-324a89ec-e77b-4475-b64a-13a0c14c45ec">SharePoint</a> into one intelligent knowledge base.</li>
<li style="font-weight:400;">The automation capabilities are split between <a href="https://aws.amazon.com/quicksuite/flows/">Quick Flows for business</a> users (natural language workflow creation) and <a href="https://docs.aws.amazon.com/quicksuite/latest/userguide/using-amazon-quick-automate.html">Quick Automate</a> for technical teams (complex multi-department processes with approval routing and system integrations). </li>
<li style="font-weight:400;">Both tools generate workflows from simple descriptions, but Quick Automate handles enterprise-scale processes like customer onboarding with advanced orchestration and monitoring.</li>
<li style="font-weight:400;">Existing Amazon QuickSight customers will be automatically upgraded to Quick Suite with all current BI capabilities preserved under the “Quick Sight” branding, maintaining the same data connectivity, security controls, and user permissions. Pricing follows a per-user subscription model with consumption-based charges for Quick Index and optional features.</li>
<li style="font-weight:400;">The service introduces “Spaces” for contextual data organization and custom chat agents that can be configured for specific departments or use cases, enabling teams to create tailored AI assistants connected to relevant datasets and workflows. This allows organizations to scale from personal productivity tools to enterprise-wide deployment while maintaining access controls.</li>
</ul>
<p>22:13  Justin – “This is a confusing product. It’s doing a lot of things, probably kind of poorly.” </p>
<p>23:13 <a href="https://www.businessinsider.com/aws-strengthens-ai-security-with-datastax-ceo-as-new-vp-2025-10?utm_campaign=business-sf&amp;utm_source=linkedin&amp;utm_medium=social">AWS Strengthens AI Security by Hiring Ex-DataStax CEO As New VP – </a><a href="https://www.businessinsider.com/aws-strengthens-ai-security-with-datastax-ceo-as-new-vp-2025-10?utm_campaign=business-sf&amp;utm_source=linkedin&amp;utm_medium=social">Business Insider</a></p>
<ul>
<li style="font-weight:400;">AWS hired Chet Kapoor, former <a href="https://www.datastax.com/">DataStax</a> CEO, as VP of Security Services and Observability, reporting directly to CEO Matt Garman, to strengthen security offerings as AWS expands its AI business.</li>
<li style="font-weight:400;">Kapoor brings experience from DataStax, where he led <a href="https://www.datastax.com/products/datastax-astra">Astra DB</a> development and integrated real-time AI capabilities, positioning him to address the security challenges of increasingly complex cloud deployments.</li>
<li style="font-weight:400;">The role consolidates leadership of security services, governance, and operations portfolios under one executive, with teams from Gee Rittenhouse, Nandini Ramani, Georgia Sitaras, and Brad Marshall now reporting to Kapoor.</li>
<li style="font-weight:400;">This hire follows recent AWS leadership changes, including the departures of VP of AI Matt Wood and VP of generative AI Vasi Philomin, signaling AWS’s focus on strengthening AI security expertise.</li>
<li style="font-weight:400;">Kapoor will work alongside AWS CISO Amy Herzog to develop security and observability services that address what Garman describes as changing requirements driven by AI adoption.</li>
</ul>
<p>26:03  Justin – “Also, DataStax was bought by IBM – and everyone knows that anything bought by IBM will be killed mercilessly.” </p>
<p>26:50 <a href="https://aws.amazon.com/about-aws/whats-new/2025/10/amazon-bedrock-agentcore-available/">Amazon Bedrock AgentCore is now generally available</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/bedrock/agentcore/">Amazon Bedrock AgentCore</a> provides a managed platform for building and deploying AI agents that can execute for up to 8 hours with complete session isolation, supporting any framework like <a href="https://www.crewai.com/">CrewAI</a>, <a href="https://www.langchain.com/langgraph">LangGraph</a>, or <a href="https://www.llamaindex.ai/">LlamaIndex</a>, and any model inside or outside Amazon Bedrock.</li>
<li style="font-weight:400;">The service includes five core components: Runtime for execution, Memory for state management, Gateway for tool integration via Model Context Protocol, Identity for OAuth and IAM authorization, and Observability with <a href="https://docs.aws.amazon.com/AmazonCloudWatch/latest/monitoring/WhatIsCloudWatch.html">CloudWatch</a> dashboards and OTEL compatibility for monitoring agents in production.</li>
<li style="font-weight:400;">AgentCore enables agents to communicate with each other through Agent-to-Agent protocol support and securely act on behalf of users with identity-aware authorization, making it suitable for enterprise automation scenarios that require extended execution times and complex tool interactions.</li>
<li style="font-weight:400;">The platform eliminates infrastructure management while providing enterprise features like VPC support, <a href="https://docs.aws.amazon.com/vpc/latest/privatelink/what-is-privatelink.html">AWS PrivateLink</a>, and <a href="https://docs.aws.amazon.com/AWSCloudFormation/latest/UserGuide/Welcome.html">CloudFormation</a> templates, with consumption-based pricing and no upfront costs across nine AWS regions.</li>
<li style="font-weight:400;">Integration with existing observability tools like <a href="https://www.datadoghq.com/">Datadog</a>, <a href="https://www.dynatrace.com/">Dynatrace</a>, and <a href="https://docs.langchain.com/langsmith/home">LangSmith</a> allows teams to monitor agent performance using their current toolchain, while the self-managed memory strategy gives developers control over how agents store and process information.</li>
</ul>
<p>28:17  Ryan – “This really to me, seems like a full app, you know, like this is a core component instead of doing development; you’re just taking  AI agents, putting them together, and giving them tasks. Then, the eight-hour runtime is crazy. It feels like it’s getting warmer in here just reading that.”</p>
<p>28:49 <a href="https://www.theinformation.com/briefings/aws-custom-chip-now-powers-key-ai-cloud-service">AWS’ Custom Chip Now Powers Most of Its Key AI Cloud Service — The </a><a href="https://www.theinformation.com/briefings/aws-custom-chip-now-powers-key-ai-cloud-service">Information</a></p>
<ul>
<li style="font-weight:400;">AWS has transitioned the majority of its AI inference workloads to its custom <a href="https://www.bing.com/ck/a?!&amp;&amp;p=75a6b3478c33c16608b3609046b750fc8b242f5c1eb52b99b9a6f67908f4610aJmltdHM9MTc2MTA5MTIwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=33cf825f-ad59-6912-19e3-9438ac3368b1&amp;psq=Inferentia+&amp;u=a1aHR0cHM6Ly9hd3MuYW1hem9uLmNvbS9haS9tYWNoaW5lLWxlYXJuaW5nL2luZmVyZW50aWEv">Inferentia chips</a>, marking a significant shift away from Nvidia GPUs for production AI services. </li>
<li style="font-weight:400;">The move demonstrates AWS’s commitment to vertical integration and cost optimization in the AI infrastructure space.</li>
<li style="font-weight:400;">Inferentia chips now handle most inference tasks for services like <a href="https://aws.amazon.com/bedrock/">Amazon Bedrock</a>, <a href="https://aws.amazon.com/sagemaker/">SageMaker</a>, and internal AI features across AWS products. </li>
<li style="font-weight:400;">This custom silicon strategy allows AWS to reduce dependency on expensive third-party GPUs while potentially offering customers lower-cost AI inference options.</li>
<li style="font-weight:400;">The shift to Inferentia represents a broader industry trend where cloud providers develop custom chips to differentiate their services and control costs. AWS can now optimize the entire stack from silicon to software for specific AI workloads, similar to Apple’s approach with its M-series chips.</li>
<li style="font-weight:400;">For AWS customers, this transition could mean more predictable pricing and better performance-per-dollar for inference workloads. The custom chips are specifically designed for inference rather than training, making them more efficient for production AI applications.</li>
<li style="font-weight:400;">This development positions AWS to compete more effectively with other cloud providers on AI pricing while maintaining control over its technology roadmap. </li>
<li style="font-weight:400;">Customers running inference-heavy workloads may see cost benefits as AWS passes along savings from reduced reliance on Nvidia hardware</li>
</ul>
<p>29:39  Ryan – “Explains all the Oracle and Azure Nvidia announcements.” </p>
<p>30:16 <a href="https://aws.amazon.com/blogs/aws/introducing-amazon-ebs-volume-clones-create-instant-copies-of-your-ebs-volumes/">Introducing Amazon EBS Volume Clones: Create instant copies of your </a><a href="https://aws.amazon.com/blogs/aws/introducing-amazon-ebs-volume-clones-create-instant-copies-of-your-ebs-volumes/">EBS volumes | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/category/storage/amazon-elastic-block-storage-ebs/">Amazon EBS</a> Volume Clones enables instant point-in-time copies of encrypted EBS volumes within the same Availability Zone through a single API call, eliminating the previous multi-step process of creating snapshots in <a href="https://aws.amazon.com/s3/">S3</a> and then new volumes.</li>
<li style="font-weight:400;">Cloned volumes are available within seconds with single-digit millisecond latency, though performance during initialization is limited to the lowest of: 3,000 IOPS/125 MiB/s baseline, source volume performance, or target volume performance.</li>
<li style="font-weight:400;">This feature targets development and testing workflows where teams need quick access to production data copies, but it complements rather than replaces EBS snapshots, which remain the recommended backup solution with 11 nines durability in S3.</li>
<li style="font-weight:400;">Pricing includes a one-time fee per GiB of source volume data at initiation, plus standard EBS charges for the new volume, making cost governance important since cloned volumes persist independently until manually deleted.</li>
<li style="font-weight:400;">The feature currently requires encrypted volumes and operates only within the same Availability Zone, supporting all EBS volume types across AWS commercial regions and select Local Zones.</li>
</ul>
<p>32:06 <a href="https://aws.amazon.com/blogs/containers/running-slurm-on-amazon-eks-with-slinky/">Running Slurm on Amazon EKS with Slinky | Containers</a></p>
<ul>
<li style="font-weight:400;">AWS introduces <a href="https://github.com/SlinkyProject">Slinky</a>, an open source project that lets you run Slurm workload manager inside Amazon EKS, enabling organizations to manage both traditional HPC batch jobs and modern <a href="https://aws.amazon.com/blogs/containers/category/compute/amazon-kubernetes-service/">Kubernetes</a> workloads on the same infrastructure without maintaining separate clusters.</li>
<li style="font-weight:400;">The solution deploys <a href="https://slurm.schedmd.com/overview.html">Slurm</a> components as Kubernetes pods with slurmctld on general-purpose nodes and slurmd on GPU/accelerated nodes, supporting features like auto-scaling worker pods based on job queues and integration with Karpenter for dynamic EC2 provisioning.</li>
<li style="font-weight:400;">Key benefit is resource optimization – AI inference workloads can scale during business hours while training jobs scale overnight using the same compute pool, with teams able to use familiar Slurm commands (sbatch, srun) alongside Kubernetes APIs.</li>
<li style="font-weight:400;">Slinky provides an alternative to <a href="https://aws.amazon.com/hpc/parallelcluster/">AWS ParallelCluster</a> (self-managed), AWS PCS (managed Slurm), and <a href="https://aws.amazon.com/sagemaker/ai/hyperpod/">SageMaker HyperPod</a> (ML-optimized) for organizations already standardized on EKS who need deterministic scheduling for long-running jobs.</li>
<li style="font-weight:400;">The architecture supports custom container images, allowing teams to package specific ML dependencies (CUDA, PyTorch versions) directly into worker pods, eliminating manual environment management while maintaining reproducibility across environments.</li>
</ul>
<h2>GCP</h2>
<p>33:09 <a href="https://cloud.google.com/blog/products/ai-machine-learning/introducing-gemini-enterprise/">Introducing Gemini Enterprise | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google launches <a href="https://www.bing.com/ck/a?!&amp;&amp;p=6866f4892c38c54d2b1af52bdb449a6ee053608589cc11edffb963501bafa58bJmltdHM9MTc2MTA5MTIwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=33cf825f-ad59-6912-19e3-9438ac3368b1&amp;psq=Gemini+Enterprise&amp;u=a1aHR0cHM6Ly9jbG91ZC5nb29nbGUuY29tL2dlbWluaS1lbnRlcnByaXNl">Gemini Enterprise</a> as a unified AI platform that combines Gemini models, no-code agent building, pre-built agents, data connectors for <a href="https://workspace.google.com/">Google Workspace</a> and <a href="https://www.office.com/">Microsoft 365</a>, and centralized governance through a single chat interface. </li>
<li style="font-weight:400;">This positions Google as offering a complete AI stack, rather than just models or toolkits like competitors.</li>
<li style="font-weight:400;">The platform includes notable integrations with Microsoft 365 and <a href="https://www.microsoft.com/microsoft-365/sharepoint/collaboration">SharePoint</a> environments while offering enhanced features when paired with Google Workspace, including new multimodal agents for video creation (Google Vids with 2.5M monthly users) and real-time speech translation in <a href="https://meet.google.com/">Google Meet</a>. This cross-platform approach differentiates it from more siloed offerings.</li>
<li style="font-weight:400;">Google introduces next-generation conversational agents with a low-code visual builder supporting 40+ languages, powered by the latest Gemini models for natural voice interactions and deep enterprise integration. </li>
<li style="font-weight:400;">Early adopters like Commerzbank report 70% inquiry resolution rates, and Mercari projects 500% ROI through 20% workload reduction.</li>
<li style="font-weight:400;">The announcement includes new developer tools like Gemini CLI (1M+ developers in 3 months) with extensions from <a href="https://www.atlassian.com/">Atlassian</a>, <a href="https://gitlab.com/users/sign_in">GitLab</a>, <a href="https://www.mongodb.com/">MongoDB</a>, and others, plus industry protocols for agent interoperability (A2A), payments (AP2), and model context (MCP). </li>
<li style="font-weight:400;">This creates a foundation infrastructure for an agent economy where developers can monetize specialized agents.</li>
<li style="font-weight:400;">Google’s partner ecosystem includes 100,000+ partners with expanded integrations for Box, Salesforce, ServiceNow, and deployment support from Accenture, Deloitte, and others. </li>
<li style="font-weight:400;">The company also launches <a href="https://cloud.google.com/blog/topics/training-certifications/google-skills-new-home-ai-learning">Google Skills</a> training platform and GEAR program to train 1 million developers, addressing the critical skills gap in enterprise AI adoption.</li>
</ul>
<p>35:01  Justin – “I think both Azure and Amazon have similar problems; they are rushing so fast to make products, that they’re creating the same products over and over again, just with slightly different limitations or use cases.” </p>
<p>36:05 <a href="https://cloud.google.com/blog/products/ai-machine-learning/introducing-llm-evalkit/">Introducing LLM-Evalkit | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google releases <a href="https://github.com/GoogleCloudPlatform/generative-ai/tree/main/tools/llmevalkit">LLM-Evalkit</a>, an open-source framework that centralizes prompt engineering workflows on <a href="https://cloud.google.com/vertex-ai">Vertex AI</a>, replacing the current fragmented approach of managing prompts across multiple documents and consoles.</li>
<li style="font-weight:400;">The tool shifts prompt development from subjective testing to data-driven iteration by requiring teams to define specific problems, create test datasets, and establish concrete metrics for measuring LLM performance.</li>
<li style="font-weight:400;">LLM-Evalkit features a no-code interface designed to democratize prompt engineering, allowing non-technical team members like product managers and UX writers to contribute to the development process.</li>
<li style="font-weight:400;">The framework integrates directly with Vertex AI SDKs and provides versioning, benchmarking, and performance tracking capabilities in a single application, addressing the lack of standardized evaluation processes in current workflows.</li>
<li style="font-weight:400;">Available now on GitHub as an open-source project, with additional evaluation features accessible through the Google Cloud console, though specific pricing details are not mentioned in the announcement.</li>
</ul>
<p>37:09  Ryan – “Reading through this announcement, it’s solving a problem I had – but I didn’t know I had.” </p>
<p>38:17 <a href="https://cloud.google.com/blog/products/storage-data-transfer/announcing-enhancements-to-google-cloud-netapp-volumes/">Announcing enhancements to Google Cloud NetApp Volumes | Google </a><a href="https://cloud.google.com/blog/products/storage-data-transfer/announcing-enhancements-to-google-cloud-netapp-volumes/">Cloud Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/netapp-volumes">Google Cloud NetApp Volumes</a> now supports iSCSI block storage alongside file storage, enabling enterprises to migrate SAN workloads to GCP without architectural changes. </li>
<li style="font-weight:400;">The service delivers up to 5 GiB/s throughput and 160K IOPS per volume with independent scaling of capacity, throughput, and IOPS.</li>
<li style="font-weight:400;"><a href="https://cloud.google.com/netapp/volumes/docs/configure-and-use/volumes/cache-ontap-volumes/overview">NetApp FlexCache</a> provides local read caches of remote volumes for distributed teams and hybrid cloud deployments. </li>
<li style="font-weight:400;">This allows organizations to access shared datasets with local-like performance across regions, supporting compute bursting scenarios that require low-latency data access.</li>
<li style="font-weight:400;">The service now integrates with Gemini Enterprise as a data store for RAG applications, allowing organizations to ground AI models on their secure enterprise data without complex ETL processes. </li>
<li style="font-weight:400;">Data remains governed within NetApp Volumes while being accessible for search and inference workflows.</li>
<li style="font-weight:400;">Auto-tiering automatically moves cold data to lower-cost storage at $0.03/GiB for the Flex service level, with configurable thresholds from 2-183 days. Large-capacity volumes now scale from 15TiB to 3PiB with over 21GiB/s throughput per volume for HPC and AI workloads.</li>
<li style="font-weight:400;"><a href="https://docs.netapp.com/us-en/ontap/concepts/snapmirror-disaster-recovery-data-transfer-concept.html">NetApp SnapMirror</a> enables replication between on-premises NetApp systems and Google Cloud with zero RPO and near-zero RTO. </li>
<li style="font-weight:400;">This positions GCP competitively against AWS FSx for NetApp ONTAP and Azure NetApp Files for enterprise storage migrations.</li>
</ul>
<p>40:30  Justin – “I have a specific workload that needs storage, that’s shared across boxes, and iSCSI is a great option for that, in addition to other methods you could use that I’m currently using, which have some sharp edges. So I’m definitely going to do some price calculation models. This might be good, because Google has multi-writer files, like EBS-type solutions, but does not have the performance that I need quite yet.”</p>
<h2>Azure</h2>
<p>41:08 <a href="https://thenewstack.io/github-will-prioritize-migrating-to-azure-over-feature-development/">GitHub Will Prioritize Migrating to Azure Over Feature Development – The </a><a href="https://thenewstack.io/github-will-prioritize-migrating-to-azure-over-feature-development/">New Stack</a></p>
<ul>
<li style="font-weight:400;">GitHub is migrating its entire infrastructure from its Virginia data center to Azure within 24 months, with teams being asked to delay feature development to focus on this migration due to capacity constraints from AI and Copilot workloads.</li>
<li style="font-weight:400;">The migration represents a significant shift from GitHub’s previous autonomy since Microsoft’s 2018 acquisition, with GitHub losing independence after CEO Thomas Dohmke’s departure and being folded deeper into Microsoft’s organizational structure.</li>
<li style="font-weight:400;">Technical challenges include migrating GitHub’s MySQL clusters that run on bare metal servers to Azure, which some employees worry could lead to more outages during the transition period, given recent service disruptions.</li>
<li style="font-weight:400;">This positions Azure to capture one of the world’s largest developer platforms as a flagship customer, demonstrating Azure’s ability to handle massive scale workloads while potentially raising concerns among open source developers about tighter Microsoft integration.</li>
<li style="font-weight:400;">The move highlights how AI workloads are straining traditional infrastructure, with GitHub citing “existential” needs to scale for AI and Copilot demands, showing how generative AI is forcing major architectural decisions across the industry.</li>
</ul>
<p>43:17  Ryan – “I just hope the service stays up; it’s so disruptive to my day job when GitHub has issues.” </p>
<p>43:33 <a href="https://www.theregister.com/2025/10/10/microsoft_365_na_outage/">Microsoft 365 services fall over in North America • The Register</a></p>
<ul>
<li style="font-weight:400;"><a href="https://duckduckgo.com/y.js?ad_domain=microsoft.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=SrfX0X8sUlwEnLKiBoZqWWpx6FDN7fwZDOPUjGchqD4cGC_TeB45S8nSFSosiKD2RDjqmKZhauLj_BblAofUBbWjJfszGbNopIAazii2HGv_c2WCCDbqyWSrqnfKsTYQ.ayaN6md5HB1FKm2IIMCRVg&amp;eddgt=f80u_3qzTk17V9UJA625yg%3D%3D&amp;rut=6a5913d6162010ff3a4f40b889feadb1c15cf11ae5197a68e18e6092accf148c&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De81ETug91Er5TeUGqcIexQWzVUCUwnNSaauPoNGcS4UiraLlb7y5rNky64luqfwJVG3cMtVKqeAalyHcKtRuWrBNTd5wSDlW_Xw_Z-XKv8nzxlo07JrltUUaUtrZ13SbSIPeF7A0Qt-c1za5h3miWJzneMnRcVxZCFs_sYe1LdeJ_kxiOZvz8_KQAtUhp6AZUOyK8q2qtQqqeeLK7SGPM3NTU-r7YRw8yEqB_wio-9rLyXN0tUl3tsXKYK4Lq5ql8PJ5124aDIEbDT5B2qcOarRHeL-6ykTJ2vAuiBb1bfGGNIvDhHV3D7RHhDpnlortWG6OdHOqyHGL_8nviisQaXxl39lR8qtSWdE01APYLOSH312Jv6g-Ikfhj3KNa93YYLW2CmpjPELmjUH-74CgVuaCpf2VIV2VADbdMVbOygQYGo_EwxUPJQSazRg1RrgDlw8mmXorG7Oq9pPnGY2ghGGGQvgytcw4T_eNy08k9Jsos9cI-qR6GTjEsGT4CDUH7FEk1FsKFYRE-cLG7zYwehOY1ZzVJtPneYI5NHhZpq4wVmvDYiaxWaA_BrD6xlqMZLwQkBiBQA2Wvf-czPfwDhQQHtJLVwUO5wuiApxMRaLiwMqrD4KB7IWv1p-EONxihmVBTGrr5ttGgMDpE_Ct3SSq1XP_1eZ_NUvM4VD6Ze9ioM8r1qGZWT2OIxuRPhsnVcdCNSe9MXwV5AjToU4LfjtnNifbexsZrzVFcp-ZROPQITvGsm2w9t63LdOId-Tj857QnJTQ%26u%3DaHR0cHMlM2ElMmYlMmY1MzUwLnhnNGtlbi5jb20lMmZ0cmslMmZ2MSUzZnByb2YlM2Q0NDAlMjZjYW1wJTNkMTc0MTQyJTI2a2N0JTNkbXNuJTI2a2NoaWQlM2QxNTkwMDI1MjYlMjZjcml0ZXJpYWlkJTNka3dkLTc5MzcxNjEwMDc2NjI5JTNhbG9jLTcxMjI4JTI2Y2FtcGFpZ25pZCUzZDU5MDM2MTI0NyUyNmxvY3BoeSUzZDgwMDU4JTI2YWRncm91cGlkJTNkMTI2OTkzNzU5NTgwODY2NSUyNmNpZCUzZDc5MzcxMjA2NTMxMDIwJTI2a2R2JTNkYyUyNmtleHQlM2QlMjZrcGclM2QlMjZrcGlkJTNkJTI2cXVlcnlTdHIlM2RNaWNyb3NvZnQlMjUyMDM2NSUyNnVybCUzZGh0dHBzJTNhJTJmJTJmd3d3Lm1pY3Jvc29mdC5jb20lMmZlbi11cyUyZm1pY3Jvc29mdC0zNjUlMmZidXNpbmVzcyUyZmNvbXBhcmUtYWxsLW1pY3Jvc29mdC0zNjUtYnVzaW5lc3MtcHJvZHVjdHMtYiUzZmVmX2lkJTNkX2tfMDViNGJjMDIyNjI0MWJmMGJmZGVkNGZhYzViY2I5OWRfa18lMjZPQ0lEJTNkQUlEY21tNDc0cXA4ZWxfU0VNX19rXzA1YjRiYzAyMjYyNDFiZjBiZmRlZDRmYWM1YmNiOTlkX2tfJTI2bXNjbGtpZCUzZDA1YjRiYzAyMjYyNDFiZjBiZmRlZDRmYWM1YmNiOTlk%26rlid%3D05b4bc0226241bf0bfded4fac5bcb99d&amp;vqd=4-313593971382612555591888224610928525715&amp;iurl=%7B1%7DIG%3D78C126210A7A4798B6E3F55A789E5666%26CID%3D029466C694176E0D250B704B95F16F34%26ID%3DDevEx%2C5045.1">Microsoft 365</a> experienced a North American outage on October 9, lasting just over an hour, caused by misconfigured network infrastructure that affected all services, including Teams, highlighting the fragility of centralized cloud services when configuration errors occur.</li>
<li style="font-weight:400;">This incident followed another Azure outage where Kubernetes crashes took down <a href="https://duckduckgo.com/y.js?ad_domain=microsoft.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=NjAybRLKIa-s-_oKed4AuYqKbIz9-gt0z8CZqBQkPYwkcUG7sisriwA0g0h5b9aAswsPv3ddRDlGFz2SbTjZYxY84UIgxVlRrfi4ZDTwEzoFc5cMwCLTno-9l1LaSbaN.3WGa5-2g31KmmHD_Ia6qcA&amp;eddgt=igiASUMcicYNwDig8ZN3eQ%3D%3D&amp;rut=493a814c55666525ccf9ce3ae71013fe66e250fd54009d39f97291c000a4101d&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8UFVBGwXBHVi0fbzRdO899DVUCUzBJN5c8KlqJWnQss2QaP22xUhvbMQVRHbOq1VIWktKWZZI0jyQmwaLH066ugQJf5LYWmi5gJ3YY5T-Aewb1vyrk04iAMI4n7sb0nm-bomYKDemnHlmfGu4v4GTm4A6x038W4OwWq87P0RJf1slzCwsqnEYof6CpBGYZRK3xn1H-yrs-EFFVzQ2-2PfLxe5TmtVdEw8rhCGgSxJIMyQyScyXpiJBGvpcIoeEms4mchsHYCg_ja6D4PRlKRw_Zuu3ZhQT50oWS8X3LCgX0GY6tDe8Toy3iF5OLAQt9-P7LMS6hBqdKdB2YMC-B62owUwRGqEAL2-WBx9N31NOV4rk7iaV2JZjOeh_GN8eR3qjdKq5_0EKLCnsAhdgbTBqZnummy0r401WUZawgj2G-n4eKsCW_v9ehEgMXZPaPO9EKplpex5yqwWgPqx7O9peubEfzc7Cx8TnfDAS0SvC8i1fHhRltNPg3E_GHRiX2ZSl6ANAKYrTPeYIKeW3350BKS-vnf6HNlqbK-A9besaatLiC8sCSwL9uUVgz2tDO3E3fj7m4DfEq8uTNRJZ4Ef72TjK684XzIpg0Sw7S4T9xTJvkx4KOXI7HfFza8ydjM5Ssf2MBliQ-pxh8t2IalVbDrOTF4c_R7L35w6kgrvpSVG-KIAzntJgwgBRs_nbO23HmW_skj2pM-rAYiiy-p8SFLXV4z88lscGjMks_RlIy33_NnAY3ECMfJChnFQ_8hmuFBBRg%26u%3DaHR0cHMlM2ElMmYlMmY1MzUwLnhnNGtlbi5jb20lMmZ0cmslMmZ2MSUzZnByb2YlM2Q0MzklMjZjYW1wJTNkMTY4ODA3JTI2a2N0JTNkbXNuJTI2a2NoaWQlM2QxNTkwMDE5MjElMjZjcml0ZXJpYWlkJTNka3dkLTc5MzcxNjAyMjA3ODcwJTNhbG9jLTE5MCUyNmNhbXBhaWduaWQlM2Q1OTAyMzczNTglMjZsb2NwaHklM2Q3OTcxMiUyNmFkZ3JvdXBpZCUzZDEyNjk5Mzc1ODI0Mjc1NjElMjZjaWQlM2Q3OTM3MTIwNDc4MzkzNiUyNmtkdiUzZGMlMjZrZXh0JTNkJTI2a3BnJTNkJTI2a3BpZCUzZCUyNnF1ZXJ5U3RyJTNkQXp1cmUlMjUyMEZyb250JTI1MjBEb29yJTI2dXJsJTNkaHR0cHMlM2ElMmYlMmZhenVyZS5taWNyb3NvZnQuY29tJTJmZW4tdXMlMmZwcm9kdWN0cyUyZmZyb250ZG9vciUyZiUzZmVmX2lkJTNkX2tfYzc4OGE0ZWIxYWYwMWRhMDRmMDNjYzEzZjk0M2Y4MzNfa18lMjZPQ0lEJTNkQUlEY21tNWVkc3dkdXVfU0VNX19rX2M3ODhhNGViMWFmMDFkYTA0ZjAzY2MxM2Y5NDNmODMzX2tfJTI2bXNjbGtpZCUzZGM3ODhhNGViMWFmMDFkYTA0ZjAzY2MxM2Y5NDNmODMz%26rlid%3Dc788a4eb1af01da04f03cc13f943f833&amp;vqd=4-293393282745782260242804007880566003621&amp;iurl=%7B1%7DIG%3DA5118ED77DA0413A872EDA24C3A1A34F%26CID%3D0B31B4327C676A4832A8A2BF7D7F6B0E%26ID%3DDevEx%2C5045.1">Azure Front Door</a> instances, suggesting potential systemic issues with Microsoft’s infrastructure management and configuration processes that enterprise customers should factor into their reliability planning.</li>
<li style="font-weight:400;">Users reported that switching to backup circuits restored services, and some attributed issues to AT&amp;T’s network, demonstrating the importance of multi-path connectivity and diverse network providers for mission-critical cloud services.</li>
<li style="font-weight:400;">Microsoft’s response involved rerouting traffic to healthy infrastructure and analyzing configuration policies to prevent future incidents, though the lack of detailed root cause information raises questions about transparency and whether customers have sufficient visibility into infrastructure dependencies.</li>
<li style="font-weight:400;">The back-to-back outages underscore why organizations need robust disaster recovery plans beyond single cloud providers, as even brief disruptions to productivity tools like Teams can significantly impact business operations across entire regions.</li>
</ul>
<p>44:17 <a href="https://azure.microsoft.com/en-us/blog/introducing-microsoft-agent-framework/">Introducing Microsoft Agent Framework | Microsoft Azure Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aka.ms/AgentFramework/PuPr">Microsoft Agent Framework</a> converges <a href="https://github.com/microsoft/autogen">AutoGen</a> research project with <a href="https://github.com/microsoft/semantic-kernel">Semantic Kernel</a> into a unified open-source SDK for orchestrating multi-agent AI systems, addressing the fragmentation challenge as 80% of enterprises now use agent-based AI according to PwC.</li>
<li style="font-weight:400;">The framework enables developers to build locally and then deploy to Azure AI Foundry with built-in observability, durability, and compliance, while supporting integration with any API via OpenAPI and cross-runtime collaboration through Agent2Agent protocol.</li>
<li style="font-weight:400;"><a href="https://azure.microsoft.com/en-us/products/ai-foundry">Azure AI Foundry</a> now provides unified observability across multiple agent frameworks, including <a href="https://www.langchain.com/">LangChain</a>, <a href="https://www.langchain.com/langgraph">LangGraph</a>, and <a href="https://openai.github.io/openai-agents-python/">OpenAI Agents SDK</a>, through <a href="https://opentelemetry.io/">OpenTelemetry</a> contributions, positioning it as a comprehensive platform compared to <a href="https://aws.amazon.com/bedrock/">AWS Bedrock</a> or <a href="https://cloud.google.com/vertex-ai">GCP Vertex AI’s</a> more limited agent support.</li>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/ai-services/speech-service/voice-live-api-reference">Voice Live API</a> reaches general availability, offering a unified real-time speech-to-speech interface that integrates STT, generative AI, TTS, and avatar capabilities in a single low-latency pipeline for building voice-enabled agents.</li>
<li style="font-weight:400;">New responsible AI capabilities in public preview include task adherence, prompt shields with spotlighting, and PII detection, addressing McKinsey’s finding that the lack of governance tools is the top barrier to AI adoption.</li>
</ul>
<p>44:48  Justin – “We continue to be in a world of confusion around Agentic and out of control of Agentic things.” </p>
<p>45:54 <a href="https://azure.microsoft.com/en-us/blog/microsoft-azure-delivers-the-first-large-scale-cluster-with-nvidia-gb300-nvl72-for-openai-workloads/">NVIDIA GB300 NVL72: Next-generation AI infrastructure at scale | Microsoft </a><a href="https://azure.microsoft.com/en-us/blog/microsoft-azure-delivers-the-first-large-scale-cluster-with-nvidia-gb300-nvl72-for-openai-workloads/">Azure Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://blogs.microsoft.com/blog/2025/09/18/inside-the-worlds-most-powerful-ai-datacenter/">Microsoft deployed</a> the first production cluster with over 4,600 NVIDIA GB300 NVL72 systems featuring Blackwell Ultra GPUs, enabling AI model training in weeks instead of months and supporting models with hundreds of trillions of parameters. </li>
<li style="font-weight:400;">This positions Azure as the first cloud provider to deliver Blackwell Ultra at scale for production workloads.</li>
<li style="font-weight:400;">Each <a href="https://techcommunity.microsoft.com/blog/azurehighperformancecomputingblog/accelerating-the-intelligence-age-with-azure-ai-infrastructure-and-the-ga-of-nd-/4394575">ND GB300 v6 VM</a> rack contains 72 GPUs with 130TB/second of NVLink bandwidth and 37TB of fast memory, delivering up to 1,440 PFLOPS of FP4 performance. </li>
<li style="font-weight:400;">The system uses 800 Gbps NVIDIA Quantum-X800 InfiniBand for cross-rack connectivity, doubling the bandwidth of previous GB200 systems.</li>
<li style="font-weight:400;">The infrastructure targets frontier AI workloads, including reasoning models, agentic AI systems, and multimodal generative AI, with OpenAI already using these clusters for training and deploying their largest models. </li>
<li style="font-weight:400;">This gives Azure a competitive edge over AWS and GCP in supporting next-generation AI workloads.</li>
<li style="font-weight:400;">Azure implemented custom cooling systems using standalone heat exchangers and new power distribution models to handle the high energy density requirements of these dense GPU clusters. </li>
<li style="font-weight:400;">The co-engineered software stack optimizes storage, orchestration, and scheduling for supercomputing scale.</li>
<li style="font-weight:400;">While pricing wasn’t disclosed, the scale and specialized nature of these VMs suggest they’ll target enterprise customers and AI research organizations requiring cutting-edge performance for training trillion-parameter models. Azure plans to deploy hundreds of thousands of Blackwell Ultra GPUs globally.</li>
</ul>
<p>47:24 Ryan – “Pricing isn’t disclosed because it’s the GDP of a small country.” </p>
<p>48:05 <a href="https://azure.microsoft.com/en-us/updates?id=503258">Generally Available: CLI command for migration from Availability Sets and </a><a href="https://azure.microsoft.com/en-us/updates?id=503258">basic load balancer on AKS </a></p>
<ul>
<li style="font-weight:400;">Thanks for the timely heads up on this one… </li>
<li style="font-weight:400;">Azure introduces a single CLI command to migrate AKS clusters from deprecated Availability Sets to Virtual Machine Scale Sets before the September 2025 deadline, simplifying what would otherwise be a complex manual migration process.</li>
<li style="font-weight:400;">The automated migration upgrades clusters from basic load balancers to standard load balancers, providing improved reliability, zone redundancy, and support for up to 1000 nodes compared to the basic tier’s 100-node limit.</li>
<li style="font-weight:400;">This positions Azure competitively with AWS EKS and GCP GKE, which already use more modern infrastructure patterns by default, though Azure’s migration tool reduces the operational burden for existing customers.</li>
<li style="font-weight:400;">Organizations running production AKS workloads on Availability Sets should prioritize testing this migration in non-production environments first, as the process involves recreating node pools, which could impact running applications.</li>
<li style="font-weight:400;">While the migration itself has no direct cost, customers will see increased charges from standard load balancers (approximately $0.025/hour plus data processing fees) compared to free basic load balancers.</li>
</ul>
<p>49:01  Ryan – “This is why you drag your feet on getting off of everything.” </p>
<h2>Oracle</h2>
<p>49:12 <a href="https://blogs.oracle.com/cloud-infrastructure/post/announcing-dark-mode-for-the-oci-console">Announcing Dark Mode For The OCI Console</a></p>
<ul>
<li style="font-weight:400;">Oracle finally joins the dark mode club with OCI Console, following years behind AWS (2017), Azure (2019), and GCP (2020) – a basic UI feature that took surprisingly long for a major cloud provider to implement.</li>
<li style="font-weight:400;">The feature allows users to toggle between light and dark themes in the console settings, with Oracle claiming it reduces eye strain and improves battery life on devices – standard benefits that every other cloud provider has been touting for years.</li>
<li style="font-weight:400;">Dark mode persists across browser sessions and devices when logged into the same OCI account, though Oracle hasn’t specified if this preference syncs across different OCI regions or tenancies.</li>
<li style="font-weight:400;">While this is a welcome quality-of-life improvement for developers working late hours, it highlights Oracle’s ongoing challenge of playing catch-up on basic console features that competitors have long considered table stakes.</li>
<li style="font-weight:400;">The rollout appears to be gradual with no specific timeline mentioned, and Oracle provides no details about API or CLI theme preferences, suggesting this is purely a web console enhancement.</li>
</ul>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback, or ask questions at theCloudPod.net or tweet at us with the hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2170375/c1e-o838u27rrjt780gw-xxg4xkqxhjp-qcfp1h.mp3" length="97831466"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 326 of The Cloud Pod, where the forecast is always cloudy! Justin and Ryan are your guides to all things cloud and AI this week! We’ve got news from SonicWall (and it’s not great), a host of goodbyes to say over at AWS, Oracle (finally) joins the dark side, and even Slurm – and you don’t even need to ride on a creepy river to experience it. Let’s get started! 
Titles we almost went with this week

 SonicWall’s Cloud Backup Service: From 5% to Oh No, That’s Everyone
 AWS Spring Cleaning: 19 Services Get the Boot
 The Great AWS Service Purge of 2025
 Maintenance Mode: Where Good Services Go to Die
 GitHub Gets Assimilated: Resistance to Azure Migration is Futile
 Salesforce to Ransomware Gang: You Can’t Always Get What You Want
 Kansas City Gets the Need for Speed with 100G Direct Connect. Peter, what are you up too
 Gemini Takes the Wheel: Google’s AI Learns to Click and Type 
 Oracle Discovers the Dark Side (Finally Has Cookies)
 Azure Goes Full Blackwell: 4,600 Reasons to Upgrade Your GPU Game
 DataStax to the Future: AWS Hires Database CEO for Security Role
 The Clone Wars: EBS Strikes Back with Instant Volume Copies
 Slurm Dunk: AWS Brings HPC Scheduling to Kubernetes
 The Great Cluster Convergence: When Slurm Met EKS
 Codex sent me a DM that I’ll ignore too on Slack

General News 
01:24 SonicWall: Firewall configs stolen for all cloud backup customers

SonicWall confirmed that all customers using their cloud backup service had firewall configuration files exposed in a breach, expanding from their initial estimate of 5% to 100% of cloud backup users. That’s a big difference…
The exposed backup files contain AES-256-encrypted credentials and configuration data, which could include MFA seeds for TOTP authentication, potentially explaining recent Akira ransomware attacks that bypassed MFA.
SonicWall requires affected customers to reset all credentials, including local user passwords, TOTP codes, VPN shared secrets, API keys, and authentication tokens across their entire infrastructure.
This incident highlights a fundamental security risk of cloud-based configuration backups where sensitive credentials are stored centrally, making them attractive targets for attackers.
The breach demonstrates why WebAuthn/passkeys offer superior security architecture since they don’t rely on shared secrets that can be stolen from backups or servers.
Interested in checking out their detailed remediation guidance? Find that here. 

02:36  Justin – “You know, providing your own encryption keys is also good; not allowing your SaaS vendor to have the encryption key is a positive thing to do. There’s all kinds of ways to protect your data in the cloud when you’re leveraging a SaaS service.”
04:43 Take this rob and shove it! Salesforce issues stern retort to ransomware extort

Salesforce is refusing to pay ransomware demands from criminals claiming to have stolen nearly 1 billion customer records, stating they will not engage, negotiate with, or pay any extortion dema...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2170375/c1a-k5d5-25m458gghwk4-m4tt9c.jpg"></itunes:image>
                                                                            <itunes:duration>00:50:54</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2170375/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[325: Db2 or Not Db2: That Is the Backup Question]]>
                </title>
                <pubDate>Thu, 16 Oct 2025 20:10:12 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2166814</guid>
                                    <link>https://tcpfm.castos.com/episodes/325-db2-or-not-db2-that-is-the-backup-question</link>
                                <description>
                                            <![CDATA[<h3>Welcome to episode 325 of The Cloud Pod, where the forecast is always cloudy! Justin is on vacation this week, so it’s up to Ryan and Matthew to bring you all the latest news in cloud and AI, and they definitely deliver! This week we have an AWS invoice undo button, Sora 2, and quite a bit of news DigitalOcean – plus so much more. Let’s get started! </h3>
<h3>Titles we almost went with this week
</h3>
<ul>
<li>AWS Shoots for the Cloud with NBA Partnership</li>
<li>Nothing But Net: AWS Scores Big with Basketball AI Deal</li>
<li>From Courtside to Cloud-side: AWS Dunks on Sports Analytics</li>
<li>PostgreSQL Gets a Gemini Twin for Natural Language Queries</li>
<li>Fuzzy Logic: When Your Database Finally Speaks Your Language</li>
<li>CLI and Let AI: Google’s Natural Language Database Assistant</li>
<li>Satya’s Org Chart Shuffle: Now with More AI Synergy</li>
<li>Microsoft Reorgs Again: This Time It’s Personal (and Commercial)</li>
<li>Ctrl+Alt+Delete: Microsoft Reboots Its Sales Machine</li>
<li>Sora 2: The Sequel Nobody Asked For But Everyone Will Use</li>
<li>OpenAI Puts the “You” in YouTube (AI Edition)</li>
<li>Sam Altman Stars in His Own AI-Generated Reality Show</li>
<li>Grok and Roll: Microsoft’s New AI Model Rocks Azure</li>
<li>To Grok or Not to Grok: That is the Question</li>
<li>Grok Around the Clock: Azure’s 24/7 Reasoning Machine</li>
<li>Spark Joy: Google Lights Up ML Inference for Data Pipelines</li>
<li>DigitalOcean’s Storage Trinity: Hot, Cold, and Backed Up</li>
<li>NFS: Not For Suckers (Network File Storage)</li>
<li>The Goldilocks Storage Strategy: Not Too Hot, Not Too Cold, Just Right</li>
<li>NAT Gonna Cost You: DigitalOcean’s Gateway to Savings</li>
<li>BYOIP: Bring Your Own IP (But Leave Your Billing Worries Behind)</li>
<li>The Great Invoice Escape: No More Support Tickets Required Ctrl+Z for Your AWS Bills: The Undo Button Finance Teams Needed</li>
<li>Image Builder Finally Learns When to Stop Trying</li>
<li>Pipeline Dreams: Now With Built-in Reality Checks</li>
<li>EC2 Image Builder Gets a Failure Intervention Feature</li>
<li>MCP: Model Context Protocol or Marvel Cinematic Protocol?
</li>
</ul>
<h2>AI is Going Great – Or How ML Makes Money </h2>
<p>00:45 <a href="https://arstechnica.com/ai/2025/10/openais-sora-2-lets-users-insert-themselves-into-ai-videos-with-sound/">OpenAI’s Sora 2 lets users insert themselves into AI videos with sound – </a><a href="https://arstechnica.com/ai/2025/10/openais-sora-2-lets-users-insert-themselves-into-ai-videos-with-sound/">Ars Technica</a></p>
<ul>
<li style="font-weight:400;">OpenAI’s Sora 2 introduces synchronized audio generation alongside video synthesis, matching Google’s <a href="https://arstechnica.com/ai/2025/05/ai-video-just-took-a-startling-leap-in-realism-are-we-doomed/">Veo 3</a> and Alibaba’s <a href="https://wan25.net/">Wan 2.5</a> capabilities. </li>
<li style="font-weight:400;">This positions OpenAI competitively in the multimodal AI space with what they call their “GPT-3.5 moment for video.”</li>
<li style="font-weight:400;">The new <a href="https://apps.apple.com/us/app/sora-by-openai/id6744034028">iOS social app</a> feature allows users to insert themselves into AI-generated videos through “cameos,” suggesting potential applications for personalized content creation and social media integration at scale.</li>
<li style="font-weight:400;">Sora 2 demonstrates improved physical accuracy and consistency across multiple shots, addressing previous limitations where objects would teleport or deform unrealistically. </li>
<li style="font-weight:400;">The model can now simulate complex movements like gymnastics routines while maintaining proper physics.</li>
<li style="font-weight:400;">The addition of “sophisticated background soundscapes, speech, and sound effects” expands potential enterprise use cases for automated video production, training materials, and marketing content generation without separate audio post-processing.</li>
<li></li></ul>
<h3>Chapters</h3>
<ul><li>(00:00:00) - GCP 325</li><li>(00:00:54) - OpenAI Sora 2: Creators of AI Videos</li><li>(00:03:31) - Joules: New Tools and APIs for Developers</li><li>(00:05:18) - OpenAI Doubles Down on Chip Diversity with AMD</li><li>(00:07:52) - NBA Launches 'Inside the Game' Powered by AWS</li><li>(00:14:27) - EC2 Image Builder Update</li><li>(00:18:13) - AWS releases Open Source MCP Server for Amazon Bedrock Agent</li><li>(00:22:57) - AWS Knowledge Based MCP Server</li><li>(00:27:27) -  AWS Service Quotations: Automatic Management</li><li>(00:30:31) - Amazon RDS for DB2 Launches Native Database Backups</li><li>(00:32:36) - GCP.com: Gemini CLI for PostgreSQL</li><li>(00:37:34) - Google Announces $4 Billion Investment in Arkansas</li><li>(00:42:06) - Microsoft Restructuring its Azure Commercial Organization</li><li>(00:44:58) - Microsoft Bringing Xai Grok 4 to Azure AI Foundry</li><li>(00:47:24) - Microsoft to Allow Personal Copilot in Corporate Environments</li><li>(00:51:07) - Fabric Mirroring for Azure SQL Managed Instances</li><li>(00:54:28) - Microsoft Firewall Update 1.8</li><li>(00:56:32) - DigitalOcean: AI Storage, NFS, and More</li><li>(00:59:58) - DigitalOcean Build smarter Agents with OpenAI and VPC</li><li>(01:02:07) - DigitalOcean Brings Per Second Charges to Droplet Plans</li><li>(01:04:40) -  per second billing for Windows at DigitalOcean</li><li>(01:06:15) - Snowflake Managed MCP Servers for Secure Governed Data</li><li>(01:11:51) - Week in the Cloud: September 7, 2017</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 325 of The Cloud Pod, where the forecast is always cloudy! Justin is on vacation this week, so it’s up to Ryan and Matthew to bring you all the latest news in cloud and AI, and they definitely deliver! This week we have an AWS invoice undo button, Sora 2, and quite a bit of news DigitalOcean – plus so much more. Let’s get started! 
Titles we almost went with this week


AWS Shoots for the Cloud with NBA Partnership
Nothing But Net: AWS Scores Big with Basketball AI Deal
From Courtside to Cloud-side: AWS Dunks on Sports Analytics
PostgreSQL Gets a Gemini Twin for Natural Language Queries
Fuzzy Logic: When Your Database Finally Speaks Your Language
CLI and Let AI: Google’s Natural Language Database Assistant
Satya’s Org Chart Shuffle: Now with More AI Synergy
Microsoft Reorgs Again: This Time It’s Personal (and Commercial)
Ctrl+Alt+Delete: Microsoft Reboots Its Sales Machine
Sora 2: The Sequel Nobody Asked For But Everyone Will Use
OpenAI Puts the “You” in YouTube (AI Edition)
Sam Altman Stars in His Own AI-Generated Reality Show
Grok and Roll: Microsoft’s New AI Model Rocks Azure
To Grok or Not to Grok: That is the Question
Grok Around the Clock: Azure’s 24/7 Reasoning Machine
Spark Joy: Google Lights Up ML Inference for Data Pipelines
DigitalOcean’s Storage Trinity: Hot, Cold, and Backed Up
NFS: Not For Suckers (Network File Storage)
The Goldilocks Storage Strategy: Not Too Hot, Not Too Cold, Just Right
NAT Gonna Cost You: DigitalOcean’s Gateway to Savings
BYOIP: Bring Your Own IP (But Leave Your Billing Worries Behind)
The Great Invoice Escape: No More Support Tickets Required Ctrl+Z for Your AWS Bills: The Undo Button Finance Teams Needed
Image Builder Finally Learns When to Stop Trying
Pipeline Dreams: Now With Built-in Reality Checks
EC2 Image Builder Gets a Failure Intervention Feature
MCP: Model Context Protocol or Marvel Cinematic Protocol?


AI is Going Great – Or How ML Makes Money 
00:45 OpenAI’s Sora 2 lets users insert themselves into AI videos with sound – Ars Technica

OpenAI’s Sora 2 introduces synchronized audio generation alongside video synthesis, matching Google’s Veo 3 and Alibaba’s Wan 2.5 capabilities. 
This positions OpenAI competitively in the multimodal AI space with what they call their “GPT-3.5 moment for video.”
The new iOS social app feature allows users to insert themselves into AI-generated videos through “cameos,” suggesting potential applications for personalized content creation and social media integration at scale.
Sora 2 demonstrates improved physical accuracy and consistency across multiple shots, addressing previous limitations where objects would teleport or deform unrealistically. 
The model can now simulate complex movements like gymnastics routines while maintaining proper physics.
The addition of “sophisticated background soundscapes, speech, and sound effects” expands potential enterprise use cases for automated video production, training materials, and marketing content generation without separate audio post-processing.
]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[325: Db2 or Not Db2: That Is the Backup Question]]>
                </itunes:title>
                                    <itunes:episode>394</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<h3>Welcome to episode 325 of The Cloud Pod, where the forecast is always cloudy! Justin is on vacation this week, so it’s up to Ryan and Matthew to bring you all the latest news in cloud and AI, and they definitely deliver! This week we have an AWS invoice undo button, Sora 2, and quite a bit of news DigitalOcean – plus so much more. Let’s get started! </h3>
<h3>Titles we almost went with this week
</h3>
<ul>
<li>AWS Shoots for the Cloud with NBA Partnership</li>
<li>Nothing But Net: AWS Scores Big with Basketball AI Deal</li>
<li>From Courtside to Cloud-side: AWS Dunks on Sports Analytics</li>
<li>PostgreSQL Gets a Gemini Twin for Natural Language Queries</li>
<li>Fuzzy Logic: When Your Database Finally Speaks Your Language</li>
<li>CLI and Let AI: Google’s Natural Language Database Assistant</li>
<li>Satya’s Org Chart Shuffle: Now with More AI Synergy</li>
<li>Microsoft Reorgs Again: This Time It’s Personal (and Commercial)</li>
<li>Ctrl+Alt+Delete: Microsoft Reboots Its Sales Machine</li>
<li>Sora 2: The Sequel Nobody Asked For But Everyone Will Use</li>
<li>OpenAI Puts the “You” in YouTube (AI Edition)</li>
<li>Sam Altman Stars in His Own AI-Generated Reality Show</li>
<li>Grok and Roll: Microsoft’s New AI Model Rocks Azure</li>
<li>To Grok or Not to Grok: That is the Question</li>
<li>Grok Around the Clock: Azure’s 24/7 Reasoning Machine</li>
<li>Spark Joy: Google Lights Up ML Inference for Data Pipelines</li>
<li>DigitalOcean’s Storage Trinity: Hot, Cold, and Backed Up</li>
<li>NFS: Not For Suckers (Network File Storage)</li>
<li>The Goldilocks Storage Strategy: Not Too Hot, Not Too Cold, Just Right</li>
<li>NAT Gonna Cost You: DigitalOcean’s Gateway to Savings</li>
<li>BYOIP: Bring Your Own IP (But Leave Your Billing Worries Behind)</li>
<li>The Great Invoice Escape: No More Support Tickets Required Ctrl+Z for Your AWS Bills: The Undo Button Finance Teams Needed</li>
<li>Image Builder Finally Learns When to Stop Trying</li>
<li>Pipeline Dreams: Now With Built-in Reality Checks</li>
<li>EC2 Image Builder Gets a Failure Intervention Feature</li>
<li>MCP: Model Context Protocol or Marvel Cinematic Protocol?
</li>
</ul>
<h2>AI is Going Great – Or How ML Makes Money </h2>
<p>00:45 <a href="https://arstechnica.com/ai/2025/10/openais-sora-2-lets-users-insert-themselves-into-ai-videos-with-sound/">OpenAI’s Sora 2 lets users insert themselves into AI videos with sound – </a><a href="https://arstechnica.com/ai/2025/10/openais-sora-2-lets-users-insert-themselves-into-ai-videos-with-sound/">Ars Technica</a></p>
<ul>
<li style="font-weight:400;">OpenAI’s Sora 2 introduces synchronized audio generation alongside video synthesis, matching Google’s <a href="https://arstechnica.com/ai/2025/05/ai-video-just-took-a-startling-leap-in-realism-are-we-doomed/">Veo 3</a> and Alibaba’s <a href="https://wan25.net/">Wan 2.5</a> capabilities. </li>
<li style="font-weight:400;">This positions OpenAI competitively in the multimodal AI space with what they call their “GPT-3.5 moment for video.”</li>
<li style="font-weight:400;">The new <a href="https://apps.apple.com/us/app/sora-by-openai/id6744034028">iOS social app</a> feature allows users to insert themselves into AI-generated videos through “cameos,” suggesting potential applications for personalized content creation and social media integration at scale.</li>
<li style="font-weight:400;">Sora 2 demonstrates improved physical accuracy and consistency across multiple shots, addressing previous limitations where objects would teleport or deform unrealistically. </li>
<li style="font-weight:400;">The model can now simulate complex movements like gymnastics routines while maintaining proper physics.</li>
<li style="font-weight:400;">The addition of “sophisticated background soundscapes, speech, and sound effects” expands potential enterprise use cases for automated video production, training materials, and marketing content generation without separate audio post-processing.</li>
<li style="font-weight:400;">This development signals increasing competition in the video synthesis market, with major cloud providers likely to integrate similar capabilities into their AI services portfolios to meet growing demand for automated content creation tools.</li>
</ul>
<p>02:04  Matt – “So, before, when you could sort of trust social media videos, now you can’t anymore.” </p>
<p>03:25 <a href="https://blog.google/technology/google-labs/jules-tools-jules-api/">Jules introduces new tools and API for developers</a></p>
<ul>
<li style="font-weight:400;">Google’s <a href="https://jules.google/">Jules</a> AI coding agent now offers command-line access through <a href="https://developers.googleblog.com/en/meet-jules-tools-a-command-line-companion-for-googles-async-coding-agent">Jules Tools</a> and an API for direct integration into developer workflows, moving beyond its original chat interface to enable programmatic task automation.</li>
<li style="font-weight:400;">The Jules API allows developers to trigger coding tasks from external systems like Slack bug reports or CI/CD pipelines, enabling automated code generation, bug fixes, and test writing as part of existing development processes.</li>
<li style="font-weight:400;">Recent updates include file-specific context selection, persistent memory for user preferences, and structured environment variable management, addressing reliability issues that previously limited production use.</li>
<li style="font-weight:400;">This positions Jules as a workflow automation tool rather than just a coding assistant, competing with <a href="https://github.com/features/copilot">GitHub Copilot</a> and <a href="https://aws.amazon.com/blogs/aws/amazon-codewhisperer-free-for-individual-use-is-now-generally-available/">Amazon CodeWhisperer</a> by focusing on asynchronous task execution rather than inline code completion.</li>
<li style="font-weight:400;">The shift to API-based access enables enterprises to integrate AI coding assistance into their existing toolchains without requiring developers to switch contexts or adopt new interfaces.</li>
</ul>
<p>04:41  Matt – “We’re just adding to the tools; then we need to figure out which one is gong to be actually useful for you.” </p>
<p>05:17 <a href="https://www.businessinsider.com/openai-chip-diversity-amd-nvidia-deals-2025-10">OpenAI Doubles Down on Chip Diversity With AMD, Nvidia Deals –Business Insider</a></p>
<ul>
<li style="font-weight:400;"><a href="http://openai">OpenAI</a> signed a multi-year deal with <a href="https://www.amd.com/en/support/download/drivers.html">AMD</a> for chips requiring up to 6 gigawatts of power, plus an option to acquire tens of billions in AMD stock, diversifying beyond its heavy reliance on <a href="https://www.nvidia.com/en-us/geforce/graphics-cards/">Nvidia GPUs</a> accessed through <a href="https://portal.azure.com/">Microsoft Azure</a>.</li>
<li style="font-weight:400;">The AMD partnership joins recent deals including 10 gigawatts of Nvidia GPUs with $100 billion investment, a <a href="https://www.broadcom.com/">Broadcom</a> partnership for custom AI chips in 2025, and a $300 billion <a href="https://www.oracle.com/">Oracle</a> compute deal, signaling OpenAI’s strategy to secure diverse hardware supply chains.</li>
<li style="font-weight:400;">This diversification could benefit the broader AI ecosystem by increasing competition in the AI chip market, potentially lowering prices and reducing supply chain vulnerabilities from geopolitical risks or natural disasters.</li>
<li style="font-weight:400;">AMD expects tens of billions in revenue from the deal, marking a significant validation of their AI technology in a market where Nvidia holds dominant market share, while OpenAI gains negotiating leverage and supply redundancy.</li>
<li style="font-weight:400;">These massive infrastructure investments serve as demand signals for continued AI growth, though they concentrate risk on OpenAI’s success – if OpenAI fails to grow as projected, it could impact multiple chip manufacturers and the broader AI infrastructure buildout.</li>
</ul>
<p>06:51  Ryan – “I’m stuck on this article sort of gigawatts of power as a unit of measurement for GPU. Like, that’s hilarious to me. we’re just, there’s not this many, not this many GPUs, but like this much in power of GPUs.”</p>
<h2>AWS</h2>
<p>07:55 <a href="https://kill-the-newsletter.com/feeds/wmrx6rsab4dbqgph/entries/qzt9x8n0ht2u256r82f0.html">AWS to Become the Official Cloud and Cloud AI Partner of the NBA, WNBA, </a><a href="https://kill-the-newsletter.com/feeds/wmrx6rsab4dbqgph/entries/qzt9x8n0ht2u256r82f0.html">NBA G League, Basketball Africa League and NBA Take-Two Media</a></p>
<ul>
<li style="font-weight:400;">AWS becomes the official cloud and AI partner for NBA, WNBA, and affiliated leagues, launching “NBA Inside the Game powered by AWS” – a new basketball intelligence platform that processes billions of data points using <a href="https://u29658508.ct.sendgrid.net/ls/click?upn=u001.bAmh-2FH5FGjD-2BJBmopNJCWsHJBOoaJT-2BY-2FW3B01P2woMk1BnpoOoh-2Bz2Nh8z9u9vfwTdUpFiZQJ-2BJbHoczX5xIoR1h4GJO-2B6R5CjBlaAB-2FxqN345QTDbGedKdx1JwTphiIs9XX1DwwA28uzmBM0-2FPdDbwnUp21orpFQJpRXgz9VGvfJ-2FaAxqKGi2DN8lwOZkqt7u8XwhsNiQLGRzPEq4fAGoKC-2BDBBArgvVJieITKTfXnC8D-2FHS8dqOa5dv7ufO-2BeVIRwooPcwjYjVyIt-2F1BoZuWG-2B-2Fbxoo14egL6zxe4FhupFV92JAs6SF9LDZ-2BtjT-2B6Uq1JHBW0TjWVTQpqoOciuKRdQVA3gg8dDfFDKU8zW4V4O1wWMLV4UQL0XD80ctm22e1lS2PocuH7syWenteXVwAykv6U3znI-2F-2B-2BxKj67Re0s1Ou0fobGjMrcYcyr4H47o18Nl1OsQYpzRu-2F9sIvjtFxNX9kC71QJL1NupGAt7Y-2BY-2BeospJijcGRkJxtsK8ND8qne4lx6qKfc7s2OfLij1iKhDY090zRcp8uMev7LT5ojUNe9gWt0XIogkOfkeBFDnl3xIXsep0J4U3A4l7syjOg8nwsI70eBEN0wy4QvR45RUl9JXJMqoWfLLA202FFc9vLdRbMYS73eVCxYHuujwavBhW-2Fe9H0n9N-2Fwae0GyUYgB1dhtl9V84EJugu-2FSKYz4M1IOU41XrpOVBsliKxpr4U5Oy-2BK0w6BUXW9ua1OzMTZw-2BW7JpDRe0-2Fk1ryesPG6WHEd3FvBYU2qZC2W6Kt5fzkmXYhlVltcfQJ-2BHeGzWW7hyBzeBP5VGtjANvc-2B0cOjWz4UOQIJxz1ztYkCVLSQBA-3D-3DTEyk_pJDCxgr9oBdAkLS7dJGA-2FJl14vWt2z1-2BX6JCK7yYlWh-2Bi2QO1qiOoPiHMFTopy-2FsyXNKJ-2FpyZ5Vc9-2F-2FnZ1dD-2FBJRGipCB0f9spcGquhci3mxgV-2FR9vM31SAcgVLJKULGrZE3bLVIMTeBG1uinjRn9MaUSYqArt88mAUsTT15rH5-2FV7cdvXgU1FZhdd9qmSQ3KpShfZq-2Fmw9uF-2FYt0eKIGzexZQdQi7p9HU3hWSVvhywHxbBLIWt9sq2NvCjeCGHkBZH26u9smG9-2BBqQT-2FB8WhO9RExAWOqKOxFGI3tIsu6742aeQ0-2Bq8nWMHcz0IUZmClQIigZLu-2BlnMnl1QLBdAyb9vUoRHnjtIhfFz04Ots-2Bqinqtsp4c90CovcA5V9A-2FpJeqmp-2FF-2BwmIzf-2B3-2BKs3T8bXukhpSRPOHx0pqwyQHF0uT1r2-2BCJVvyqH4i7c0P8IU">Amazon Bedrock</a> and <a href="https://u29658508.ct.sendgrid.net/ls/click?upn=u001.bAmh-2FH5FGjD-2BJBmopNJCWsHJBOoaJT-2BY-2FW3B01P2woMk1BnpoOoh-2Bz2Nh8z9u9vfwTdUpFiZQJ-2BJbHoczX5xIoR1h4GJO-2B6R5CjBlaAB-2FxrBZT-2FVS4TPhZAau249wQ9BPfZ7u0fqS1SAtawql80uEfBJ8s5eRh364k4W7cATvH-2B89j2-2Bnvgyj-2FkMUaGQA1N4CUZWl5CWCnzwRhQPeg1z2m3OnAiyO55raPMCWp5-2F6A5sR2juk6X7r3xRCxN3fkbyUmvjmsfcCGi2RQJAZKxtoqkEoAnbOu5W5UmQeUG5XxASqpKdT5DKtKNUxG6Aw2l-2B-1-V_pJDCxgr9oBdAkLS7dJGA-2FJl14vWt2z1-2BX6JCK7yYlWh-2Bi2QO1qiOoPiHMFTopy-2FsyXNKJ-2FpyZ5Vc9-2F-2FnZ1dD-2FBJRGipCB0f9spcGquhci3mxgV-2FR9vM31SAcgVLJKULGrZE3bLVIMTeBG1uinjRn9MaUSYqArt88mAUsTT15rH5-2FV7cdvXgU1FZhdd9qmSQ3KpShfZq-2Fmw9uF-2FYt0eKIGzexZQdQi7p9HU3hWSVvhywHxbBLIWt9sq2NvCjeCGHkBZH26u9smG9-2BBqQT-2FB8WhO9RExAWOqKOxFGI3tIsu65k8VNYwOD7OcbS56p5LmyCVBctpSghNv3zB2G3EbQOXOsYnSG9RuFOQ5EfUns1-2FUd0cL7hSV-2BiHU3h9HwsR2uTEV1i-2B6Rh-2BNBhSpjJ-2FEa-2BLGK8BR4kyUTGDwEoO61lmkeVwDOgUPA5LJ60u-2B-2BhDZ2F">SageMaker</a> to deliver real-time analytics and insights during live games.</li>
<li style="font-weight:400;">The platform introduces AI-powered advanced statistics that analyze 29 data points per player using machine learning to generate previously unmeasurable performance metrics, with initial stats rolling out during the 2025-26 season accessible via NBA App, NBA.com, and Prime Video broadcasts.</li>
<li style="font-weight:400;">Play Finder” technology uses AI to analyze player movements across thousands of games, enabling instant search and retrieval of similar plays for broadcasters and eventually allowing teams direct access to ML models for coaching and front office workflows.</li>
<li style="font-weight:400;">The NBA App, NBA.com, and NBA League Pass will run entirely on AWS infrastructure, supporting global fan engagement with personalized, in-language content delivery while complementing Amazon’s 11-year media rights agreement for 66 regular-season games on Prime Video.</li>
<li style="font-weight:400;">This partnership demonstrates AWS’s expanding role in sports analytics beyond traditional cloud infrastructure, showcasing how AI services like Bedrock and SageMaker can transform real-time data processing for consumer-facing applications at massive scale.</li>
</ul>
<p>10:51  Ryan – “I do like the AI analytics for sports, like AWS is already in the NFL and F! Racings and it’s sort of a neat add-on when they integrate it.”  </p>
<p>12:45 <a href="https://aws.amazon.com/about-aws/whats-new/2025/10/self-service-invoice-correction-feature/">AWS Introduces self-service invoice correction feature</a></p>
<ul>
<li style="font-weight:400;">AWS launches self-service invoice correction feature allowing customers to instantly update purchase order numbers, business legal names, and addresses on their invoices through the Billing and Cost Management console without contacting support.</li>
<li style="font-weight:400;">This addresses a common pain point for enterprise customers who need accurate invoices for accounting compliance and reduces manual support ticket volume for AWS teams.</li>
<li style="font-weight:400;">The guided workflow in the console lets customers update both their account settings and select existing invoices, providing immediate corrected versions for download.</li>
<li style="font-weight:400;">Available in all AWS regions except GovCloud and China regions, making it accessible to most commercial AWS customers globally.</li>
<li style="font-weight:400;">Particularly valuable for organizations with strict procurement processes or those who’ve undergone mergers, acquisitions, or address changes that require invoice updates for proper expense tracking.</li>
</ul>
<p>17:53 <a href="https://aws.amazon.com/about-aws/whats-new/2025/10/ec2-image-builder-capabilities-managing-image-pipelines/">EC2 Image Builder now provides enhanced capabilities for managingimage pipelines</a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/imagebuilder/latest/userguide/what-is-image-builder.html">EC2 Image Builder</a> now automatically disables pipelines after consecutive failures, preventing unnecessary resource creation and reducing costs from repeatedly failed builds – a practical solution for teams dealing with flaky build processes.</li>
<li style="font-weight:400;">The new custom log group configuration allows teams to set specific retention periods and encryption settings for pipeline logs, addressing compliance requirements and giving better control over log management costs.</li>
<li style="font-weight:400;">This update targets a common pain point where failed image builds could run indefinitely, consuming resources and generating costs without producing usable outputs – particularly valuable for organizations running frequent automated builds.</li>
<li style="font-weight:400;">The features are available at no additional cost across all AWS commercial regions including China and <a href="https://cloud.gov/">GovCloud</a>, making them immediately accessible for existing Image Builder users through Console, CLI, API, <a href="https://docs.aws.amazon.com/AWSCloudFormation/latest/UserGuide/Welcome.html">CloudFormation</a>, or CDK.</li>
<li style="font-weight:400;">These enhancements position Image Builder as a more mature CI/CD tool for AMI creation, competing more effectively with third-party solutions by addressing operational concerns around cost control and logging flexibility.</li>
</ul>
<p>16:22  Matt – “I just like this because it automatically disables the pipeline, and I feel like this is more for all those old things that you forgot about that are running that just keep triggering daily that break at one point – or you hope break, so you actually don’t keep spending the money on them. That’s a pretty nice feature, in my opinion, there where it just stops it from running forever.”</p>
<p>18:26 <a href="https://aws.amazon.com/about-aws/whats-new/2025/10/open-source-mcp-server-amazon-bedrock-agentcore">Open Source Model Context Protocol (MCP) Server now available for AmazonBedrock AgentCore</a></p>
<ul>
<li style="font-weight:400;">AWS releases an open-source Model Context Protocol (MCP) server for <a href="https://aws.amazon.com/bedrock/agentcore/">Amazon Bedrock AgentCore</a>, providing a standardized interface for developers to build, analyze, and deploy AI agents directly in their development environments with one-click installation support for IDEs like <a href="https://kiro.dev/">Kiro</a>, <a href="https://www.anthropic.com/news/claude-code-plugins">Claude Code</a>, <a href="https://www.bing.com/ck/a?!&amp;&amp;p=48b1d5f033fbf62df1ef0a29044259c4e91c7adbbd1d09b53d8ca7a8de3a0f3cJmltdHM9MTc2MDMxMzYwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=33cf825f-ad59-6912-19e3-9438ac3368b1&amp;psq=Cursor&amp;u=a1aHR0cHM6Ly9jdXJzb3IuY29tLw">Cursor</a>, and <a href="https://www.bing.com/ck/a?!&amp;&amp;p=4859ea11d531e65a24ec180df4ec5316f34cc691066fdcd0046e0f846348785bJmltdHM9MTc2MDMxMzYwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=33cf825f-ad59-6912-19e3-9438ac3368b1&amp;psq=Amazon+Q+Developer+CLI&amp;u=a1aHR0cHM6Ly9hd3MuYW1hem9uLmNvbS9kZXZlbG9wZXIvbGVhcm5pbmcvcS1kZXZlbG9wZXItY2xpLw">Amazon Q Developer CLI</a>.</li>
<li style="font-weight:400;">The MCP server enables natural language-driven agent development, allowing developers to iteratively build agents and transform agent logic to work with the <a href="https://aws.github.io/bedrock-agentcore-starter-toolkit/">AgentCore SDK</a> before deploying to development accounts, streamlining the path from prototype to production.</li>
<li style="font-weight:400;">This integration addresses the complexity of AI agent development by providing a unified protocol that works across multiple development tools, reducing the friction between local development and AWS deployment while maintaining security and scale capabilities.</li>
<li style="font-weight:400;">Available globally via GitHub, the MCP server represents AWS’s commitment to open-source tooling for generative AI development, complementing the broader AgentCore platform which handles secure deployment and operation of AI agents at scale.</li>
<li style="font-weight:400;">For businesses looking to implement AI agents, this reduces development time and technical barriers while maintaining enterprise-grade security and scalability, with pricing following the standard Amazon Bedrock AgentCore model.</li>
</ul>
<p>20:50  Ryan- “This is one of those things where I’m a team of one right now doing a whole bunch of snowflake development of internal services, and so I’m like, what’s this for? I don’t understand the problem. But I can imagine that this is something that’s really useful more when you’re spreading out against teams so that you can get unification on some of these things, because if you have a team of people all developing separate agents that are, in theory, somehow going to work together…so I think this is maybe a step in that direction.” </p>
<p>22:02 <a href="https://aws.amazon.com/about-aws/whats-new/2025/10/amazon-ecs-one-click-event-capture-history-querying-console/">Amazon ECS now supports one-click event capture and event history querying in the AWS Management Console</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/ecs/">Amazon ECS</a> adds one-click event capture in the console that automatically creates EventBridge rules and <a href="https://docs.aws.amazon.com/AmazonCloudWatch/latest/logs/Working-with-log-groups-and-streams.html">CloudWatch log groups</a>, eliminating manual setup for monitoring task state changes and service events.</li>
<li style="font-weight:400;">The new event history tab provides pre-built query templates for common troubleshooting scenarios like stopped tasks and container exit codes, keeping data beyond the default retention limits without requiring CloudWatch Logs Insights knowledge.</li>
<li style="font-weight:400;">This addresses a long-standing pain point where ECS task events would disappear after tasks stopped, making post-mortem debugging difficult – now operators can query historical events directly from the ECS console with filters for time range, task ID, and deployment ID.</li>
<li style="font-weight:400;">The feature is available in all AWS Commercial and GovCloud regions at standard CloudWatch Logs pricing, making it accessible for teams that need better visibility into container lifecycle events without additional tooling.</li>
<li style="font-weight:400;">For DevOps teams managing production ECS workloads, this simplifies incident response by consolidating event data in one place rather than jumping between multiple AWS consoles to piece together what happened during an outage.</li>
</ul>
<p>23:14  Jonathan – “It’s a great click ops feature.” </p>
<p>24:04 <a href="https://aws.amazon.com/about-aws/whats-new/2025/10/aws-knowledge-mcp-server-generally-available">AWS Knowledge MCP Server now generally available</a></p>
<ul>
<li style="font-weight:400;">AWS launches a free MCP (Model Context Protocol) server that provides AI agents and LLM applications direct access to AWS documentation, blog posts, What’s New announcements, and Well-Architected best practices in a format optimized for language models.</li>
<li style="font-weight:400;">The server includes regional availability data for AWS APIs and CloudFormation resources, helping AI agents provide more accurate responses about service availability and reduce hallucinations when answering AWS-related questions.</li>
<li style="font-weight:400;">No AWS account required and available at no cost with rate limits, making it accessible for developers building AI assistants or chatbots that need authoritative AWS information without manual context management.</li>
<li style="font-weight:400;">Compatible with any MCP client or agentic framework supporting the protocol, allowing developers to integrate trusted AWS knowledge into their AI applications through a simple endpoint configuration.</li>
<li style="font-weight:400;">This addresses a common challenge where AI models provide outdated or incorrect AWS information by ensuring responses are anchored in official, up-to-date AWS documentation and best practices.</li>
</ul>
<p>25:46  Jonathan – “It’s the rate limiting; it’s putting realistic in controls in place, whereas before they would just scrap everything.” </p>
<p>28:48 <a href="https://aws.amazon.com/about-aws/whats-new/2025/10/automatic-quota-management-service-quotas/">Automatic quota management is now generally available for AWS Service Quotas</a></p>
<ul>
<li style="font-weight:400;">AWS <a href="https://console.aws.amazon.com/servicequotas/home">Service Quotas</a> now automatically monitors quota usage and sends proactive notifications through email, SMS, or Slack before customers hit their limits, preventing application interruptions from quota exhaustion.</li>
<li style="font-weight:400;">The feature integrates with AWS Health and CloudTrail events, enabling customers to build automated workflows that respond to quota threshold alerts and potentially request increases programmatically.</li>
<li style="font-weight:400;">This addresses a common operational pain point where teams discover quota limits only after hitting them, causing service disruptions or failed deployments during critical scaling events. (Really though, is there any other way?)</li>
<li style="font-weight:400;">The service is available at no additional cost across all commercial AWS regions, making it accessible for organizations of any size to improve their quota management practices.</li>
<li style="font-weight:400;">For DevOps teams managing multi-account environments, this provides centralized visibility into quota consumption patterns across services, helping predict future needs and plan capacity more effectively.</li>
</ul>
<p>32:06 <a href="https://aws.amazon.com/about-aws/whats-new/2025/10/amazon-rds-for-db2-native-database-backup/">Amazon RDS for Db2 launches support for native database backups</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/rds/db2/">RDS for Db2</a> now supports native database-level backups, allowing customers to selectively back up individual databases within a <a href="https://docs.aws.amazon.com/AmazonRDS/latest/UserGuide/db2-multiple-databases.html">multi-database instance</a> rather than requiring full instance snapshots. This enables more granular control for migrations and reduces storage costs.</li>
<li style="font-weight:400;">The feature addresses a common enterprise need for moving specific databases between environments – customers can now easily migrate individual databases to another RDS instance or back to on-premises Db2 installations using standard backup commands.</li>
<li style="font-weight:400;">Development teams benefit from the ability to quickly create database copies for testing environments without duplicating entire instances, while compliance teams can maintain separate backup copies of specific databases to meet regulatory requirements.</li>
<li style="font-weight:400;">Cost optimization becomes more achievable as customers only pay for storage of the specific databases they need to back up rather than full instance snapshots, particularly valuable for instances hosting multiple databases where only some require frequent backups.</li>
<li style="font-weight:400;">The feature is available in all regions where RDS for Db2 is offered, with pricing following standard RDS storage rates detailed at aws.amazon.com/rds/db2/pricing.</li>
</ul>
<h2>GCP</h2>
<p>34:19 <a href="https://cloud.google.com/blog/products/databases/gemini-cli-for-postgresql-in-action/">Gemini CLI for PostgreSQL in action | Google Cloud Blog</a></p>
<p> </p>
<ul>
<li style="font-weight:400;">Google introduces <a href="https://github.com/gemini-cli-extensions/postgres">Gemini CLI extension for PostgreSQL</a> that enables natural language database management, allowing developers to implement features like fuzzy search through conversational commands instead of manual SQL configuration and extension management.</li>
<li style="font-weight:400;">The tool automatically identifies appropriate PostgreSQL extensions (like pg_trgm for fuzzy search), checks installation status, handles setup, and generates optimized queries with proper indexing recommendations – reducing typical multi-step database tasks to simple English requests.</li>
<li style="font-weight:400;">Key capabilities include full lifecycle database control from instance creation to user management, automatic code generation based on table schemas, and intelligent schema exploration – positioning it as a database assistant rather than just a command line tool.</li>
<li style="font-weight:400;">This addresses a common developer pain point of context switching between code editors, database clients, and cloud consoles, potentially accelerating feature development for applications requiring advanced PostgreSQL capabilities like search functionality.</li>
<li style="font-weight:400;">Available through GitHub at github.com/gemini-cli-extensions/postgres, this represents Google’s broader push to integrate Gemini AI across their cloud services, though pricing details and performance benchmarks compared to traditional database management approaches aren’t specified.</li>
</ul>
<p>35:35  Matt – “I really like the potentially increasing people, because they don’t have context switch. It’s like it’s a feature.”</p>
<p>39:01 <a href="https://blog.google/inside-google/company-announcements/google-american-innovation-arkansas/">Google announces new $4 billion investment in Arkansas</a></p>
<ul>
<li style="font-weight:400;">Google is investing $4 billion in Arkansas through 2027 to build its first data center in the state at West Memphis, expanding GCP’s regional presence and capacity for cloud and AI workloads in the central US.</li>
<li style="font-weight:400;">The investment includes a 600 MW solar project partnership with Entergy and programs to reduce peak power usage, addressing the growing energy demands of AI infrastructure while improving grid stability.</li>
<li style="font-weight:400;">Google is providing free access to Google AI courses and Career Certificates to all Arkansas residents, starting with University of Arkansas and Arkansas State University students, to build local cloud and AI talent.</li>
<li style="font-weight:400;">The $25 million Energy Impact Fund for Crittenden County residents demonstrates Google’s approach to community investment alongside data center development, potentially setting a model for future expansions.</li>
<li style="font-weight:400;">This positions GCP to better serve customers in the central US with lower latency and regional data residency options, competing with AWS and Azure’s existing presence in neighboring states.</li>
</ul>
<p>40:25  Ryan – “So per some live research, Walmart is using both Azure and Google as their own private data center infrastructure.” </p>
<h2>Azure</h2>
<p>43:36 <a href="https://blogs.microsoft.com/blog/2025/10/01/accelerating-our-commercial-growth/">Accelerating our commercial growth</a></p>
<ul>
<li style="font-weight:400;">Microsoft is restructuring its commercial organization under Judson Althoff as CEO of commercial business, consolidating sales, marketing, operations, and engineering teams to accelerate AI transformation services for enterprise customers.</li>
<li style="font-weight:400;">The reorganization creates a unified commercial leadership team with shared accountability for product strategy, go-to-market readiness, and sales execution, potentially streamlining how <a href="https://azure.microsoft.com/en-us/products/ai-services/">Azure AI services</a> are delivered to customers.</li>
<li style="font-weight:400;">Operations teams now report directly to commercial leadership rather than corporate, which should tighten feedback loops between customer needs and Azure service delivery.</li>
<li style="font-weight:400;">This structural change allows Satya Nadella and engineering leaders to focus on datacenter buildout, systems architecture, and AI innovation while commercial teams handle customer-facing execution.</li>
<li style="font-weight:400;">The move signals Microsoft’s push to position itself as the primary partner for enterprise AI transformation, likely intensifying competition with AWS and Google Cloud for AI workload dominance.</li>
</ul>
<p>45:47 Matt – “Yeah, I think it’s just the AI. Even our account team changed their name a bunch; they al have AI in their name now.” </p>
<p>46:31 <a href="https://azure.microsoft.com/en-us/blog/grok-4-is-now-available-in-azure-ai-foundry-unlock-frontier-intelligence-and-business-ready-capabilities/">Grok 4 is now available in Microsoft Azure AI Foundry | Microsoft Azure Blog</a></p>
<ul>
<li style="font-weight:400;">Microsoft brings<a href="https://x.ai/"> xAI’s</a> <a href="https://x.ai/news/grok-4">Grok 4</a> model to <a href="https://ai.azure.com/">Azure AI Foundry</a> with a 128K token context window, native tool use, and integrated web search capabilities, positioning it as a competitor to GPT-4 and Claude for enterprise reasoning tasks.</li>
<li style="font-weight:400;">The model features “think mode” for first-principles reasoning that breaks down complex problems step-by-step, making it particularly suited for research analysis, tutoring, and troubleshooting scenarios where logical consistency matters.</li>
<li style="font-weight:400;">Pricing starts at $2 per million input tokens and $10 per million output tokens for Grok 4, with faster variants available at lower costs – Grok 4 Fast Reasoning at $0.60/$2.40 and Fast Non-Reasoning at $0.30/$1.20 per million tokens.</li>
<li style="font-weight:400;">Azure AI Content Safety is enabled by default for all Grok models, addressing enterprise concerns about responsible AI deployment while Microsoft continues safety testing and compliance checks.</li>
<li style="font-weight:400;">The extended context window allows processing entire code repositories or hundreds of pages of documents in a single request, reducing the need to manually chunk large inputs for analysis tasks.</li>
</ul>
<p>48:18  Ryan – “I like competition generally, and so it’s good to see another competitor model developer, but it is it like they’re adding features that are one model behind Anthopic and OpenAI.”</p>
<p>49:06 <a href="https://go.theregister.com/feed/www.theregister.com/2025/10/01/microsoft_consumer_copilot_corporate/">Microsoft to allow consumer Copilot in corporate environs • The Register</a></p>
<ul>
<li style="font-weight:400;">Question one: What? </li>
<li style="font-weight:400;">Microsoft now allows employees to use personal <a href="https://copilot.microsoft.com/">Copilot</a> subscriptions (Personal, Family, or Premium) with work <a href="https://www.office.com/">Microsoft 365</a> accounts, effectively endorsing shadow IT practices while maintaining that enterprise data protections remain intact through <a href="https://learn.microsoft.com/en-us/entra/identity/conditional-access/controls">Entra identity controls</a>.</li>
<li style="font-weight:400;">IT administrators can disable this feature (which they are rushing to do right now) through cloud policy controls and audit personal Copilot interactions, though the default enablement removes their initial authority over AI tool adoption within their organizations.</li>
<li style="font-weight:400;">This move positions Microsoft to boost Copilot adoption statistics by any means necessary counting personal usage in enterprise environments, while competing AI vendors may view this as Microsoft leveraging its Office dominance to crowd out alternatives.</li>
<li style="font-weight:400;">Government tenants (GCC/DoD) are excluded from this capability, and employees should note that their personal Copilot prompts and responses will be captured and auditable by their employers.</li>
<li style="font-weight:400;">The feature represents Microsoft’s shift from preventing shadow IT to managing it, potentially creating compliance challenges for organizations with strict data governance requirements while offering a controlled alternative to completely unmanaged AI tools.</li>
</ul>
<p>50:44  Ryan – “I think this is nutso.” </p>
<p>53:00 <a href="https://campaigns.endjin.com/t/t-l-ghtdryd-tddtjdjjc-ui/">Fabric Mirroring for Azure SQL Managed Instance (Generally Available) | Microsoft Fabric Blog | Microsoft Fabric</a></p>
<ul>
<li style="font-weight:400;">Azure SQL Managed Instance <a href="https://learn.microsoft.com/fabric/mirroring/overview">Mirroring</a> enables near real-time data replication to Microsoft Fabric’s OneLake without ETL processes, supporting both data changes and schema modifications like column additions/drops unlike traditional CDC approaches.</li>
<li style="font-weight:400;">The feature provides free compute and storage based on Fabric capacity size (F64 capacity includes 64TB free mirroring storage), with OneLake storage charges only applying after exceeding the free limit.</li>
<li style="font-weight:400;">Mirrored data becomes immediately available across all Fabric services including Power BI Direct Lake mode, Data Warehouse, Notebooks, and Copilots, allowing cross-database queries between mirrored databases, warehouses, and lakehouses.</li>
<li style="font-weight:400;">Microsoft positions this as a zero-code, zero-ETL solution competing with AWS Database Activity Streams and GCP Datastream, targeting enterprises seeking simplified operational data access and reduced total cost of ownership.</li>
<li style="font-weight:400;">The service extends beyond Managed Instance to include Azure SQL Database and SQL Server 2016-2025, creating a unified mirroring approach across Microsoft’s entire SQL portfolio into their analytics platform.</li>
<li style="font-weight:400;">Interested in pricing? Find that <a href="https://azure.microsoft.com/pricing/details/microsoft-fabric/#overview">here</a>. </li>
</ul>
<p>54:55  Ryan – “Because Microsoft SQL server is so memory intensive for performance, being able to do large queries across, you know, datasets has always been difficult with that…So I can see why this is very handy if you’re Microsoft SQL on Azure. And then the fact that they’re giving you so much for free is the incentive there. They know what they’re doing.”</p>
<p>56:35 <a href="https://azure.microsoft.com/en-us/updates?id=511722">Generally Available: Azure Firewall Updates – IP Group limit increased to 
 600 per Firewall Policy</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/firewall-manager/policy-overview">Azure Firewall Policy</a> now supports 600 IP Groups per policy, tripling the previous limit of 200, allowing organizations to consolidate more network security rules into fewer, more manageable groups.</li>
<li style="font-weight:400;">This enhancement directly addresses enterprise scalability needs by reducing rule complexity – instead of maintaining thousands of individual IP addresses across multiple policies, administrators can organize them into logical groups like “branch offices” or “partner networks.”</li>
<li style="font-weight:400;">The increased limit brings Azure Firewall closer to parity with <a href="https://aws.amazon.com/network-firewall/">AWS Network Firewall</a> and GCP Cloud Armor, which have historically offered more flexible rule management options for large-scale deployments.</li>
<li style="font-weight:400;">Primary beneficiaries include large enterprises and managed service providers who manage complex multi-tenant environments, as they can now implement more granular security policies without hitting artificial limits.</li>
<li style="font-weight:400;">While the feature itself is free, customers should note that Azure Firewall pricing starts at $1.25 per deployment hour plus data processing charges, making efficient rule management critical for cost optimization.</li>
</ul>
<p>57:50  Matt – “Azure Firewall isn’t cheap, but it’s also your but it’s also your IDS and IPS, so if you’re comparing it to Apollo Alto or any of these other massive ones, the Premiere version is not cheap, but it does give you a lot of those security things.”</p>
<h2>Other Clouds</h2>
<p>58:54 <a href="https://www.digitalocean.com/blog/nfs-cold-storage-backups">Announcing cost-efficient storage with Network file storage, cold storage, </a><a href="https://www.digitalocean.com/blog/nfs-cold-storage-backups">and usage-based backups | DigitalOcean</a></p>
<ul>
<li style="font-weight:400;">DigitalOcean is launching Network File Storage (NFS) on October 20th, a managed file system service starting at 50 GiB increments that supports NFSv3/v4 and allows multiple GPU/CPU droplets to mount the same share for <a href="https://www.digitalocean.com/products/gradient">AI/ML workloads</a>. </li>
<li style="font-weight:400;">This addresses the need for shared high-performance storage without the typical 1TB+ minimums of competitors.</li>
<li style="font-weight:400;">Spaces cold storage enters public preview at $0.007/GiB per month with one free retrieval monthly, targeting petabyte-scale datasets that need instant access but are rarely used. The pricing model avoids unpredictable retrieval fees common with other providers by including one monthly retrieval in the base price.</li>
<li style="font-weight:400;">Usage-based backups now support 4, 6, or 12-hour backup intervals with retention from 3 days to 6 months, priced from $0.01-0.04/GiB-month based on frequency. This consumption-based model helps businesses meet strict RPO requirements without paying for unused capacity.</li>
<li style="font-weight:400;">All three services target AI/ML workloads and data-intensive applications, with NFS optimized for training datasets, cold storage for archived models, and frequent backups for GPU droplet protection. </li>
<li style="font-weight:400;">The combination provides a complete storage strategy for organizations dealing with growing data footprints.</li>
<li style="font-weight:400;">The services are initially available in limited regions (NFS in ATL1 and NYC) with preview access requiring support tickets or form submissions, indicating a measured rollout approach typical of infrastructure services.</li>
</ul>
<p>1:01:24  Matt – “At lot of these companies don’t need the scale, the flexibility and everything else that AWS, GCP, and Azure provide…this is probably all they need.”  </p>
<p>1:02:36<a href="https://www.digitalocean.com/blog/new-capabilities-security-developer-tools-gradient-ai-platform">Build Smarter Agents with Image Generation, Auto-Indexing, VPC Security, and new AI Tools on DigitalOcean Gradient AI Platform | DigitalOcean</a></p>
<ul>
<li style="font-weight:400;">DigitalOcean’s <a href="https://www.digitalocean.com/products/gradient/platform">Gradient AI Platform</a> now supports image generation through OpenAI’s gpt-image-1 model, marking their first non-text modality and enabling developers to create images programmatically via the same API endpoint used for text completions.</li>
<li style="font-weight:400;">Auto-indexing for Knowledge Bases automatically detects, fetches, and re-indexes new or updated documents from connected sources into OpenSearch databases, reducing manual maintenance for keeping AI agents’ knowledge current.</li>
<li style="font-weight:400;">New VPC integration allows AI agents and indexing jobs to run on private networks within DigitalOcean’s managed infrastructure, addressing enterprise security requirements without exposing services to the public internet.</li>
<li style="font-weight:400;">Two new developer tools are coming: the Agent Development Kit (ADK) provides a code-first framework for building and deploying AI agent workflows, while</li>
<li style="font-weight:400;">Genie offers VS Code integration for designing multi-agent systems using natural language.</li>
<li style="font-weight:400;">These updates position DigitalOcean to compete more directly with major cloud providers in the AI platform space by offering multimodal capabilities, enterprise security features, and developer-friendly tooling for building production AI applications.</li>
</ul>
<p>1:04:14  Matt – “Theyre really learning about their audience, and they’re going to build specific to what their customer needs… and they’ve determined that their customers need these image generation AI features. They’re not always the fastest, but they always get there.” </p>
<p>1:05:11 <a href="https://www.digitalocean.com/blog/dropletplans-persecbilling-byoip-natgateway">Announcing per-sec billing, new Droplet plans, BYOIP, and NAT gateway preview to reduce scaling costs | DigitalOcean</a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.digitalocean.com/droplets/new?fleetUuid=2d108fad-6d02-4c95-af2f-4094f43eaed3&amp;i=403c6d&amp;region=sfo2&amp;size=s-2vcpu-4gb-120gb-intel">DigitalOcean</a> is switching from hourly to per-second billing for <a href="https://www.digitalocean.com/products/droplets">Droplets</a> starting January 1, 2026, with a 60-second minimum charge, which seems like the standard now.</li>
<li style="font-weight:400;">This change could reduce costs by up to 80% for short-lived workloads like CI/CD pipelines that previously paid for full hours when only using minutes.</li>
<li style="font-weight:400;">New intermediate Droplet sizes bridge the gap between shared and dedicated CPU plans, allowing in-place upgrades without IP changes or data migration. The new plans include 5x SSD variants for CPU Optimized and 6.5x SSD variants for General Purpose, addressing the previous large cost jump between tiers.</li>
<li style="font-weight:400;">Bring Your Own IP (BYOIP) is now generally available with a 7-day setup time compared to 1-4 weeks at hyperscalers. This allows businesses to maintain their IP reputation and avoid breaking client allow-lists when migrating to DigitalOcean.</li>
<li style="font-weight:400;">VPC NAT Gateway enters public preview at $40/month including 100GB bandwidth, supporting up to 500,000 simultaneous connections. </li>
<li style="font-weight:400;">This managed service provides centralized egress with static IPs for private resources without the complexity of self-managed NAT instances.</li>
<li style="font-weight:400;">These updates target cost optimization and migration friction points, particularly benefiting ephemeral workloads, auto-scaling applications, and businesses needing to maintain IP continuity during cloud migrations.</li>
</ul>
<p>1:09:31 <a href="https://www.snowflake.com/content/snowflake-site/global/en/blog/managed-mcp-servers-secure-data-agents">Introducing Snowflake Managed MCP Servers for Secure, Governed Data Agents</a></p>
<ul>
<li style="font-weight:400;">Snowflake is introducing Managed MCP (Model Context Protocol) Servers that enable secure data agents to access enterprise data while maintaining governance and compliance controls. This addresses the challenge of giving AI agents access to sensitive data without compromising security.</li>
<li style="font-weight:400;">The MCP protocol, originally developed by Anthropic, allows AI assistants to interact with external data sources through a standardized interface. </li>
<li style="font-weight:400;">Snowflake’s implementation adds enterprise-grade security layers including authentication, authorization, and audit logging.</li>
<li style="font-weight:400;">Data agents can now query Snowflake databases, run SQL commands, and retrieve results without requiring direct database credentials or exposing sensitive connection strings. All interactions are governed by Snowflake’s existing role-based access controls and data governance policies.</li>
<li style="font-weight:400;">This integration enables organizations to build AI applications that can answer questions about their business data while ensuring compliance with data residency, privacy regulations, and internal security policies. The managed service handles infrastructure complexity and scaling automatically.</li>
<li style="font-weight:400;">Developers can connect popular AI frameworks and tools to Snowflake data through the MCP interface, reducing the complexity of building secure data pipelines for AI applications. This positions Snowflake as a bridge between enterprise data warehouses and the emerging AI agent ecosystem.</li>
</ul>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback, or ask questions at theCloudPod.net or tweet at us with the hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2166814/c1e-5rkrb15q82czmk7d-5zd1m49da5k1-qjit8x.mp3" length="139465505"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 325 of The Cloud Pod, where the forecast is always cloudy! Justin is on vacation this week, so it’s up to Ryan and Matthew to bring you all the latest news in cloud and AI, and they definitely deliver! This week we have an AWS invoice undo button, Sora 2, and quite a bit of news DigitalOcean – plus so much more. Let’s get started! 
Titles we almost went with this week


AWS Shoots for the Cloud with NBA Partnership
Nothing But Net: AWS Scores Big with Basketball AI Deal
From Courtside to Cloud-side: AWS Dunks on Sports Analytics
PostgreSQL Gets a Gemini Twin for Natural Language Queries
Fuzzy Logic: When Your Database Finally Speaks Your Language
CLI and Let AI: Google’s Natural Language Database Assistant
Satya’s Org Chart Shuffle: Now with More AI Synergy
Microsoft Reorgs Again: This Time It’s Personal (and Commercial)
Ctrl+Alt+Delete: Microsoft Reboots Its Sales Machine
Sora 2: The Sequel Nobody Asked For But Everyone Will Use
OpenAI Puts the “You” in YouTube (AI Edition)
Sam Altman Stars in His Own AI-Generated Reality Show
Grok and Roll: Microsoft’s New AI Model Rocks Azure
To Grok or Not to Grok: That is the Question
Grok Around the Clock: Azure’s 24/7 Reasoning Machine
Spark Joy: Google Lights Up ML Inference for Data Pipelines
DigitalOcean’s Storage Trinity: Hot, Cold, and Backed Up
NFS: Not For Suckers (Network File Storage)
The Goldilocks Storage Strategy: Not Too Hot, Not Too Cold, Just Right
NAT Gonna Cost You: DigitalOcean’s Gateway to Savings
BYOIP: Bring Your Own IP (But Leave Your Billing Worries Behind)
The Great Invoice Escape: No More Support Tickets Required Ctrl+Z for Your AWS Bills: The Undo Button Finance Teams Needed
Image Builder Finally Learns When to Stop Trying
Pipeline Dreams: Now With Built-in Reality Checks
EC2 Image Builder Gets a Failure Intervention Feature
MCP: Model Context Protocol or Marvel Cinematic Protocol?


AI is Going Great – Or How ML Makes Money 
00:45 OpenAI’s Sora 2 lets users insert themselves into AI videos with sound – Ars Technica

OpenAI’s Sora 2 introduces synchronized audio generation alongside video synthesis, matching Google’s Veo 3 and Alibaba’s Wan 2.5 capabilities. 
This positions OpenAI competitively in the multimodal AI space with what they call their “GPT-3.5 moment for video.”
The new iOS social app feature allows users to insert themselves into AI-generated videos through “cameos,” suggesting potential applications for personalized content creation and social media integration at scale.
Sora 2 demonstrates improved physical accuracy and consistency across multiple shots, addressing previous limitations where objects would teleport or deform unrealistically. 
The model can now simulate complex movements like gymnastics routines while maintaining proper physics.
The addition of “sophisticated background soundscapes, speech, and sound effects” expands potential enterprise use cases for automated video production, training materials, and marketing content generation without separate audio post-processing.
]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2166814/c1a-k5d5-47mdnp15u6x8-bkcnbw.jpg"></itunes:image>
                                                                            <itunes:duration>01:12:32</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2166814/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[323: Databricks One: Because Seven Eight Nine]]>
                </title>
                <pubDate>Thu, 09 Oct 2025 18:29:24 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2161682</guid>
                                    <link>https://tcpfm.castos.com/episodes/323-databricks-one-because-seven-eight-nine</link>
                                <description>
                                            <![CDATA[<h3>Welcome to episode 323 of The Cloud Pod, where the forecast is always cloudy! Justin, Matt and Ryan are in the studio tonight to bring you all the latest in cloud and AI news! This week we have a close call from Entra, some DeepSeek news, Firestore, and even an acquisition! Make sure to stay tuned for the aftershow – and Matt obviously falling asleep on the job. Let’s get started! </h3>
<h3>Titles we almost went with this week</h3>
<ul>
<li>When One Key Opens Every Door: Microsoft’s Close Call with Cloud Catastrophe</li>
<li>Bedrock Goes Qwen-tum: Alibaba’s Models Join the AWS Party</li>
<li>DeepSeek and You Shall Find V3.1 in Bedrock</li>
<li>GPUs of Unusual Size? I Don’t Think They Exist (Narrator: They Do)</li>
<li>Kubernetes Without the Kubernightmares</li>
<li>Firestore and Forget: AI Takes the Wheel SCPs Get Their Full License: IAM Language Edition</li>
<li>Do What I Meant, Not What I Prompted</li>
<li>Atlassian Pays a Billion to DX the Developer Experience</li>
<li>Entra at Your Own Risk: The Azure Identity Crisis That Almost Was</li>
<li>Oracle Intelligence: The AI Nobody Asked For</li>
<li>Wisconsin Gets Cheesy with AI: Microsoft’s Dairy State Datacenter </li>
<li>Azure Opens the Data Floodgates (But Only in Europe)</li>
<li>PostgreSQL Gets a Security Blanket and Won’t Share Its TEEs</li>
<li>Microsoft’s New Cooling System Has Veins Like a Leaf and Runs Hotter Than Your Gaming PC</li>
<li>Azure Gets Cold Feet About Hot Chips, Decides to Go With the Flow
</li>
</ul>
<h2>AI Is Going Great – Or How ML Makes Money </h2>
<p>00:58 <a href="https://blog.google/technology/developers/ai-agents-intensive/">Google and Kaggle launch AI Agents Intensive course</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.google.com/">Google</a> and <a href="https://www.kaggle.com/">Kaggle</a> are launching a <a href="https://blog.google/technology/developers/google-kaggle-genai-intensive-recap-2025/">5-day intensive course on AI agents</a> from November 10-14. </li>
<li style="font-weight:400;">This follows their GenAI course that attracted 280,000 learners, with curriculum covering agent architectures, tools, memory systems, and production deployment.</li>
<li style="font-weight:400;">The course focuses on building autonomous AI agents and multi-agent systems, which represents a shift from traditional single-model AI to systems that can independently perform tasks, make decisions, and interact with tools and APIs.</li>
<li style="font-weight:400;">This development signals growing enterprise interest in AI agents for cloud environments, where autonomous systems can manage infrastructure, optimize resources, and handle complex workflows without constant human intervention.</li>
<li style="font-weight:400;">The hands-on approach includes codelabs and a capstone project, indicating Google’s push to democratize agent development skills as businesses increasingly need engineers who can build production-ready autonomous systems.</li>
<li style="font-weight:400;">The timing aligns with major cloud providers racing to offer agent-based services, as AI agents become essential for automating cloud operations, customer service, and business processes at scale.</li>
<li style="font-weight:400;">Interested in registering? You can do that <a href="https://rsvp.withgoogle.com/events/google-ai-agents-intensive_2025">here</a>. </li>
</ul>
<h2>Cloud Tools </h2>
<p>03:21 <a href="https://techcrunch.com/2025/09/18/atlassian-acquires-dx-a-developer-productivity-platform-for-1b/">Atlassian acquires DX, a developer productivity platform, for $1B</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.bing.com/ck/a?!&amp;&amp;p=93a849bbd67b7e45bb6c7617d46055ea9eb77c2d945277bc96bdb39b2182571cJmltdHM9MTc1OTE5MDQwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=33cf825f-ad59-6912-19e3-9438ac3368b1&amp;psq=Atlassian&amp;u=a1aHR0cHM6Ly93d3cuYXRsYXNzaWFuLmNvbS8">Atlassian</a> is acquiring <a href="https://getdx.com/">DX</a>, a developer productivity ana...</li></ul>
<h3>Chapters</h3>
<ul><li>(00:00:00) - Cloud Podcast: Databricks 1</li><li>(00:01:11) - Google and Kegel Launch Five Day Training Course on AI Agents</li><li>(00:03:34) - Atlasian Buys DX: Will It Hurt Their Business?</li><li>(00:07:03) - Amazon Web Services: New Models for DeepSeek and DeepSe</li><li>(00:08:42) - Amazon RDS: MySQL Innovation Release 9.4 in Database Preview</li><li>(00:14:12) - QDeveloper CLI Adds Remote MCPs</li><li>(00:15:56) - Amazon Nova Act Extension</li><li>(00:18:08) - Google Cloud: Security Command Center Insights for Kubernetes</li><li>(00:20:42) - Google's Firestore: MCP for AI Systems</li><li>(00:22:59) - AI Adoption Among Software Developers Hits 90%, Says Google</li><li>(00:24:00) - AI: Return on Investment?</li><li>(00:31:05) - Microsoft's Entra ID Vulnerabilities</li><li>(00:36:37) - Microsoft Unveils $100 Million AI Data Center</li><li>(00:40:31) - Azure SQL Server 2020: Managed Instance</li><li>(00:43:20) - AKS Automatic for Kubernetes + Azure Cloud</li><li>(00:45:49) - Databricks 1.4</li><li>(00:47:11) - Microsoft's HPC Infrastructure: HBV5 Series VMs</li><li>(00:52:08) - NET (for Mobile, Desktop, and More)</li><li>(00:53:12) - Azure Monitor Kubernetes: Higher throughput & more</li><li>(00:54:56) - Microsoft SQL: Integrations with Grafana</li><li>(01:01:59) - Microsoft Expands Fabric with New Features and Collaboration</li><li>(01:05:21) - Azure Application Gateway: zero downtime upgrade capability</li><li>(01:07:28) - Oracle's AI Strategy: Setting the Standard</li><li>(01:10:42) - Week in Cloud: Exploring the Cloud</li><li>(01:11:26) - The Need for Prompt Engineering in Cloud Software</li><li>(01:18:28) - Image Generation with Google GPT5</li><li>(01:22:04) - A Week in the Life</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 323 of The Cloud Pod, where the forecast is always cloudy! Justin, Matt and Ryan are in the studio tonight to bring you all the latest in cloud and AI news! This week we have a close call from Entra, some DeepSeek news, Firestore, and even an acquisition! Make sure to stay tuned for the aftershow – and Matt obviously falling asleep on the job. Let’s get started! 
Titles we almost went with this week

When One Key Opens Every Door: Microsoft’s Close Call with Cloud Catastrophe
Bedrock Goes Qwen-tum: Alibaba’s Models Join the AWS Party
DeepSeek and You Shall Find V3.1 in Bedrock
GPUs of Unusual Size? I Don’t Think They Exist (Narrator: They Do)
Kubernetes Without the Kubernightmares
Firestore and Forget: AI Takes the Wheel SCPs Get Their Full License: IAM Language Edition
Do What I Meant, Not What I Prompted
Atlassian Pays a Billion to DX the Developer Experience
Entra at Your Own Risk: The Azure Identity Crisis That Almost Was
Oracle Intelligence: The AI Nobody Asked For
Wisconsin Gets Cheesy with AI: Microsoft’s Dairy State Datacenter 
Azure Opens the Data Floodgates (But Only in Europe)
PostgreSQL Gets a Security Blanket and Won’t Share Its TEEs
Microsoft’s New Cooling System Has Veins Like a Leaf and Runs Hotter Than Your Gaming PC
Azure Gets Cold Feet About Hot Chips, Decides to Go With the Flow


AI Is Going Great – Or How ML Makes Money 
00:58 Google and Kaggle launch AI Agents Intensive course

Google and Kaggle are launching a 5-day intensive course on AI agents from November 10-14. 
This follows their GenAI course that attracted 280,000 learners, with curriculum covering agent architectures, tools, memory systems, and production deployment.
The course focuses on building autonomous AI agents and multi-agent systems, which represents a shift from traditional single-model AI to systems that can independently perform tasks, make decisions, and interact with tools and APIs.
This development signals growing enterprise interest in AI agents for cloud environments, where autonomous systems can manage infrastructure, optimize resources, and handle complex workflows without constant human intervention.
The hands-on approach includes codelabs and a capstone project, indicating Google’s push to democratize agent development skills as businesses increasingly need engineers who can build production-ready autonomous systems.
The timing aligns with major cloud providers racing to offer agent-based services, as AI agents become essential for automating cloud operations, customer service, and business processes at scale.
Interested in registering? You can do that here. 

Cloud Tools 
03:21 Atlassian acquires DX, a developer productivity platform, for $1B

Atlassian is acquiring DX, a developer productivity ana...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[323: Databricks One: Because Seven Eight Nine]]>
                </itunes:title>
                                    <itunes:episode>323</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<h3>Welcome to episode 323 of The Cloud Pod, where the forecast is always cloudy! Justin, Matt and Ryan are in the studio tonight to bring you all the latest in cloud and AI news! This week we have a close call from Entra, some DeepSeek news, Firestore, and even an acquisition! Make sure to stay tuned for the aftershow – and Matt obviously falling asleep on the job. Let’s get started! </h3>
<h3>Titles we almost went with this week</h3>
<ul>
<li>When One Key Opens Every Door: Microsoft’s Close Call with Cloud Catastrophe</li>
<li>Bedrock Goes Qwen-tum: Alibaba’s Models Join the AWS Party</li>
<li>DeepSeek and You Shall Find V3.1 in Bedrock</li>
<li>GPUs of Unusual Size? I Don’t Think They Exist (Narrator: They Do)</li>
<li>Kubernetes Without the Kubernightmares</li>
<li>Firestore and Forget: AI Takes the Wheel SCPs Get Their Full License: IAM Language Edition</li>
<li>Do What I Meant, Not What I Prompted</li>
<li>Atlassian Pays a Billion to DX the Developer Experience</li>
<li>Entra at Your Own Risk: The Azure Identity Crisis That Almost Was</li>
<li>Oracle Intelligence: The AI Nobody Asked For</li>
<li>Wisconsin Gets Cheesy with AI: Microsoft’s Dairy State Datacenter </li>
<li>Azure Opens the Data Floodgates (But Only in Europe)</li>
<li>PostgreSQL Gets a Security Blanket and Won’t Share Its TEEs</li>
<li>Microsoft’s New Cooling System Has Veins Like a Leaf and Runs Hotter Than Your Gaming PC</li>
<li>Azure Gets Cold Feet About Hot Chips, Decides to Go With the Flow
</li>
</ul>
<h2>AI Is Going Great – Or How ML Makes Money </h2>
<p>00:58 <a href="https://blog.google/technology/developers/ai-agents-intensive/">Google and Kaggle launch AI Agents Intensive course</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.google.com/">Google</a> and <a href="https://www.kaggle.com/">Kaggle</a> are launching a <a href="https://blog.google/technology/developers/google-kaggle-genai-intensive-recap-2025/">5-day intensive course on AI agents</a> from November 10-14. </li>
<li style="font-weight:400;">This follows their GenAI course that attracted 280,000 learners, with curriculum covering agent architectures, tools, memory systems, and production deployment.</li>
<li style="font-weight:400;">The course focuses on building autonomous AI agents and multi-agent systems, which represents a shift from traditional single-model AI to systems that can independently perform tasks, make decisions, and interact with tools and APIs.</li>
<li style="font-weight:400;">This development signals growing enterprise interest in AI agents for cloud environments, where autonomous systems can manage infrastructure, optimize resources, and handle complex workflows without constant human intervention.</li>
<li style="font-weight:400;">The hands-on approach includes codelabs and a capstone project, indicating Google’s push to democratize agent development skills as businesses increasingly need engineers who can build production-ready autonomous systems.</li>
<li style="font-weight:400;">The timing aligns with major cloud providers racing to offer agent-based services, as AI agents become essential for automating cloud operations, customer service, and business processes at scale.</li>
<li style="font-weight:400;">Interested in registering? You can do that <a href="https://rsvp.withgoogle.com/events/google-ai-agents-intensive_2025">here</a>. </li>
</ul>
<h2>Cloud Tools </h2>
<p>03:21 <a href="https://techcrunch.com/2025/09/18/atlassian-acquires-dx-a-developer-productivity-platform-for-1b/">Atlassian acquires DX, a developer productivity platform, for $1B</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.bing.com/ck/a?!&amp;&amp;p=93a849bbd67b7e45bb6c7617d46055ea9eb77c2d945277bc96bdb39b2182571cJmltdHM9MTc1OTE5MDQwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=33cf825f-ad59-6912-19e3-9438ac3368b1&amp;psq=Atlassian&amp;u=a1aHR0cHM6Ly93d3cuYXRsYXNzaWFuLmNvbS8">Atlassian</a> is acquiring <a href="https://getdx.com/">DX</a>, a developer productivity analytics platform, for $1 billion after failing to build their own solution internally for three years. </li>
<li style="font-weight:400;">DX analyzes engineering team productivity, and identifies bottlenecks without making developers feel surveilled.</li>
<li style="font-weight:400;">DX provides both qualitative and quantitative insights into developer productivity, helping enterprises understand what’s slowing down their engineering teams. </li>
<li style="font-weight:400;">The platform serves over 350 enterprise customers including ADP, Adyen, and GitHub.</li>
<li style="font-weight:400;">The acquisition is particularly timely, as companies struggle to measure ROI on AI tool investments and understand if their growing AI budgets are being spent effectively. DX can help track how these tools impact developer productivity.</li>
<li style="font-weight:400;">90% of DX’s customers already use Atlassian tools, making this a natural integration that creates an end-to-end workflow. </li>
<li style="font-weight:400;">Teams can identify bottlenecks with DX analytics then use Atlassian’s project management tools to address them.</li>
<li style="font-weight:400;">Despite serving major enterprises and tripling their customer base annually, DX raised less than $5 million in venture funding. This bootstrapped approach aligned with Atlassian’s own growth philosophy.</li>
</ul>
<p>04:30 Justin – “I use DX, I actually really like DX, some I’m hoping Atlassian doesn’t F it up.” </p>
<h2>AWS</h2>
<p>06:51 <a href="https://aws.amazon.com/blogs/aws/qwen-models-are-now-available-in-amazon-bedrock/">Qwen models are now available in Amazon Bedrock | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/bedrock/?trk=ba8b32c9-8088-419f-9258-82e9375ad130&amp;sc_channel=el">Amazon Bedrock</a> adds four <a href="https://qwen.ai/home?trk=ba8b32c9-8088-419f-9258-82e9375ad130&amp;sc_channel=el">Qwen3</a> models from <a href="https://www.bing.com/ck/a?!&amp;&amp;p=3fabe54a44d689dd2c750651d24322f5a39baeae75b439006ca6ea6223dde06cJmltdHM9MTc1OTE5MDQwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=33cf825f-ad59-6912-19e3-9438ac3368b1&amp;psq=Alibaba+Qwen&amp;u=a1aHR0cHM6Ly93d3cuYWxpYmFiYWdyb3VwLmNvbS9kb2N1bWVudC0xODUzOTQwMjI2OTc2NjQ1MTIw">Alibaba</a>, including mixture-of-experts (MoE) and dense architectures, with the largest Qwen3-Coder-480B having 480B total parameters but only activating 35B per request for efficient inference.</li>
<li style="font-weight:400;">The models introduce hybrid thinking modes that allow developers to choose between step-by-step reasoning for complex problems or fast responses for simpler tasks, helping balance performance and cost trade-offs.</li>
<li style="font-weight:400;">Qwen3-Coder models support up to 256K tokens natively (1M with extrapolation), enabling repository-scale code analysis and long-context processing without chunking, while maintaining strong performance on coding benchmarks.</li>
<li style="font-weight:400;">All models are available as fully managed serverless offerings across multiple regions with no infrastructure setup required, and Amazon Bedrock automatically enables access for all AWS accounts starting October 2025.</li>
<li style="font-weight:400;">Key use cases include agentic workflows with built-in tool calling capabilities, code generation across entire repositories, and cost-optimized deployments using the smaller Qwen3-32B dense model for edge computing scenarios.</li>
</ul>
<p>07:22 <a href="https://aws.amazon.com/blogs/aws/deepseek-v3-1-now-available-in-amazon-bedrock/">DeepSeek-V3.1 model now available in Amazon Bedrock | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/bedrock/deepseek?trk=769a1a2b-8c19-4976-9c45-b6b1226c7d20&amp;sc_channel=el">DeepSeek-V3.1</a> is now available in <a href="https://aws.amazon.com/bedrock/?trk=769a1a2b-8c19-4976-9c45-b6b1226c7d20&amp;sc_channel=el">Amazon Bedrock</a> as a fully managed foundation model that switches between thinking mode for step-by-step reasoning and non-thinking mode for faster direct answers, with <a href="https://aws.amazon.com/blogs/aws/deepseek-r1-now-available-as-a-fully-managed-serverless-model-in-amazon-bedrock/?trk=769a1a2b-8c19-4976-9c45-b6b1226c7d20&amp;sc_channel=el">AWS being the first cloud provider to offer DeepSeek models in a serverless deployment</a>.</li>
<li style="font-weight:400;">The model delivers improved performance in code generation, debugging, and software engineering workflows while supporting over 100 languages with near-native proficiency, making it suitable for global enterprise applications and multilingual customer service implementations.</li>
<li style="font-weight:400;">Key technical capabilities include enhanced tool calling through post-training optimization, structured tool usage for agentic workflows, and integration with <a href="https://aws.amazon.com/bedrock/guardrails/">Amazon Bedrock Guardrails</a> for implementing custom safeguards and responsible AI policies.</li>
<li style="font-weight:400;">Available in 5 <a href="https://docs.aws.amazon.com/bedrock/latest/userguide/models-regions.html?trk=769a1a2b-8c19-4976-9c45-b6b1226c7d20&amp;sc_channel=el">AWS regions</a> (US West Oregon, Asia Pacific Tokyo/Mumbai, Europe London/Stockholm) with support for both InvokeModel and Converse APIs, allowing developers to toggle between reasoning modes based on use case requirements.</li>
<li style="font-weight:400;">AWS is simplifying model access by automatically enabling all serverless foundation models for every AWS account starting October 2025, eliminating manual activation while maintaining IAM and SCP controls for administrators to restrict access as needed.</li>
</ul>
<p>08:00  Justin – “I’m still skeptical about DeepSeek; because it sounded like it was derivative of ChatGPT, so I don’t really know what you’re getting out of it, other than it’s something cheaper.”  </p>
<p>08:34 <a href="https://aws.amazon.com/about-aws/whats-new/2025/09/amazon-rds-mysql-innovation-release-94-database-preview-environment/">Amazon RDS for MySQL announces Innovation Release 9.4 in Amazon RDS </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/09/amazon-rds-mysql-innovation-release-94-database-preview-environment/">Database Preview Environment</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/rds/mysql/">Amazon RDS</a> now offers MySQL Innovation Release 9.4 in its <a href="https://aws.amazon.com/rds/databasepreview/">Database Preview Environment</a>, giving customers early access to the latest MySQL features including bug fixes, security patches, and new capabilities before general availability.</li>
<li style="font-weight:400;">The Preview Environment provides a fully managed database experience for testing <a href="https://dev.mysql.com/doc/relnotes/mysql/9.4/en/">MySQL 9.4</a> with both Single-AZ and Multi-AZ deployments on latest generation instances, though databases are automatically deleted after 60 days.</li>
<li style="font-weight:400;">MySQL Innovation Releases follow a different support model than LTS versions – Innovation releases are only supported until the next minor release while LTS versions like MySQL 8.0 and 8.4 receive up to 8 years of community support.</li>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/AmazonRDS/latest/UserGuide/MySQL.Concepts.VersionMgmt.html#mysql-working-with-the-database-preview-environment">Preview Environment instances</a> are priced identically to production RDS instances in US East (Ohio), making it cost-neutral for organizations to test new MySQL versions before committing to production upgrades.</li>
<li style="font-weight:400;">This preview capability allows database teams to validate application compatibility and performance with MySQL 9.4 features in a production-like environment without risking their main workloads.</li>
<li style="font-weight:400;"><a href="https://dev.mysql.com/blog-archive/introducing-mysql-innovation-and-long-term-support-lts-versions/">https://dev.mysql.com/blog-archive/introducing-mysql-innovation-and-long-term-support-lts-versions/</a> </li>
<li style="font-weight:400;">Please: DO NOT use this for production! </li>
</ul>
<p>09:45  Ryan – “My experience with database upgrades is the opposite. No matter how much preview is offered in time and enticement, you’ll still have to kick everyone off the older version kicking and screaming.”</p>
<p>11:50 <a href="https://aws.amazon.com/about-aws/whats-new/2025/09/aws-organizations-iam-language-service-control-policies/">AWS Organizations supports full IAM policy language for service control </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/09/aws-organizations-iam-language-service-control-policies/">policies (SCPs)</a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/organizations/latest/userguide/orgs_introduction.html">AWS Organizations</a> now supports the full IAM policy language for Service Control Policies (SCPs), enabling conditions, individual resource ARNs, and NotAction elements with Allow statements – bringing SCPs to feature parity with IAM managed policies.</li>
<li style="font-weight:400;">This enhancement allows organizations to create more precise permission guardrails, such as restricting access to specific S3 buckets or EC2 instances across all accounts using condition statements, rather than blanket service-level restrictions.</li>
<li style="font-weight:400;">The addition of wildcards at the beginning or middle of Action strings, and the NotResource element enables more flexible policy patterns, reducing the need for multiple SCPs to achieve complex permission boundaries.</li>
<li style="font-weight:400;">Existing SCPs remain fully compatible with no migration required, making this a zero-friction upgrade that immediately benefits organizations using AWS Organizations for multi-account governance.</li>
<li style="font-weight:400;">The feature is available in all commercial and GovCloud regions at no additional cost, strengthening AWS Organizations’ position as the primary tool for enterprise-wide security governance.</li>
</ul>
<p>12:43  Ryan – “They actually had the stones to say zero friction and SCP in the same article, huh?” </p>
<p>14:11 <a href="https://aws.amazon.com/about-aws/whats-new/2025/09/amazon-q-developer-remote-mcp-servers/">Amazon Q Developer CLI announces support for remote MCP servers</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/developer/learning/q-developer-cli/">Amazon Q Developer CLI</a> now supports remote <a href="https://modelcontextprotocol.io/introduction">Model Context Protocol (MCP)</a> servers, enabling centralized tool management with HTTP transport and OAuth authentication for services like <a href="https://www.atlassian.com/">Atlassian</a> and <a href="https://github.com/">GitHub</a>.</li>
<li style="font-weight:400;">This shifts compute resources from local machines to centralized servers, reducing individual developer workload while providing better access control and security management for development teams.</li>
<li style="font-weight:400;">Remote MCP servers allow Q Developer CLI to query available tools from external services after authentication, making third-party integrations more scalable across development organizations.</li>
<li style="font-weight:400;">Configuration requires specifying HTTP transport type, authentication URL, and optional headers in either custom agent configuration or mcp.json files.</li>
<li style="font-weight:400;">The feature is available in both Q Developer CLI and IDE plugins, expanding the ways developers can leverage centralized tool management in their existing workflows.</li>
</ul>
<p>15:18  Justin – “I think having it centralized is ideal, especially from a security and access control perspective. It’s a bit of a problem when these MCPS are running on everyone’s laptops – because that means they may not be consistent, they may not all follow all the same permissions models you need them to, or different access rights…so there’s lots of reasons why you’d like to have a remote MCP.”    </p>
<p>15:54 <a href="https://aws.amazon.com/blogs/aws/accelerate-ai-agent-development-with-the-nova-act-ide-extension/">Accelerate AI agent development with the Nova Act IDE extension</a></p>
<ul>
<li style="font-weight:400;">AWS launches <a href="https://github.com/aws/nova-act-extension">Nova Act extension</a>, a free IDE plugin for <a href="https://code.visualstudio.com/">VS Code</a>, <a href="https://cursor.com/en">Cursor</a>, and <a href="https://kiro.dev/">Kiro</a> that enables developers to build browser automation agents using natural language prompts and the Nova Act model without switching between coding and testing environments.</li>
<li style="font-weight:400;">The extension features a notebook-style builder mode that breaks automation scripts into modular cells for individual testing, plus integrated debugging with live browser preview and execution logs for complex multi-step workflows.</li>
<li style="font-weight:400;">Developers can generate automation scripts through natural language chat or use predefined templates for common tasks like shopping automation, data extraction, QA testing, and form filling, then customize with APIs and authentication.</li>
<li style="font-weight:400;">Built on the open-source <a href="https://nova.amazon.com/act">Nova Act SDK</a> (Apache 2.0 license), the extension provides a complete agent development lifecycle within the IDE – from prototyping with natural language to production-grade script validation.</li>
<li style="font-weight:400;">This positions AWS deeper into the AI agent development space, competing with standalone automation tools by integrating agent creation directly into developer workflows at no additional cost beyond Nova Act API usage.</li>
</ul>
<p>17:39  Ryan – “I get why this is more than just a model, right? This is a specific workflow for development, and there’s clearly extensions and features in here that are above and beyond what’s in Kiro and Q, presumably, but they’d have to be really good.”</p>
<h2>GCP</h2>
<p>18:07 <a href="https://cloud.google.com/blog/products/identity-security/new-gce-and-gke-dashboards-strengthen-security-posture/">New GCE and GKE dashboards strengthen security posture</a></p>
<ul>
<li style="font-weight:400;">Google embeds <a href="https://cloud.google.com/security/products/security-command-center">Security Command Center</a> insights directly into <a href="https://cloud.google.com/products/compute">GCE</a> and <a href="https://cloud.google.com/kubernetes-engine">GKE </a>consoles, providing security dashboards that surface misconfigurations, vulnerabilities, and active threats without requiring separate security tools or interfaces.</li>
<li style="font-weight:400;">The GCE dashboard displays top security findings, vulnerability trends over time, and CVE prioritization powered by Google Threat Intelligence and Mandiant analysis, helping teams identify which VMs to patch first based on exploitability and impact.</li>
<li style="font-weight:400;"><a href="https://console.cloud.google.com/kubernetes/security/dashboard">GKE’s security</a> dashboard focuses on workload configurations, container threats like cryptomining and privilege escalation, and software vulnerabilities specific to Kubernetes environments, addressing common container security blind spots.</li>
<li style="font-weight:400;">While basic security findings are included free, accessing vulnerability and threat widgets requires Security Command Center Premium with a 30-day trial available, positioning this as a value-add upsell for existing GCP customers.</li>
<li style="font-weight:400;">This integration approach differs from AWS and Azure which typically require navigating to separate security services, potentially reducing context switching for infrastructure teams managing day-to-day operations.</li>
</ul>
<p>18:58  Ryan – “I got to play around with this and it’s really cool. I love getting that security information front and center for developers and the people actually using the platform. You know, as, as a security professional, we have all this information that’s devoid of context, and, if you’re lucky, you know enough to build a detection and be able to query a workflow. It’s going to just fire off a ticket that no one’s going to look at. And so this is, I think, putting it right in the console, I think that some people – not everyone – will take the initiative and be like, this is very red. I should make it not so red.”</p>
<p>20:53 <a href="https://cloud.google.com/blog/products/ai-machine-learning/firestore-support-and-custom-tools-in-mcp-toolbox/">Firestore support and custom tools in MCP Toolbox</a></p>
<ul>
<li style="font-weight:400;">Google expands<a href="https://cloud.google.com/blog/products/ai-machine-learning/mcp-toolbox-for-databases-now-supports-model-context-protocol?e=48754805"> MCP Toolbox</a> to support <a href="https://cloud.google.com/products/firestore">Firestore</a>, enabling developers to connect AI assistants directly to their NoSQL databases through natural language commands for querying, updating documents, and validating security rules.</li>
<li style="font-weight:400;">The integration allows developers to perform database operations without writing code – for example, asking an AI assistant to “find all users whose wishlists contain discontinued product IDs” or “remove specific items from multiple user documents” directly from their IDE or<a href="https://cloud.google.com/gemini/docs/codeassist/gemini-cli"> CLI</a>.</li>
<li style="font-weight:400;">This positions Google alongside <a href="https://www.anthropic.com/news/model-context-protocol">Anthropic’s Model Context Protocol standard</a>, providing a unified way for AI systems to interact with enterprise data sources, though AWS and Azure haven’t announced similar MCP-compatible database tooling yet.</li>
<li style="font-weight:400;">The Firestore tools support document retrieval, collection queries, document updates, and security rule validation, addressing common developer pain points like debugging data issues and testing access controls before deployment.</li>
<li style="font-weight:400;">Web and mobile app developers building on Firestore can now complete tasks that previously required manual console navigation or custom scripts in minutes through conversational AI, particularly useful for e-commerce, social apps, and any application with complex document structures.</li>
</ul>
<p>21:46  Ryan – “As someone who never wants to write SQL queries ever again, I love these types of things. This is exactly how I want to interact with a database.” </p>
<p>23:17 <a href="https://blog.google/technology/developers/dora-report-2025/">How are developers using AI? Inside Google’s 2025 DORA report</a></p>
<ul>
<li style="font-weight:400;">Google’s 2025 <a href="https://dora.dev/">DORA</a> report shows AI adoption among software developers has reached 90%, up 14% from last year, with developers spending a median of 2 hours daily using AI tools for development tasks.</li>
<li style="font-weight:400;">Despite 80% of developers reporting productivity gains and 59% seeing improved code quality, a trust paradox exists where 30% trust AI “a little” or “not at all”, suggesting AI serves as a supportive tool rather than replacing human judgment.</li>
<li style="font-weight:400;">The report identifies seven team archetypes from “Harmonious high-achievers” to “Legacy bottleneck” teams, revealing that AI acts as both a mirror and multiplier – amplifying efficiency in cohesive organizations while exposing weaknesses in fragmented ones.</li>
<li style="font-weight:400;">Google introduces the <a href="https://cloud.google.com/blog/products/ai-machine-learning/introducing-doras-inaugural-ai-capabilities-model">DORA AI Capabilities Model</a>, a blueprint of seven essential capabilities combining technical and cultural factors needed for successful AI adoption in software development organizations.</li>
<li style="font-weight:400;">While AI adoption now correlates with higher software delivery throughput (reversing last year’s findings), organizations still face challenges ensuring software quality before delivery, indicating adoption alone doesn’t guarantee success.</li>
<li style="font-weight:400;">HBR Article Justin and Ryan mentioned: <a href="https://hbr.org/2025/09/ai-generated-workslop-is-destroying-productivity">https://hbr.org/2025/09/ai-generated-workslop-is-destroying-productivity</a> </li>
</ul>
<h2>Azure</h2>
<p>31:25 <a href="https://arstechnica.com/security/2025/09/microsofts-entra-id-vulnerabilities-could-have-been-catastrophic/">Microsoft’s Entra ID vulnerabilities could have been catastrophic</a></p>
<ul>
<li style="font-weight:400;">Security researcher Dirk-jan Mollema discovered two critical vulnerabilities in Microsoft’s <a href="https://entra.microsoft.com/?culture=en-us&amp;country=us">Entra ID</a> (formerly Azure Active Directory) that could have allowed attackers to gain global administrator privileges across all <a href="https://duckduckgo.com/y.js?ad_domain=microsoft.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=Lr2FNhr9EQ4lZX2Zz2R9oeFRox8c9uW7_i0MLGE1H1ed8u5lYjwOhPLv8E__0Ufju05ubFIViCDikiITVzlipyNcWMfA8seYqzKLx3zTMTR6nVos_MUFFXBjyiHnlL24.3WHRQAS7Y9KUMFrXxR1KXg&amp;eddgt=NG4zQ9dEmsMQlAUKv6yXIw%3D%3D&amp;rut=1b40352c076a7318543c2afaed57e126bdae8541235819c7dde81e909399f46a&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8oDxh7keVIm4RouVgk8N4SjVUCUz-aN0uDYtaAYK_eOT5XMm5EGvMtsGY7YbLTc8mFRyts_jjA7nyGgduocwrd7QcdTjZepe_ZUDxMWQCToKH8QZijzP5EKIUIJzSIeZT8FxH0vPn3Tyvqii_KfH22snnNqwPphOHRdM-D4eOAcnYpHNJRarrIVF_cgC3Zx9TxtSJRe-8bV26jiZO-EanpPI3K2kNxz9eOIvItrk4ctv-F4d8QrBtIp_VxVIM3QbYkFS7TjNDrE7N29sIswTVMuEBM0cWDXPcM4qE6OEZmrGBjzgxB5Xr0dum2X0fHlQaVDoIV2ZT4cm76jyEziR-_2CXhu3QkLlLFS2JXQFaGtBBsOUudgQ6g4AdtN_YeMeAhc29DUkvEVrGNg5khhspNdRXLnlRfdGhoGHJjRj-9KAwD5KkQqVrIaJ-qzdd1YrZmn4S4YHnU-zBvkSUZDzKFRa5O7BqLBAYHujYann5TKaIxCxHL-rB71pi54CqbkZBiuFSgzAahPv6pDJ4bo4pV9Y4Fm8_SeC_jGoKWxw_2aEoaUvWyWWm5P_ET7niMQOSy2uzBjmTBsksWC2WnemLt9tJ_DCT3GGts9ST01m3U4kSY3rtGCbX1fUVkNNT9QpoIKtRkD9O_XNqLFZVGxYt3ghcGdZuGK36JJGBtBoPrnpn_Mcw8YSsLqjTBbb-T8GeVqz8cwYYG4yovrt2ELqYCxX2FO9TdWMKCocMuxaKQP5WfwVm3ovQZJ0N87x7aafZIKHU2g%26u%3DaHR0cHMlM2ElMmYlMmY1MzUwLnhnNGtlbi5jb20lMmZ0cmslMmZ2MSUzZnByb2YlM2Q0MzklMjZjYW1wJTNkMTY4ODA3JTI2a2N0JTNkbXNuJTI2a2NoaWQlM2QxNTkwMDE5MjElMjZjcml0ZXJpYWlkJTNka3dkLTc4ODIxODQ2NDc2Njc1JTNhbG9jLTE5MCUyNmNhbXBhaWduaWQlM2Q1OTAyMzczNTglMjZsb2NwaHklM2Q4MDUxMSUyNmFkZ3JvdXBpZCUzZDEyNjExNDE0ODk3MDI1NDUlMjZjaWQlM2Q3ODgyMTQ0ODU3MjY3MyUyNmtkdiUzZGMlMjZrZXh0JTNkJTI2a3BnJTNkJTI2a3BpZCUzZCUyNnF1ZXJ5U3RyJTNkQXp1cmUlMjZ1cmwlM2RodHRwcyUzYSUyZiUyZmF6dXJlLm1pY3Jvc29mdC5jb20lMmZlbi11cyUyZnByaWNpbmclMmZwdXJjaGFzZS1vcHRpb25zJTJmYXp1cmUtYWNjb3VudCUyZnNlYXJjaCUzZmVmX2lkJTNkX2tfOTA1ODI4OTllNjlmMWUwNTNhODE0MDQ1ODdiODkyMjlfa18lMjZPQ0lEJTNkQUlEY21tNWVkc3dkdXVfU0VNX19rXzkwNTgyODk5ZTY5ZjFlMDUzYTgxNDA0NTg3Yjg5MjI5X2tfJTI2bXNjbGtpZCUzZDkwNTgyODk5ZTY5ZjFlMDUzYTgxNDA0NTg3Yjg5MjI5%26rlid%3D90582899e69f1e053a81404587b89229&amp;vqd=4-318972028386034827997776754807182358777&amp;iurl=%7B1%7DIG%3DFC3F9D03C5EA4E61840693F0A5435D3B%26CID%3D2B23CBBA90286B6B0D3FDDCD91926A87%26ID%3DDevEx%2C5046.1">Azure</a> customer tenants worldwide, potentially compromising every organization’s user identities, access controls, and subscription management tools.</li>
<li style="font-weight:400;">The vulnerabilities enabled an attacker with just a test or trial tenant to request tokens that could impersonate any user in any other tenant, allowing them to modify configurations, create admin users, and essentially achieve complete control over customer environments – a scenario that represents one of the most severe cloud security risks possible.</li>
<li style="font-weight:400;">Microsoft has presumably patched these vulnerabilities following Mollema’s responsible disclosure, but the incident highlights the concentration risk of centralized cloud identity systems where a single vulnerability can expose millions of organizations simultaneously, unlike traditional on-premises Active Directory deployments.</li>
<li style="font-weight:400;">This discovery underscores why organizations need defense-in-depth strategies even when using major cloud providers, including monitoring for unusual administrative actions, implementing conditional access policies, and maintaining incident response plans that account for potential cloud provider compromises.</li>
<li style="font-weight:400;">For Azure customers, this serves as a reminder to review Entra ID security configurations, enable all available security features like Privileged Identity Management, and ensure proper logging and alerting are configured to detect potential unauthorized access attempts or configuration changes.</li>
</ul>
<p>32:52  Matt – “We had a problem. We fixed the problem. Buy more stuff from us so you don’t have any problems in the future.”</p>
<p>36:56 <a href="https://blogs.microsoft.com/blog/2025/09/18/inside-the-worlds-most-powerful-ai-datacenter/">Inside the world’s most powerful AI datacenter – The Official Microsoft</a> <a href="https://blogs.microsoft.com/blog/2025/09/18/inside-the-worlds-most-powerful-ai-datacenter/">Blog</a></p>
<ul>
<li style="font-weight:400;">Microsoft <a href="https://aka.ms/WIdatacenter">unveiled</a> Fairwater in Wisconsin, a 315-acre AI datacenter with 1.2 million square feet that operates as a single supercomputer using NVIDIA GB200 servers with 72 GPUs per rack delivering 865,000 tokens per second, positioning it as 10x more powerful than current supercomputers.</li>
<li style="font-weight:400;">The facility uses closed-loop liquid cooling with zero operational water waste and a two-story rack configuration to minimize latency, while Azure’s reengineered storage can handle over 2 million read/write transactions per second per account with exabyte-scale capacity.</li>
<li style="font-weight:400;">Microsoft is building identical Fairwater datacenters across the US and partnering with nScale for facilities in Norway and the UK, all interconnected via AI WAN to create a distributed supercomputer network that pools compute resources across regions.</li>
<li style="font-weight:400;">This infrastructure specifically targets OpenAI, Microsoft AI, and Copilot workloads, with Azure being first to deploy NVIDIA GB200 at datacenter scale – a notable advantage over AWS and GCP who haven’t announced similar GB200 deployments.</li>
<li style="font-weight:400;">The investment represents tens of billions of dollars and positions Microsoft to offer frontier AI training capabilities that smaller cloud providers can’t match, though pricing details weren’t disclosed and will likely command premium rates given the specialized hardware.</li>
</ul>
<p>40:17 <a href="https://techcommunity.microsoft.com/blog/azuresqlblog/introducing-new-update-policy-for-azure-sql-managed-instance/4454895">Introducing new update policy for Azure SQL Managed Instance | </a><a href="https://techcommunity.microsoft.com/blog/azuresqlblog/introducing-new-update-policy-for-azure-sql-managed-instance/4454895">Microsoft Community Hub</a></p>
<ul>
<li style="font-weight:400;">Azure SQL Managed Instance now offers three <a href="https://aka.ms/sqlmiupdatepolicydocs">update policy</a> options: Always-up-to-date for immediate access to new SQL engine features, SQL Server 2022 for fixed feature sets matching on-premises versions, and the new SQL Server 2025 policy (preview) that provides database portability while including recent innovations like vector data types and JSON functions.</li>
<li style="font-weight:400;">The SQL Server 2025 policy bridges the gap between cloud innovation and enterprise requirements for regulatory compliance or contractual obligations, allowing organizations to maintain compatibility with on-premises SQL Server 2025 while benefiting from managed service capabilities.</li>
<li style="font-weight:400;">Key technical additions in the 2025 policy include <a href="https://learn.microsoft.com/en-us/sql/relational-databases/performance/optimized-locking">optimized locking</a> for better concurrency, native <a href="https://learn.microsoft.com/en-us/sql/t-sql/data-types/vector-data-type">vector data type</a> support for AI workloads, <a href="https://learn.microsoft.com/en-us/sql/relational-databases/regular-expressions/overview">regular expression functions</a>, <a href="https://learn.microsoft.com/en-us/sql/t-sql/data-types/json-data-type">JSON data type</a> with aggregate functions, and the ability to <a href="https://learn.microsoft.com/en-us/sql/relational-databases/system-stored-procedures/sp-invoke-external-rest-endpoint-transact-sql">invoke HTTP REST endpoints</a> directly from T-SQL.</li>
<li style="font-weight:400;">This positions <a href="https://www.bing.com/aclk?ld=e86E48u162jfc4FbOWsmxgGTVUCUzukrLKttBRgP7frIdj_NJNH7elWluaLzZSsUFYg5GF4EfyRbwxgVNuX28EQJlFevKY1WlssbdzhpotFa6hj4dAOiRNw3lgdbE32dMYAmCIavJ3TxLMFpFj3nodM_cfetYHKMCHk_s4rqlcXdA6AijTktMT_Zm2XvJgwFWV9ekeQQ&amp;u=aHR0cHMlM2ElMmYlMmZhenVyZS5taWNyb3NvZnQuY29tJTJmZW4tdXMlMmZwcm9kdWN0cyUyZmF6dXJlLXNxbCUyZiUzZmVmX2lkJTNkX2tfYzdmMDlkZjU3Nzc3MWYzYzczOTBmNTI5YmNlMThiZTNfa18lMjZPQ0lEJTNkQUlEY21tNWVkc3dkdXVfU0VNX19rX2M3ZjA5ZGY1Nzc3NzFmM2M3MzkwZjUyOWJjZTE4YmUzX2tfJTI2bXNjbGtpZCUzZGM3ZjA5ZGY1Nzc3NzFmM2M3MzkwZjUyOWJjZTE4YmUz&amp;rlid=c7f09df577771f3c7390f529bce18be3">Azure SQL Managed Instance</a> competitively against <a href="https://www.bing.com/ck/a?!&amp;&amp;p=c80063acff1b8e4f0d7c90b751543441ece27698ad2b556115773d9acd21ee34JmltdHM9MTc1OTE5MDQwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=33cf825f-ad59-6912-19e3-9438ac3368b1&amp;psq=AWS+RDS&amp;u=a1aHR0cHM6Ly9kb2NzLmF3cy5hbWF6b24uY29tL0FtYXpvblJEUy9sYXRlc3QvVXNlckd1aWRlL1dlbGNvbWUuaHRtbA">AWS RDS</a> and <a href="https://www.bing.com/aclk?ld=e8Wkp3x25dP-A-TJvHkvOSCzVUCUwW1zoOB_D-NdH_9SmWmLEZg4L08BtokuumMAzY6ObtiQ8iLBX13_dV7py6jHGaMrG_LN_zHzcp8eXkYxT5OgFD0iv2saQS6iRMFcaE5wuHKgCAgbnaSp35870dNkK2Oz64YIdNZWepvNVsF4GgbP51jpkpRWy2UIg-vciyru8MUw&amp;u=aHR0cHMlM2ElMmYlMmZjbG91ZC5nb29nbGUuY29tJTJmc3FsJTNmdXRtX3NvdXJjZSUzZGJpbmclMjZ1dG1fbWVkaXVtJTNkY3BjJTI2dXRtX2NhbXBhaWduJTNkbmEtVVMtYWxsLWVuLWRyLWJrd3MtYWxsLWFsbC10cmlhbC1lLWRyLTE3MTAxMzQlMjZ1dG1fY29udGVudCUzZHRleHQtYWQtbm9uZS1hbnktREVWX2MtQ1JFXy1BREdQX0Rlc2slMmIlMjU3QyUyYkJLV1MlMmItJTJiRVhBJTJiJTI1N0MlMmJUeHQtRGF0YWJhc2VzLUNsb3VkJTJiU1FMLUtXSURfMjg0ODk5MzY2OTEta3dkLTc3MTcyMzAxOTEzMTEwJTNhbG9jLTE5MCUyNnV0bV90ZXJtJTNkS1dfZ29vZ2xlJTI1MjBjbG91ZCUyNTIwc3FsLVNUX2dvb2dsZSUyYmNsb3VkJTJic3FsJTI2Z2NsaWQlM2RmYTY1MDI2OGFlODMxZWJmNGVmZjZhYjQ2NTMzNTY5MyUyNmdjbHNyYyUzZDNwLmRzJTI2bXNjbGtpZCUzZGZhNjUwMjY4YWU4MzFlYmY0ZWZmNmFiNDY1MzM1Njkz&amp;rlid=fa650268ae831ebf4eff6ab465335693">Google Cloud SQL</a> by offering more granular control over feature adoption timelines, addressing enterprise concerns about database portability while AWS typically forces customers into their latest engine versions.</li>
<li style="font-weight:400;">Organizations using SQL Server 2022 policy should plan migrations before mainstream support ends in 2027, as instances will automatically upgrade to newer policies at end of support, making this particularly relevant for enterprises with strict change management requirements.</li>
</ul>
<p>41:54  Matt – “This is different, because Azure is complicated – because Azure. You have Azure SQL, which is RDS, it’s fully managed. You have Azure Managed Instances, or Azure SQL managed instances, which is SQL on a server. You have access to the server, but they give you extra visibility and everything else into the SQL on that box, and can do the upgrades and stuff.”</p>
<p>43:19 <a href="https://azure.microsoft.com/en-us/blog/azure-kubernetes-service-automatic-fast-and-frictionless-kubernetes-for-all/">Fast, Secure Kubernetes with AKS Automatic | Microsoft Azure Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/azure/aks/intro-aks-automatic">AKS Automatic</a> delivers production-ready Kubernetes clusters with one-click deployment, removing manual configuration of node pools, networking, and security settings while maintaining full Kubernetes API compatibility and CNCF conformance.</li>
<li style="font-weight:400;">The service includes automated scaling via <a href="https://karpenter.sh/">Karpenter</a> for nodes and built-in HPA/VPA/KEDA for pods, plus automatic patching, <a href="https://learn.microsoft.com/en-us/azure/azure-monitor/alerts/itsmc-overview">Azure Monitor integration</a>, and <a href="https://learn.microsoft.com/en-us/entra/identity/authentication/concept-authentication-methods">Microsoft Entra ID authentication</a> configured by default.</li>
<li style="font-weight:400;">Microsoft positions this as competing with GKE Autopilot and EKS Fargate by offering a fully managed experience while preserving Kubernetes extensibility, targeting both startups without dedicated DevOps teams and enterprises seeking standardized deployments.</li>
<li style="font-weight:400;">Key differentiators include <a href="https://duckduckgo.com/y.js?ad_domain=microsoft.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=Y2cM8pcz9MBOhRtr6zeMj6xIib6wLGR40r6OvJdubK2UpPY7Ttax_vGxRvAI9hb_nkW-GZssqTVGhwmAVwzCUwQJEFd3LCUVwQnhAU18UYYp21LL_4NFj9ra7a7cpHms.x8VPAlMAb3f6BghVbCNhWQ&amp;eddgt=rClQr4KrpOIhScfGwYeF_g%3D%3D&amp;rut=dd6294aa73eb7cc6cd0e8abb1c3531509b08ec9f33396dcbbeb3730ee559f31a&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8RYmsbi9Cdr9ve5kYRAQyQDVUCUxTXB2ooROeVpavEhWmF3X4D3RJi8i3hCAko5IbWf_GTSSEqbCTYWprUhJsDIsI0lXC4eiazwZVeUlq6hx75UcZLuA-3g96Hs5ACpMAG4NCLnB5BCRaV4-37UE8R2KY8_T-DXhOpQWQiufQhywiFc8ZBNptaSdwfC6FuQUbWv-s8b3waor4WV2SeTWAZShxuz_qyRG6o9gJ_eD1tQqaBdu3yzp-_-m1yzcHSZZoC7Db0eA0Q_JZJN_oKbfPJb3g8w2yNICOiF_8Ym8yrN8_0ZY003jPobRc-hNY2il6wwew4WrykPqQrfOJxvizOihiHuG6JLt4NAnCJqKNDgDZ8YhQR1stI27jtTbDMJUMtIz7ezGh3A4HBe1-O_Fpfx5AEOxKoaHaf0gcU6zYtTBaJOvm1KyoLOjahYGx0NFH3HimtkRyzVEXWMh-vTaWkmH8V4vDdN4ZDKX1CrfhfGrLtRUzwBw55L5grYwmjUKM3KZaA5vAwqj-kDzP6wAwHR2eLkOppCGWGFejHdoM1siuxNN2EBZp471DYCEVFw3Yuk_jxIVW_s4m6HYvG0J-UXCEYCjCwRh-uu888u6HHJmk2zii1MYg7o5tHuKzh-GUR8IKsg1RvtI3VhyoX_SMWfA-q5myxg9idIfciZITU0z77DzKWvmlhAPntwxvtnUKUSsKl2u0gDkCRiz05ateIlT95Nl3J_eZKg3yFFHDyECz39PfIQWrn3pw4heqmZe7A9ZPLw%26u%3DaHR0cHMlM2ElMmYlMmY1MzUwLnhnNGtlbi5jb20lMmZ0cmslMmZ2MSUzZnByb2YlM2Q0MzklMjZjYW1wJTNkMTY4ODA3JTI2a2N0JTNkbXNuJTI2a2NoaWQlM2QxNTkwMDE5MjElMjZjcml0ZXJpYWlkJTNka3dkLTc5NzE1MjAwMDQ1MjQ4JTNhbG9jLTE5MCUyNmNhbXBhaWduaWQlM2Q1OTAyMzczNTglMjZsb2NwaHklM2Q4MDcxNCUyNmFkZ3JvdXBpZCUzZDEyNzU0MzUxNDA0MTg0OTQlMjZjaWQlM2Q3OTcxNDgwMTQ2Nzk4MSUyNmtkdiUzZGMlMjZrZXh0JTNkJTI2a3BnJTNkJTI2a3BpZCUzZCUyNnF1ZXJ5U3RyJTNkQXp1cmUlMjUyMExpbnV4JTI2dXJsJTNkaHR0cHMlM2ElMmYlMmZhenVyZS5taWNyb3NvZnQuY29tJTJmZW4tdXMlMmZzb2x1dGlvbnMlMmZsaW51eC1vbi1henVyZSUzZmVmX2lkJTNkX2tfOGUwMTVlZjYzMGU2MTdhNjI3NDEzNjJhZmQ3NWEyNWVfa18lMjZPQ0lEJTNkQUlEY21tNWVkc3dkdXVfU0VNX19rXzhlMDE1ZWY2MzBlNjE3YTYyNzQxMzYyYWZkNzVhMjVlX2tfJTI2bXNjbGtpZCUzZDhlMDE1ZWY2MzBlNjE3YTYyNzQxMzYyYWZkNzVhMjVl%26rlid%3D8e015ef630e617a62741362afd75a25e&amp;vqd=4-112611093533843317793650952511292013823&amp;iurl=%7B1%7DIG%3D439AE311D7184B789C73D671C400F0F1%26CID%3D07ADA67F01146E803C90B00700886F00%26ID%3DDevEx%2C5046.1">Azure Linux</a> nodes by default, GPU support for AI workloads, and integration with Azure’s broader platform services, though pricing details aren’t specified beyond the “Automatic” tier selection during cluster creation.</li>
<li style="font-weight:400;">The service addresses the “Kubernetes tax” by automating day-two operations like upgrades and repairs, allowing teams to deploy directly from GitHub Actions while Azure handles infrastructure management automatically.</li>
</ul>
<p>44:38  Ryan – “Yeah, in my day job I’m doing a whole bunch of vulnerability reporting on the container structure. I’m like, half of these containers are just the Kubernetes infrastructure! It’s crazy.”</p>
<p>45:19 <a href="https://azure.microsoft.com/en-us/updates?id=503235">Generally Available: AKS Automatic </a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/azure/aks/intro-aks-automatic">AKS Automatic</a> removes the operational complexity of Kubernetes by automatically managing cluster configurations, security patches, and infrastructure tuning, allowing teams to focus on application development rather than cluster maintenance.</li>
<li style="font-weight:400;">This managed approach positions Azure against <a href="https://www.bing.com/ck/a?!&amp;&amp;p=3347124ae819a5776c611219eae1c569ea83105228dd57ef936404683a73791bJmltdHM9MTc1OTE5MDQwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=33cf825f-ad59-6912-19e3-9438ac3368b1&amp;psq=AWS+EKS&amp;u=a1aHR0cHM6Ly9kb2NzLmF3cy5hbWF6b24uY29tL2Vrcy9sYXRlc3QvdXNlcmd1aWRlL3doYXQtaXMtZWtzLmh0bWw">AWS EKS</a> and <a href="https://cloud.google.com/kubernetes-engine">Google GKE</a> by offering a more hands-off experience, though specific pricing and feature comparisons aren’t detailed in the announcement.</li>
<li style="font-weight:400;">Target customers include development teams new to Kubernetes or those with limited DevOps resources who need container orchestration without the steep learning curve and ongoing management overhead.</li>
<li style="font-weight:400;">The service integrates with existing Azure security and monitoring tools, providing automated security updates and reliability improvements without manual intervention.</li>
<li style="font-weight:400;">Organizations should evaluate whether the automated management trade-offs align with their control requirements and assess potential cost implications of this convenience layer over standard AKS.</li>
</ul>
<p>PLUS</p>
<p><a href="https://techcommunity.microsoft.com/blog/linuxandopensourceblog/aks-automatic-with-azure-linux/4454284">AKS Automatic with Azure Linux | Microsoft Community Hub</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aka.ms/AKSAutomatic/blog">AKS Automatic</a> is now GA and simplifies Kubernetes management by automatically handling cluster setup, node management, scaling, security, and networking while running on <a href="https://learn.microsoft.com/en-us/azure/azure-linux/intro-azure-linux">Azure Linux</a> by default, reducing operational overhead for developers and platform teams.</li>
<li style="font-weight:400;">Azure Linux provides a minimal attack surface with only essential packages for Kubernetes workloads, passes all CIS Level 1 benchmarks by default (the only AKS-supported distribution to do so), and includes <a href="https://www.nist.gov/itl/fips.cfm">FIPS</a> and <a href="https://www.fedramp.gov/">FedRAMP</a> compliance certifications.</li>
<li style="font-weight:400;">Performance improvements include faster cluster creation, upgrades, scaling, deletion, node provisioning, and pod startup due to Azure Linux’s reduced image footprint, with automatic patching that respects maintenance schedules and undergoes rigorous testing.</li>
<li style="font-weight:400;">This positions Microsoft to compete with AWS EKS and GCP GKE by offering a more automated Kubernetes experience with end-to-end support for the entire stack, targeting organizations that want Kubernetes benefits without the operational complexity.</li>
<li style="font-weight:400;">The service comes preconfigured with monitoring, scaling, security, and networking tools, supports all current and future AKS extensions and add-ons, and enables deployment from container image to production-ready application within minutes.</li>
</ul>
<p>45:45  <a href="https://azure.microsoft.com/en-us/updates?id=503408">Public Preview: Databricks One in Azure Databricks </a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/databricks/">Databricks One</a> consolidates data engineering, analytics, and AI development into a single platform within <a href="https://learn.microsoft.com/en-us/azure/databricks/admin/workspace-settings/manage-previews">Azure Databricks</a>, addressing the common challenge of fragmented data workflows across multiple tools and services.</li>
<li style="font-weight:400;">The platform introduces unified governance across all data operations, which could help enterprises meet compliance requirements while reducing the complexity of managing permissions and access controls across separate systems.</li>
<li style="font-weight:400;">This positions Azure Databricks more directly against AWS’s fragmented approach with EMR, Glue, and SageMaker, and GCP’s Dataproc and Vertex AI, by offering a more integrated experience for data teams.</li>
<li style="font-weight:400;">Target customers include enterprises struggling with data silos and organizations looking to accelerate their AI/ML initiatives without managing multiple platforms and governance frameworks.</li>
<li style="font-weight:400;">While pricing details aren’t provided in the preview announcement, consolidation typically reduces operational overhead but may increase platform lock-in considerations for organizations evaluating multi-cloud strategies.</li>
</ul>
<p>46:42 Justin – “So if you didn’t want this, you are going to get it forced on you at some point. </p>
<p>47:15 <a href="https://azure.microsoft.com/en-us/updates?id=503129">Public Preview: Azure HBv5-series VMs</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aka.ms/AzureHBv5_Signup">Azure HBv5-series VMs launch in preview</a> in the South Central US region, targeting memory bandwidth-intensive HPC workloads like computational fluid dynamics, automotive simulations, and weather modeling that require extreme memory throughput performance.</li>
<li style="font-weight:400;">These VMs represent Microsoft’s latest push into specialized HPC infrastructure, competing directly with AWS’s memory-optimized instances like X2gd and GCP’s M3 series for scientific computing and engineering simulation workloads.</li>
<li style="font-weight:400;">HBv5 instances likely feature AMD’s latest EPYC processors with enhanced memory bandwidth capabilities, though specific technical specifications aren’t provided in the preview announcement.</li>
<li style="font-weight:400;">Target customers include automotive manufacturers running crash simulations, aerospace companies modeling aerodynamics, and meteorological organizations processing weather prediction models that bottleneck on memory bandwidth rather than compute.</li>
<li style="font-weight:400;">Preview availability in a single region suggests Microsoft is testing performance and gathering feedback before broader deployment, with pricing details expected once general availability is announced.</li>
</ul>
<p>49:56 <a href="https://azure.microsoft.com/en-us/updates?id=503134">Public Preview: Azure Functions .NET 10 support </a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/azure-functions/dotnet-isolated-process-guide?tabs=ihostapplicationbuilder%2Cwindows#deploy-to-azure-functions">Azure Functions </a>adds .NET 10 support in public preview, allowing developers to leverage the latest .NET runtime improvements including better performance and reduced memory usage in their serverless applications.</li>
<li style="font-weight:400;">The upgrade requires updating the target framework and Microsoft.Azure.Functions.Worker.Sdk to version 2.0.5 or later, providing a straightforward migration path for existing .NET Functions projects.</li>
<li style="font-weight:400;">This positions Azure Functions competitively with <a href="https://aws.amazon.com/lambda/">AWS Lambda</a> which supports .NET 8, while <a href="https://cloud.google.com/functions">Google Cloud Functions</a> currently only supports .NET Core 3.1, giving Azure a temporary advantage for .NET developers.</li>
<li style="font-weight:400;">Enterprise customers running .NET workloads can now standardize on .NET 10 across their entire Azure stack, from App Service to Functions, simplifying dependency management and security patching.</li>
<li style="font-weight:400;">The preview status suggests general availability will likely arrive in early 2025, giving organizations time to test compatibility with their existing code before production deployment.</li>
</ul>
<p>51:00  Ryan – “I’m just happy to see .NET running in serverless workloads.” </p>
<p>Show note editor Heather adds “This is a NO time of the day research thing.” </p>
<p>53:30 <a href="https://azure.microsoft.com/en-us/updates?id=503034">Generally Available: High Scale mode for Azure Monitor – Container </a><a href="https://azure.microsoft.com/en-us/updates?id=503034">Insights </a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/azure-monitor/containers/container-insights-analyze">Azure Monitor Container Insights</a> now offers High Scale mode in general availability, enabling higher log collection throughput for <a href="https://learn.microsoft.com/en-us/azure/aks/what-is-aks">Azure Kubernetes Service</a> clusters that generate substantial logging volumes.</li>
<li style="font-weight:400;">This addresses a common pain point for enterprises running large-scale AKS deployments where standard Container Insights might struggle with log ingestion rates during peak loads or debugging scenarios.</li>
<li style="font-weight:400;">The feature positions Azure competitively against <a href="https://docs.aws.amazon.com/AmazonCloudWatch/latest/monitoring/ContainerInsights.html">AWS CloudWatch Container Insights</a> and <a href="https://cloud.google.com/blog/topics/developers-practitioners/introduction-google-clouds-operations-suite">GCP’s Operations suite</a>, particularly for organizations requiring robust observability at scale without custom log aggregation solutions.</li>
<li style="font-weight:400;">Target customers include enterprises with high-transaction microservices architectures, financial services running real-time processing, and any AKS workloads generating logs beyond standard collection limits.</li>
<li style="font-weight:400;">While Microsoft hasn’t detailed specific pricing changes, customers should evaluate whether the improved throughput justifies potential increased costs from higher log ingestion and storage volumes.</li>
</ul>
<p>54:17  Matt – “The same thing as CloudWatch, it’s so expensive to take logs into any of these platforms, but you gotta get them somewhere. So you kind of just are stuck paying for it.”</p>
<p>54:49 <a href="https://azure.microsoft.com/en-us/updates?id=500795">Generally Available: Confidential computing for Azure Database for </a><a href="https://azure.microsoft.com/en-us/updates?id=500795">PostgreSQL flexible server </a></p>
<ul>
<li style="font-weight:400;"><a href="https://duckduckgo.com/y.js?ad_domain=microsoft.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=my9jlqbF6uhdyX6otq9XJfbAXNcnuvuyToljdROxSbvuf7950qo7AbrfMwbEIrvh0L_4rKMSPC9SfLBJZ5P9xh2W80csHgFXbvfiWQcEMn2yLjICh_o6WWoPjW22Vw17.94LUdCnn3H_1V5bP-c9_Ow&amp;eddgt=62hIhOGo6wsArXsURtW1mw%3D%3D&amp;rut=b4e891a5b94938e882277834c4213e5a13dcda28f7a0f42b168e2567a382d358&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8Q46eitJuYNTiQRKj8D2E6zVUCUyaEIqzzv1ENKWM46Z0gymnbPj4lEc_-46E2gxELeUGIo_vGJA3BXbYKwB9wUeIZTGyhsUk97IQLWpPHkCDXIV9tXI8ZHyxAnu4ai2XuY5Sqr6COlVOHReGWFtWT_EG_s97VBg7eB49DR4jIjj65yvUzVzLCubJAz-SOVrT0xw6dETYpHbihVV2isOX5I8EIremn93OLEWxg1CDXz2FGXse-qKWH1at8878j3iB9O_WahfB23qorEKzVgmtRtiWhAN1YhS2cDEAqoaByzlr7EjnW68Ce_arPElAb8MWrvSet5E3h-NTzNqxRyRJILsn0WmCv5hzTZ4Oi30VZn1xOYOdKvixlY8K48vheZU4R-ZWWhBGxRj7m6c7q3Ru5tSCXSAatCXEwHgFwR_Dsvs3-1mEzPt9qfb_RFb7wROpWI6d9A75M81vFv3eZZjotcyYe1DbHWnm6baxFNYrRLpLs43MAnzWvejaooHWYvfHjAaOznOQPpeiz9s-vvXOsVwRTE20-RpsCS0q36auRiCJSAoA00I5j3tQPn7638LhkrGW--aKD7chU3050blSJIhlXN2PCrPYrm-JoufDW7N0pKz1VMUkcXrO1mNmTA1hqM2RwrOsFbfjvJg0PSNoWmnFn4dL__PefY9L46Nn2Q7Eka2SHkvBk_pbyvXkHOLRFFm86tdbwt0apQulIDEprgQD4OhkF8BupOp-MuGN1bTO0SzJOJgxtLu0AzmXKmAtgzvUgQ%26u%3DaHR0cHMlM2ElMmYlMmY1MzUwLnhnNGtlbi5jb20lMmZ0cmslMmZ2MSUzZnByb2YlM2Q0MzklMjZjYW1wJTNkMTY4ODA3JTI2a2N0JTNkbXNuJTI2a2NoaWQlM2QxNTkwMDE5MjElMjZjcml0ZXJpYWlkJTNka3dkLTc4ODIxODQ2NDc2OTE0JTNhbG9jLTE5MCUyNmNhbXBhaWduaWQlM2Q1OTAyMzczNTglMjZsb2NwaHklM2Q3Njk3OCUyNmFkZ3JvdXBpZCUzZDEyNjExNDE0ODk3MDI3ODUlMjZjaWQlM2Q3ODgyMTQ0ODU3MjY5NyUyNmtkdiUzZGMlMjZrZXh0JTNkJTI2a3BnJTNkJTI2a3BpZCUzZCUyNnF1ZXJ5U3RyJTNkQXp1cmUlMjUyMERhdGFiYXNlJTI1MjBmb3IlMjUyMFBvc3RncmVTUUwlMjZ1cmwlM2RodHRwcyUzYSUyZiUyZmF6dXJlLm1pY3Jvc29mdC5jb20lMmZlbi11cyUyZnByb2R1Y3RzJTJmbXlzcWwlMmYlM2ZlZl9pZCUzZF9rXzI2NGFmNzhlY2FiYjFkODFhMjU4OWY1MjFlNzRiYmJhX2tfJTI2T0NJRCUzZEFJRGNtbTVlZHN3ZHV1X1NFTV9fa18yNjRhZjc4ZWNhYmIxZDgxYTI1ODlmNTIxZTc0YmJiYV9rXyUyNm1zY2xraWQlM2QyNjRhZjc4ZWNhYmIxZDgxYTI1ODlmNTIxZTc0YmJiYQ%26rlid%3D264af78ecabb1d81a2589f521e74bbba&amp;vqd=4-133719560982294937616715883868076601037&amp;iurl=%7B1%7DIG%3DE384975F61FB4F708A6EF3446B19DA0A%26CID%3D0771B8B8A212627708B6AEC0A37D633D%26ID%3DDevEx%2C5045.1">Azure Database for PostgreSQL</a> now supports <a href="https://learn.microsoft.com/en-us/azure/confidential-computing/overview">confidential computing</a> through hardware-based trusted execution environments (TEEs), ensuring data remains encrypted even during processing and preventing unauthorized access from cloud administrators or malicious insiders.</li>
<li style="font-weight:400;">This positions Azure competitively against <a href="https://aws.amazon.com/ec2/nitro/nitro-enclaves/">AWS Nitro Enclaves</a> and <a href="https://security.googlecloudcommunity.com/community-blog-42/protecting-your-data-why-confidential-computing-is-necessary-for-your-business-4007">Google Confidential Computing</a>, particularly for regulated industries like healthcare and finance that require cryptographic verification of their database environments.</li>
<li style="font-weight:400;">The feature leverages Intel SGX or AMD SEV technologies to create isolated compute environments, though customers should expect performance overhead of 10-20% and potential limitations on certain PostgreSQL extensions.</li>
<li style="font-weight:400;">Primary use cases include multi-tenant SaaS applications processing sensitive customer data, compliance with data residency requirements, and organizations needing to demonstrate zero-trust security models to auditors.</li>
<li style="font-weight:400;">Pricing follows standard PostgreSQL flexible server rates with an additional premium for confidential computing instances, making it cost-effective for high-value workloads but potentially expensive for general-purpose databases.</li>
</ul>
<p>57:11 <a href="https://techcommunity.microsoft.com/blog/microsoftdatamigration/announcing-the-azure-database-migration-service-hub-experience/4454900">Announcing the Azure Database Migration Service Hub Experience | </a><a href="https://techcommunity.microsoft.com/blog/microsoftdatamigration/announcing-the-azure-database-migration-service-hub-experience/4454900">Microsoft Community Hub</a></p>
<ul>
<li style="font-weight:400;"><a href="https://azure.microsoft.com/en-us/products/database-migration/">Azure Database Migration Service Hub</a> provides a centralized dashboard for discovering, assessing, and tracking SQL Server migrations to Azure, addressing the complexity of managing multiple migration projects across enterprise environments.</li>
<li style="font-weight:400;">The service automatically discovers SQL Servers in your environment and provides readiness assessments, helping organizations prioritize which databases to migrate first based on dependencies and potential blockers.</li>
<li style="font-weight:400;">Microsoft plans to expand beyond SQL Server to support multi-RDBMS migrations and add real-time migration tracking with status monitoring, error reporting, and completion metrics directly in the dashboard.</li>
<li style="font-weight:400;">This positions Azure competitively against AWS Database Migration Service and Google Database Migration Service by offering a more integrated assessment phase, though AWS currently supports more source database types out of the box.</li>
<li style="font-weight:400;">The Hub experience targets enterprises consolidating data centers or modernizing legacy SQL Server deployments, with the dashboard particularly useful for teams managing dozens or hundreds of database migrations simultaneously.</li>
</ul>
<p>57:57  Ryan – “It’s a great play by Azure. They have a huge advantage in this space and I think there is a desire by a lot of companies to get out of legacy deployments, so it’s smart. Hurry up with the features.”</p>
<p>58:19 <a href="https://azure.microsoft.com/en-us/updates?id=503286">Public Preview: Azure Managed Service for Prometheus now includes </a><a href="https://azure.microsoft.com/en-us/updates?id=503286">native Grafana dashboards within the Azure portal</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/azure-monitor/metrics/prometheus-metrics-overview">Azure Managed Service for Prometheus</a> now embeds <a href="https://aka.ms/DashboardsWithGrafanaDocs">Grafana</a> dashboards directly in the Azure portal at no additional cost, eliminating the need to manage separate Grafana instances for basic visualization needs.</li>
<li style="font-weight:400;">This integration reduces operational overhead by providing out-of-the-box dashboards for common Azure services while maintaining compatibility with existing <a href="https://prometheus.io/docs/prometheus/latest/querying/basics/">Prometheus query language</a> (PromQL) workflows.</li>
<li style="font-weight:400;">The feature positions Azure competitively against <a href="https://docs.aws.amazon.com/prometheus/latest/userguide/what-is-Amazon-Managed-Service-Prometheus.html">AWS Managed Service for Prometheus</a> which requires separate <a href="https://aws.amazon.com/grafana/">Amazon Managed Grafana</a> instances, though GCP’s Cloud Monitoring already offers integrated visualization.</li>
<li style="font-weight:400;">Target users include DevOps teams and platform engineers who need quick metric visualization without the complexity of managing dedicated Grafana infrastructure, particularly useful for Azure-native workloads.</li>
<li style="font-weight:400;">While this simplifies basic monitoring scenarios, organizations with complex visualization requirements or multi-cloud deployments will likely still need standalone Grafana instances for advanced customization.</li>
</ul>
<p>58:54  Justin – “I look forward to the arguments between ‘well the Azure monitoring says this, but the Grafana monitoring says this’ and it’s in the same dashboard.” </p>
<p>1:00:01 <a href="https://azure.microsoft.com/en-us/updates?id=501915">Generally Available: At-cost data transfer between Azure and an external </a><a href="https://azure.microsoft.com/en-us/updates?id=501915">endpoint</a></p>
<ul>
<li style="font-weight:400;">Azure now offers at-cost data transfer for customers moving data from Azure to external endpoints via the internet in Europe, eliminating the typical egress fees that can make multi-cloud or hybrid strategies expensive.</li>
<li style="font-weight:400;">This move directly addresses vendor lock-in concerns by reducing the financial barriers to data portability, making it easier for European customers to adopt multi-cloud architectures or migrate workloads between providers.</li>
<li style="font-weight:400;">The feature appears limited to European regions and CSP partners initially, suggesting Microsoft is responding to EU regulatory pressure around data sovereignty and cloud provider switching costs.</li>
<li style="font-weight:400;">Unlike AWS and GCP which still charge standard egress fees for most data transfers, this positions Azure as more open to hybrid and multi-cloud scenarios, though the geographic limitation reduces its competitive impact.</li>
<li style="font-weight:400;">Enterprise customers running hybrid workloads or needing to regularly sync large datasets between Azure and on-premises systems will see immediate cost benefits, particularly for backup, disaster recovery, and data lake scenarios.</li>
</ul>
<p>1:01:11 <a href="https://azure.microsoft.com/en-us/updates?id=503617">Generally Available: Introducing the new Network Security Hub </a><a href="https://azure.microsoft.com/en-us/updates?id=503617">experience</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/firewall-manager/overview">Azure Firewall Manager</a> has been rebranded as <a href="https://learn.microsoft.com/en-us/azure/event-hubs/network-security-perimeter">Network Security Hub</a>, consolidating <a href="https://duckduckgo.com/y.js?ad_domain=microsoft.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=jATNAn2xcVH2sTd16G051RS_6_ILxx-wLX5ncP2NldCmYpHHoIhQcswtzyyFRhcSQkhnX-dK0K3FnAQHzN5fU7gAcwMqY5gzNYZ5p7qcmWAHVosS_3zhW9cTVpfke62K.DuRd54we1XnZxpWzu4o7ew&amp;eddgt=TIh8s08supcKhYcfwp6AKw%3D%3D&amp;rut=5d8e0035b6858d14792b530d3e216559914c9f75e8a57a497ac37ac83cda81c4&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8bP_eBhGiyWhnYtglHo3vsTVUCUxX6fXR1jLZ9Hn3Ef7I6TenNwpzDQay5lHfcCQW2UePzcoVyQISu2h563aK_eN5_FlCHwb02XrJaDC4BbE2IiAl0unB5sguFqD4qjnaI_vipAcJne7KegT4RcJ_qUwzGUtqIGyYS4hpCh9Q_kCeC51UK1uGhgr-0Hz2-Tiq_tX95VFAjTZbMoKJM-q_YDZgsjb9rAlJgnsRDBDAjdkxrlYt3NX1o8P5RYHF2sWhzztp_XqMqKJtuVg8KhGGwHsYilKw0VdIM3LiDe08PFoyre3RG3mRifVJvCxDu34l7-Jf26BnbvnquLPuoSzm4btwlzYYXMuIPNQTJsS1gvTp2zZU0dO_-CwC1x7DEvjN48H5noPaesR75YO2tBQvZHM13yYAshOowfV1cC0xTr-WL20dCIMxcms9s4cBMrItqFHMX5nzGwOo5LOB-FOA0yXgnUnlNeqgG93hIPNimv6ywxyIQbcnnkm3UwWF72P_KPhd0qlx_Q87a0SN0oc-HioUSOszz7lpOc-Mu4ucU_OcpmohMFq2rHMtBW347D-MQ1mC4cTBOJj3R0QhUCWGioy25H-o2k1ptXv_hp2kyTUzUhwS7rqCtn8Vukdzml0hGJ35_EUA9JVPPP0ar3oucOfUmXFkuzl5JxilNnQcs31hGuVreFB5Yk4gliOORxZeoyEqlaiKkLnFXwz3ncrjjg-JvKwCls3uklWdI3KXIzgwfnRqiF6--nZyWds4Y9gZH7mfAA%26u%3DaHR0cHMlM2ElMmYlMmY1MzUwLnhnNGtlbi5jb20lMmZ0cmslMmZ2MSUzZnByb2YlM2Q0MzklMjZjYW1wJTNkMTY4ODA3JTI2a2N0JTNkbXNuJTI2a2NoaWQlM2QxNTkwMDE5MjElMjZjcml0ZXJpYWlkJTNka3dkLTc4NzUzMTI4NDk1Mjg2JTNhbG9jLTE5MCUyNmNhbXBhaWduaWQlM2Q1OTAyMzczNTglMjZsb2NwaHklM2Q4MDcxNCUyNmFkZ3JvdXBpZCUzZDEyNjAwNDE5Nzc5Njk1ODQlMjZjaWQlM2Q3ODc1MjcyODM0ODc4NiUyNmtkdiUzZGMlMjZrZXh0JTNkJTI2a3BnJTNkJTI2a3BpZCUzZCUyNnF1ZXJ5U3RyJTNkQXp1cmUlMjUyMEZpcmV3YWxsJTI2dXJsJTNkaHR0cHMlM2ElMmYlMmZhenVyZS5taWNyb3NvZnQuY29tJTJmZW4tdXMlMmZwcm9kdWN0cyUyZmF6dXJlLWZpcmV3YWxsJTJmJTNmZWZfaWQlM2Rfa183YzUwMjI4YzQ3NjExZDQ4ZjYwNzRhNTA5ZmIxOWM5ZF9rXyUyNk9DSUQlM2RBSURjbW01ZWRzd2R1dV9TRU1fX2tfN2M1MDIyOGM0NzYxMWQ0OGY2MDc0YTUwOWZiMTljOWRfa18lMjZtc2Nsa2lkJTNkN2M1MDIyOGM0NzYxMWQ0OGY2MDc0YTUwOWZiMTljOWQ%26rlid%3D7c50228c47611d48f6074a509fb19c9d&amp;vqd=4-64297528687563390502861678119025886612&amp;iurl=%7B1%7DIG%3DD3FB434F92794BAFA04EFAD14F481D75%26CID%3D1A1D9825AC796B172AB98E5DADCC6AA3%26ID%3DDevEx%2C5045.1">Azure Firewall</a>, <a href="https://duckduckgo.com/y.js?ad_domain=microsoft.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=nTophlrnyFT4hGUjci1_6-q3KdE2GtiwRrmQQlXStyq5jSNwfyzy4Bo5NGlBoRv9k1QdLul7KL28RWphmn2NpDYrW7umIG2ItViD8DwC2x2z0I90TycYRbZq8dv_v_Mk.E-S9ej3fKwLODNtdS-tzzQ&amp;eddgt=ly6F_vCSt0wCqSor2IMg1A%3D%3D&amp;rut=c18a04fc16e50a33fe882286ac00884c8968c6a5dfb7adc1785778a4105acaa8&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8glH3EwTb_9m7aFdAALLawjVUCUwQYtN5xg1gWsSxSTk8a-xux6suTVtSbdwvxluHZ6pn12F6QA9UZE7YatPiDtgyj3B4KkyV-8WrSPuZns_yTtx3yDRCaz1buw4rnJ4NrvOOTcn-JSrmLSSzDwxMK90MvULrRok0X_tkj9mW4ontgLVeXZ6iuFELnViI3gIDI8f75BeSfTuS_WPxbrADuBI5p4zn051fY0EDdSEJRS9i9RHGKMQr7XBcxwOEdQdUqKydSUTQxRSK0ARD8xo52syHe1oSRgsWo6EJBDzZb-FWo5EYOPNczv_g0uztrZTLNJNybwcQ6MIw6I254CkhPNcbBCdgNOKXsk5rjYS-4Rjr-l2LgnK8hSMfWxdIsInyIitr7NtQ7QhlJ3scdQ2jTduIii7vWMyyxpecaDzyzRiLQc40FYA6gKgxb0VaoYWztHNL8JlUe1AaW5heudkpMrUfrQfep3r1uN1dTQWc8cVjiACx_XmK30eWdOdXgADiGygNTQqaA8OAz0MAtTpk_iCeRUxJQpDqfNCiQ8WWalqYtRI99Ly285V7jS4x5K7pBaYVJKbEALmKCbP9KUdHHkH9tdeloZ-M7QNxhJXo0NsvKjV65X5GHNVL8OAo-ShCrNPpF_C8XoY84KDn3VyPQlWVKCu1cdLpS8r5hRm-tyyztHkyVrMKKfLcTZ7dpUPQzPecTraD4e1AUb7i7xwVrYwKSUYP2zlnQ0rBycOyoOCPy00AzivvVfK_ue_awG-gRZwivg%26u%3DaHR0cHMlM2ElMmYlMmY1MzUwLnhnNGtlbi5jb20lMmZ0cmslMmZ2MSUzZnByb2YlM2Q0MzklMjZjYW1wJTNkMTY4ODA3JTI2a2N0JTNkbXNuJTI2a2NoaWQlM2QxNTkwMDE5MjElMjZjcml0ZXJpYWlkJTNka3dkLTc5MDk2NzI1NzM0NTI3JTNhbG9jLTE5MCUyNmNhbXBhaWduaWQlM2Q1OTAyMzczNTglMjZsb2NwaHklM2Q3OTcyNCUyNmFkZ3JvdXBpZCUzZDEyNjU1Mzk1MzU5Njk3MTclMjZjaWQlM2Q3OTA5NjMyNjY0OTI3MyUyNmtkdiUzZGMlMjZrZXh0JTNkJTI2a3BnJTNkJTI2a3BpZCUzZCUyNnF1ZXJ5U3RyJTNkQXp1cmUlMjUyMFdlYiUyNTIwQXBwbGljYXRpb24lMjUyMEZpcmV3YWxsJTI1MjBXQUYlMjZ1cmwlM2RodHRwcyUzYSUyZiUyZmF6dXJlLm1pY3Jvc29mdC5jb20lMmZlbi11cyUyZnByb2R1Y3RzJTJmd2ViLWFwcGxpY2F0aW9uLWZpcmV3YWxsJTJmJTNmZWZfaWQlM2Rfa183NzlhNDM3ZGU0MWExZTcxZDU1MzJiYTEzY2ZhMDRhZV9rXyUyNk9DSUQlM2RBSURjbW01ZWRzd2R1dV9TRU1fX2tfNzc5YTQzN2RlNDFhMWU3MWQ1NTMyYmExM2NmYTA0YWVfa18lMjZtc2Nsa2lkJTNkNzc5YTQzN2RlNDFhMWU3MWQ1NTMyYmExM2NmYTA0YWU%26rlid%3D779a437de41a1e71d5532ba13cfa04ae&amp;vqd=4-199202037092047928103559757033036276179&amp;iurl=%7B1%7DIG%3DA07C08B5C3C7449589027DA53CB5596D%26CID%3D0167336C8EF6637A2E6E25148F6A62B6%26ID%3DDevEx%2C5045.1">Web Application Firewall (WAF)</a>, and <a href="https://duckduckgo.com/y.js?ad_domain=microsoft.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=OJRZKJeGNVh28sMvQ95AAbMrxKHxP5CA20d_MTnhhxYMTRUT5XLU1vNIAjRnSSO8K9DEu2gSjXftwdoqtB5d99kqj2aahrzsklMLc85KG6DiSRovapAj01Yg9PKgCTRl.N3AFou0kcXiIZfO_FaGf7A&amp;eddgt=-d5rw3GTJWM_CaxJtEykVA%3D%3D&amp;rut=d2ccc10fde70c14115d79b39151925f7d294aecf759d22fe217443ccdb0c4349&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8VwZ0YxQKIXUCDxkMI8RPYjVUCUz0en8ETwJIiwgHHheAzSWWcxm-_tOBsI97GFOwT3cm_rVYG73NaVUAmM_rqYyOpfN4HTvYKdEtbifs8N5hf4FxOVReAdlhDBmLP38TZEwNZEmM-xP6VdPnGmuFQOmi-kmfsZweW2CX1Dp36kBJPfCKB5HwhXnj1zV5SsCYY4ZVoRDyC1dslksXUBCYW44AVmAsnXHeFJp5L7jj3js5WMsR-kQpk9D3DZ_SE4oKzy6az9xrbovRdyL-QU99l1qs8EENXxuJGo-jTcKSFM7QlbYQs5iju1aG306LsNYPbgf6MEG9w2zLBU1GenqKnpV4Uvj0hhdDYKZI4s3mYq4-lm9iaSEt_oDa5UGjSG6cVDbFkuAbrT2PUTIntzyk-4VyQHdOc9FNqYYYNFGtJuRd0fxcuLEMNPKQlAmnnkvBUPVT6yvpUwaGiNG-CzrRQfnoEf6m2Zcmey4dnH9UFvT43-Pq0_EoDwJYxnMDvpZcD4WdQVgs1CWDGgdn2GH1xR9b2SwOhV5_BJl1U7_VGfeSUJuxJybtz5oDX6qk0lv6sikVqnAxWHuMVPzeAcH5sh64CwVKOlSeo309ebq9LENKFWErswvfq43k7K2wY62w3Qovfd84GuNt1OZ_FxYZvItcHA5GX62kE2QHKMlDy-aeEyb4G4iTq0_N1lGoLBUBa9iFkX66gJM7nzKVkE5p7IggW7f2_LEuZTykGTblaftwQBURm2U0WLU7fzGi8O6Fj-TPfA%26u%3DaHR0cHMlM2ElMmYlMmY1MzUwLnhnNGtlbi5jb20lMmZ0cmslMmZ2MSUzZnByb2YlM2Q0MzklMjZjYW1wJTNkMTY4ODA3JTI2a2N0JTNkbXNuJTI2a2NoaWQlM2QxNTkwMDE5MjElMjZjcml0ZXJpYWlkJTNka3dkLTc4ODIxODQ2NDc3MDU0JTNhbG9jLTE5MCUyNmNhbXBhaWduaWQlM2Q1OTAyMzczNTglMjZsb2NwaHklM2Q3ODk2MiUyNmFkZ3JvdXBpZCUzZDEyNjExNDE0ODk3MDI5MTMlMjZjaWQlM2Q3ODgyMTQ0ODU3MjcxMiUyNmtkdiUzZGMlMjZrZXh0JTNkJTI2a3BnJTNkJTI2a3BpZCUzZCUyNnF1ZXJ5U3RyJTNkQXp1cmUlMjUyMEREb1MlMjUyMFByb3RlY3Rpb24lMjZ1cmwlM2RodHRwcyUzYSUyZiUyZmF6dXJlLm1pY3Jvc29mdC5jb20lMmZlbi11cyUyZnByb2R1Y3RzJTJmZGRvcy1wcm90ZWN0aW9uJTJmJTNmZWZfaWQlM2Rfa18wMjMwMTdlMTVhNTkxMjllODJkZjdmM2Q4MmIyYTAyYl9rXyUyNk9DSUQlM2RBSURjbW01ZWRzd2R1dV9TRU1fX2tfMDIzMDE3ZTE1YTU5MTI5ZTgyZGY3ZjNkODJiMmEwMmJfa18lMjZtc2Nsa2lkJTNkMDIzMDE3ZTE1YTU5MTI5ZTgyZGY3ZjNkODJiMmEwMmI%26rlid%3D023017e15a59129e82df7f3d82b2a02b&amp;vqd=4-45676181548487530578080481973275330070&amp;iurl=%7B1%7DIG%3D98080DFEB5F94CFDAC0E26777FCF0786%26CID%3D1A2FD2E89331609F3E44C490925E61CB%26ID%3DDevEx%2C5045.1">DDoS Protection</a> into a single management interface for simplified security operations.</li>
<li style="font-weight:400;">This centralization addresses a common pain point where customers had to navigate multiple portals to manage different security services, now providing unified policy management and monitoring across network security tools.</li>
<li style="font-weight:400;">The hub approach aligns Azure more closely with AWS Security Hub’s consolidated view, though Azure’s implementation focuses specifically on network security rather than broader security posture management.</li>
<li style="font-weight:400;">Primary use cases include enterprises managing complex multi-region deployments who need consistent security policies across Azure Firewall instances, WAF rules, and DDoS protection settings from one location.</li>
<li style="font-weight:400;">While pricing remains unchanged for the underlying services, the consolidated management experience should reduce operational overhead and the time required to implement and audit security policies across Azure environments.</li>
</ul>
<p>1:01:51  Matt – “From my preliminary research, it’s just a nice gooey update that they’ve done to kind of make it be a little bit cleaner. It looks like it’s easier to manage some of these things just with Terraform across the way, but, you know, they’re trying to make this be better for companies at a larger scale.”</p>
<p>1:02:32 <a href="https://blog.fabric.microsoft.com/en-GB/blog/september-2025-fabric-feature-summary/">Fabric September 2025 Feature Summary | Microsoft Fabric Blog | </a><a href="https://blog.fabric.microsoft.com/en-GB/blog/september-2025-fabric-feature-summary/">Microsoft Fabric</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.microsoft.com/en-us/microsoft-fabric/resources/data-101/what-is-fabric">Microsoft Fabric’s</a> September 2025 update delivers over 100 new features across data engineering, analytics, and AI workloads, with key additions including general availability of governance APIs, Purview data protection policies, and native support for pandas DataFrames in User Data Functions that leverage Apache Arrow for improved performance.</li>
<li style="font-weight:400;">The new <a href="https://www.nimblelearn.com/blog/8-things-you-should-know-about-mcp-in-microsoft-fabric/">Fabric MCP (Model Context Protocol)</a> server enables AI-assisted code generation directly within VS Code and GitHub Codespaces, while the open-sourced <a href="https://learn.microsoft.com/en-us/fabric/data-engineering/api-graphql-local-model-context-protocol">Fabric CLI</a> and new <a href="https://learn.microsoft.com/en-us/fabric/extensibility-toolkit/overview-story">Extensibility Toolkit</a> allow developers to build custom Fabric items in hours rather than days using <a href="https://copilot.microsoft.com/">Copilot</a>-optimized starter kits.</li>
<li style="font-weight:400;">Real-time intelligence capabilities expand significantly with Maps visualization for geospatial data, 10x performance boost for Activator (now supporting 10,000 events per second), and direct <a href="https://learn.microsoft.com/en-us/azure/azure-monitor/logs/data-platform-logs">Azure Monitor Logs</a> integration via Eventstream, positioning Fabric as a comprehensive alternative to standalone analytics platforms.</li>
<li style="font-weight:400;">Data Factory introduces simplified “pipelines” branding, adds 20+ new connectors including Google BigQuery and Oracle, and enables workspace-level workload assignment, allowing teams to add capabilities without tenant-wide changes while maintaining governance controls.</li>
<li style="font-weight:400;">Database mirroring extends to <a href="https://cloud.google.com/bigquery">Google BigQuery</a> and Oracle with near real-time replication into <a href="https://learn.microsoft.com/en-us/fabric/onelake/onelake-overview">OneLake</a>, plus VNET and on-premises gateway support for secure connectivity, enabling organizations to unify multi-cloud and hybrid data estates without complex ETL processes.</li>
</ul>
<p>1:03:30  Justin – “I appreciate all this Fabric stuff; Fabric is Azure’s Q.” </p>
<p>1:04:09 <a href="https://www.geekwire.com/2025/microsoft-tames-intense-chip-heat-with-ai-designed-liquid-cooling-veins-inspired-by-biology/">Microsoft tames intense chip heat with liquid cooling veins, designed by </a><a href="https://www.geekwire.com/2025/microsoft-tames-intense-chip-heat-with-ai-designed-liquid-cooling-veins-inspired-by-biology/">AI and inspired by biology – GeekWire</a></p>
<ul>
<li style="font-weight:400;">Microsoft developed AI-designed microfluidic cooling that brings liquid coolant directly inside processors through vein-like channels, enabling servers to run hotter and faster through overclocking while handling spiky workloads like Teams meetings without needing excess idle capacity.</li>
<li style="font-weight:400;">The cooling system is up to 3x more effective than current cold plates at removing heat from chips’ hottest spots, which can have heat density comparable to the sun’s surface, and Microsoft plans to integrate this into future Azure Cobalt chips and Maia AI accelerators.</li>
<li style="font-weight:400;">This positions Microsoft to compete more effectively with AWS and Google in AI workloads by reducing the number of servers needed while improving performance, addressing the industry challenge of either overbuilding capacity or risking performance issues during peak demand.</li>
<li style="font-weight:400;">Microsoft is making this an industry standard through partnerships, potentially enabling future 3D chip stacking architectures where coolant flows between silicon layers – a development that could significantly advance computing capabilities beyond current limitations.</li>
<li style="font-weight:400;">The company also announced partnerships with Corning and Heraeus for hollow core fiber production to reduce data center latency, and with Stegra for green steel that cuts carbon emissions by 95% in datacenter construction.</li>
</ul>
<p>1:05:13  Ryan- “Necessity is the mother of all innovation, right? And so this is not only as trying to offset carbon credits, but it’s also all the demand for AI and more compute – and less space and less power and water. So I think it’s neat to see innovations come out of that, and the way they make the sound just makes it seem like sci-fi, which is cool.”</p>
<p>1:06:18 <a href="https://azure.microsoft.com/updates?id=501017">Generally Available: Application Gateway upgrades with no performance </a><a href="https://azure.microsoft.com/updates?id=501017">impact</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/application-gateway/overview">Azure Application Gateway</a> now maintains full capacity during upgrades by automatically provisioning new gateway instances, eliminating the performance degradation that previously occurred during maintenance windows.</li>
<li style="font-weight:400;">This zero-downtime upgrade capability addresses a common pain point where load balancers would operate at reduced capacity during updates, potentially causing slowdowns for high-traffic applications.</li>
<li style="font-weight:400;">The feature puts Azure on par with <a href="https://docs.aws.amazon.com/elasticloadbalancing/latest/application/introduction.html">AWS Application Load Balancer</a> and <a href="https://cloud.google.com/load-balancing">Google Cloud Load Balancing</a>, both of which have offered hitless upgrades for several years.</li>
<li style="font-weight:400;">Enterprise customers running mission-critical workloads will benefit most, as they no longer need to schedule maintenance windows or over-provision capacity to handle upgrade periods.</li>
<li style="font-weight:400;">While the announcement doesn’t specify additional costs, the automatic provisioning of temporary instances during upgrades may result in brief periods of increased compute charges.</li>
</ul>
<p>1:07:10  Matt – “About two years ago they added the feature called Mac Surge, which is when you have a scale set, you add a node and then you delete it. So here, they are adding their app gateways; so essentially if you have 10, you would go to 11 and then you would remove one of the original ones. And they essentially are just leveraging that as part of the app gateways… But if you’re also auto scaling, which if you have the app that can handle that, you don’t control your nodes. So you would just lose capacity at one point. So it’s one of those quality of life improvements.</p>
<h2>Oracle</h2>
<p>1:08:27 <a href="https://www.oracle.com/news/announcement/blog/oracle-sets-the-standard-in-enterprise-ai-2025-09-18/">Oracle Sets The Standard In Enterprise Ai</a></p>
<ul>
<li style="font-weight:400;">Oracle announced comprehensive AI capabilities across its cloud platform, positioning itself as the enterprise AI standard with integrated solutions spanning infrastructure to applications.</li>
<li style="font-weight:400;">Oracle’s AI strategy centers on three pillars: AI infrastructure with NVIDIA GPUs and OCI Supercluster, embedded AI in all SaaS applications, and custom AI development tools – a vertical integration play that AWS and Azure don’t match but may lock customers deeper into Oracle’s ecosystem.</li>
<li style="font-weight:400;">The company claims 50+ AI features across <a href="https://www.oracle.com/">Oracle Cloud Application</a> including supply chain optimization and financial forecasting, though specific performance metrics or customer adoption rates weren’t disclosed, making it difficult to assess real-world impact versus marketing.</li>
<li style="font-weight:400;"><a href="https://www.oracle.com/artificial-intelligence/data-science/">OCI Data Science</a> platform now includes automated ML capabilities and pre-built models for common enterprise tasks, competing directly with <a href="https://aws.amazon.com/sagemaker/">AWS SageMaker</a> and Azure ML but arriving years later to market with unclear differentiation beyond Oracle database integration.</li>
<li style="font-weight:400;">Oracle emphasizes “responsible AI” with built-in governance and explainability features, addressing enterprise concerns about AI transparency – though implementation details and how this compares to competitors’ AI governance tools remain vague.</li>
<li style="font-weight:400;">The integrated approach from infrastructure to applications could simplify AI adoption for existing Oracle customers, but may struggle to attract new enterprises already invested in hyperscaler AI platforms unless pricing is significantly competitive.</li>
</ul>
<p>1:09:42  Justin – “The best thing about this article is they basically imply that they invented AI.”</p>
<h2>After Show</h2>
<p>1:21:40 <a href="https://www.oreilly.com/radar/prompt-engineering-is-requirements-engineering/">Prompt Engineering Is Requirements Engineering – O’Reilly</a></p>
<ul>
<li style="font-weight:400;">Prompt engineering is fundamentally requirements engineering applied to AI interactions – the same communication challenges that have plagued software development since the 1960s NATO conference now appear when working with AI models to generate code or solutions.</li>
<li style="font-weight:400;">Context engineering emerges as a critical skill for cloud developers using AI tools – determining what information to include in prompts (surrounding code, test inputs, design constraints) directly impacts output quality, similar to how requirements scope has always affected project success.</li>
<li style="font-weight:400;">The shift from static documentation to iterative refinement mirrors Agile’s evolution – just as user stories replaced heavyweight specifications, prompt engineering requires continuous conversation with AI rather than single-shot commands, though AI won’t ask clarifying questions like human teammates.</li>
<li style="font-weight:400;">Cloud-based AI services amplify traditional requirements failures – when AI generates code directly from natural language without the structured syntax guardrails, small variations in problem framing can produce significantly different outputs that look plausible but fail in practice.</li>
<li style="font-weight:400;">Organizations falling into the “prompt library trap” repeat 1990s template mistakes – standardized prompts can’t replace the core skill of understanding and communicating intent, just as perfect requirements templates never guaranteed successful software delivery.</li>
</ul>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback, or ask questions at theCloudPod.net or tweet at us with the hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2161682/c1e-k5d5sgp82rh5gmqm-ndvwmr50u411-zgtzhx.mp3" length="158096694"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 323 of The Cloud Pod, where the forecast is always cloudy! Justin, Matt and Ryan are in the studio tonight to bring you all the latest in cloud and AI news! This week we have a close call from Entra, some DeepSeek news, Firestore, and even an acquisition! Make sure to stay tuned for the aftershow – and Matt obviously falling asleep on the job. Let’s get started! 
Titles we almost went with this week

When One Key Opens Every Door: Microsoft’s Close Call with Cloud Catastrophe
Bedrock Goes Qwen-tum: Alibaba’s Models Join the AWS Party
DeepSeek and You Shall Find V3.1 in Bedrock
GPUs of Unusual Size? I Don’t Think They Exist (Narrator: They Do)
Kubernetes Without the Kubernightmares
Firestore and Forget: AI Takes the Wheel SCPs Get Their Full License: IAM Language Edition
Do What I Meant, Not What I Prompted
Atlassian Pays a Billion to DX the Developer Experience
Entra at Your Own Risk: The Azure Identity Crisis That Almost Was
Oracle Intelligence: The AI Nobody Asked For
Wisconsin Gets Cheesy with AI: Microsoft’s Dairy State Datacenter 
Azure Opens the Data Floodgates (But Only in Europe)
PostgreSQL Gets a Security Blanket and Won’t Share Its TEEs
Microsoft’s New Cooling System Has Veins Like a Leaf and Runs Hotter Than Your Gaming PC
Azure Gets Cold Feet About Hot Chips, Decides to Go With the Flow


AI Is Going Great – Or How ML Makes Money 
00:58 Google and Kaggle launch AI Agents Intensive course

Google and Kaggle are launching a 5-day intensive course on AI agents from November 10-14. 
This follows their GenAI course that attracted 280,000 learners, with curriculum covering agent architectures, tools, memory systems, and production deployment.
The course focuses on building autonomous AI agents and multi-agent systems, which represents a shift from traditional single-model AI to systems that can independently perform tasks, make decisions, and interact with tools and APIs.
This development signals growing enterprise interest in AI agents for cloud environments, where autonomous systems can manage infrastructure, optimize resources, and handle complex workflows without constant human intervention.
The hands-on approach includes codelabs and a capstone project, indicating Google’s push to democratize agent development skills as businesses increasingly need engineers who can build production-ready autonomous systems.
The timing aligns with major cloud providers racing to offer agent-based services, as AI agents become essential for automating cloud operations, customer service, and business processes at scale.
Interested in registering? You can do that here. 

Cloud Tools 
03:21 Atlassian acquires DX, a developer productivity platform, for $1B

Atlassian is acquiring DX, a developer productivity ana...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2161682/c1a-k5d5-z3pzj1mvu71m-abk35p.jpg"></itunes:image>
                                                                            <itunes:duration>01:22:16</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2161682/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[324: Clippy’s Revenge: The AI Assistant That Actually Works - Sort Of]]>
                </title>
                <pubDate>Thu, 09 Oct 2025 13:00:34 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2161684</guid>
                                    <link>https://tcpfm.castos.com/episodes/324-clippys-revenge-the-ai-assistant-that-actually-works-sort-of</link>
                                <description>
                                            <![CDATA[<h3>Welcome to episode 324 of The Cloud Pod, where the forecast is always cloudy! Justin, Ryan, and Jonathan are your hosts, bringing you all the latest news and announcements in Cloud and AI. This week we have some exec changes over at Oracle, a LOT of announcements about Sonnet 4.5, and even some marketplace updates over at Azure! Let’s get started. </h3>
<h3>Titles we almost went with this week
</h3>
<ul>
<li>Oracle’s Executive Shuffle: Promoting from Within While Chasing from Behind</li>
<li>Copilot Takes the Wheel on Your Legacy Code Highway</li>
<li>Queue Up for GPUs: Google’s Take-a-Number Approach to AI Computing</li>
<li>License to Bill: Google’s 400% Markup Grievance</li>
<li>Autopilot Engages: GKE Goes Full Self-Driving Mode</li>
<li>SQL Server Finally Gets a Lake House Instead of a Server Room</li>
<li>Microsoft Gives Office Apps Their Own AI Interns</li>
<li>Claude and Present Danger: The AI That Codes for 30 Hours Straight</li>
<li>The Claude Father Part 4.5: An Offer Your Code Can’t Refuse</li>
<li>CUD You Believe It? Google Makes Discounts Actually Flexible</li>
<li>ECS Goes Full IPv6: No IPv4s Given</li>
<li>Breaking News: AWS Finally Lets You Hit the Emergency Stop Button</li>
<li>One Marketplace to Rule Them All</li>
<li>BigQuery Gets a Crystal Ball and a Chatty Friend</li>
<li>Azure’s September to Remember: When Certificates and Allocators Attack</li>
<li>Shall I Compare Thee to a Sonnet? 4.5 Ways Anthropic Just Leveled Up</li>
<li>AWS provides a big red button
</li>
</ul>
<h2>Follow Up </h2>
<p>01:26 <a href="https://cloud.google.com/blog/topics/inside-google-cloud/global-harms-restrictive-cloud-licensing-one-year-later/">The global harms of restrictive cloud licensing, one year later | Google </a><a href="https://cloud.google.com/blog/topics/inside-google-cloud/global-harms-restrictive-cloud-licensing-one-year-later/">Cloud Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/">Google Cloud</a> <a href="https://cloud.google.com/blog/topics/inside-google-cloud/filing-eu-complaint-against-microsoft-licensing">filed a formal complaint</a> with the European Commission one year ago about Microsoft’s anti-competitive cloud licensing practices, specifically the 400% price markup Microsoft imposes on customers who move Windows Server workloads to non-Azure clouds.</li>
<li style="font-weight:400;">The <a href="https://www.gov.uk/government/organisations/competition-and-markets-authority">UK Competition and Markets Authority</a> found that restrictive licensing costs UK cloud customers £500 million annually due to lack of competition, while US government agencies overspend by $750 million yearly because of Microsoft’s licensing tactics.</li>
<li style="font-weight:400;">Microsoft recently disclosed that <a href="https://www.microsoft.com/en-us/investor/events/fy-2025/earnings-fy-2025-q4">forcing software customers to use Azure</a> is one of three pillars driving its growth and is implementing <a href="https://partner.microsoft.com/en-ie/blog/article/new-licensing-benefits-make-bringing-workloads-and-licenses-to-partners-clouds-easier">new licensing changes</a> preventing managed service providers from hosting certain workloads on Azure competitors.</li>
<li style="font-weight:400;">Multiple regulators globally including South Africa and the US FTC are now investigating Microsoft’s cloud licensing practices, with the CMA finding that Azure has gained customers at 2-3x the rate of competitors since implementing restrictive terms.</li>
<li style="font-weight:400;">A <a href="https://ecipe.org/">European Centre for International Political Economy</a> study suggests ending restrictive licensing could unlock €1.2 trillion in additional EU GDP by 2030 and generate €450 billion annually in fiscal savings and productivity gains.</li>
</ul>
<p>03:32  Jonathan – “I’d feel happier about these complaints Google were making if they actually reciprocated the deals they make for their customers in the...</p>
<h3>Chapters</h3>
<ul><li>(00:00:00) - GCP Alumni</li><li>(00:01:35) - Microsoft's Cloud Licensing Practices</li><li>(00:05:22) - Microsoft introduces Office Agent in Copilot Chat</li><li>(00:08:13) - Claude Sonet 4.5 Launches</li><li>(00:09:33) - Claude 4.5 New Feature Announcement</li><li>(00:15:12) - Bill Gates on ChatGPT and Bots</li><li>(00:16:10) - Snowflake, Cloud Sonnet 4.5, and SQL Server</li><li>(00:17:39) - Amazon EC2, ECS now supporting IPv6 Only workloads</li><li>(00:20:23) - Amazon Machine Image Governance (New Parameter)</li><li>(00:25:42) - Easy to Auto-Scalping (New Feature)</li><li>(00:29:23) - Amazon EC2: Managed Serverless Instances</li><li>(00:33:28) - AWS Outposts: Third-Party Storage Integration</li><li>(00:36:45) - Google's Flex Start VMS for AI & GKE Autop</li><li>(00:41:48) - Google Launches Cloud SQL, BigQuery Extensions</li><li>(00:45:11) - BigQuery and Google Analytics: AI Data Analysis & Forecast</li><li>(00:47:02) - Microsoft Azure Migrate and Modernize: Cloud Code vs. Microsoft</li><li>(00:53:22) - Microsoft's Azure Marketplace Unifying with AppSource</li><li>(00:56:06) - Azure Compute Gallery: Soft Delete</li><li>(00:57:49) - Microsoft Azure Outages: Lessons Learned</li><li>(01:03:32) - Week in Cloud: A Week of Consistency</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 324 of The Cloud Pod, where the forecast is always cloudy! Justin, Ryan, and Jonathan are your hosts, bringing you all the latest news and announcements in Cloud and AI. This week we have some exec changes over at Oracle, a LOT of announcements about Sonnet 4.5, and even some marketplace updates over at Azure! Let’s get started. 
Titles we almost went with this week


Oracle’s Executive Shuffle: Promoting from Within While Chasing from Behind
Copilot Takes the Wheel on Your Legacy Code Highway
Queue Up for GPUs: Google’s Take-a-Number Approach to AI Computing
License to Bill: Google’s 400% Markup Grievance
Autopilot Engages: GKE Goes Full Self-Driving Mode
SQL Server Finally Gets a Lake House Instead of a Server Room
Microsoft Gives Office Apps Their Own AI Interns
Claude and Present Danger: The AI That Codes for 30 Hours Straight
The Claude Father Part 4.5: An Offer Your Code Can’t Refuse
CUD You Believe It? Google Makes Discounts Actually Flexible
ECS Goes Full IPv6: No IPv4s Given
Breaking News: AWS Finally Lets You Hit the Emergency Stop Button
One Marketplace to Rule Them All
BigQuery Gets a Crystal Ball and a Chatty Friend
Azure’s September to Remember: When Certificates and Allocators Attack
Shall I Compare Thee to a Sonnet? 4.5 Ways Anthropic Just Leveled Up
AWS provides a big red button


Follow Up 
01:26 The global harms of restrictive cloud licensing, one year later | Google Cloud Blog

Google Cloud filed a formal complaint with the European Commission one year ago about Microsoft’s anti-competitive cloud licensing practices, specifically the 400% price markup Microsoft imposes on customers who move Windows Server workloads to non-Azure clouds.
The UK Competition and Markets Authority found that restrictive licensing costs UK cloud customers £500 million annually due to lack of competition, while US government agencies overspend by $750 million yearly because of Microsoft’s licensing tactics.
Microsoft recently disclosed that forcing software customers to use Azure is one of three pillars driving its growth and is implementing new licensing changes preventing managed service providers from hosting certain workloads on Azure competitors.
Multiple regulators globally including South Africa and the US FTC are now investigating Microsoft’s cloud licensing practices, with the CMA finding that Azure has gained customers at 2-3x the rate of competitors since implementing restrictive terms.
A European Centre for International Political Economy study suggests ending restrictive licensing could unlock €1.2 trillion in additional EU GDP by 2030 and generate €450 billion annually in fiscal savings and productivity gains.

03:32  Jonathan – “I’d feel happier about these complaints Google were making if they actually reciprocated the deals they make for their customers in the...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[324: Clippy’s Revenge: The AI Assistant That Actually Works - Sort Of]]>
                </itunes:title>
                                    <itunes:episode>324</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<h3>Welcome to episode 324 of The Cloud Pod, where the forecast is always cloudy! Justin, Ryan, and Jonathan are your hosts, bringing you all the latest news and announcements in Cloud and AI. This week we have some exec changes over at Oracle, a LOT of announcements about Sonnet 4.5, and even some marketplace updates over at Azure! Let’s get started. </h3>
<h3>Titles we almost went with this week
</h3>
<ul>
<li>Oracle’s Executive Shuffle: Promoting from Within While Chasing from Behind</li>
<li>Copilot Takes the Wheel on Your Legacy Code Highway</li>
<li>Queue Up for GPUs: Google’s Take-a-Number Approach to AI Computing</li>
<li>License to Bill: Google’s 400% Markup Grievance</li>
<li>Autopilot Engages: GKE Goes Full Self-Driving Mode</li>
<li>SQL Server Finally Gets a Lake House Instead of a Server Room</li>
<li>Microsoft Gives Office Apps Their Own AI Interns</li>
<li>Claude and Present Danger: The AI That Codes for 30 Hours Straight</li>
<li>The Claude Father Part 4.5: An Offer Your Code Can’t Refuse</li>
<li>CUD You Believe It? Google Makes Discounts Actually Flexible</li>
<li>ECS Goes Full IPv6: No IPv4s Given</li>
<li>Breaking News: AWS Finally Lets You Hit the Emergency Stop Button</li>
<li>One Marketplace to Rule Them All</li>
<li>BigQuery Gets a Crystal Ball and a Chatty Friend</li>
<li>Azure’s September to Remember: When Certificates and Allocators Attack</li>
<li>Shall I Compare Thee to a Sonnet? 4.5 Ways Anthropic Just Leveled Up</li>
<li>AWS provides a big red button
</li>
</ul>
<h2>Follow Up </h2>
<p>01:26 <a href="https://cloud.google.com/blog/topics/inside-google-cloud/global-harms-restrictive-cloud-licensing-one-year-later/">The global harms of restrictive cloud licensing, one year later | Google </a><a href="https://cloud.google.com/blog/topics/inside-google-cloud/global-harms-restrictive-cloud-licensing-one-year-later/">Cloud Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/">Google Cloud</a> <a href="https://cloud.google.com/blog/topics/inside-google-cloud/filing-eu-complaint-against-microsoft-licensing">filed a formal complaint</a> with the European Commission one year ago about Microsoft’s anti-competitive cloud licensing practices, specifically the 400% price markup Microsoft imposes on customers who move Windows Server workloads to non-Azure clouds.</li>
<li style="font-weight:400;">The <a href="https://www.gov.uk/government/organisations/competition-and-markets-authority">UK Competition and Markets Authority</a> found that restrictive licensing costs UK cloud customers £500 million annually due to lack of competition, while US government agencies overspend by $750 million yearly because of Microsoft’s licensing tactics.</li>
<li style="font-weight:400;">Microsoft recently disclosed that <a href="https://www.microsoft.com/en-us/investor/events/fy-2025/earnings-fy-2025-q4">forcing software customers to use Azure</a> is one of three pillars driving its growth and is implementing <a href="https://partner.microsoft.com/en-ie/blog/article/new-licensing-benefits-make-bringing-workloads-and-licenses-to-partners-clouds-easier">new licensing changes</a> preventing managed service providers from hosting certain workloads on Azure competitors.</li>
<li style="font-weight:400;">Multiple regulators globally including South Africa and the US FTC are now investigating Microsoft’s cloud licensing practices, with the CMA finding that Azure has gained customers at 2-3x the rate of competitors since implementing restrictive terms.</li>
<li style="font-weight:400;">A <a href="https://ecipe.org/">European Centre for International Political Economy</a> study suggests ending restrictive licensing could unlock €1.2 trillion in additional EU GDP by 2030 and generate €450 billion annually in fiscal savings and productivity gains.</li>
</ul>
<p>03:32  Jonathan – “I’d feel happier about these complaints Google were making if they actually reciprocated the deals they make for their customers in the EU in the US.” </p>
<h2>AI is Going Great – Or How ML Makes Money </h2>
<p>05:14 <a href="https://www.microsoft.com/en-us/microsoft-365/blog/2025/09/29/vibe-working-introducing-agent-mode-and-office-agent-in-microsoft-365-copilot/">Vibe working: Introducing Agent Mode and Office Agent in Microsoft 365 </a><a href="https://www.microsoft.com/en-us/microsoft-365/blog/2025/09/29/vibe-working-introducing-agent-mode-and-office-agent-in-microsoft-365-copilot/">Copilot | Microsoft 365 Blog</a></p>
<ul>
<li style="font-weight:400;">Microsoft introduces<a href="https://www.microsoft.com/en-us/microsoft-365/blog/2025/09/29/vibe-working-introducing-agent-mode-and-office-agent-in-microsoft-365-copilot/"> Agent Mode for Office apps</a> and Office Agent in <a href="https://m365copilot.com/">Copilot chat</a>, leveraging OpenAI’s latest reasoning models and Anthropic models to enable multi-step, iterative AI workflows for document creation. </li>
<li style="font-weight:400;">This represents a shift from single-prompt AI assistance to conversational, agentic productivity where AI can evaluate results, fix issues, and iterate until outcomes are verified.</li>
<li style="font-weight:400;">Agent Mode in Excel democratizes expert-level spreadsheet capabilities by enabling AI to “speak Excel” natively, handling complex formulas, data visualizations, and financial analysis tasks. </li>
<li style="font-weight:400;">The system achieved notable performance on <a href="https://spreadsheetbench.github.io/">SpreadsheetBench</a> benchmarks, and can execute prompts like creating financial reports, loan calculators, and budget trackers with full validation steps.</li>
<li style="font-weight:400;">Agent Mode in Word transforms document creation into an interactive dialogue where Copilot drafts content, suggests refinements, and asks clarifying questions while maintaining Word’s native formatting. This enables faster iteration on complex documents like monthly reports and project updates through conversational prompts rather than manual editing. (FYI, this is a good way to get AI Slop, so buyer beware.)</li>
<li style="font-weight:400;">The thing we’re the most excited about, however, is Office Agent in Copilot chat, which creates complete PowerPoint presentations and Word documents through a three-step process: clarifying intent, conducting web-based research with reasoning capabilities, and producing quality-checked content using code generation. (Justin, being an exec, really just likes the pretty slides.) </li>
<li style="font-weight:400;">This addresses previous AI limitations in creating well-structured presentations by showing chain of thought and providing live previews.</li>
<li style="font-weight:400;">The features are rolling out through Microsoft’s <a href="https://aka.ms/FrontierProgram">Frontier program</a> for Microsoft 365 Copilot licensed customers and Personal/Family subscribers, with Excel and Word Agent Mode available on web (desktop coming soon) and <a href="https://aka.ms/OfficeAgent25">Office Agent</a> currently US-only in English. </li>
<li style="font-weight:400;">This positions Microsoft to compete directly with other AI productivity tools while leveraging their existing Office ecosystem.</li>
</ul>
<p>17:27  Justin – “There’s web apps for all of them. They’re not as good as Google web apps, but they pretend to be.” </p>
<p>08:14 <a href="https://www.anthropic.com/news/claude-sonnet-4-5">Introducing Claude Sonnet 4.5 \ Anthropic</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.anthropic.com/claude/sonnet">Claude Sonnet 4.5</a> achieves 77.2% on SWE-bench verified, positioning it as the leading coding model with the ability to maintain focus for over 30 hours on complex multi-step tasks. </li>
<li style="font-weight:400;">The model is available via API at $3/$15 per million tokens, matching the previous Sonnet 4 pricing.</li>
<li style="font-weight:400;">The Claude Agent SDK provides developers with the same infrastructure that powers <a href="https://anthropic.com/news/enabling-claude-code-to-work-more-autonomously">Claude Code</a>, enabling creation of custom AI agents for various tasks beyond coding. </li>
<li style="font-weight:400;">This includes memory management for long-running tasks, permission systems, and subagent coordination capabilities.</li>
<li style="font-weight:400;">Computer use capabilities improved significantly with 61.4% on OSWorld benchmark (up from 42.2% four months ago), enabling direct browser navigation, spreadsheet manipulation, and task completion. </li>
<li style="font-weight:400;">The <a href="https://www.anthropic.com/news/claude-for-chrome">Claude for Chrome</a> extension brings these capabilities to Max subscribers.</li>
<li style="font-weight:400;">New product features include checkpoints in Claude Code for progress saving and rollback, a <a href="https://marketplace.visualstudio.com/items?itemName=anthropic.claude-code">native VS Code extension</a>, <a href="https://anthropic.com/news/context-management">context editing with memory tools</a> in the API, and direct code execution with file creation (spreadsheets, slides, documents) in <a href="https://claude.ai/redirect/website.v1.974b81d9-5b0e-470e-b16d-6bfb54822fbc/download">Claude apps</a>.</li>
<li style="font-weight:400;">Early customer results show 44% reduction in vulnerability intake time for security agents, 18% improvement in planning performance for Devin, and zero error rate on internal code editing benchmarks (down from 9%). </li>
<li style="font-weight:400;">The model operates under ASL-3 safety protections with improved alignment metrics.</li>
</ul>
<p>12:02  Ryan – “I’ve been using Sonnet 4 pretty much exclusively for coding, just because the results I’ve been getting on everything else is really hit or miss. But I definitely won’t let it go off, because it WILL go off on some tangents.” </p>
<p>16:22 <a href="https://www.databricks.com/blog/claude-sonnet-45-here">Claude Sonnet 4.5 Is Here | Databricks Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://r.search.yahoo.com/rdclks/dWU9MTV1czJocGtlOTc2ZCZ1dD0xNzU5ODEyODEzMTE5JnVvPTgyNDYzNzU1NDg0NDM3Jmx0PTImcz0xJmVzPXVjcXlZOUEwTDBYdmw4WENMeXVTXzJuLmhiWXVTTE4xTzh1QjlkWUhzQXBOQTBfTEJxNlkwVVdpOV9FNDNxWGhPZVJyTW9XOUFXMUdhYmV0/RV=2/RE=1762404813/RO=14/RU=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8iDKEn7OPZY13iQrQc1HSAjVUCUy_yvf5m4bMIJBhQs4VB-nEAYTO6il0LHoNIqL5zZhKi5SSx3adozJSGIhhwdsFPBXlASyQs195mZcswG5iMbjBH85KctE-GTTtSKYdgszU2Z4s7uI3dWUZuKyTA_o3xsvh42odnEaG7rsFvkN5uiylYizbIf7hHrRJmhPlBw9Pjg%26u%3DaHR0cHMlM2ElMmYlMmZsb2dpbi5kYXRhYnJpY2tzLmNvbSUyZnNpZ251cCUzZnByb3ZpZGVyJTNkREIlMjZzY2lkJTNkNzAxOFkwMDAwMDFGaTBOUUFTJTI2dXRtX21lZGl1bSUzZHBhaWQlMmJzZWFyY2glMjZ1dG1fc291cmNlJTNkYmluZyUyNnV0bV9jYW1wYWlnbiUzZDQxNDgwNzQ3MyUyNnV0bV9hZGdyb3VwJTNkMTMxOTQxNTM0MTg1NTA5NCUyNnV0bV9jb250ZW50JTNkdHJpYWwlMjZ1dG1fb2ZmZXIlM2R0cmlhbCUyNnV0bV9hZCUzZCUyNnV0bV90ZXJtJTNkZGF0YWJyaWNrcy5jb20lMjZkYnhfc291cmNlJTNkcGFpZCUyNm1zY2xraWQlM2QxZWQ0OTI1NDAxZWUxMDY2YTdhMmZhYjA5ZDIzMzYyMQ%26rlid%3D1ed4925401ee1066a7a2fab09d233621/RK=2/RS=MvCoY79vNWev.oQwXq6fWnTus9g-;_ylt=AwrOtvDNnORoUpsBe21XNyoA;_ylu=Y29sbwNncTEEcG9zAzEEdnRpZAMEc2VjA292LXRvcA--;_ylc=X3IDMgRydAMw?IG=0aceb6f086e34f869b024edef833dcf5">Databricks</a> integrates Claude Sonnet 4.5 directly into their platform through <a href="https://www.databricks.com/blog/2023/04/18/introducing-ai-functions-integrating-large-language-models-databricks-sql.html">AI Functions</a>, allowing enterprises to apply the model to governed data without moving it to external APIs. </li>
<li style="font-weight:400;">This preserves data lineage and security while enabling complex analysis at scale.</li>
<li style="font-weight:400;">The integration enables SQL and Python users to treat Claude as a built-in operator for analyzing unstructured data like contracts, PDFs, and images. Databricks automatically handles backend scaling from single rows to millions of records.</li>
<li style="font-weight:400;">Key technical advancement is bringing AI models to data rather than exporting data to models, solving governance and compliance challenges. </li>
<li style="font-weight:400;">This approach maintains existing data pipelines while adding AI capabilities for tasks like contract analysis and compliance risk detection.</li>
<li style="font-weight:400;"><a href="https://www.databricks.com/product/artificial-intelligence/agent-bricks">Agent Bricks</a> allows enterprises to build domain-specific agents using Claude Sonnet 4.5, with built-in evaluation and continuous improvement mechanisms. The platform handles model tuning and performance monitoring for production deployments.</li>
<li style="font-weight:400;">Claude Sonnet 4.5 launches just seven weeks after <a href="https://www.anthropic.com/claude/opus">Claude Opus 4.1</a>, highlighting rapid model evolution. </li>
<li style="font-weight:400;">Databricks’ model-agnostic approach lets enterprises switch between providers as needs change without rebuilding infrastructure.</li>
</ul>
<p>16:31 <a href="https://www.snowflake.com/en/blog/cortex-ai-claude-sonnet-4-5/">Announcing Anthropic Claude Sonnet 4.5 on Snowflake Cortex AI</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.snowflake.com/en/">Snowflake</a> now offers same-day availability of Anthropic’s <a href="https://www.anthropic.com/claude/sonnet">Claude Sonnet 4.5 </a>model through<a href="https://www.snowflake.com/en/product/features/cortex/"> Cortex AI</a>, accessible via SQL functions and <a href="https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-llm-rest-api">REST API</a> within Snowflake’s secure data perimeter. </li>
<li style="font-weight:400;">The model shows improvements in domain knowledge for finance and cybersecurity, enhanced agentic capabilities for multi-step workflows, and achieved higher scores on SWE-bench Verified for coding tasks.</li>
<li style="font-weight:400;">Enterprises can leverage Sonnet 4.5 through three main interfaces: <a href="https://ai.snowflake.com/">Snowflake Intelligence</a> for natural language business queries, <a href="https://docs.snowflake.com/en/user-guide/snowflake-cortex/llm-functions">Cortex AISQL</a> for multimodal data analysis directly in SQL, and <a href="https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-agents">Cortex Agents</a> for building intelligent systems that handle complex business processes. </li>
<li style="font-weight:400;">The integration maintains Snowflake’s existing security and governance capabilities while processing both structured and unstructured data.</li>
<li style="font-weight:400;">The model is available in supported regions with cross-region inference for non-supported areas, and Snowflake reports over 6,100 accounts using their AI capabilities in Q2 FY26. </li>
<li style="font-weight:400;">Developers can access the model using simple SQL commands like AI_COMPLETE or through<a href="https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-llm-rest-api"> REST API</a> calls for low-latency inference in native applications.</li>
<li style="font-weight:400;">This partnership represents a shift toward embedding frontier AI models directly into data warehouses, allowing analysts to run advanced AI operations using familiar SQL syntax without moving data outside their secure environment. </li>
<li style="font-weight:400;">This approach reduces the complexity of building AI pipelines while maintaining enterprise-grade security and governance.</li>
</ul>
<p>16:41 <a href="https://www.databricks.com/blog/announcing-sql-server-connector-lakeflow-connect-now-generally-available">Announcing SQL Server connector from Lakeflow Connect, now Generally </a><a href="https://www.databricks.com/blog/announcing-sql-server-connector-lakeflow-connect-now-generally-available">Available | Databricks Blog</a></p>
<ul>
<li style="font-weight:400;">Databricks’ <a href="https://docs.databricks.com/aws/en/ingestion/lakeflow-connect/sql-server-source-setup">SQL Server connector</a> for <a href="https://www.databricks.com/product/data-engineering/lakeflow-connect'">Lakeflow Connect</a> is now GA, providing fully managed data ingestion from SQL Server to the lakehouse with built-in CDC and Change Tracking support, eliminating the need for custom pipelines or complex ETL tools.</li>
<li style="font-weight:400;">The connector addresses the common challenge of SQL Server data being locked in transactional systems by enabling incremental data capture without impacting production performance, supporting both on-premises and cloud SQL Server environments through a simple point-and-click UI or API.</li>
<li style="font-weight:400;">Key capabilities include automatic SCD Type 2 support for tracking historical changes, integration with <a href="https://docs.databricks.com/aws/en/dev-tools/bundles/'">Databricks Asset Bundles</a> and <a href="https://docs.databricks.com/aws/en/dev-tools/terraform/">Terraform</a> for CI/CD workflows, and the ability to ingest from multiple SQL Server instances simultaneously without full table refreshes.</li>
<li style="font-weight:400;">Early adopters like Cirrus Aircraft report migrating hundreds of tables in days instead of months, while Australian Red Cross Lifeblood uses it to build reliable pipelines without complex data engineering, demonstrating real-world value for enterprises moving to lakehouse architectures.</li>
<li style="font-weight:400;">This release is part of Lakeflow Connect’s broader ecosystem that now includes GA connectors for <a href="https://www.servicenow.com/">ServiceNow</a> and <a href="https://360suite.google.com/">Google Analytics</a>, with <a href="https://www.postgresql.org/">PostgreSQL</a>, <a href="https://support.microsoft.com/en-us/office/sign-in-to-sharepoint-324a89ec-e77b-4475-b64a-13a0c14c45ec">SharePoint</a>, and query-based connectors for <a href="https://www.oracle.com/">Oracle</a>, <a href="https://www.mysql.com/">MySQL</a>, and <a href="https://www.teradata.com/">Teradata</a> coming soon.</li>
</ul>
<p>17:35  Ryan – “This has been a challenge for awhile; getting data out of these transactional databases so that you can run large reporting jobs on them. So I like any sort of “easy button” that moves you out of that ecosystem.” </p>
<h2>AWS</h2>
<p>17:53 <a href="https://aws.amazon.com/blogs/aws/introducing-claude-sonnet-4-5-in-amazon-bedrock-anthropics-most-intelligent-model-best-for-coding-and-complex-agents/">Introducing Claude Sonnet 4.5 in Amazon Bedrock: Anthropic’s most </a><a href="https://aws.amazon.com/blogs/aws/introducing-claude-sonnet-4-5-in-amazon-bedrock-anthropics-most-intelligent-model-best-for-coding-and-complex-agents/">intelligent model, best for coding and complex agents | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.anthropic.com/news/claude-sonnet-4-5">Claude Sonnet 4.5</a> is now available in <a href="https://aws.amazon.com/bedrock/">Amazon Bedrock</a> as <a href="https://www.anthropic.com/">Anthropic’s</a> most advanced model, specifically optimized for coding tasks and complex agent applications with enhanced tool handling, memory management, and context processing capabilities.</li>
<li style="font-weight:400;">The model introduces three key API features: <a href="https://aws.amazon.com/blogs/security/context-window-overflow-breaking-the-barrier/">Smart Context Window</a> Management that generates responses up to available limits instead of erroring out, Tool Use Clearing for automatic cleanup of interaction history to reduce token costs, and Cross-Conversation Memory that persists information across sessions using local memory files.</li>
<li style="font-weight:400;">Integration with <a href="https://aws.amazon.com/bedrock/agentcore/?trk=ac97e39c-d115-4d4a-b3fe-c695e0c9a7ee&amp;sc_channel=el">Amazon Bedrock AgentCore</a> enables 8-hour long-running support with complete session isolation and comprehensive observability, making it suitable for autonomous security operations, financial analysis, and research workflows that require extended processing times.</li>
<li style="font-weight:400;">Claude Sonnet 4.5 excels at autonomous long-horizon coding tasks where it can plan and execute complex software projects spanning hours or days, with demonstrated strength in cybersecurity for proactive vulnerability patching and finance for transforming manual audits into intelligent risk management.</li>
<li style="font-weight:400;">Access requires using inference profiles that define which <a href="https://aws.amazon.com/about-aws/global-infrastructure/regions_az/">AWS Regions</a> process requests, with system-defined cross-Region profiles available for optimal performance distribution across multiple regions.</li>
</ul>
<p>18:06 Justin – “I was mad because it wasn’t working, and then I remembered, “oh yeah…in Bedrock you have to go enable the new model one by one. So if you’re trying to use Bedrock and it’s not working, remember to update your model access.” </p>
<p>18:21 <a href="https://aws.amazon.com/blogs/containers/amazon-ecs-announces-ipv6-only-support/">Amazon ECS announces IPv6-only support | Containers</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/ecs/">Amazon ECS</a> now supports IPv6-only workloads, allowing containers to run without any IPv4 dependencies while maintaining full compatibility with AWS services like ECR, <a href="https://docs.aws.amazon.com/AmazonCloudWatch/latest/monitoring/WhatIsCloudWatch.html">CloudWatch</a>, and <a href="https://docs.aws.amazon.com/secretsmanager/latest/userguide/intro.html">Secrets Manager</a> through native IPv6 endpoints.</li>
<li style="font-weight:400;">This addresses IPv4 address exhaustion challenges and eliminates the need for NAT gateways in private subnets, reducing operational complexity and costs associated with NAT gateway hours and public IPv4 address charges.</li>
<li style="font-weight:400;">The implementation requires minimal configuration changes – simply use IPv6-only subnets with your ECS tasks, and the service automatically adapts without needing IPv6-specific<a href="https://docs.aws.amazon.com/systems-manager/latest/userguide/systems-manager-parameter-store.html"> parameters</a>, supporting awsvpc, bridge, and host <a href="https://docs.aws.amazon.com/AmazonECS/latest/developerguide/task-networking.html">networking modes</a>.</li>
<li style="font-weight:400;">Migration strategies include in-place updates for non-load-balanced services or blue-green deployments using weighted target groups for<a href="https://aws.amazon.com/elasticloadbalancing/application-load-balancer/"> ALB</a>/<a href="https://aws.amazon.com/elasticloadbalancing/network-load-balancer/">NLB</a> workloads, with DNS64/NAT64 available for connecting to IPv4-only internet services.</li>
<li style="font-weight:400;">Federal agencies and organizations with IPv6 compliance requirements can now run containerized workloads that meet regulatory mandates while simplifying their network architecture and improving security posture through streamlined access control. </li>
</ul>
<p>18:57 <a href="https://aws.amazon.com/about-aws/whats-new/2025/09/amazon-ec2-auto-scaling-ipv6/">Amazon EC2 Auto Scaling now supports Internet Protocol Version 6 (IPv6)</a></p>
<ul>
<li style="font-weight:400;">EC2 Auto Scaling now supports IPv6 in dual-stack configuration alongside IPv4, addressing the growing scarcity of IPv4 addresses and enabling virtually unlimited scaling for applications.</li>
<li style="font-weight:400;">The dual-stack approach allows gradual migration from IPv4 to IPv6, reducing risk during transitions while providing contiguous IP ranges that simplify microservice architectures and network management.</li>
<li style="font-weight:400;">This update arrives as enterprises face IPv4 exhaustion challenges, with IPv6 adoption becoming essential for large-scale deployments and IoT workloads that require extensive address spaces.</li>
<li style="font-weight:400;">Available in all commercial AWS regions except New Zealand (we’re not sure what the deal is there, but sorry Kiwis). </li>
<li style="font-weight:400;">The feature integrates with existing VPC configurations and requires no additional charges beyond standard EC2 and networking costs.</li>
<li style="font-weight:400;">Organizations running containerized workloads or microservices architectures will benefit from simplified IP management and the ability to assign dedicated ranges to each service without address constraints.</li>
</ul>
<p>19:47  Matt- “It is amazing how fast that IPv4 cost does add up in your account, especially if you have load balancers, multiple subnets, and you’re running multiple ECS containers and public subnets for some reason.”</p>
<p>20:36 <a href="https://aws.amazon.com/about-aws/whats-new/2025/09/amazon-ec2-allowed-amis-setting-parameters-ami-governance/">Amazon EC2 Allowed AMIs setting adds new parameters for enhanced AMI </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/09/amazon-ec2-allowed-amis-setting-parameters-ami-governance/">governance</a></p>
<ul>
<li style="font-weight:400;">EC2’s Allowed AMIs setting now supports four new parameters – marketplace codes, deprecation time, creation date, and AMI names – giving organizations more granular control over which Amazon Machine Images can be discovered and launched across their AWS accounts.</li>
<li style="font-weight:400;">The marketplace codes parameter addresses a common security concern by allowing teams to restrict usage to specific vetted marketplace AMIs, while deprecation time and creation date parameters help enforce policies against outdated or potentially vulnerable images.</li>
<li style="font-weight:400;">AMI name parameter enables enforcement of naming conventions, which is particularly useful for large organizations that use standardized naming patterns to indicate compliance status, department ownership, or approved software stacks.</li>
<li style="font-weight:400;">These parameters integrate with <a href="https://www.bing.com/ck/a?!&amp;&amp;p=64d3776c53343568b728fd88b4751e0fc533d8a3437cd3da3904b2a339c86b44JmltdHM9MTc1OTc5NTIwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=33cf825f-ad59-6912-19e3-9438ac3368b1&amp;psq=AWS+Declarative+Policies&amp;u=a1aHR0cHM6Ly9kb2NzLmF3cy5hbWF6b24uY29tL29yZ2FuaXphdGlvbnMvbGF0ZXN0L3VzZXJndWlkZS9vcmdzX21hbmFnZV9wb2xpY2llc19kZWNsYXJhdGl2ZS5odG1s">AWS Declarative Policies</a> for organization-wide governance, allowing central IT teams to enforce AMI compliance across hundreds or thousands of accounts without manual intervention.</li>
<li style="font-weight:400;">The feature is available in all AWS regions at no additional cost and represents a practical solution to the challenge of shadow IT and unauthorized software deployment in cloud environments.</li>
</ul>
<p>25:07  Jonathan – “Just wait six months, they’ll all have the same features anyway.” </p>
<p>26:00 <a href="https://aws.amazon.com/about-aws/whats-new/2025/09/amazon-ec2-auto-scaling-forced-cancellation-instance/">Amazon EC2 Auto Scaling now supports forced cancellation of instance </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/09/amazon-ec2-auto-scaling-forced-cancellation-instance/">refreshes</a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/autoscaling/ec2/userguide/cancel-instance-refresh.html#cancel-instance-refresh-cli">EC2 Auto Scaling</a> now allows forced cancellation of instance refreshes by setting WaitForTransitioningInstances to false in the CancelInstanceRefresh API, enabling immediate abort without waiting for in-progress launches or terminations to complete.</li>
<li style="font-weight:400;">This feature addresses emergency scenarios where rapid roll forward is needed, such as when a current deployment causes service disruptions and teams need to quickly abandon the problematic refresh and start a new one.</li>
<li style="font-weight:400;">The enhancement provides better control over Auto Scaling group updates by bypassing lifecycle hooks and pending instance activities, reducing downtime during critical deployment issues.</li>
<li style="font-weight:400;">Available in all AWS regions including GovCloud, this feature integrates with existing Auto Scaling workflows and requires no additional cost beyond standard EC2 and Auto Scaling charges.</li>
<li style="font-weight:400;">For organizations using instance refreshes for configuration updates or deployments, this capability reduces recovery time objectives (RTO) when deployments go wrong, particularly valuable for production environments requiring quick remediation.</li>
</ul>
<p>26:38  Justin – “I was like, this isn’t really that big of an issue, and then I remembered well, I’ve had a really big autoscaling group, and this could be a really big problem. If you have like 5 webservers, you probably don’t care. But if you have hundreds? This could be a big lifesaver for you.” </p>
<p>29:00 <a href="https://aws.amazon.com/blogs/aws/announcing-amazon-ecs-managed-instances-for-containerized-applications/">Announcing Amazon ECS Managed Instances for containerized </a><a href="https://aws.amazon.com/blogs/aws/announcing-amazon-ecs-managed-instances-for-containerized-applications/">applications | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;">Your hosts spent quite a bit of time arguing about this one…</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/category/compute/amazon-ec2/">Amazon ECS</a> Managed Instances bridges the gap between serverless simplicity and <a href="https://aws.amazon.com/ec2">EC2</a> flexibility by providing fully managed container compute that supports all EC2 instance types including GPUs and specialized architectures while <a href="https://aws.amazon.com/">AWS</a> handles provisioning, scaling, and security patching.</li>
<li style="font-weight:400;">The service automatically selects cost-optimized instances by default but allows customers to specify up to 20 instance attributes when workloads require specific capabilities, addressing the limitation that prevented customers with EC2 pricing commitments from using serverless options.</li>
<li style="font-weight:400;">Infrastructure management includes automated security patches every 14 days using <a href="https://bottlerocket.dev/en/os/">Bottlerocket OS</a>, intelligent task placement to consolidate workloads onto fewer instances, and automatic termination of idle instances to optimize costs.</li>
<li style="font-weight:400;">Pricing consists of standard EC2 instance costs plus a management fee, initially available in 6 regions including US East, US West, Europe, Africa, and Asia Pacific with support for console, CLI, CDK, and CloudFormation deployment.</li>
<li style="font-weight:400;">For The Cloud Pod specifically, one single node was $.03 for the management fee. </li>
<li style="font-weight:400;">This addresses a key customer pain point where teams wanted serverless operational simplicity but needed specific compute capabilities like GPU acceleration or particular CPU architectures that weren’t available in <a href="https://aws.amazon.com/fargate/">Fargate</a>.</li>
</ul>
<p>30:12  Justin – “I love Fargate, but I don’t like paying for Fargate. That’s why I run our Cloud Pod website on an EC2 instance because it’s way cheaper. So for three cents more a gig versus going to Fargate, this is probably where I would land if I didn’t really want to manage the host.”</p>
<p>33:11 <a href="https://aws.amazon.com/blogs/aws/announcing-aws-outposts-third-party-storage-integration-with-dell-and-hpe/">Announcing AWS Outposts third-party storage integration with Dell and </a><a href="https://aws.amazon.com/blogs/aws/announcing-aws-outposts-third-party-storage-integration-with-dell-and-hpe/">HPE | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/category/compute/aws-outposts/">AWS Outposts</a> now integrates with <a href="https://www.dell.com/en-us/shop/storage-servers-and-networking-for-business/sf/power-store">Dell PowerStore</a> and <a href="https://www.hpe.com/us/en/storage/alletra.html">HPE Alletra Storage MP B10000</a> arrays, joining existing support for <a href="https://www.netapp.com/data-management/ontap-data-management-software/">NetApp</a> and <a href="https://www.purestorage.com/products/nvme/flasharray-x.html">Pure Storage</a>, allowing customers to use their third-party storage investments with Outposts through native AWS tooling.</li>
<li style="font-weight:400;">The integration supports both data and boot volumes with two boot methods – iSCSI SANboot for read/write volumes and Localboot for read-only volumes using iSCSI or NVMe-over-TCP protocols, manageable through the <a href="https://aws.amazon.com/ec2/">EC2</a> Launch Instance Wizard.</li>
<li style="font-weight:400;">This addresses two key customer needs: organizations migrating VMware workloads who need to maintain existing storage during transition, and companies with strict data residency requirements that must keep data on-premises while using AWS services.</li>
<li style="font-weight:400;">Available at no additional charge across all Outposts form factors (2U servers and both rack generations) in all supported regions, with AWS-verified <a href="https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/AMIs.html">AMIs</a> for <a href="https://www.microsoft.com/en-us/windows-server">Windows Server 2022</a> and <a href="https://www.redhat.com/en/technologies/linux-platforms/enterprise-linux">RHEL 9</a> plus automation scripts on <a href="https://github.com/aws-samples/sample-outposts-third-party-storage-integration">AWS Samples</a>.</li>
<li style="font-weight:400;">Second-generation <a href="https://aws.amazon.com/outposts/rack/faqs/">Outposts racks</a> can now combine doubled compute performance (2x vCPU, memory, and network bandwidth) with customers’ preferred storage arrays, providing flexibility for hybrid cloud deployments.</li>
</ul>
<p>34:37  Jonathan – “It’s more that you can not have AWS provide the storage layer, but you can have them still support S3 and EBS and those other things on top of this third party storage subsystem.” </p>
<h2>GCP</h2>
<p>36:35 <a href="https://cloud.google.com/blog/products/compute/introducing-flex-start-vms-for-the-compute-engine-instance-api/">Introducing Flex-start VMs for the Compute Engine Instance API. | Google </a><a href="https://cloud.google.com/blog/products/compute/introducing-flex-start-vms-for-the-compute-engine-instance-api/">Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google launches <a href="https://cloud.google.com/compute/docs/instances/about-flex-start-vms">Flex-start VMs</a> in GA, a new consumption model that queues GPU requests for up to 2 hours instead of failing immediately, addressing the persistent challenge of GPU scarcity for AI workloads.</li>
<li style="font-weight:400;">This appears to be unique among major cloud providers – rather than competing on raw capacity, Google is innovating on the access model itself by introducing a fair queuing system with <a href="https://cloud.google.com/products/dws/pricing?e=48754805&amp;hl=en">significant discounts</a> compared to on-demand pricing.</li>
<li style="font-weight:400;">The service integrates directly with Compute Engine’s existing instance API and CLI, allowing easy adoption into current workflows without requiring migration to a separate scheduling service, with VMs running for up to 7 days uninterrupted.</li>
<li style="font-weight:400;">Key use cases include AI model fine-tuning, batch inference, and HPC workloads that can tolerate delayed starts in exchange for better resource availability and lower costs, particularly valuable for research and development teams.</li>
<li style="font-weight:400;">The stop/start capability with automatic re-queuing and configurable termination actions (preserving VM state after 7 days) provides flexibility for long-running experiments while managing costs effectively.</li>
</ul>
<p>37:32  Ryan – “I love this. This is great. You’re still going to see a whole bunch of data scientists spamming the workbooks trying to get this to run, but I do think that from a pure capacity standpoint this is the right answer to some of these things, just because a lot of these jobs are very long running and it’s not really instant results.”  </p>
<p>39:52 <a href="https://cloud.google.com/blog/products/containers-kubernetes/gke-autopilot-now-available-to-all-qualifying-clusters/">GKE Autopilot now available to all qualifying clusters | Google Cloud Blo</a>g</p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/kubernetes-engine?e=48754805">GKE</a> Autopilot features are now available in Standard clusters through compute classes, allowing existing GKE users to access container-optimized compute without migrating to dedicated Autopilot clusters – this brings efficient bin-packing and rapid scaling to 70% of GKE clusters that weren’t using Autopilot mode.</li>
<li style="font-weight:400;">The <a href="https://cloud.google.com/blog/products/containers-kubernetes/container-optimized-compute-delivers-autoscaling-for-autopilot">container-optimized compute platform</a> starts at just 50 milli-CPU (5% of one core) and scales to 28vCPU, with customers only paying for requested resources rather than entire nodes – addressing the common Kubernetes challenge of overprovisioning and wasted compute capacity.</li>
<li style="font-weight:400;">New automatic provisioning for <a href="https://cloud.google.com/kubernetes-engine/docs/concepts/about-compute-classes">compute classes</a> lets teams gradually adopt Autopilot features alongside existing node pools without disrupting current workloads, solving the previous all-or-nothing approach that made migration risky for production environments.</li>
<li style="font-weight:400;">AI workloads can now run on GPUs and TPUs with Autopilot’s managed node properties and enterprise-grade security controls, competing directly with <a href="https://docs.aws.amazon.com/eks/latest/userguide/automode.html">AWS EKS Auto Mode</a> and <a href="https://learn.microsoft.com/en-us/azure/aks/node-autoprovision">Azure AKS automatic node provisioning</a> but with tighter integration to Google’s AI ecosystem.</li>
<li style="font-weight:400;">Available starting with GKE version 1.33.1 in the Rapid release channel, with 30% of new GKE clusters already created in Autopilot mode in 2024, suggesting strong customer adoption of managed Kubernetes operations.</li>
</ul>
<p>37:32  Ryan – “So now you can have not only dedicated compute, but preemptible and now autopilot capacity all in the single cluster. Kind of cool.”</p>
<p>41:58 <a href="https://cloud.google.com/blog/products/databases/gemini-cli-extensions-for-google-data-cloud/">Gemini CLI extensions for Google Data Cloud | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google launches <a href="https://blog.google/technology/developers/introducing-gemini-cli-open-source-ai-agent/">Gemini CLI</a> extensions for Data Cloud services including <a href="https://cloud.google.com/sql">Cloud SQL</a>, <a href="https://cloud.google.com/alloydb/docs/overview">AlloyDB</a>, and <a href="https://cloud.google.com/bigquery">BigQuery</a>, enabling developers to manage databases and run analytics directly from their terminal using natural language prompts.</li>
<li style="font-weight:400;">What could go wrong? </li>
<li style="font-weight:400;">The extensions allow developers to provision databases, create tables, generate APIs, and perform data analysis through conversational commands, potentially reducing the time needed for common database operations and eliminating context switching between tools.</li>
<li style="font-weight:400;">BigQuery’s extension includes AI-powered forecasting capabilities and conversational analytics APIs, letting users ask business questions in natural language and receive insights without writing SQL queries.</li>
<li style="font-weight:400;">This positions Google against <a href="https://aws.amazon.com/blogs/devops/introducing-amazon-codewhisperer-for-command-line/">AWS’s recent CodeWhisperer CLI</a> integration and <a href="https://learn.microsoft.com/en-us/azure/developer/github-copilot-azure/get-started">Azure’s GitHub Copilot CLI</a>, though Google’s approach focuses specifically on data services rather than general cloud operations.</li>
<li style="font-weight:400;">Key use cases include rapid prototyping for startups, data exploration for analysts who aren’t SQL experts, and streamlining database operations for DevOps teams managing multiple Cloud SQL or AlloyDB instances.</li>
</ul>
<p>43:28 <a href="https://cloud.google.com/blog/products/ai-machine-learning/announcing-claude-sonnet-4-5-on-vertex-ai/">Announcing Claude Sonnet 4.5 on Vertex AI | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Surprise surprise…</li>
<li style="font-weight:400;">Google Cloud now offers <a href="https://console.cloud.google.com/vertex-ai/publishers/anthropic/model-garden/claude-sonnet-4-5">Claude Sonnet 4.5</a> on <a href="https://cloud.google.com/vertex-ai">Vertex AI</a>, Anthropic’s most advanced model designed for autonomous agents that can work independently for hours on complex coding, cybersecurity, financial analysis, and research tasks.</li>
<li style="font-weight:400;">The integration includes Vertex AI’s <a href="https://google.github.io/adk-docs/">Agent Development Kit</a> and <a href="https://cloud.google.com/vertex-ai/generative-ai/docs/agent-engine/overview">Agent Engine</a> for building multi-agent systems, plus provisioned throughput for dedicated capacity at fixed costs, addressing enterprise needs for reliable AI deployment.</li>
<li style="font-weight:400;">Claude Sonnet 4.5 supports a 1 million token context window, batch predictions, and prompt caching on Vertex AI, with global endpoint routing that automatically serves traffic from the nearest available region for reduced latency.</li>
<li style="font-weight:400;">Customers like <a href="https://cloud.google.com/customers/augment">Augment Code</a>, <a href="https://cloud.google.com/customers/springnew">spring.new</a>, and <a href="https://cloud.google.com/customers/telusai">TELUS</a> are already using Claude on Vertex AI, with spring.new reporting application development time reduced from three months to 1-2 hours using natural language prompts.</li>
<li style="font-weight:400;">The model is available through <a href="https://cloud.google.com/model-garden">Vertex AI Model Garden</a> and <a href="https://cloud.google.com/marketplace">Google Cloud Marketplace</a>, with VS Code extension support and <a href="https://www.anthropic.com/news/enabling-claude-code-to-work-more-autonomously">Claude Code 2.0</a> terminal interface featuring checkpoints for more autonomous development operations.</li>
</ul>
<p>43:51 <a href="https://cloud.google.com/blog/products/compute/adopt-new-vm-series-with-gke-compute-classes-flexible-cuds/">Adopt new VM series with GKE compute classes, Flexible CUDs | Google </a><a href="https://cloud.google.com/blog/products/compute/adopt-new-vm-series-with-gke-compute-classes-flexible-cuds/">Cloud Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/kubernetes-engine/docs/concepts/about-custom-compute-classes">GKE compute classes</a> let you define a prioritized list of machine families for autoscaling, automatically falling back to alternative VM types if your preferred option isn’t available – solving the challenge of adopting new Gen4 machines like N4 and C4 while maintaining workload availability.</li>
<li style="font-weight:400;">Compute Flexible CUDs provide spend-based discounts up to 46% that follow your workload across different machine families, unlike resource-based CUDs that lock you to specific VM types – enabling financial flexibility when migrating between machine generations.</li>
<li style="font-weight:400;">The combination addresses real adoption barriers: compatibility testing through gradual rollouts, regional capacity constraints with automatic fallbacks, and financial commitment alignment by allowing discounts to apply across multiple VM families including both new and legacy options.</li>
<li style="font-weight:400;">Shopify successfully used this approach during Black Friday/Cyber Monday 2024, prioritizing new N4 machines with N2 fallbacks to handle massive scale while maintaining cost optimization through Flex CUDs.</li>
<li style="font-weight:400;">This approach particularly benefits organizations running large GKE fleets or high-performance workloads that want to leverage new C4/C4D series VMs for better price-performance without sacrificing availability or losing existing discount commitments.</li>
</ul>
<p>44:08  Justin – “So this is a solution to a problem that Google has because they’;re terrible at capacity planning. Perfect.” </p>
<p>45:35 <a href="https://cloud.google.com/blog/products/data-analytics/ai-based-forecasting-and-analytics-in-bigquery-via-mcp-and-adk/">AI-based forecasting and analytics in BigQuery via MCP and ADK | Google </a><a href="https://cloud.google.com/blog/products/data-analytics/ai-based-forecasting-and-analytics-in-bigquery-via-mcp-and-adk/">Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">BigQuery now offers two new AI tools for data analysis: ask_data_insights enables natural language queries against structured data using <a href="https://cloud.google.com/blog/products/data-analytics/understanding-lookers-conversational-analytics-api">Conversational Analytics API</a>, while BigQuery Forecast provides time-series predictions using the built-in TimesFM model without requiring separate ML infrastructure setup.</li>
<li style="font-weight:400;">These tools integrate with both Google’s <a href="https://google.github.io/adk-docs/">Agent Development Kit (ADK)</a> and <a href="https://github.com/googleapis/genai-toolbox">Model Context Protocol (MCP) Toolbox</a>, allowing developers to build AI agents that can analyze BigQuery data and generate forecasts with just a few lines of code – positioning Google against AWS Bedrock and Azure OpenAI Service in the enterprise AI agent space.</li>
<li style="font-weight:400;">The ask_data_insights tool provides transparency by showing step-by-step query formulation and execution logs, addressing enterprise concerns about AI black boxes when analyzing sensitive business data, while BigQuery Forecast leverages the <a href="https://cloud.google.com/bigquery/docs/reference/standard-sql/bigqueryml-syntax-ai-forecast">AI.FORECAST</a> function to deliver predictions with confidence intervals.</li>
<li style="font-weight:400;">Key use cases include retail sales forecasting, web traffic prediction, and inventory management, with the demo showing <a href="https://marketingplatform.google.com/about/analytics-360/">Google Analytics 360</a> data analysis – particularly valuable for businesses already invested in Google’s analytics ecosystem who want to extract deeper insights without data science expertise.</li>
<li style="font-weight:400;">Both tools are available today in the <a href="https://googleapis.github.io/genai-toolbox/resources/tools/bigquery/">MCP Toolbox</a> and <a href="https://google.github.io/adk-docs/tools/built-in-tools/#bigquery">ADK’s built-in toolset</a>, with users only needing read access to BigQuery tables, though specific pricing details aren’t mentioned beyond standard BigQuery query and ML costs.</li>
</ul>
<p>46:38  Ryan – “…this is really neat. And then the fact that it does show you the logic all the way through, which I think is super important. You can ask natural-line questions, and it just comes back with a whole bunch of analysis, and then what happens if that doesn’t work consistently? How do you debug that? This is basically building it, which is how I learned anyway, so it works really well when it’s spitting out the actual config for me instead of just telling me what the results are.”</p>
<h2>Azure</h2>
<p>49:06 <a href="https://azure.microsoft.com/en-us/blog/accelerate-migration-and-modernization-with-agentic-ai/">Announcing migration and modernization agentic AI tools | Microsoft Azure </a><a href="https://azure.microsoft.com/en-us/blog/accelerate-migration-and-modernization-with-agentic-ai/">Blog</a></p>
<ul>
<li style="font-weight:400;">Microsoft announced agentic AI tools for migration and modernization at their <a href="https://www.microsoft.com/en-us/events/launch-events/migrate-and-modernize-summit">Migrate and Modernize Summit</a>, with <a href="https://www.bing.com/ck/a?!&amp;&amp;p=f341ba7a1971834d118a4bcb3132b49e471d992e048aa471359746ce467705f2JmltdHM9MTc1OTc5NTIwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=33cf825f-ad59-6912-19e3-9438ac3368b1&amp;psq=GitHub+Copilot&amp;u=a1aHR0cHM6Ly9naXRodWIuY29tL2ZlYXR1cmVzL2NvcGlsb3Q">GitHub Copilot</a> now automating <a href="https://aka.ms/ghcp-appmod/Java">Java</a> and <a href="https://aka.ms/ghcp-appmod/dotNET">.NET</a> app upgrades that previously took months down to days or hours.</li>
<li style="font-weight:400;"><a href="https://aka.ms/azm/news">Azure Migrate</a> introduces AI-powered guidance and connects directly with GitHub Copilot for app modernization, enabling IT and developer teams to collaborate seamlessly while providing application-awareness by default and expanded support for <a href="https://www.bing.com/ck/a?!&amp;&amp;p=2761843852f9cb62e452b0df6cf8bb97e9999ed8fda81ab016b5f4b63407664fJmltdHM9MTc1OTc5NTIwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=33cf825f-ad59-6912-19e3-9438ac3368b1&amp;psq=PostgreSQL&amp;u=a1aHR0cHM6Ly93d3cucG9zdGdyZXNxbC5vcmcvZG93bmxvYWQv">PostgreSQL</a> and Linux distributions.</li>
<li style="font-weight:400;">The new <a href="https://azure.microsoft.com/en-us/solutions/azure-accelerate/">Azure Accelerate program</a> combines expert guidance with funding for eligible projects and includes the <a href="https://partner.microsoft.com/en-us/asset/collection/cloud-accelerate-factory">Cloud Accelerate Factory</a> where Microsoft engineers provide zero-cost deployment support for over 30 Azure services.</li>
<li style="font-weight:400;">GitHub Copilot’s app modernization capabilities analyze codebases, detect breaking changes, suggest migration paths, containerize code, and generate deployment artifacts – with Ford China reporting 70% reduction in time and effort for middleware app modernization.</li>
<li style="font-weight:400;">This positions Microsoft competitively against AWS and GCP by addressing the 37% of application portfolios requiring modernization, though specific pricing details weren’t provided beyond the zero-cost deployment support through Azure Accelerate.</li>
</ul>
<p>50:12  Ryan – “Get these things migrated. Because you can’t run them on these ancient frameworks that are full of vulnerabilities.” </p>
<p>54:32 <a href="https://blogs.microsoft.com/blog/2025/09/25/introducing-microsoft-marketplace-thousands-of-solutions-millions-of-customers-one-marketplace/">Introducing Microsoft Marketplace — Thousands of solutions. Millions of </a><a href="https://blogs.microsoft.com/blog/2025/09/25/introducing-microsoft-marketplace-thousands-of-solutions-millions-of-customers-one-marketplace/">customers. One Marketplace. – The Official Microsoft Blog</a></p>
<ul>
<li style="font-weight:400;">Microsoft unifies Azure Marketplace and AppSource into a single <a href="https://marketplace.microsoft.com/?ocid=cmmlvdg0mq9">Microsoft Marketplace</a>, creating one destination for cloud solutions, AI apps, and agents with over 3,000 AI offerings now available for direct integration into <a href="https://ai.azure.com/">Azure AI Foundry</a> and <a href="https://m365.cloud.microsoft/?omkt=en-001&amp;source=post_page---------------------------">Microsoft 365 Copilot</a>.</li>
<li style="font-weight:400;">The marketplace introduces multiparty private offers and CSP integration, allowing channel partners like Arrow, Crayon, and TD SYNNEX to resell solutions through their own marketplaces while maintaining Microsoft’s security and governance standards.</li>
<li style="font-weight:400;">For <a href="https://learn.microsoft.com/en-us/azure/cost-management-billing/manage/track-consumption-commitment">Azure Consumption Commitment</a> customers, 100% of purchases for Azure benefit eligible solutions count toward their commitment, providing a financial incentive to consolidate software procurement through the marketplace.</li>
<li style="font-weight:400;">Configuration time for AI apps has been reduced from 20 minutes to 1 minute per instance according to Siemens, with solutions now deployable directly within Microsoft products using Model Context Protocol (MCP) standards.</li>
<li style="font-weight:400;">This positions Microsoft competitively against AWS Marketplace and Google Cloud Marketplace by offering tighter integration with productivity tools like Microsoft 365, though AWS still maintains a larger overall catalog of third-party solutions.</li>
</ul>
<p>55:23  Justin – “I guess it’s nice to have one marketplace to rule them all, but 3,000 AI apps sounds like a lot of AI slop.”</p>
<p>56:59 <a href="https://azure.microsoft.com/updates?id=506886">Public Preview: Soft Delete feature in Azure Compute Gallery</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/azure-resource-manager/management/preview-features">Azure Compute Gallery</a> now includes soft delete functionality with a 7-day retention period, allowing recovery of accidentally deleted VM images and application packages before permanent deletion.</li>
<li style="font-weight:400;">This feature addresses a common operational risk where teams accidentally delete critical golden images or application templates, providing a safety net similar to AWS AMI deregistration’s 24-hour pending state.</li>
<li style="font-weight:400;">The 7-day retention window aligns with typical enterprise change control cycles, giving IT teams sufficient time to detect and recover from deletion errors during weekend maintenance windows.</li>
<li style="font-weight:400;">Target use cases include DevOps teams managing large image libraries, enterprises with strict compliance requirements for image retention, and managed service providers handling multiple customer environments.</li>
<li style="font-weight:400;">While pricing details aren’t specified, users should expect storage costs during the retention period similar to standard gallery storage rates, making this a low-cost insurance policy against operational mistakes.</li>
</ul>
<p>57:21  Matt – “So essentially it’s an easy way to do upgrades versus the way AWS – and you have to press (and by press I mean type your cancel API command) to stop the rolling upgrade of the system…this also prevents the same issue that we’ve all run into where I’ve stopped sharing this across accounts and we just broke production somewhere.”</p>
<p>58:48 <a href="https://share.google/KhE5WiA89c9gn7SbF">Switzerland Azure Outage</a></p>
<ul>
<li style="font-weight:400;">Azure experienced two major regional outages in September 2025 – Switzerland North suffered a 22-hour outage affecting 20+ services due to a malformed certificate prefix, while East US 2 had a 10-hour incident caused by an Allocator service issue that created cascading failures across availability zones</li>
<li style="font-weight:400;">The East US 2 incident reveals critical architectural challenges in Azure’s control plane design – aggressive retry logic meant to improve reliability actually amplified the problem by creating massive backlogs that took hours to drain even after the initial issue was resolved</li>
<li style="font-weight:400;">Both incidents highlight gaps in Azure’s incident communication systems – automated alerts only covered a subset of affected services, forcing manual notifications and public status page updates hours into the outages, leaving many customers uninformed during critical periods</li>
<li style="font-weight:400;">Microsoft’s response includes immediate fixes like reverting the problematic Allocator behavior and adjusting throttling configurations, plus longer-term improvements to load testing, backlog drainage tools, and communication systems scheduled through June 2026. (So be prepared for this to happen at least three more times before then.) </li>
<li style="font-weight:400;">These outages underscore the importance of multi-region deployment strategies for mission-critical workloads – customers relying on single-region deployments faced extended downtime with no failover options during these regional control plane failures.</li>
</ul>
<h2>Oracle</h2>
<p>1:01:54 <a href="https://www.oracle.com/news/announcement/oracle-corporation-announces-promotion-of-clay-magouyrk-and-mike-scilia-2025-09-22/">Oracle Corporation Announces Promotion Of Clay Magouyrk And Mike </a><a href="https://www.oracle.com/news/announcement/oracle-corporation-announces-promotion-of-clay-magouyrk-and-mike-scilia-2025-09-22/">Scilia 2025 09 22</a></p>
<ul>
<li style="font-weight:400;">Oracle promoted Clay Magouyrk to Executive Vice President of Oracle Cloud Infrastructure, and Mike Sicilia to Executive Vice President of Oracle Industries, signaling continued investment in cloud infrastructure and vertical market strategies despite their distant third-place position behind AWS and Azure.</li>
<li style="font-weight:400;">Magouyrk’s promotion after leading OCI engineering suggests Oracle is doubling down on their infrastructure-first approach, though they’ll need significant innovation to close the gap with hyperscalers who have 10+ year head starts and vastly larger customer bases.</li>
<li style="font-weight:400;">Sicilia’s elevation to lead Oracle Industries indicates a focus on vertical-specific solutions, a strategy that could differentiate Oracle from AWS/Azure/GCP by leveraging their deep enterprise relationships in healthcare, financial services, and telecommunications.</li>
<li style="font-weight:400;">These executive changes come as Oracle tries to position OCI as the preferred cloud for enterprise workloads, particularly for customers already invested in Oracle databases and applications who want integrated stack benefits.</li>
<li style="font-weight:400;">The promotions suggest organizational stability at Oracle Cloud during a critical growth phase, though the real test will be whether new leadership can accelerate customer adoption beyond Oracle’s traditional installed base. </li>
</ul>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback, or ask questions at theCloudPod.net or tweet at us with the hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2161684/c1e-jkjku5k892tzo877-0v7dg0jquvdk-prxeru.mp3" length="123953449"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 324 of The Cloud Pod, where the forecast is always cloudy! Justin, Ryan, and Jonathan are your hosts, bringing you all the latest news and announcements in Cloud and AI. This week we have some exec changes over at Oracle, a LOT of announcements about Sonnet 4.5, and even some marketplace updates over at Azure! Let’s get started. 
Titles we almost went with this week


Oracle’s Executive Shuffle: Promoting from Within While Chasing from Behind
Copilot Takes the Wheel on Your Legacy Code Highway
Queue Up for GPUs: Google’s Take-a-Number Approach to AI Computing
License to Bill: Google’s 400% Markup Grievance
Autopilot Engages: GKE Goes Full Self-Driving Mode
SQL Server Finally Gets a Lake House Instead of a Server Room
Microsoft Gives Office Apps Their Own AI Interns
Claude and Present Danger: The AI That Codes for 30 Hours Straight
The Claude Father Part 4.5: An Offer Your Code Can’t Refuse
CUD You Believe It? Google Makes Discounts Actually Flexible
ECS Goes Full IPv6: No IPv4s Given
Breaking News: AWS Finally Lets You Hit the Emergency Stop Button
One Marketplace to Rule Them All
BigQuery Gets a Crystal Ball and a Chatty Friend
Azure’s September to Remember: When Certificates and Allocators Attack
Shall I Compare Thee to a Sonnet? 4.5 Ways Anthropic Just Leveled Up
AWS provides a big red button


Follow Up 
01:26 The global harms of restrictive cloud licensing, one year later | Google Cloud Blog

Google Cloud filed a formal complaint with the European Commission one year ago about Microsoft’s anti-competitive cloud licensing practices, specifically the 400% price markup Microsoft imposes on customers who move Windows Server workloads to non-Azure clouds.
The UK Competition and Markets Authority found that restrictive licensing costs UK cloud customers £500 million annually due to lack of competition, while US government agencies overspend by $750 million yearly because of Microsoft’s licensing tactics.
Microsoft recently disclosed that forcing software customers to use Azure is one of three pillars driving its growth and is implementing new licensing changes preventing managed service providers from hosting certain workloads on Azure competitors.
Multiple regulators globally including South Africa and the US FTC are now investigating Microsoft’s cloud licensing practices, with the CMA finding that Azure has gained customers at 2-3x the rate of competitors since implementing restrictive terms.
A European Centre for International Political Economy study suggests ending restrictive licensing could unlock €1.2 trillion in additional EU GDP by 2030 and generate €450 billion annually in fiscal savings and productivity gains.

03:32  Jonathan – “I’d feel happier about these complaints Google were making if they actually reciprocated the deals they make for their customers in the...]]>
                </itunes:summary>
                                                                            <itunes:duration>01:04:28</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2161684/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[322: Did OpenAI and Microsoft Break Up? It’s Complicated…]]>
                </title>
                <pubDate>Wed, 24 Sep 2025 23:00:55 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2149152</guid>
                                    <link>https://tcpfm.castos.com/episodes/322-did-openai-and-microsoft-break-up-its-complicated</link>
                                <description>
                                            <![CDATA[<h3>Welcome to episode 322 of The Cloud Pod, where the forecast is always cloudy! We have BIG NEWS – Jonathan is back! He’s joined in the studio by Justin and Ryan to bring you all the latest in cloud and AI news, including ongoing drama in the Microsoft/OpenAI drama, saying goodbye to data transfer fees (in the EU), M4 Power, and more. Let’s get started!  </h3>
<h3>Titles we almost went with this week
</h3>
<ul>
<li>EU Later, Egress Fees: Google’s Brexit from Data Transfer Charges</li>
<li>The Keys to the Cosmos: Azure Unlocks Customer Control</li>
<li>Breaking Up is Hard to Do: Google Splits LLM Inference for Better Performance</li>
<li>OpenAI and Microsoft: From Exclusive to It’s Complicated </li>
<li>Google’s New Model Has Trust Issues (And That’s a Good Thing)</li>
<li>Mac to the Future: AWS Brings M4 Power to the Cloud</li>
<li>Oracle’s Cloud Nine: Stock Soars on Half-Trillion Dollar Dreams</li>
<li>ChatGPT: From Chat Bot to Hat Bot (Everyone’s Wearing Different Professional Hats)</li>
<li>Five Billion Reasons to Love British AI</li>
<li>NVMe Gonna Give You Up: AWS Delivers the Storage Metrics You’ve Been Missing</li>
<li>Tea and AI: OpenAI Crosses the Pond</li>
<li>The Norway Bug Strikes Back: A New YAML Hope
</li>
</ul>
<p>A big thanks to this week’s sponsor:</p>
<p>We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our Slack channel for more info.</p>
<h2>AI Is Going Great – Or How ML Makes Money </h2>
<p>01:33 <a href="https://www.geekwire.com/2025/a-new-deal-for-microsoft-and-openai-reading-between-the-lines-of-their-secretive-agreement/?utm_source=GeekWire+Newsletters&amp;utm_campaign=db0ae6c088-daily-digest-email&amp;utm_medium=email&amp;utm_term=0_4e93fc7dfd-db0ae6c088-233353605&amp;mc_cid=db0ae6c088&amp;mc_eid=04fad859c0">Microsoft and OpenAI make a deal: Reading between the lines of their </a><a href="https://www.geekwire.com/2025/a-new-deal-for-microsoft-and-openai-reading-between-the-lines-of-their-secretive-agreement/?utm_source=GeekWire+Newsletters&amp;utm_campaign=db0ae6c088-daily-digest-email&amp;utm_medium=email&amp;utm_term=0_4e93fc7dfd-db0ae6c088-233353605&amp;mc_cid=db0ae6c088&amp;mc_eid=04fad859c0">secretive new agreement – GeekWire</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.microsoft.com/en-us">Microsoft</a> and <a href="https://openai.com/">OpenAI</a> have signed a non-binding memorandum of understanding that will restructure their partnership, with OpenAI’s nonprofit entity receiving an equity stake exceeding $100 billion in a new public benefit corporation where Microsoft will play a major role.</li>
<li style="font-weight:400;">The deal addresses the AGI clause that previously allowed OpenAI to unilaterally dissolve the partnership upon achieving artificial general intelligence, which had been a significant risk for Microsoft’s multi-billion-dollar investment.</li>
<li style="font-weight:400;">Both companies are diversifying their partnerships – Microsoft is now using Anthropic’s technology for some Office 365 AI features, while OpenAI has signed a $300 billion computing contract with Oracle over five years.</li>
<li style="font-weight:400;">Microsoft’s exclusivity on OpenAI cloud workloads has been replaced with a right of first refusal, enabling OpenAI to participate in the $500 billion Stargate AI project with Oracle and other partners.</li>
<li style="font-weight:400;">The restructuring allows OpenAI to raise capital for its mission while ensuring the nonprofit’s resources grow proportionally, with plans to use funds for community impact, including a recently launched $50 million grant program.</li>
</ul>
<p>ALSO:</p>
<p><a href="https://arstechnica.com/ai/2025/09/openai-and-microsoft-sign-preliminary-deal-to-revise-partnership-terms/">OpenAI and Microsoft sign preliminary deal to revise partnership terms – </a><a></a></p>
<h3>Chapters</h3>
<ul><li>(00:00:00) - The Cloud Pod</li><li>(00:00:34) - Microsoft and OpenAI Restructuring</li><li>(00:06:55) - OpenAI's ChatGPT 5.0 Update</li><li>(00:12:33) - ChatGPT: How People Are Using the Technology</li><li>(00:16:33) - OpenAI's Stargate UK Announcement</li><li>(00:18:24) - LocalStack for Mac: New Instances Launch</li><li>(00:25:06) - Amazon EC2: More NVME Performance Metrics with EFA</li><li>(00:26:43) - AWS Launches R8GN</li><li>(00:28:20) -  AWS CDK Preview: Refactoring with Cloudformation</li><li>(00:29:59) - Amazon CloudTrail: AI Security Analysis with a McP Server</li><li>(00:33:44) - Amazon Web Services: Cloud Commitment Insurance</li><li>(00:35:37) - Google Cloud Launches Multi-Cloud Data Transfer Essentials</li><li>(00:40:13) - Kubernetes 1.34</li><li>(00:44:17) - Google Cloud introduces new recipe for disaggregated AI Inferance</li><li>(00:46:47) - Google's Data Science Agent Now Generates Code for BigQuery,</li><li>(00:49:09) - Google Cloud Launches DNS Armor to Detect Cyberthreats</li><li>(00:52:02) - Google's Agent Payments Protocol (AP2)</li><li>(00:54:32) - Google Cloud: Alloy DB on C4</li><li>(00:56:42) - Google Cloud Trace now supports Open telemetry protocol (OTEL)</li><li>(01:00:19) - Google's New 'Practical Guide to Data Science'</li><li>(01:02:26) - Vault Gemma: The First Large Language Model with Privacy</li><li>(01:06:05) - Customer Managed Keys</li><li>(01:12:39) - Azure Logic Apps: Model Context Protocol Server (MCP)</li><li>(01:14:46) - Microsoft's Kubernetes Storage v2</li><li>(01:16:46) - Microsoft Fabric and AI Foundry: New Features, New Features</li><li>(01:18:50) - Oracle Stock Jumping On Cloud Revenue Forecast</li><li>(01:22:40) - Week in the Cloud: September 7, 2017</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 322 of The Cloud Pod, where the forecast is always cloudy! We have BIG NEWS – Jonathan is back! He’s joined in the studio by Justin and Ryan to bring you all the latest in cloud and AI news, including ongoing drama in the Microsoft/OpenAI drama, saying goodbye to data transfer fees (in the EU), M4 Power, and more. Let’s get started!  
Titles we almost went with this week


EU Later, Egress Fees: Google’s Brexit from Data Transfer Charges
The Keys to the Cosmos: Azure Unlocks Customer Control
Breaking Up is Hard to Do: Google Splits LLM Inference for Better Performance
OpenAI and Microsoft: From Exclusive to It’s Complicated 
Google’s New Model Has Trust Issues (And That’s a Good Thing)
Mac to the Future: AWS Brings M4 Power to the Cloud
Oracle’s Cloud Nine: Stock Soars on Half-Trillion Dollar Dreams
ChatGPT: From Chat Bot to Hat Bot (Everyone’s Wearing Different Professional Hats)
Five Billion Reasons to Love British AI
NVMe Gonna Give You Up: AWS Delivers the Storage Metrics You’ve Been Missing
Tea and AI: OpenAI Crosses the Pond
The Norway Bug Strikes Back: A New YAML Hope


A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our Slack channel for more info.
AI Is Going Great – Or How ML Makes Money 
01:33 Microsoft and OpenAI make a deal: Reading between the lines of their secretive new agreement – GeekWire

Microsoft and OpenAI have signed a non-binding memorandum of understanding that will restructure their partnership, with OpenAI’s nonprofit entity receiving an equity stake exceeding $100 billion in a new public benefit corporation where Microsoft will play a major role.
The deal addresses the AGI clause that previously allowed OpenAI to unilaterally dissolve the partnership upon achieving artificial general intelligence, which had been a significant risk for Microsoft’s multi-billion-dollar investment.
Both companies are diversifying their partnerships – Microsoft is now using Anthropic’s technology for some Office 365 AI features, while OpenAI has signed a $300 billion computing contract with Oracle over five years.
Microsoft’s exclusivity on OpenAI cloud workloads has been replaced with a right of first refusal, enabling OpenAI to participate in the $500 billion Stargate AI project with Oracle and other partners.
The restructuring allows OpenAI to raise capital for its mission while ensuring the nonprofit’s resources grow proportionally, with plans to use funds for community impact, including a recently launched $50 million grant program.

ALSO:
OpenAI and Microsoft sign preliminary deal to revise partnership terms – ]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[322: Did OpenAI and Microsoft Break Up? It’s Complicated…]]>
                </itunes:title>
                                    <itunes:episode>322</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<h3>Welcome to episode 322 of The Cloud Pod, where the forecast is always cloudy! We have BIG NEWS – Jonathan is back! He’s joined in the studio by Justin and Ryan to bring you all the latest in cloud and AI news, including ongoing drama in the Microsoft/OpenAI drama, saying goodbye to data transfer fees (in the EU), M4 Power, and more. Let’s get started!  </h3>
<h3>Titles we almost went with this week
</h3>
<ul>
<li>EU Later, Egress Fees: Google’s Brexit from Data Transfer Charges</li>
<li>The Keys to the Cosmos: Azure Unlocks Customer Control</li>
<li>Breaking Up is Hard to Do: Google Splits LLM Inference for Better Performance</li>
<li>OpenAI and Microsoft: From Exclusive to It’s Complicated </li>
<li>Google’s New Model Has Trust Issues (And That’s a Good Thing)</li>
<li>Mac to the Future: AWS Brings M4 Power to the Cloud</li>
<li>Oracle’s Cloud Nine: Stock Soars on Half-Trillion Dollar Dreams</li>
<li>ChatGPT: From Chat Bot to Hat Bot (Everyone’s Wearing Different Professional Hats)</li>
<li>Five Billion Reasons to Love British AI</li>
<li>NVMe Gonna Give You Up: AWS Delivers the Storage Metrics You’ve Been Missing</li>
<li>Tea and AI: OpenAI Crosses the Pond</li>
<li>The Norway Bug Strikes Back: A New YAML Hope
</li>
</ul>
<p>A big thanks to this week’s sponsor:</p>
<p>We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our Slack channel for more info.</p>
<h2>AI Is Going Great – Or How ML Makes Money </h2>
<p>01:33 <a href="https://www.geekwire.com/2025/a-new-deal-for-microsoft-and-openai-reading-between-the-lines-of-their-secretive-agreement/?utm_source=GeekWire+Newsletters&amp;utm_campaign=db0ae6c088-daily-digest-email&amp;utm_medium=email&amp;utm_term=0_4e93fc7dfd-db0ae6c088-233353605&amp;mc_cid=db0ae6c088&amp;mc_eid=04fad859c0">Microsoft and OpenAI make a deal: Reading between the lines of their </a><a href="https://www.geekwire.com/2025/a-new-deal-for-microsoft-and-openai-reading-between-the-lines-of-their-secretive-agreement/?utm_source=GeekWire+Newsletters&amp;utm_campaign=db0ae6c088-daily-digest-email&amp;utm_medium=email&amp;utm_term=0_4e93fc7dfd-db0ae6c088-233353605&amp;mc_cid=db0ae6c088&amp;mc_eid=04fad859c0">secretive new agreement – GeekWire</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.microsoft.com/en-us">Microsoft</a> and <a href="https://openai.com/">OpenAI</a> have signed a non-binding memorandum of understanding that will restructure their partnership, with OpenAI’s nonprofit entity receiving an equity stake exceeding $100 billion in a new public benefit corporation where Microsoft will play a major role.</li>
<li style="font-weight:400;">The deal addresses the AGI clause that previously allowed OpenAI to unilaterally dissolve the partnership upon achieving artificial general intelligence, which had been a significant risk for Microsoft’s multi-billion-dollar investment.</li>
<li style="font-weight:400;">Both companies are diversifying their partnerships – Microsoft is now using Anthropic’s technology for some Office 365 AI features, while OpenAI has signed a $300 billion computing contract with Oracle over five years.</li>
<li style="font-weight:400;">Microsoft’s exclusivity on OpenAI cloud workloads has been replaced with a right of first refusal, enabling OpenAI to participate in the $500 billion Stargate AI project with Oracle and other partners.</li>
<li style="font-weight:400;">The restructuring allows OpenAI to raise capital for its mission while ensuring the nonprofit’s resources grow proportionally, with plans to use funds for community impact, including a recently launched $50 million grant program.</li>
</ul>
<p>ALSO:</p>
<p><a href="https://arstechnica.com/ai/2025/09/openai-and-microsoft-sign-preliminary-deal-to-revise-partnership-terms/">OpenAI and Microsoft sign preliminary deal to revise partnership terms – </a><a href="https://arstechnica.com/ai/2025/09/openai-and-microsoft-sign-preliminary-deal-to-revise-partnership-terms/">Ars Technica</a></p>
<ul>
<li style="font-weight:400;"><a href="https://openai.com/">OpenAI</a> and <a href="https://www.microsoft.com/en-us">Microsoft</a> <a href="https://blogs.microsoft.com/blog/2025/09/11/a-joint-statement-from-microsoft-and-openai/">signed a non-binding memorandum of understanding</a> to revise their partnership terms, requiring formal contract finalization as OpenAI transitions from nonprofit to for-profit structure, with Microsoft holding over $13 billion in investments.</li>
<li style="font-weight:400;">The partnership revision addresses growing competition between the companies for AI customers and OpenAI’s need for compute capacity beyond what Microsoft Azure can currently provide, leading OpenAI to explore additional cloud partnerships.</li>
<li style="font-weight:400;">Contract complications include provisions that would restrict Microsoft’s access to OpenAI technology once AGI is achieved, now defined by both companies as AI systems generating at least $100 billion in profit rather than technical capabilities.</li>
<li style="font-weight:400;">OpenAI abandoned its original full for-profit conversion plan after regulatory pressure and lawsuits from Elon Musk, who argues the shift violates OpenAI’s founding nonprofit mission to benefit humanity.</li>
<li style="font-weight:400;">This restructuring impacts cloud infrastructure planning as hyperscalers must balance exclusive partnerships against the reality that leading AI companies need multi-cloud strategies to meet their massive compute demands.</li>
</ul>
<p>02:59  Justin – “I’m not convinced that we can get to true AGI with the way that we’re building these models. I think there’s things that could lead us to breakthroughs that would get us to AGI, but the transformer model, and the way we do this, and predictive text, is not AGI. As good as you can be at predicting things, doesn’t mean you can have conscious thought.” </p>
<p>07:45 <a href="https://openai.com/index/introducing-upgrades-to-codex">Introducing Upgrades to Codex</a></p>
<ul>
<li style="font-weight:400;">OpenAI upgraded <a href="https://openai.com/index/introducing-codex/">Codex</a> to better translate natural language into code with improvements in handling complex programming tasks, edge cases, and expanded multi-language support. </li>
<li style="font-weight:400;">This enhances developer productivity in cloud-native applications where rapid prototyping and automation are essential.</li>
<li style="font-weight:400;">The architecture changes and training data updates enable more accurate code generation, which could reduce development time for cloud infrastructure automation scripts, API integrations, and serverless function creation.</li>
<li style="font-weight:400;">Enhanced Codex capabilities directly benefit cloud developers by automating repetitive coding tasks like writing boilerplate code for cloud service integrations, database queries, and deployment configurations.</li>
<li style="font-weight:400;">The improved edge case handling makes Codex more reliable for production use cases, potentially enabling automated code generation for cloud monitoring scripts, data pipeline creation, and infrastructure-as-code templates.</li>
<li style="font-weight:400;">These upgrades position Codex as a practical tool for accelerating cloud application development, particularly for teams building microservices, implementing CI/CD pipelines, or managing multi-cloud deployments.</li>
</ul>
<p>10:14  Jonathan – “I think Codex is probably better at some classes of coding. I think it’s great at React; you want to build a UI, use Codex and use OpenAI stuff. You want to build a backend app written in C or Python or something else? I’d use Claude Code. There seem to be different focuses.”</p>
<p>13:24 <a href="https://openai.com/index/how-people-are-using-chatgpt">How people are using ChatGPT</a></p>
<ul>
<li style="font-weight:400;">OpenAI’s analysis reveals <a href="https://www.bing.com/ck/a?!&amp;&amp;p=fc7927e7cdf17a4c53519c0acd90b3b0a1826dc05f35bab60e46deb84504ca1aJmltdHM9MTc1ODU4NTYwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=33cf825f-ad59-6912-19e3-9438ac3368b1&amp;psq=ChatGPT&amp;u=a1aHR0cHM6Ly9jaGF0Z3B0LmNvbS8">ChatGPT</a> usage patterns across diverse professional domains, with significant adoption in software development, content creation, education, and business operations, demonstrating the technology’s broad applicability beyond initial expectations.</li>
<li style="font-weight:400;">The data shows developers using ChatGPT for code generation, debugging, and documentation tasks, while educators leverage it for lesson planning and personalized learning experiences, indicating practical integration into existing cloud-based workflows.</li>
<li style="font-weight:400;">Business users report productivity gains through automated report generation, data analysis assistance, and customer service applications, suggesting potential for deeper integration with cloud platforms and enterprise systems.</li>
<li style="font-weight:400;">Usage patterns highlight the need for cloud providers to optimize infrastructure for conversational AI workloads, including considerations for API rate limits, response latency, and cost management for high-volume applications.</li>
<li style="font-weight:400;">The findings underscore growing demand for AI-powered tools in cloud environments, with implications for platform providers to develop specialized services for LLM deployment, fine-tuning, and integration with existing cloud services.</li>
</ul>
<p>14:51  Jonathan – “I wish it was more detailed; like how many people are talking to it like it’s a person? How many people are doing nonsense (like on) Reddit?”</p>
<p>17:42 <a href="https://openai.com/index/introducing-stargate-uk/">Introducing Stargate UK</a></p>
<ul>
<li style="font-weight:400;">OpenAI’s <a href="https://www.bing.com/ck/a?!&amp;&amp;p=298563b6c9f44d83b35f12a1cdc0d56cee978072ce8cddc783eb1dc40704b24bJmltdHM9MTc1ODU4NTYwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=33cf825f-ad59-6912-19e3-9438ac3368b1&amp;psq=Stargate+UK&amp;u=a1aHR0cHM6Ly9vcGVuYWkuY29tL2luZGV4L2ludHJvZHVjaW5nLXN0YXJnYXRlLXVrLw">Stargate UK</a> appears to be a regional deployment or infrastructure expansion focused on the UK market, potentially offering localized AI services with reduced latency and compliance with UK data sovereignty requirements.</li>
<li style="font-weight:400;">This development suggests OpenAI is building dedicated cloud infrastructure in the UK, which could enable faster API response times for European customers and address GDPR compliance needs for AI workloads.</li>
<li style="font-weight:400;">The UK-specific deployment may include region-locked models or features tailored to British English and UK-specific use cases, similar to how cloud providers offer region-specific services.</li>
<li style="font-weight:400;">For businesses, this could mean the ability to keep AI processing and data within UK borders, addressing regulatory requirements for financial services, healthcare, and government sectors that require data localization.</li>
<li style="font-weight:400;">The move indicates a broader trend of AI companies following traditional cloud provider patterns by establishing regional presence to meet performance, compliance, and data residency demands.</li>
</ul>
<p>18:19  Justin – “I mean, we already have a GPU shortage, so to now make a regionalized need for AI is going to further strain the GPU capacity issues, and so I should probably buy some Nvidia stuff.”</p>
<h2>AWS</h2>
<p>19:37 <a href="https://aws.amazon.com/blogs/aws/announcing-amazon-ec2-m4-and-m4-pro-mac-instances/">Announcing Amazon EC2 M4 and M4 Pro Mac instances | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;">AWS launches <a href="https://aws.amazon.com/ec2/instance-types/mac/">EC2 M4 and M4 Pro Mac instances</a> built on <a href="https://www.apple.com/mac-mini/specs/">Apple M4 Mac mini</a> hardware, offering up to 20% better build performance than M2 instances with 24GB unified memory for standard M4 and 48GB for M4 Pro variants.</li>
<li style="font-weight:400;">Each instance includes 2TB of local SSD storage for improved caching and build performance, though this storage is ephemeral and tied to the instance lifecycle rather than the dedicated host.</li>
<li style="font-weight:400;">The instances integrate with AWS services like <a href="https://www.bing.com/ck/a?!&amp;&amp;p=c94fd93deda8e976fef2ac6bb29b8d66692b00cd9edbcfa07ea8e8ea611ed874JmltdHM9MTc1ODU4NTYwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=33cf825f-ad59-6912-19e3-9438ac3368b1&amp;psq=CodeBuild&amp;u=a1aHR0cHM6Ly9kb2NzLmF3cy5hbWF6b24uY29tL2NvZGVidWlsZC9sYXRlc3QvdXNlcmd1aWRlL3dlbGNvbWUuaHRtbA">CodeBuild</a>, <a href="https://www.bing.com/ck/a?!&amp;&amp;p=b544481873f7b0fbf2a608cdff85ce909699ff1bc270af11841beda87636b55aJmltdHM9MTc1ODU4NTYwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=33cf825f-ad59-6912-19e3-9438ac3368b1&amp;psq=CodePipeline&amp;u=a1aHR0cHM6Ly9hd3MuYW1hem9uLmNvbS9jb2RlcGlwZWxpbmUv">CodePipeline</a>, and Secrets Manager for CI/CD workflows, while supporting <a href="https://www.bing.com/ck/a?!&amp;&amp;p=07ca781d41edaab5012ec203bc2017fc95bf5024b8fd62824a7f58d69d3d4e41JmltdHM9MTc1ODU4NTYwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=33cf825f-ad59-6912-19e3-9438ac3368b1&amp;psq=macOS+Sequoia+15.6&amp;u=a1aHR0cHM6Ly9zdXBwb3J0LmFwcGxlLmNvbS9lbi11cy8xMjAyODM">macOS Sequoia 15.6</a> and later with up to 10 Gbps VPC and 8 Gbps EBS bandwidth through Thunderbolt connections.</li>
<li style="font-weight:400;">Pricing follows the standard EC2 Mac model with a 24-hour minimum allocation period on dedicated hosts, available through On-Demand and Savings Plans in US East and US West regions initially.</li>
<li style="font-weight:400;">Beyond iOS/macOS development, the 16-core Neural Engine makes these instances suitable for ML inference workloads, expanding their use cases beyond traditional Apple platform development.</li>
</ul>
<p>22:00 <a href="https://aws.amazon.com/blogs/aws/accelerate-serverless-testing-with-localstack-integration-in-vs-code-ide/">Accelerate serverless testing with LocalStack integration in VS Code IDE | </a><a href="https://aws.amazon.com/blogs/aws/accelerate-serverless-testing-with-localstack-integration-in-vs-code-ide/">AWS News Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.bing.com/ck/a?!&amp;&amp;p=37453f646d5613ae6fefb0f1b1ccbef0ac8f323466ae34dae6d5b8028d7d6dd4JmltdHM9MTc1ODU4NTYwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=33cf825f-ad59-6912-19e3-9438ac3368b1&amp;psq=AWS+Toolkit+for+VS+Code&amp;u=a1aHR0cHM6Ly9hd3MuYW1hem9uLmNvbS92aXN1YWxzdHVkaW9jb2RlLw">AWS Toolkit for VS Code</a> now integrates with <a href="https://localstack.cloud/">LocalStack</a>, enabling developers to test serverless applications locally without switching between tools or managing complex configurations. </li>
<li style="font-weight:400;">The integration allows direct connection to LocalStack endpoints for emulating services like Lambda, SQS, EventBridge, and DynamoDB.</li>
<li style="font-weight:400;">This addresses a key gap in serverless development workflows where AWS SAM CLI handles unit testing well, but developers need better solutions for local integration testing of multi-service architectures. Previously, LocalStack required standalone management and manual endpoint configuration.</li>
<li style="font-weight:400;">The integration provides a tiered testing approach: LocalStack for early development without IAM/VPC complexity, then transition to cloud-based testing with remote debugging when needed. Developers can deploy stacks locally using familiar sam deploy commands with a LocalStack profile.</li>
<li style="font-weight:400;">Available in AWS Toolkit v3.74.0 across all commercial AWS Regions, the LocalStack Free tier covers core services with no additional AWS costs. Paid LocalStack tiers offer expanded service coverage for teams needing broader emulation capabilities.</li>
<li style="font-weight:400;">The feature continues AWS’s push to make VS Code the primary serverless development environment, building on recent console-to-IDE integration and remote debugging capabilities launched in July 2025.</li>
</ul>
<p>23:05  Ryan – “It’s interesting; it’s one of those things where I’ve been able to deal with the complexity, so didn’t realize the size of the gap, but I can see how a developer, without infrastructure knowledge, might struggle a little bit.” </p>
<p>26:38 <a href="https://aws.amazon.com/about-aws/whats-new/2025/09/amazon-ec2-detailed-performance-stats-nvme-local-volumes/">Amazon EC2 supports detailed performance stats on all NVMe local </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/09/amazon-ec2-detailed-performance-stats-nvme-local-volumes/">volumes</a></p>
<ul>
<li style="font-weight:400;">EC2 now provides 11 detailed performance metrics for instance store NVMe volumes at one-second granularity, including IOPS, throughput, queue length, and latency histograms broken down by IO size – matching the monitoring capabilities previously only available for EBS volumes.</li>
<li style="font-weight:400;">This feature addresses a significant monitoring gap for workloads using local NVMe storage on Nitro-based instances, enabling teams to troubleshoot performance issues and optimize IO patterns without additional tooling or cost.</li>
<li style="font-weight:400;">The latency histograms by IO size provide granular insights that help identify whether performance bottlenecks are related to small random reads, large sequential writes, or specific IO patterns in database and analytics workloads.</li>
<li style="font-weight:400;">Available by default on all Nitro-based EC2 instances with local NVMe storage across all AWS regions at no additional charge, making it immediately accessible for existing deployments.</li>
<li style="font-weight:400;">This brings feature parity between ephemeral instance store and persistent EBS storage monitoring, simplifying operations for hybrid storage architectures that use both storage types</li>
</ul>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2025/09/aws-efa-metrics-improved-observability-networking">New EFA metrics for improved observability of AWS networking</a></p>
<ul>
<li style="font-weight:400;">AWS adds five new <a href="https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/efa-working-monitor.html">Elastic Fabric Adapter metrics</a> to help diagnose network performance issues in AI/ML and HPC workloads by tracking retransmitted packets, timeout events, and unresponsive connections.</li>
<li style="font-weight:400;">The metrics are stored as counters in the sys filesystem and can be integrated with Prometheus and Grafana for monitoring dashboards and alerting, addressing the observability gap for high-performance networking workloads.</li>
<li style="font-weight:400;">Available only on <a href="https://docs.aws.amazon.com/ec2/latest/instancetypes/ec2-nitro-instances.html">Nitro v4</a> and later instances with EFA installer 1.43.0+, this targets customers running distributed training or tightly-coupled HPC applications where network performance directly impacts job completion times.</li>
<li style="font-weight:400;">These device-level counters help identify whether performance degradation stems from network congestion or instance misconfiguration, enabling faster troubleshooting for workloads that can cost thousands per hour.</li>
<li style="font-weight:400;">The feature arrives as AWS faces increased competition in AI infrastructure from specialized providers, making network observability critical for customers deciding between cloud and on-premises deployments for large-scale training.</li>
</ul>
<p>27:37  Jonathan – “That’s cool, it’s great that it’s local and it’s not through CloudWatch at .50 cents a metric per however long.”</p>
<p>28:19 <a href="https://aws.amazon.com/about-aws/whats-new/2025/09/generally-available-amazon-ec2-r8gn-instances">Now generally available: Amazon EC2 R8gn instances</a></p>
<ul>
<li style="font-weight:400;">AWS launches <a href="https://aws.amazon.com/ec2/instance-types/r8g">R8gn instances</a> powered by <a href="https://www.bing.com/ck/a?!&amp;&amp;p=4ed1b46b0fdce5ebe36a0af20e37fb263aa14ff03082093fd35c064401b0a991JmltdHM9MTc1ODU4NTYwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=33cf825f-ad59-6912-19e3-9438ac3368b1&amp;psq=Graviton4+processors&amp;u=a1aHR0cHM6Ly9hd3MuYW1hem9uLmNvbS9lYzIvZ3Jhdml0b24v">Graviton4 processors</a>, delivering 30% better compute performance than Graviton3 and featuring up to 600 Gbps network bandwidth – the highest among network-optimized EC2 instances.</li>
<li style="font-weight:400;">These memory-optimized instances scale up to 48xlarge with 1,536 GiB RAM and 60 Gbps EBS bandwidth, targeting network-intensive workloads like SQL/NoSQL databases and in-memory computing applications.</li>
<li style="font-weight:400;">R8gn instances support Elastic Fabric Adapter (EFA) on larger sizes (16xlarge and up), enabling lower latency for tightly coupled HPC clusters and distributed computing workloads.</li>
<li style="font-weight:400;">Currently available only in US East (N. Virginia) and US West (Oregon) regions, with metal sizes restricted to N. Virginia – suggesting a phased rollout approach for this new instance family.</li>
<li style="font-weight:400;">The combination of Graviton4 processors and 6th-generation Nitro Cards positions R8gn as AWS’s premium offering for customers needing both high memory capacity and extreme network performance in a single instance type.</li>
</ul>
<p>29:18  Jonathan – “That’s what you need for VLM clustering across multiple machines. That’s fantastic.”</p>
<p>29:55 <a href="https://aws.amazon.com/about-aws/whats-new/2025/09/aws-cdk-refactor-preview/">Introducing AWS CDK Refactor (Preview)</a></p>
<ul>
<li style="font-weight:400;">AWS CDK now includes a ‘cdk refactor’ command in preview that enables safe infrastructure reorganization by preserving deployed resource states when renaming constructs or moving resources between stacks. This addresses a long-standing pain point where code restructuring could accidentally trigger resource replacement and potential downtime.</li>
<li style="font-weight:400;">The feature leverages AWS CloudFormation’s refactor capabilities with automated mapping computation to maintain logical ID consistency during architectural changes. This allows teams to break down monolithic stacks, implement inheritance patterns, or upgrade to higher-level constructs without complex migration procedures.</li>
<li style="font-weight:400;">Real-world impact includes enabling continuous infrastructure code evolution for production environments without service disruption. Teams can now confidently refactor their CDK applications to improve maintainability and adopt best practices without risking stateful resources like databases or S3 buckets.</li>
<li style="font-weight:400;">The feature is available in all AWS regions where CDK is supported, with no additional cost beyond standard CloudFormation usage. Documentation and a detailed walkthrough are available at docs.aws.amazon.com/cdk/v2/guide/refactor.html.</li>
<li style="font-weight:400;">This development matters for AWS customers managing complex infrastructure as code deployments who previously had to choose between maintaining technical debt or risking production stability during refactoring operations.</li>
</ul>
<p>30:56  Ryan – “It’s interesting, I want to see – because how it works is key, right? Because in Terraform, you can do this, it’s just clunky and hard. And so I’m hoping that this is a little smoother. I don’t use CDK enough to really know how it structures.”</p>
<p>31:36 <a href="https://aws.amazon.com/about-aws/whats-new/2025/09/aws-cloudtrail-mcp-server-enhanced-security-analysis/">AWS launches CloudTrail MCP Server for enhanced security analysis</a></p>
<ul>
<li style="font-weight:400;">AWS introduces a Model Context Protocol (MCP) server for <a href="https://docs.aws.amazon.com/awscloudtrail/latest/userguide/cloudtrail-user-guide.html">CloudTrail</a> that enables AI agents to analyze security events and user activities through natural language queries instead of traditional API calls.</li>
<li style="font-weight:400;">The MCP server provides access to 90-day management event histories via <a href="https://docs.aws.amazon.com/awscloudtrail/latest/APIReference/API_LookupEvents.html">LookupEvents API</a> and up to 10 years of data through CloudTrail Lake using Trino SQL queries, streamlining security investigations and compliance workflows.</li>
<li style="font-weight:400;">This open-source integration (available at github.com/awslabs/mcp/tree/main/src/cloudtrail-mcp-server) allows organizations to leverage existing AI assistants for security analysis without building custom API integrations.</li>
<li style="font-weight:400;">The service is available in all regions supporting CloudTrail LookupEvents API or CloudTrail Lake, with costs based on standard CloudTrail pricing for event lookups and Lake queries.</li>
<li style="font-weight:400;">Key use cases include automated security incident investigation, compliance auditing through conversational interfaces, and simplified access to CloudTrail data for teams without deep AWS API knowledge.</li>
</ul>
<p>32:23  Ryan – “This is fantastic, just because it’s so tricky to sort of structure queries in whatever SQL language to get the data you want. And being able to phrase things in natural language has really made security operations just completely simpler.”</p>
<h2>GCP</h2>
<p>36:35 <a href="https://cloud.google.com/blog/products/networking/new-for-the-uk-and-eu-no-cost-multicloud-data-transfer-essentials/">New for the U.K. and EU: No-cost, multicloud Data Transfer Essentials | </a><a href="https://cloud.google.com/blog/products/networking/new-for-the-uk-and-eu-no-cost-multicloud-data-transfer-essentials/">Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google Cloud launches <a href="https://cloud.google.com/data-transfer-essentials/docs/overview">Data Transfer Essentials</a>, a no-cost service for EU and UK customers to transfer data between Google Cloud and other cloud providers for multicloud workloads. </li>
<li style="font-weight:400;">The service meets <a href="https://cloud.google.com/blog/products/identity-security/navigating-the-eu-ai-act-google-clouds-proactive-approach">EU Data Act</a> requirements for cloud interoperability, while Google chooses not to pass on costs to customers, despite the Act allowing it.</li>
<li style="font-weight:400;">Data Transfer Essentials targets organizations running parallel workloads across multiple clouds, enabling them to process data without incurring Google Cloud egress fees. </li>
<li style="font-weight:400;">Customers must opt-in and configure their multicloud traffic, which will appear as zero-charge line items on bills while non-qualifying traffic continues at standard Network Service Tier rates.</li>
<li style="font-weight:400;">This positions Google Cloud ahead of competitors on multicloud data transfer costs, as AWS and Azure still charge significant egress fees for cross-cloud transfers. </li>
<li style="font-weight:400;">The service builds on Google’s previous moves, like waiving exit fees entirely and launching <a href="https://www.bing.com/ck/a?!&amp;&amp;p=53b529bf743e5631cb41ce4360c66a26f7f0fc0c6cf7092296bb9f561a367f2cJmltdHM9MTc1ODU4NTYwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=33cf825f-ad59-6912-19e3-9438ac3368b1&amp;psq=BigQuery+Omni&amp;u=a1aHR0cHM6Ly9jbG91ZC5nb29nbGUuY29tL2JpZ3F1ZXJ5L2RvY3Mvb21uaS1pbnRyb2R1Y3Rpb24">BigQuery Omni</a> for multicloud <a href="https://cloud.google.com/blog/products/data-analytics/introducing-bigquery-omni">data warehousing</a>.</li>
<li style="font-weight:400;">Key use cases include distributed analytics workloads, multi-region disaster recovery setups, and organizations using best-of-breed services across different clouds. </li>
<li style="font-weight:400;">Financial services and healthcare companies with strict data residency requirements could benefit from cost-free data movement between clouds.</li>
<li style="font-weight:400;">The service requires manual configuration through Google’s guide to designate qualifying multicloud traffic, adding operational overhead compared to standard networking. </li>
<li style="font-weight:400;">Organizations must ensure traffic genuinely serves multicloud workloads to be eligible for zero-cost transfers.</li>
</ul>
<p>41:13 <a href="https://opensource.googleblog.com/2025/09/kubernetes-134-is-available-on-gke.html">Kubernetes 1.34 is available on GKE! | Google Open Source Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.34.md">Kubernetes 1.34</a> brings Dynamic Resource Allocation (DRA) to GA, finally giving production-ready support for better GPU, TPU, and specialized hardware management – a critical feature for AI/ML workloads that need precise resource allocation and sharing.</li>
<li style="font-weight:400;">The introduction of KYAML addresses the infamous “Norway Bug” and YAML’s whitespace nightmares by enforcing stricter parsing rules while remaining compatible with existing parsers – just set KUBECTL_KYAML=true to avoid those frustrating debugging sessions from stray spaces.</li>
<li style="font-weight:400;">Pod-level resource limits (now beta) simplify multi-container resource management by letting you set a total resource budget for the entire pod instead of juggling individual container limits, with pod-level settings taking precedence when both are defined.</li>
<li style="font-weight:400;">Several stability improvements landed, including ordered namespace deletion for security (preventing NetworkPolicy removal before pods), streaming LIST responses to reduce API server memory pressure in large clusters, and resilient watch cache initialization to prevent thundering herd scenarios.</li>
<li style="font-weight:400;">GKE’s rapid channel delivered this release just 5 days after the OSS release, showcasing Google’s commitment to keeping its managed Kubernetes service current with upstream developments.</li>
</ul>
<p>42:57  Jonathan- “I like to think of it as fixing a problem with JSON, rather than fixing a problem with YAML, because what it looks like is JSON, but now you can have comments – inline comments, like you could always do with YAML.”</p>
<p>45:22 <a href="https://cloud.google.com/blog/products/compute/ai-inference-recipe-using-nvidia-dynamo-with-ai-hypercomputer/">AI Inference recipe using NVIDIA Dynamo with AI Hypercomputer | Google </a><a href="https://cloud.google.com/blog/products/compute/ai-inference-recipe-using-nvidia-dynamo-with-ai-hypercomputer/">Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google Cloud introduces a new recipe for disaggregated AI inference using <a href="https://www.nvidia.com/en-us/ai/dynamo/">NVIDIA Dynamo</a> on <a href="https://cloud.google.com/solutions/ai-hypercomputer">AI Hypercomputer</a>, which physically separates the prefill (prompt processing) and decode (token generation) phases of LLM inference across different GPU pools to improve performance and reduce costs.</li>
<li style="font-weight:400;">The solution leverages A3 Ultra instances with <a href="https://www.bing.com/ck/a?!&amp;&amp;p=f581706eb805f1441b97a10238405f818dd303b2cebdef3132a9abba3bd80338JmltdHM9MTc1ODU4NTYwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=33cf825f-ad59-6912-19e3-9438ac3368b1&amp;psq=NVIDIA+H200+GPUs&amp;u=a1aHR0cHM6Ly93d3cucnVucG9kLmlvL2FydGljbGVzL2d1aWRlcy9udmlkaWEtaDIwMC1ncHU">NVIDIA H200 GPUs</a> orchestrated by GKE, with NVIDIA Dynamo acting as the inference server that intelligently routes workloads between specialized GPU pools – one optimized for compute-heavy prefill tasks and another for memory-bound decode operations.</li>
<li style="font-weight:400;">This architecture addresses a fundamental inefficiency in traditional GPU serving, where both inference phases compete for the same resources, causing bottlenecks when long prefill operations block rapid token generation, leading to poor GPU utilization and higher costs.</li>
<li style="font-weight:400;">The recipe supports popular inference engines, including vLLM, SGLang, and TensorRT-LLM, with initial configurations available for single-node (4 GPUs prefill, 4 GPUs decode) and multi-node deployments for models like Llama-3.3-70B-Instruct, available at github.com/AI-Hypercomputer/gpu-recipes.</li>
<li style="font-weight:400;">While AWS and Azure offer various inference optimization techniques, Google’s approach of physically disaggregating inference phases with dedicated GPU pools and intelligent routing represents a distinct architectural approach to solving the compute vs memory bandwidth challenge in LLM serving.</li>
</ul>
<p>46:52  Jonathan – “It’s just like any app, any monolith, where different parts of the monolith get used at different rates, or have different resource requirements. Do you scale the entire monolith up and then have wasted CPU or RAM on some of them? Or do you break it up into different components and optimize for each particular task? And that’s all they’re doing. It’s a pretty good idea.”</p>
<p>47:56 <a href="https://cloud.google.com/blog/products/data-analytics/data-science-agent-now-supports-bigquery-ml-dataframes-and-spark/">Data Science Agent now supports BigQuery ML, DataFrames, and Spark | </a><a href="https://cloud.google.com/blog/products/data-analytics/data-science-agent-now-supports-bigquery-ml-dataframes-and-spark/">Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google’s <a href="https://cloud.google.com/bigquery/docs/colab-data-science-agent">Data Science Agent</a> now generates code for <a href="https://cloud.google.com/bigquery/docs/bqml-introduction">BigQuery ML</a>, <a href="https://cloud.google.com/bigquery/docs/bigquery-dataframes-introduction">BigQuery DataFrames</a>, and <a href="https://cloud.google.com/products/serverless-spark">Apache Spark</a>, enabling users to scale data processing and ML workflows directly on BigQuery infrastructure or distributed Spark clusters by simply including keywords like “BQML”, “BigFrames”, or “PySpark” in prompts.</li>
<li style="font-weight:400;">The agent introduces @ mentions for BigQuery table discovery within the current project and automatic metadata retrieval, allowing users to reference tables directly in prompts without manual navigation – though cross-project searches still require the traditional “+” button interface.</li>
<li style="font-weight:400;">This positions GCP competitively against <a href="https://www.bing.com/ck/a?!&amp;&amp;p=c633fae2c4b5d5c4778db2deeef64bb8de8401369664c1b58907a31f223a966aJmltdHM9MTc1ODU4NTYwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=33cf825f-ad59-6912-19e3-9438ac3368b1&amp;psq=AWS+SageMaker%27s&amp;u=a1aHR0cHM6Ly9hd3MuYW1hem9uLmNvbS9zYWdlbWFrZXIv">AWS SageMaker’s</a> code generation features and <a href="https://www.bing.com/ck/a?!&amp;&amp;p=7fa17b547cc7ceb612c343e29cdf8583596c937e884fa58084b49ed781d354a4JmltdHM9MTc1ODU4NTYwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=33cf825f-ad59-6912-19e3-9438ac3368b1&amp;psq=Azure%27s+Copilot+integrations&amp;u=a1aHR0cHM6Ly9sZWFybi5taWNyb3NvZnQuY29tL2VuLXVzL2F6dXJlL2NvcGlsb3Qvb3ZlcnZpZXc">Azure’s Copilot integrations</a> by offering native BigQuery scaling advantages, particularly for organizations already invested in BigQuery’s ecosystem for data warehousing and analytics.</li>
<li style="font-weight:400;">The key limitation is that the agent currently generates only Spark 4.0 code, which may require organizations on earlier Spark versions to upgrade or avoid using the agent for PySpark workflows until backward compatibility is added.</li>
<li style="font-weight:400;">The feature targets data scientists and analysts working with large-scale datasets that exceed single-machine memory limits, with practical applications in forecasting, customer segmentation, and predictive modeling using serverless infrastructure to minimize operational overhead.</li>
</ul>
<p>48:52  Ryan – “This kind of makes me wonder what the data science agent did before this announcement…”</p>
<p>50:18 <a href="https://cloud.google.com/blog/products/identity-security/introducing-dns-armor-to-mitigate-domain-name-system-risks/">Introducing DNS Armor to mitigate domain name system risks | Google </a><a href="https://cloud.google.com/blog/products/identity-security/introducing-dns-armor-to-mitigate-domain-name-system-risks/">Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google Cloud launches <a href="https://cloud.google.com/dns/docs/threat-detection">DNS Armor</a> in preview, partnering with Infoblox to provide DNS-based threat detection that catches malicious domains 68 days earlier than traditional security tools by analyzing over 70 billion DNS events daily.</li>
<li style="font-weight:400;">The service detects command and control server connections, DNS tunneling for data exfiltration, and malware distribution sites using both feed-based detection for known threats and machine learning algorithms for emerging attack patterns.</li>
<li style="font-weight:400;">DNS Armor operates as a fully managed service requiring no VMs, integrates with <a href="https://www.bing.com/ck/a?!&amp;&amp;p=1e8f5a7dfa99527ace0e8a542cb15ff09253ffa1c818df9f97814e0ab19a7283JmltdHM9MTc1ODU4NTYwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=33cf825f-ad59-6912-19e3-9438ac3368b1&amp;psq=Cloud+Logging&amp;u=a1aHR0cHM6Ly9jbG91ZC5nb29nbGUuY29tL2xvZ2dpbmc">Cloud Logging</a> and <a href="https://www.bing.com/ck/a?!&amp;&amp;p=7ab862414d1a290be93a101e931bef94282b7045d92a8cbe85758f21fbd95569JmltdHM9MTc1ODU4NTYwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=33cf825f-ad59-6912-19e3-9438ac3368b1&amp;psq=Security+Command+Center&amp;u=a1aHR0cHM6Ly9jbG91ZC5nb29nbGUuY29tL3NlY3VyaXR5L3Byb2R1Y3RzL3NlY3VyaXR5LWNvbW1hbmQtY2VudGVy">Security Command Center</a>, and can be enabled at the project level across VPCs with no performance impact on Cloud DNS.</li>
<li style="font-weight:400;">This positions GCP competitively against AWS Route 53 Resolver DNS Firewall and Azure DNS Private Resolver, offering similar DNS security capabilities but with Infoblox’s threat intelligence that adds 4 million new threat indicators monthly.</li>
<li style="font-weight:400;">Enterprise customers running workloads in GCP gain an additional security layer that addresses the fact that 92% of malware uses DNS for command and control, making this particularly valuable for financial services, healthcare, and other regulated industries.</li>
</ul>
<p>51:16  Ryan – “This is cool. This is one of the harder problems to solve in security is just that there’s so many services where you have to populate DNS entries and then to route traffic to them. And then it can basically be abandoned over time in bit rot. And so then, it can be snatched up by someone else and then abused; this will help you detect that scenario.”</p>
<p>53:13 <a href="https://cloud.google.com/blog/products/ai-machine-learning/announcing-agents-to-payments-ap2-protocol/">Announcing Agent Payments Protocol (AP2) | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google announced <a href="http://goo.gle/ap2">Agent Payments Protocol (AP2)</a>, an open protocol for secure AI agent-led payments that works with<a href="https://a2a-protocol.org/"> A2A and Model Context Protocol</a>, addressing critical gaps in authorization, authenticity, and accountability when AI agents make purchases on behalf of users</li>
<li style="font-weight:400;">The protocol uses cryptographically-signed “Mandates” as tamper-proof digital contracts that create verifiable audit trails for both real-time purchases (human present) and delegated tasks (human not present), solving the trust problem when agents transact autonomously</li>
<li style="font-weight:400;">AP2 supports multiple payment types, including credit cards, stablecoins, and cryptocurrencies, with the A2A x402 extension already providing production-ready crypto payment capabilities in collaboration with Coinbase and Ethereum Foundation</li>
<li style="font-weight:400;">Over 60 major organizations are participating, including American Express, Mastercard, PayPal, Salesforce, and ServiceNow, positioning this as an industry-wide initiative rather than a Google-only solution</li>
<li style="font-weight:400;">The protocol enables new commerce models like automated price monitoring and purchasing, personalized merchant offers through agent-to-agent communication, and coordinated multi-vendor transactions within budget constraints</li>
</ul>
<p>54:26  Jonathan – “This may be the path to the micro payments thing that people have been trying to get off the ground for years. You run a blog or something, and something like this could actually get you the half cent per view that would cover the cost of the server or something.”</p>
<p>55:56 <a href="https://cloud.google.com/blog/products/databases/c4a-axion-processors-for-alloydb-now-ga/">C4A Axion processors for AlloyDB now GA | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/blog/products/compute/google-axion-powers-cloud-sql-and-alloydb?e=48754805">AlloyDB on C4A Axion processors</a> delivers up to 45% better price-performance than N-series VMs for transactional workloads and achieves 3 million transactions per minute, with the new 1 vCPU option cutting entry costs by 50% for development environments.</li>
<li style="font-weight:400;">Google’s custom <a href="https://www.bing.com/ck/a?!&amp;&amp;p=eb1654019e86415d2bb44930ee084aab6cbd6afe5d4139002342368bcc204743JmltdHM9MTc1ODY3MjAwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=33cf825f-ad59-6912-19e3-9438ac3368b1&amp;psq=ARM-based+Axion+processors&amp;u=a1aHR0cHM6Ly9jbG91ZC5nb29nbGUuY29tL3Byb2R1Y3RzL2F4aW9u">ARM-based Axion processors</a> outperform <a href="https://www.bing.com/ck/a?!&amp;&amp;p=dcb3ca080a8e85caa70274cd129f8e16789fa776881568fdd996235ef480b7ecJmltdHM9MTc1ODY3MjAwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=33cf825f-ad59-6912-19e3-9438ac3368b1&amp;psq=Amazon%27s+Graviton4&amp;u=a1aHR0cHM6Ly9hd3MuYW1hem9uLmNvbS9ibG9ncy9hd3Mvbm93LWF2YWlsYWJsZS1ncmF2aXRvbjQtcG93ZXJlZC1tZW1vcnktb3B0aW1pemVkLWFtYXpvbi1lYzIteDhnLWluc3RhbmNlcy8">Amazon’s Graviton4</a> offerings by 2x in throughput and 3x in price-performance for PostgreSQL workloads, according to independent Gigaom testing, positioning GCP competitively in the ARM database market.</li>
<li style="font-weight:400;">The addition of a 1 vCPU/8GB memory configuration addresses developer needs for cost-effective sandbox environments, though it lacks uptime SLAs even in HA configurations, while production workloads can scale up to 72 vCPUs with a new 48 vCPU intermediate option.</li>
<li style="font-weight:400;">C4A instances are priced identically to N2 VMs while delivering superior performance, making migration a straightforward cost optimization opportunity for existing <a href="https://www.bing.com/ck/a?!&amp;&amp;p=7ab2f19b2e96b9ed7f2ea31f78cf1cc54c844b17b6819568b513b633510cebedJmltdHM9MTc1ODY3MjAwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=33cf825f-ad59-6912-19e3-9438ac3368b1&amp;psq=AlloyDB&amp;u=a1aHR0cHM6Ly9jbG91ZC5nb29nbGUuY29tL2FsbG95ZGIvZG9jcy9vdmVydmlldw">AlloyDB</a> customers without pricing penalties.</li>
<li style="font-weight:400;">Limited regional availability in select Google Cloud regions may impact adoption timing, but the GA status signals production readiness for customers already testing in preview who cited both performance gains and cost reductions.</li>
</ul>
<p>58:04 <a href="https://cloud.google.com/blog/products/management-tools/opentelemetry-now-in-google-cloud-observability/">OpenTelemetry now in Google Cloud Observability | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.bing.com/ck/a?!&amp;&amp;p=6dd2fd37456dc52f21d0d157fe8234050a57608efad40a02d66ffd1f170f3496JmltdHM9MTc1ODY3MjAwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=33cf825f-ad59-6912-19e3-9438ac3368b1&amp;psq=Google+Cloud+Trace&amp;u=a1aHR0cHM6Ly9jbG91ZC5nb29nbGUuY29tL3RyYWNlL2RvY3Mv">Google Cloud Trace</a> now supports <a href="https://cloud.google.com/trace/docs/migrate-to-otlp-endpoints">OpenTelemetry Protocol (OTLP) for trace data</a> ingestion via<a href="http://telemetry.googleapis.com"> telemetry.googleapis.com</a>, enabling vendor-agnostic telemetry pipelines that eliminate the need for Google-specific exporters and preserve the OTel data model during transmission.</li>
<li style="font-weight:400;">The new OTLP endpoint significantly increases storage limits: attribute keys expand from 128 to 512 bytes, values from 256 bytes to 64 KiB, span names from 128 to 1024 bytes, and attributes per span from 32 to 1024, addressing previous limitations for high-volume trace data users.</li>
<li style="font-weight:400;">Cloud Trace’s internal storage now natively utilizes the OpenTelemetry data model and leverages OTel semantic conventions, such as <a href="http://service.name">service.name</a> and <a href="https://opentelemetry.io/docs/concepts/signals/traces/#span-status">span status</a>, in the Trace Explorer UI, thereby improving the user experience for filtering and analyzing traces.</li>
<li style="font-weight:400;">Google positions this as the first step in a broader strategy to support OTLP across all telemetry types (traces, metrics, and logs), with future plans for server-side processing, flexible routing, and unified telemetry management across environments.</li>
<li style="font-weight:400;">Organizations using multi-cloud or hybrid environments benefit from reduced client-side complexity and the ability to easily send telemetry to multiple observability backends without additional exporters or format conversions.</li>
</ul>
<p>1:00:41 <a href="https://blog.google/around-the-globe/google-europe/united-kingdom/waltham-cross-data-centre/">Our new Waltham Cross data center is part of our two-year, £5 billion </a><a href="https://blog.google/around-the-globe/google-europe/united-kingdom/waltham-cross-data-centre/">investment to help power the UK’s AI economy.</a></p>
<ul>
<li style="font-weight:400;">Google is investing £5 billion over two years in UK infrastructure, including a new data center in Waltham Cross, Hertfordshire, to support growing demand for AI services like Google Cloud, Search, and Maps.</li>
<li style="font-weight:400;">The investment encompasses capital expenditure, R&amp;D, and engineering resources, with projections to support 8,250 jobs annually in the UK while strengthening the country’s AI economy.</li>
<li style="font-weight:400;">Google <a href="https://www.googlecloudpresscorner.com/2025-09-16-Google-Opens-Waltham-Cross-Data-Centre-as-Part-of-Two-year-GBP5-Billion-Investment-in-the-UK-to-Help-Power-its-AI-Economy">partnered with Shell</a> to manage its UK carbon-free energy portfolio and deploy battery technology that stores surplus clean energy and feeds it back to the grid during peak demand.</li>
<li style="font-weight:400;">This expansion positions Google to compete more effectively with AWS and Azure in the UK market by providing local infrastructure for AI workloads and reducing latency for UK customers.</li>
<li style="font-weight:400;">The data center will support Google DeepMind’s AI research in science and healthcare, offering UK enterprises and researchers improved access to Google’s AI capabilities and cloud services.</li>
</ul>
<p>1:01:31  Justin – “The Deep Mind AI research is the most obvious reason why they did this.” </p>
<p>1:02:22 <a href="https://cloud.google.com/blog/topics/developers-practitioners/announcing-the-new-practical-guide-to-data-science-on-google-cloud/">Announcing the new Practical Guide to Data Science on Google Cloud | </a><a href="https://cloud.google.com/blog/topics/developers-practitioners/announcing-the-new-practical-guide-to-data-science-on-google-cloud/">Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google released a new ebook called <a href="https://services.google.com/fh/files/misc/a_practical_guide_to_data_science_with_google_cloud.pdf">A Practical Guide to Data Science with Google Cloud</a> that demonstrates how to use <a href="https://cloud.google.com/bigquery">BigQuery</a>, <a href="https://cloud.google.com/vertex-ai">Vertex AI</a>, and <a href="https://cloud.google.com/products/serverless-spark">Serverless Spark</a> together for modern data science workflows.</li>
<li style="font-weight:400;">The guide emphasizes unified workflows through <a href="https://cloud.google.com/colab/docs/introduction">Colab Enterprise notebooks</a> that blend SQL, <a href="https://www.bing.com/ck/a?!&amp;&amp;p=e5bc031193fbd91b0c1a97a5efc38ec3d6d0b8fee8be401bcae46a67970229d3JmltdHM9MTc1ODY3MjAwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=33cf825f-ad59-6912-19e3-9438ac3368b1&amp;psq=python&amp;u=a1aHR0cHM6Ly93d3cucHl0aG9uLm9yZy9kb3dubG9hZHMv">Python</a>, and <a href="https://cloud.google.com/products/serverless-spark">Spark</a> code in one place, with AI assistive features that generate multi-step plans and code from high-level goals.</li>
<li style="font-weight:400;">Google’s approach allows data scientists to manage structured and unstructured data in one foundation, using familiar SQL syntax to process documents or analyze images directly through BigQuery.</li>
<li style="font-weight:400;">The ebook includes real-world use cases like retail demand forecasting and agricultural risk assessment, with each example linking to executable notebooks for immediate hands-on practice.</li>
<li style="font-weight:400;">This positions Google Cloud as offering more integrated data science tooling compared to <a href="https://www.bing.com/ck/a?!&amp;&amp;p=5188aacf62e22e5006226cf745838011a4a72892cf4bef817c7103ef49f9d61dJmltdHM9MTc1ODY3MjAwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=33cf825f-ad59-6912-19e3-9438ac3368b1&amp;psq=AWS+SageMaker&amp;u=a1aHR0cHM6Ly9kb2NzLmF3cy5hbWF6b24uY29tL25leHQtZ2VuZXJhdGlvbi1zYWdlbWFrZXIvbGF0ZXN0L3VzZXJndWlkZS93aGF0LWlzLXNhZ2VtYWtlci5odG1s">AWS SageMaker</a> or <a href="https://www.bing.com/aclk?ld=e8Dwo0aqeSOJc4S7f4lbLx8TVUCUy7GjujYsmFAuYoKW47YDHCvVAc-uoqumHHt4Uf6oKSSqpKdC2cTypQpEkqY0MiCQ5BYPYHJ7ZeI_mHZ9zh7UwEsd1dSMnKhIShH1a3E6GTePzjzLfg-Pvl9yooSe7Mr5L6sY3zxxX8c9BcQsnKLCQEk30JLF0O-rx0JR8mRQ5nbw&amp;u=aHR0cHMlM2ElMmYlMmZhenVyZS5taWNyb3NvZnQuY29tJTJmZW4tdXMlMmZwcmljaW5nJTJmcHVyY2hhc2Utb3B0aW9ucyUyZmF6dXJlLWFjY291bnQlMmZzZWFyY2glM2ZpY2lkJTNkbWFjaGluZS1sZWFybmluZyUyNmVmX2lkJTNkX2tfNzNlNmRjYTFmMTZlMTBiZGM1ODJiNDcyNmI3ZTk5Mzdfa18lMjZPQ0lEJTNkQUlEY21tNWVkc3dkdXVfU0VNX19rXzczZTZkY2ExZjE2ZTEwYmRjNTgyYjQ3MjZiN2U5OTM3X2tfJTI2bXNjbGtpZCUzZDczZTZkY2ExZjE2ZTEwYmRjNTgyYjQ3MjZiN2U5OTM3&amp;rlid=73e6dca1f16e10bdc582b4726b7e9937">Azure ML</a>, particularly with the SQL-based approach to unstructured data analysis through BigQuery.</li>
</ul>
<p>1:04:29 <a href="https://arstechnica.com/ai/2025/09/google-releases-vaultgemma-its-first-privacy-preserving-llm/">Google releases VaultGemma, its first privacy-preserving LLM – Ars </a><a href="https://arstechnica.com/ai/2025/09/google-releases-vaultgemma-its-first-privacy-preserving-llm/">Technica</a></p>
<ul>
<li style="font-weight:400;">Google Research has developed VaultGemma, its first large language model implementing differential privacy techniques that prevent the model from memorizing and potentially exposing sensitive training data by introducing calibrated noise during training.</li>
<li style="font-weight:400;">The research establishes new scaling laws for private LLMs, demonstrating that increased privacy (more noise) requires either higher compute budgets measured in FLOPs or larger data budgets measured in tokens to maintain model performance.</li>
<li style="font-weight:400;">This addresses a critical challenge as tech companies increasingly rely on potentially sensitive user data for training, with the noise-batch ratio serving as the key parameter for balancing privacy protection against model accuracy.</li>
<li style="font-weight:400;">For cloud providers and enterprises, this technology enables the deployment of LLMs that can train on proprietary or regulated data without risk of exposing that information through model outputs, opening new use cases in healthcare, finance, and other privacy-sensitive domains.</li>
<li style="font-weight:400;">The approach provides a mathematical framework for developers to calculate the optimal trade-offs between privacy guarantees, computational costs, and model performance when building privacy-preserving AI systems.</li>
</ul>
<p>1:05:36  Justin – “You want to train a model based off of sensitive data, and then you want to offer the output of that model through a chatbot or whatever it is publicly. And it’s terrifying, as a security professional, because you don’t know what data is going to be spit out, and you can’t predict it, and it’s very hard to analyze within the model what’s in there… And so if solutions like this, where you can sort of have mathematical guarantees – or at least something you can point at, that would go a long way in making those workloads a reality, which is fantastic.”</p>
<h2>Azure</h2>
<p>1:08:20 <a href="https://azure.microsoft.com/en-us/updates?id=501980">Generally Available: Azure Cosmos DB for MongoDB (vCore) encryption </a><a href="https://azure.microsoft.com/en-us/updates?id=501980">with customer-managed key </a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/cosmos-db/mongodb/vcore/">Azure Cosmos DB for MongoDB vCore</a> now supports customer-managed keys (CMK) in addition to the default service-managed encryption, providing enterprises with full control over their encryption keys through <a href="https://duckduckgo.com/y.js?ad_domain=microsoft.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=FuVGBFc1a65RNxc66P7WzF0l7N7E4T7YLa60Xp5yChSuZqq7F13bfmP-WiQUddERbFHca3J185opKfQR_5N9wIUPE75WHa4ducQ3L4_O4a2kDj2aXk_r1G6iUCD7d19t.8UBNKwxqdFfgDz6RO-cZ0g&amp;eddgt=eMs4473wX3h29vC20mrcBg%3D%3D&amp;rut=01c3170645557144ff17c5909baeae6907d5a8c6d44b35349e75b4cd679689f6&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8eQ48ZacYoXHUrMT0r07uSzVUCUzPEFyOO2UIdqNcDcscngy6FMhCZv-0D0EXepFHzzQMfIuDdoXvUDqVQ2F24jfUNZfNjwwVzWPthy8BEy4_CrYrMFg6MqdOg1Fb0lUq0RsE3_X83d7x2c7VTQnROJWxQRav_93eaRA6Ojj95eCA5yk7VZuf1M-FdYxOQoWCnrqz4ful-spXWQ0POap3rV7AhkWZFGRjHrS0c96P0Pn_wNAqSxZPpQt4l1vP2tZhemSLQ-ExDUak1BFdaqQ4Y0zGQdomkvnv7V5r7iOPcU5FvBiA5lG54wXQ5rG3DtfDHArrlyYpGF3dSoDCMNYADRpcWbfyRgws93m5Ls8vioyN5InckhUYVSz-pvx6ALZmNK6Ec0YsN3ZFv_aq1-tsada3xnduwD82_0x8VIaSsfLBbSVaeYtyzXQk8UhpcLNDhUCq0acBC1CVHmviIlbTuJoVDdQf2NWGzB0dX8CBki-QSbvkG0RRbukdNwPITKomSh9TTbLCl4i5m6rgUYZQeSeoB406OWO8IxpCKp6RFy0sUv-sIfXO7CYFIdgP3f0WNATpPgqmn3Ywbk8uNNoFdUJp_1-et2KSxO6kwivdjU2NWhAu0wTHaY-n-C4lnaNq0oSLqLrzfaJy7til_Go6h86gCmgxa6mr-ZjYI-u2kPWCaTbs_jVKtpp4iYV_eRCv_-2VtHl5-crArLJbZU55vq3_A0bDzXGhoMhkZByAejJWxLGf7gqTPDribQDseeLrEhBtdw%26u%3DaHR0cHMlM2ElMmYlMmY1MzUwLnhnNGtlbi5jb20lMmZ0cmslMmZ2MSUzZnByb2YlM2Q0MzklMjZjYW1wJTNkMTY4ODA3JTI2a2N0JTNkbXNuJTI2a2NoaWQlM2QxNTkwMDE5MjElMjZjcml0ZXJpYWlkJTNka3dkLTc5NzE1MjAwMDQ0OTY3JTNhbG9jLTE5MCUyNmNhbXBhaWduaWQlM2Q1OTAyMzczNTglMjZsb2NwaHklM2Q3OTY3OCUyNmFkZ3JvdXBpZCUzZDEyNzU0MzUxNDA0MTgyMjIlMjZjaWQlM2Q3OTcxNDgwMTQ2Nzk1NSUyNmtkdiUzZGMlMjZrZXh0JTNkJTI2a3BnJTNkJTI2a3BpZCUzZCUyNnF1ZXJ5U3RyJTNkQXp1cmUlMjUyMEtleSUyNTIwVmF1bHQlMjZ1cmwlM2RodHRwcyUzYSUyZiUyZmF6dXJlLm1pY3Jvc29mdC5jb20lMmZlbi11cyUyZnByb2R1Y3RzJTJma2V5LXZhdWx0JTJmJTNmZWZfaWQlM2Rfa19hYWIwZjQ3ZmZiNjcxMDBhM2QwNDljYWM2MGNmMGQyNV9rXyUyNk9DSUQlM2RBSURjbW01ZWRzd2R1dV9TRU1fX2tfYWFiMGY0N2ZmYjY3MTAwYTNkMDQ5Y2FjNjBjZjBkMjVfa18lMjZtc2Nsa2lkJTNkYWFiMGY0N2ZmYjY3MTAwYTNkMDQ5Y2FjNjBjZjBkMjU%26rlid%3Daab0f47ffb67100a3d049cac60cf0d25&amp;vqd=4-261278282211478547119388813045535902808&amp;iurl=%7B1%7DIG%3D08B5CE85BC604F7692AC5848F5D86E69%26CID%3D29EC6D3744C36FEE25E87B4645AC6ECB%26ID%3DDevEx%2C5046.1">Azure Key Vault</a> integration.</li>
<li style="font-weight:400;">This dual-layer encryption approach aligns Azure with <a href="https://docs.aws.amazon.com/documentdb/latest/developerguide/what-is.html">AWS DocumentDB</a> and <a href="https://www.mongodb.com/products/platform">MongoDB Atlas</a> encryption capabilities, addressing compliance requirements for regulated industries like healthcare and finance that mandate customer-controlled encryption.</li>
<li style="font-weight:400;">The feature enables key rotation, revocation, and audit logging through Azure Key Vault, though customers should note potential performance impacts and additional Key Vault costs beyond standard Cosmos DB pricing.</li>
<li style="font-weight:400;">Organizations can implement bring-your-own-key (BYOK) scenarios for multi-cloud deployments or maintain encryption key consistency across hybrid environments, particularly useful for migrations from on-premises MongoDB.</li>
<li style="font-weight:400;">The vCore deployment model already differentiates from Cosmos DB’s RU-based pricing by offering predictable compute-based costs, and CMK support strengthens its appeal for traditional MongoDB workloads requiring familiar operational patterns.</li>
</ul>
<p>1:09:31  Ryan – “I do like these models, but I do think it should be used sparingly – because I don’t think there’s a whole lot of advantage of bringing your own key… because you can revoke the key and then Azure can’t edit your data, and it feels like an unwarranted layer of protection.” </p>
<p>1:14:57 <a href="https://techcommunity.microsoft.com/blog/integrationsonazureblog/introducing-logic-apps-mcp-servers-public-preview/4450419">Introducing Logic Apps MCP servers (Public Preview) | Microsoft </a><a href="https://techcommunity.microsoft.com/blog/integrationsonazureblog/introducing-logic-apps-mcp-servers-public-preview/4450419">Community Hub</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/logic-apps/logic-apps-overview">Azure Logic Apps</a> now supports Model Context Protocol (MCP) servers in public preview, allowing developers to transform Logic Apps connectors into reusable MCP tools for building AI agents, with two deployment options: registering connectors through <a href="https://learn.microsoft.com/en-us/azure/api-center/overview">Azure API Center </a>or enabling existing Logic Apps as remote MCP servers.</li>
<li style="font-weight:400;">The API Center integration provides automated workflow creation and Easy Auth configuration in minutes, while also registering MCP servers in a centralized enterprise catalog for discovery and management across organizations.</li>
<li style="font-weight:400;">This positions Azure against AWS’s agent-building capabilities by leveraging Logic Apps’ extensive connector ecosystem (over 1,000 connectors) as pre-built tools for AI agents, reducing development overhead compared to building custom integrations from scratch.</li>
<li style="font-weight:400;">Target customers include enterprises building AI agents that need to integrate with multiple systems – the MCP approach allows modular composition of capabilities like data access, messaging, and workflow orchestration without extensive custom coding.</li>
<li style="font-weight:400;">Implementation requires Logic Apps Standard tier (consumption-based pricing starting at $0.000025 per action), Microsoft Entra app registration for authentication, and HTTP Request/Response triggers with proper schema descriptions for tool discovery.</li>
</ul>
<p>1:16:04  Ryan – “For me, the real value in this is that central catalog. The minute MCP was out there, people were standing up their own MCP servers and building their own agents, and then it was duplicative, and so you’ve got every team basically running their own server doing the exact same thing. And now you get the efficiency of centralizing that through a catalog. Also, you don’t have to redo all the work that’s involved with that. There’s efficiency there as well.”</p>
<p>1:17:13 <a href="https://azure.microsoft.com/en-us/blog/accelerating-ai-and-databases-with-azure-container-storage-now-7-times-faster-and-open-source/">Accelerating AI and databases with Azure Container Storage, now 7 times </a><a href="https://azure.microsoft.com/en-us/blog/accelerating-ai-and-databases-with-azure-container-storage-now-7-times-faster-and-open-source/">faster and open source | Microsoft Azure Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://azure.microsoft.com/en-us/blog/accelerating-ai-and-databases-with-azure-container-storage-now-7-times-faster-and-open-source/">Azure Container Storage v2.0.0</a> delivers 7x higher IOPS and 4x lower latency for Kubernetes workloads using local NVMe drives, with PostgreSQL showing 60% better transaction throughput. </li>
<li style="font-weight:400;">The service is now completely free with no per-GB fees, making it cost-competitive against AWS EBS and Google Persistent Disk, which charge for management overhead.</li>
<li style="font-weight:400;">Microsoft open-sourced the entire platform at github.com/Azure/local-csi-driver, allowing deployment on any Kubernetes cluster beyond AKS. This positions Azure as more open than competitors while maintaining feature parity between managed and self-hosted versions.</li>
<li style="font-weight:400;">The new architecture reduces CPU consumption to less than 12.5% of node resources (down from up to 50% previously) while delivering better performance. This efficiency gain directly translates to cost savings since customers can run more workloads on the same infrastructure.</li>
<li style="font-weight:400;">Integration with KAITO (<a href="https://learn.microsoft.com/en-us/azure/aks/ai-toolchain-operator">Kubernetes AI Toolchain Operator</a>) enables 5x faster AI model loading for inference workloads on GPU-enabled VMs with local NVMe. This targets the growing market of organizations running LLMs and AI workloads on Kubernetes, competing with AWS SageMaker and GCP Vertex AI.</li>
<li style="font-weight:400;">Single-node deployment support removes the previous 3-node minimum requirement, making it practical for edge computing, development environments, and cost-conscious deployments. This flexibility addresses a key limitation compared to traditional SAN-based storage solutions.</li>
</ul>
<p>1:19:17 <a href="https://blogs.microsoft.com/blog/2025/09/16/microsoft-leads-shift-beyond-data-unification-to-organization-delivering-next-gen-ai-readiness-with-new-microsoft-fabric-capabilities/">Microsoft leads shift beyond data unification to organization, delivering </a><a href="https://blogs.microsoft.com/blog/2025/09/16/microsoft-leads-shift-beyond-data-unification-to-organization-delivering-next-gen-ai-readiness-with-new-microsoft-fabric-capabilities/">next-gen AI readiness with new Microsoft Fabric capabilities</a></p>
<ul>
<li style="font-weight:400;">Microsoft Fabric introduces Graph and Maps capabilities to help organizations structure data for AI agents, moving beyond simple data unification to create contextualized, relationship-aware data foundations that AI systems can reason over effectively.</li>
<li style="font-weight:400;">The new Graph in Fabric feature uses LinkedIn’s graph design principles to visualize and query relationships across enterprise data like customers, partners, and supply chains, while Maps in Fabric adds geospatial analytics for location-based decision making.</li>
<li style="font-weight:400;">OneLake, Fabric’s unified data lake, now supports mirroring from Oracle and Google BigQuery, plus new shortcuts to Azure Blob Storage, allowing organizations to access all their data regardless of location while maintaining governance through new security controls.</li>
<li style="font-weight:400;">Microsoft is integrating Fabric with Azure AI Foundry to create a complete data-to-AI pipeline, where Fabric provides the structured data foundation and AI Foundry enables developers to build and scale AI applications using familiar tools like GitHub and Visual Studio.</li>
<li style="font-weight:400;">The platform targets enterprises ready to move from AI experimentation to production deployment, with over 50,000 Fabric certifications already achieved by users preparing for these new AI-ready data capabilities.</li>
</ul>
<p>1:20:35  Justin – “The fabric stuff is interesting because it’s basically just a ton of stuff, like Power BI and the Data Lake and stuff, shoved into one unified platform, which is nice, and it makes it easier to do data processes. So I don’t expect it to be a major cost increase for customers who are already using fabric.”</p>
<h2>Oracle</h2>
<p>1:21:40 <a href="https://siliconangle.com/2025/09/09/oracles-stock-makes-biggest-single-day-gain-26-years-huge-cloud-revenue-projections/">Oracle’s stock makes biggest single-day gain in 26 years on huge cloud </a><a href="https://siliconangle.com/2025/09/09/oracles-stock-makes-biggest-single-day-gain-26-years-huge-cloud-revenue-projections/">revenue projections – SiliconANGLE</a></p>
<ul>
<li style="font-weight:400;">Oracle’s stock jumped 36% after announcing projected cloud infrastructure revenue of $144 billion by fiscal 2030, with RPO (remaining performance obligations) hitting $455 billion – a 359% year-over-year increase driven by four multibillion-dollar contracts signed this quarter.</li>
<li style="font-weight:400;">Oracle’s projected $18 billion in OCI revenue for the current fiscal year still trails AWS ($112B) and Azure ($75B), but their aggressive growth trajectory suggests they’re positioning to become a legitimate third hyperscaler option, particularly for enterprises already invested in Oracle databases.</li>
<li style="font-weight:400;">The upcoming Oracle AI Database service (launching October) will allow customers to run LLMs from OpenAI, Anthropic, and others directly against Oracle database data – a differentiator from AWS/Azure, which lack native database integration at this level.</li>
<li style="font-weight:400;">Oracle’s partnership strategy with AWS, Microsoft, and Google to provide data center infrastructure creates an unusual dynamic where competitor growth actually benefits Oracle, while their 4.5GW data center expansion with OpenAI shows they’re securing critical AI infrastructure capacity.</li>
<li style="font-weight:400;">The market’s enthusiasm appears driven more by Oracle’s confidence in projecting 5-year revenue forecasts (unusual in cloud infrastructure) than actual Q1 results, which missed both earnings ($1.47 vs $1.48 expected) and revenue ($14.93 B vs $15.04 B expected) targets.</li>
</ul>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback, or ask questions at theCloudPod.net or tweet at us with the hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2149152/c1e-jkjku5jj2whz5w4r-mkjn06qka030-nqyl7v.mp3" length="160262421"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 322 of The Cloud Pod, where the forecast is always cloudy! We have BIG NEWS – Jonathan is back! He’s joined in the studio by Justin and Ryan to bring you all the latest in cloud and AI news, including ongoing drama in the Microsoft/OpenAI drama, saying goodbye to data transfer fees (in the EU), M4 Power, and more. Let’s get started!  
Titles we almost went with this week


EU Later, Egress Fees: Google’s Brexit from Data Transfer Charges
The Keys to the Cosmos: Azure Unlocks Customer Control
Breaking Up is Hard to Do: Google Splits LLM Inference for Better Performance
OpenAI and Microsoft: From Exclusive to It’s Complicated 
Google’s New Model Has Trust Issues (And That’s a Good Thing)
Mac to the Future: AWS Brings M4 Power to the Cloud
Oracle’s Cloud Nine: Stock Soars on Half-Trillion Dollar Dreams
ChatGPT: From Chat Bot to Hat Bot (Everyone’s Wearing Different Professional Hats)
Five Billion Reasons to Love British AI
NVMe Gonna Give You Up: AWS Delivers the Storage Metrics You’ve Been Missing
Tea and AI: OpenAI Crosses the Pond
The Norway Bug Strikes Back: A New YAML Hope


A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our Slack channel for more info.
AI Is Going Great – Or How ML Makes Money 
01:33 Microsoft and OpenAI make a deal: Reading between the lines of their secretive new agreement – GeekWire

Microsoft and OpenAI have signed a non-binding memorandum of understanding that will restructure their partnership, with OpenAI’s nonprofit entity receiving an equity stake exceeding $100 billion in a new public benefit corporation where Microsoft will play a major role.
The deal addresses the AGI clause that previously allowed OpenAI to unilaterally dissolve the partnership upon achieving artificial general intelligence, which had been a significant risk for Microsoft’s multi-billion-dollar investment.
Both companies are diversifying their partnerships – Microsoft is now using Anthropic’s technology for some Office 365 AI features, while OpenAI has signed a $300 billion computing contract with Oracle over five years.
Microsoft’s exclusivity on OpenAI cloud workloads has been replaced with a right of first refusal, enabling OpenAI to participate in the $500 billion Stargate AI project with Oracle and other partners.
The restructuring allows OpenAI to raise capital for its mission while ensuring the nonprofit’s resources grow proportionally, with plans to use funds for community impact, including a recently launched $50 million grant program.

ALSO:
OpenAI and Microsoft sign preliminary deal to revise partnership terms – ]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2149152/c1a-k5d5-7z9741pju80z-vp9rvy.jpg"></itunes:image>
                                                                            <itunes:duration>01:23:24</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2149152/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[321: The Cloud Pod is in Tears Trying to Understand Azure Tiers]]>
                </title>
                <pubDate>Fri, 19 Sep 2025 15:57:06 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2145087</guid>
                                    <link>https://tcpfm.castos.com/episodes/321-the-cloud-pod-is-in-tears-trying-to-understand-azure-tiers</link>
                                <description>
                                            <![CDATA[<h3>The Cloud Pod is in Tears Trying to Understand Azure Tiers  </h3>
<h3> Welcome to episode 321 of The Cloud Pod, where the forecast is always cloudy! Justin, Ryan, and Matt are all on hand to bring you the latest in cloud and AI news, including increased metrics data (because who doesn’t love more data), some issues over at Cloudflare, and even bigger issues at Builder.ai  – plus so much more. 
Let’s get started! </h3>
<h3>Titles we almost went with this week
</h3>
<ul>
<li>Lost in Translation: Google Helps IPv6 Find Its Way to IPv4</li>
<li>BigQuery’s Soft Landing for Hard Problems</li>
<li>CloudWatch Gets a Two-Week Memory Upgrade</li>
<li>VM Glow-Up: From Gen1 Zero to Gen2 Hero</li>
<li>Azure Gets Contextual: API Management Learns to Speak AI</li>
<li>The Cloud Pod: Now Broadcasting from 20,000 Leagues Under the Sea</li>
<li>LoRA LoRA on the Wall, Who’s the Finest Model of Them All</li>
<li>Azure Says MFA or the Highway for Resource Management</li>
<li>Two-Factor or Two-Furious: Azure’s Security Ultimatum</li>
<li>Agent 007: License to Build</li>
<li>CUD You Believe It? Google’s Discounts Get More Flexible</li>
<li>WAF’s New Deal: Free Logs with Every Million Requests Served</li>
<li>SOC It To Me: Google’s AI Security Workshop Tour</li>
<li>MFA mandatory in Azure, now you too can hate/hate MS Authenticator</li>
<li>AWS AMIs no longer the Tribbles of cloud computing</li>
<li>ECS Exec; Justin’s prediction from 2018 finally comes true</li>
</ul>
<h2>General News
</h2>
<p>00:56 <a href="https://finopsweekly.com/finops-weekly-summit-2025/">FinOps Weekly Summit 2025</a></p>
<ul>
<li style="font-weight:400;">Victor Garcia reached out and asked us to share the news about the FinOps Weekly Summit coming up on October 23rd, 2025. </li>
<li style="font-weight:400;">A lot of great speakers; if you’re in the FinOps space, we recommend it. </li>
<li style="font-weight:400;">Want to register? You can do that <a href="https://finopsweekly.com/finops-weekly-summit-2025/#register">here</a>. </li>
</ul>
<p>01:53 <a href="https://ignite.microsoft.com/en-US/home">Ignite Registration Opens</a> </p>
<ul>
<li style="font-weight:400;">San Francisco, Moscone Center</li>
<li style="font-weight:400;">November 18–21, 2025</li>
<li style="font-weight:400;">Need to convince your manager to pay for you to go? Find that letter <a href="https://aka.ms/MSIgnite_FY26_CYM">here</a>. </li>
</ul>
<p>02:45 <a href="https://blog.cloudflare.com/unauthorized-issuance-of-certificates-for-1-1-1-1/">Addressing the unauthorized issuance of multiple TLS certificates for </a><a href="https://blog.cloudflare.com/unauthorized-issuance-of-certificates-for-1-1-1-1/">1.1.1.1</a></p>
<ul>
<li style="font-weight:400;">Some issues over at <a href="https://blog.cloudflare.com/introducing-certificate-transparency-monitoring/">Cloudflare</a> recently…</li>
<li style="font-weight:400;"><a href="https://www.fina.hr/">Fina CA</a> issued 12 unauthorized TLS certificates for Cloudflare’s <a href="https://one.one.one.one/">1.1.1.1 DNS resolver IP address</a> between February 2024 and August 2025, violating domain control validation requirements and potentially allowing man-in-the-middle attacks on DNS-over-TLS and DNS-over-HTTPS connections.</li>
<li style="font-weight:400;">The incident highlights vulnerabilities in the Certificate Authority trust model where any trusted CA can issue certificates for any domain or IP without proper validation, though exploitation would require the attacker to have the private key, intercept traffic, and target clients that trust Fina CA (primarily Microsoft systems).</li>
<li style="font-weight:400;">Cloudflare failed to detect these certificates for months despite operating its own Certificate Transparency monitoring service because its system wasn’t configured to alert on IP address certificates rather than domain names, exposing gaps in its internal security monitoring.</li>
<li style="font-weight:400;">The certificates have been <a href=""></a></li></ul>
<h3>Chapters</h3>
<ul><li>(00:00:00) - The Cloud Pod: Trying to Understand Azure tiers</li><li>(00:01:04) - Two Up! Finops Weekly Summit and Ignite</li><li>(00:02:56) - Cloudflare: Certificate Transparency is Critical Infrastructure</li><li>(00:06:08) - AI is How ML Makes Money</li><li>(00:08:44) - Visual Studio: August Update to Copilot</li><li>(00:11:16) - Amazon.com: Regions and Zones in AWS Global View</li><li>(00:14:19) - CloudWatch Metrics Insights: Extended to 3 Hours</li><li>(00:16:19) - CloudWatch: Single Monitoring Alarms for Dynamic Resource Fleets</li><li>(00:17:32) - AWS User Notifications now support centralized notification management across multi-</li><li>(00:19:46) - ECS: Monitoring AMI usage with Cloud Shell</li><li>(00:23:39) -  AWS Terraform: Five Year Old Code</li><li>(00:25:14) -  AWS IAM: Network Parameter Controls for VPCs</li><li>(00:27:56) - AWS WAF now provides 500 MB of free CloudWatch log</li><li>(00:31:00) - WASP Config: Resource Tag Tracking for IAM Policies</li><li>(00:33:01) - GCP: DNS64 and NAT64 for IPv6</li><li>(00:34:28) - BigQuery Data Storage: Soft Failover</li><li>(00:35:58) - Google Expands Cloud CUDs to Include HANA, Cloud</li><li>(00:39:04) - Google Cloud Launches Society Operations Center Workshop</li><li>(00:40:13) - Google Data Proc now supports multi-tenant cluster</li><li>(00:41:37) - Google's Official Rust SDK</li><li>(00:43:22) - Microsoft Azure: Upgrade to Gen2 with Trustful Launch enabled</li><li>(00:45:34) - Azure API Management: New Features and Native Auto-Scaling</li><li>(00:46:37) - Microsoft Launches GPT Real Time on Azure AI Foundry</li><li>(00:50:47) - Azure AI Foundry</li><li>(00:53:23) - Week in Cloud: September 7, 2018</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[The Cloud Pod is in Tears Trying to Understand Azure Tiers  
 Welcome to episode 321 of The Cloud Pod, where the forecast is always cloudy! Justin, Ryan, and Matt are all on hand to bring you the latest in cloud and AI news, including increased metrics data (because who doesn’t love more data), some issues over at Cloudflare, and even bigger issues at Builder.ai  – plus so much more. 
Let’s get started! 
Titles we almost went with this week


Lost in Translation: Google Helps IPv6 Find Its Way to IPv4
BigQuery’s Soft Landing for Hard Problems
CloudWatch Gets a Two-Week Memory Upgrade
VM Glow-Up: From Gen1 Zero to Gen2 Hero
Azure Gets Contextual: API Management Learns to Speak AI
The Cloud Pod: Now Broadcasting from 20,000 Leagues Under the Sea
LoRA LoRA on the Wall, Who’s the Finest Model of Them All
Azure Says MFA or the Highway for Resource Management
Two-Factor or Two-Furious: Azure’s Security Ultimatum
Agent 007: License to Build
CUD You Believe It? Google’s Discounts Get More Flexible
WAF’s New Deal: Free Logs with Every Million Requests Served
SOC It To Me: Google’s AI Security Workshop Tour
MFA mandatory in Azure, now you too can hate/hate MS Authenticator
AWS AMIs no longer the Tribbles of cloud computing
ECS Exec; Justin’s prediction from 2018 finally comes true

General News

00:56 FinOps Weekly Summit 2025

Victor Garcia reached out and asked us to share the news about the FinOps Weekly Summit coming up on October 23rd, 2025. 
A lot of great speakers; if you’re in the FinOps space, we recommend it. 
Want to register? You can do that here. 

01:53 Ignite Registration Opens 

San Francisco, Moscone Center
November 18–21, 2025
Need to convince your manager to pay for you to go? Find that letter here. 

02:45 Addressing the unauthorized issuance of multiple TLS certificates for 1.1.1.1

Some issues over at Cloudflare recently…
Fina CA issued 12 unauthorized TLS certificates for Cloudflare’s 1.1.1.1 DNS resolver IP address between February 2024 and August 2025, violating domain control validation requirements and potentially allowing man-in-the-middle attacks on DNS-over-TLS and DNS-over-HTTPS connections.
The incident highlights vulnerabilities in the Certificate Authority trust model where any trusted CA can issue certificates for any domain or IP without proper validation, though exploitation would require the attacker to have the private key, intercept traffic, and target clients that trust Fina CA (primarily Microsoft systems).
Cloudflare failed to detect these certificates for months despite operating its own Certificate Transparency monitoring service because its system wasn’t configured to alert on IP address certificates rather than domain names, exposing gaps in its internal security monitoring.
The certificates have been ]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[321: The Cloud Pod is in Tears Trying to Understand Azure Tiers]]>
                </itunes:title>
                                    <itunes:episode>321</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<h3>The Cloud Pod is in Tears Trying to Understand Azure Tiers  </h3>
<h3> Welcome to episode 321 of The Cloud Pod, where the forecast is always cloudy! Justin, Ryan, and Matt are all on hand to bring you the latest in cloud and AI news, including increased metrics data (because who doesn’t love more data), some issues over at Cloudflare, and even bigger issues at Builder.ai  – plus so much more. 
Let’s get started! </h3>
<h3>Titles we almost went with this week
</h3>
<ul>
<li>Lost in Translation: Google Helps IPv6 Find Its Way to IPv4</li>
<li>BigQuery’s Soft Landing for Hard Problems</li>
<li>CloudWatch Gets a Two-Week Memory Upgrade</li>
<li>VM Glow-Up: From Gen1 Zero to Gen2 Hero</li>
<li>Azure Gets Contextual: API Management Learns to Speak AI</li>
<li>The Cloud Pod: Now Broadcasting from 20,000 Leagues Under the Sea</li>
<li>LoRA LoRA on the Wall, Who’s the Finest Model of Them All</li>
<li>Azure Says MFA or the Highway for Resource Management</li>
<li>Two-Factor or Two-Furious: Azure’s Security Ultimatum</li>
<li>Agent 007: License to Build</li>
<li>CUD You Believe It? Google’s Discounts Get More Flexible</li>
<li>WAF’s New Deal: Free Logs with Every Million Requests Served</li>
<li>SOC It To Me: Google’s AI Security Workshop Tour</li>
<li>MFA mandatory in Azure, now you too can hate/hate MS Authenticator</li>
<li>AWS AMIs no longer the Tribbles of cloud computing</li>
<li>ECS Exec; Justin’s prediction from 2018 finally comes true</li>
</ul>
<h2>General News
</h2>
<p>00:56 <a href="https://finopsweekly.com/finops-weekly-summit-2025/">FinOps Weekly Summit 2025</a></p>
<ul>
<li style="font-weight:400;">Victor Garcia reached out and asked us to share the news about the FinOps Weekly Summit coming up on October 23rd, 2025. </li>
<li style="font-weight:400;">A lot of great speakers; if you’re in the FinOps space, we recommend it. </li>
<li style="font-weight:400;">Want to register? You can do that <a href="https://finopsweekly.com/finops-weekly-summit-2025/#register">here</a>. </li>
</ul>
<p>01:53 <a href="https://ignite.microsoft.com/en-US/home">Ignite Registration Opens</a> </p>
<ul>
<li style="font-weight:400;">San Francisco, Moscone Center</li>
<li style="font-weight:400;">November 18–21, 2025</li>
<li style="font-weight:400;">Need to convince your manager to pay for you to go? Find that letter <a href="https://aka.ms/MSIgnite_FY26_CYM">here</a>. </li>
</ul>
<p>02:45 <a href="https://blog.cloudflare.com/unauthorized-issuance-of-certificates-for-1-1-1-1/">Addressing the unauthorized issuance of multiple TLS certificates for </a><a href="https://blog.cloudflare.com/unauthorized-issuance-of-certificates-for-1-1-1-1/">1.1.1.1</a></p>
<ul>
<li style="font-weight:400;">Some issues over at <a href="https://blog.cloudflare.com/introducing-certificate-transparency-monitoring/">Cloudflare</a> recently…</li>
<li style="font-weight:400;"><a href="https://www.fina.hr/">Fina CA</a> issued 12 unauthorized TLS certificates for Cloudflare’s <a href="https://one.one.one.one/">1.1.1.1 DNS resolver IP address</a> between February 2024 and August 2025, violating domain control validation requirements and potentially allowing man-in-the-middle attacks on DNS-over-TLS and DNS-over-HTTPS connections.</li>
<li style="font-weight:400;">The incident highlights vulnerabilities in the Certificate Authority trust model where any trusted CA can issue certificates for any domain or IP without proper validation, though exploitation would require the attacker to have the private key, intercept traffic, and target clients that trust Fina CA (primarily Microsoft systems).</li>
<li style="font-weight:400;">Cloudflare failed to detect these certificates for months despite operating its own Certificate Transparency monitoring service because its system wasn’t configured to alert on IP address certificates rather than domain names, exposing gaps in its internal security monitoring.</li>
<li style="font-weight:400;">The certificates have been <a href="http://rdc.fina.hr/RDC2020/FinaRDCCA2020partc1.crl">revoked</a> and no evidence of malicious use was found, but the incident demonstrates why Certificate Transparency logs are critical infrastructure – without Fina CA voluntarily logging these test certificates, they might never have been discovered.</li>
<li style="font-weight:400;">Organizations should review their root certificate stores and consider removing or restricting CAs with poor validation practices, while DNS client developers should implement <a href="https://radar.cloudflare.com/certificate-transparency">Certificate Transparency</a> validation requirements similar to modern browsers to prevent future incidents.</li>
</ul>
<p>02:58  Matt – “I really like how in this they say we messed up, but also you should go review everyone that you don’t trust, and only keep ours, because we ARE trusted, and look what we just found and how we fixed it.”</p>
<h2>AI Is Going Great – Or How ML Makes Money </h2>
<p>06:02 <a href="https://www.nytimes.com/2025/08/31/technology/builder-ai-collapse.html?smid=nytcore-android-share">How Builder.ai Collapsed Amid Silicon Valley’s Biggest Boom – The New </a><a href="https://www.nytimes.com/2025/08/31/technology/builder-ai-collapse.html?smid=nytcore-android-share">York Times</a></p>
<ul>
<li style="font-weight:400;">Builder.ai collapsed from a $1.5 billion valuation to bankruptcy after the board discovered sales were overstated by 75% – reported $217M revenue in 2024 was actually $51M, highlighting risks in AI startup valuations during the current investment boom</li>
<li style="font-weight:400;">The company spent 80% of revenue on marketing rather than product development, using terms like “AI-powered” and “machine learning” without substantial AI technology – its “<a href="https://www.bing.com/ck/a?!&amp;&amp;p=8a12a792576679044fd713440299ae8d67845970572b0e3930298aec13685262JmltdHM9MTc1Nzk4MDgwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=33cf825f-ad59-6912-19e3-9438ac3368b1&amp;psq=Natasha+AI&amp;u=a1aHR0cHM6Ly90ZWNoZnVuZGluZ25ld3MuY29tL2Zha2UtaXQtdGlsbC15b3UtdW5pY29ybi1idWlsZGVyLWFpcy1uYXRhc2hhLXdhcy1uZXZlci1haS1qdXN0LTcwMC1pbmRpYW4tY29kZXJzLWJlaGluZC10aGUtY3VydGFpbi8">Natasha AI</a>” product manager was reportedly assisted by 700 Indian programmers rather than autonomous AI</li>
<li style="font-weight:400;">Microsoft invested $30M and partnered with Builder for cloud storage integration, while other investors included <a href="https://www.bing.com/ck/a?!&amp;&amp;p=72658ae16a82e025fa744736c13d781e379da73ea807239520c5d717ea5e6a74JmltdHM9MTc1Nzk4MDgwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=33cf825f-ad59-6912-19e3-9438ac3368b1&amp;psq=Qatar+Investment+Authority&amp;u=a1aHR0cHM6Ly93d3cucWlhLnFhL2VuL0Fib3V0L3BhZ2VzL2RlZmF1bHQuYXNweA">Qatar Investment Authority</a>, <a href="https://www.bing.com/ck/a?!&amp;&amp;p=5d8c7282eb9cd24745cb858d5c3dab166a9504484f6aa88408a7da8ae09ddad7JmltdHM9MTc1Nzk4MDgwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=33cf825f-ad59-6912-19e3-9438ac3368b1&amp;psq=SoftBank%27s+DeepCore&amp;u=a1aHR0cHM6Ly9ncm91cC5zb2Z0YmFuay9lbi9uZXdzL3ByZXNzLzIwMTgwMTI5">SoftBank’s DeepCore</a>, and Jeffrey Katzenberg – total funding reached $450M before the collapse</li>
<li style="font-weight:400;">SEC has charged multiple AI startups with fraud this year, including GameOn ($60M investor losses) and Nate (shopping app using Filipino contractors instead of AI), with Builder now under investigation by Southern District of New York prosecutors</li>
<li style="font-weight:400;">The .ai domain registrations are approaching 1 million addresses with 1,500 new ones daily, compared to an estimated 10,000 total ventures during the dot-com era, which demonstrates the scale of the current AI investment frenzy, where companies rebrand to attract funding</li>
</ul>
<p>07:30  Ryan – “I’ve definitely seen this before, and you know, this sort of model of that’s like ‘we’ve got machine learning, we got this, and now it’s with AI too’. It’s the same sort of thing – fake it till you make it only goes so far.”</p>
<p>09:31 <a href="https://devblogs.microsoft.com/visualstudio/the-visual-studio-august-update-is-here-smarter-ai-better-debugging-and-more-control/">The Visual Studio August Update is here – smarter AI, better debugging, </a><a href="https://devblogs.microsoft.com/visualstudio/the-visual-studio-august-update-is-here-smarter-ai-better-debugging-and-more-control/">and more control – Visual Studio Blog</a></p>
<ul>
<li style="font-weight:400;">Visual Studio’s August 2025 update <a href="https://devblogs.microsoft.com/visualstudio/gpt-5-now-available-in-visual-studio/">integrates GPT-5</a> and introduces Model Context Protocol (MCP) support, enabling developers to connect AI agents directly to databases, code search, and deployment systems without custom integrations for each tool.</li>
<li style="font-weight:400;">MCP functions as “the HTTP of tool connectivity” with OAuth support for any provider, one-click server installation from web repositories, and governance controls via GitHub policy settings for enterprise compliance.</li>
<li style="font-weight:400;">The enhanced <a href="https://www.bing.com/ck/a?!&amp;&amp;p=1ccdff6f47bbf2868d4fe461d1c7c03e824c25775a2806991d3f4948654d09c9JmltdHM9MTc1Nzk4MDgwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=33cf825f-ad59-6912-19e3-9438ac3368b1&amp;psq=Copilot+Chat&amp;u=a1aHR0cHM6Ly9jb3BpbG90Lm1pY3Jvc29mdC5jb20vP21zb2NraWQ9MzNjZjgyNWZhZDU5NjkxMjE5ZTM5NDM4YWMzMzY4YjE">Copilot Chat</a> now uses improved semantic code search to automatically retrieve relevant code snippets from natural language queries across entire solutions, reducing manual navigation time.</li>
<li style="font-weight:400;">Developers can now bring their own AI models using API keys from <a href="https://www.bing.com/ck/a?!&amp;&amp;p=93e6af08d6980b522078c7725e7ac9dead91bac7f820337c18f13f2f1dc6ce4fJmltdHM9MTc1Nzk4MDgwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=33cf825f-ad59-6912-19e3-9438ac3368b1&amp;psq=OpenAI%2C&amp;u=a1aHR0cHM6Ly9vcGVuYWkuY29tLw">OpenAI,</a> <a href="https://www.bing.com/ck/a?!&amp;&amp;p=a0fca627a471901a03a5542e1255d02695659a0d071ab3c06eaeff1969d57d65JmltdHM9MTc1Nzk4MDgwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=33cf825f-ad59-6912-19e3-9438ac3368b1&amp;psq=google&amp;u=a1aHR0cHM6Ly93d3cuZ29vZ2xlLmNvbS8">Google</a>, or <a href="https://www.bing.com/ck/a?!&amp;&amp;p=7024d802bb3826b4ee164ede8911670e1eb486ac45404abd29167213d09f2356JmltdHM9MTc1Nzk4MDgwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=33cf825f-ad59-6912-19e3-9438ac3368b1&amp;psq=Anthropic&amp;u=a1aHR0cHM6Ly93d3cuYW50aHJvcGljLmNvbS8">Anthropic</a>, providing flexibility for teams with specific performance, privacy, or cost requirements in their cloud development workflows.</li>
<li style="font-weight:400;">New features include partial code completion acceptance (word-by-word or line-by-line), Git history context in chat, and unified debugging for <a href="https://www.bing.com/ck/a?!&amp;&amp;p=2d5ad013d38e1f6c88ceeb8a463c583e25befa19d4005cb5f9e2e36d317e5b8bJmltdHM9MTc1Nzk4MDgwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=33cf825f-ad59-6912-19e3-9438ac3368b1&amp;psq=Unreal+Engine&amp;u=a1aHR0cHM6Ly93d3cudW5yZWFsZW5naW5lLmNvbS8">Unreal Engine</a> that combines Blueprint and native C++ code in a single session.</li>
</ul>
<p>10:50  Ryan – “I’ve been using Copilot almost exclusively for a little while in VS Code, just because it’s better than some of the add-ons. There’s a couple of other integrations you can use with AWS Q and Gemini, and you can sort of tack them on, but Copilot, you can use multiple languages, and it has just built-in hooks into the client itself. So I don’t know if it’s a matter of it’s the first one I use, so I’m biased or what, but I really like it.”</p>
<h2>AWS</h2>
<p>11:37 <a href="https://aws.amazon.com/about-aws/whats-new/2025/09/manage-access-aws-regions-local-zones/">AWS adds the ability to centrally manage access to AWS Regions and AWS </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/09/manage-access-aws-regions-local-zones/">Local Zones</a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/global-view.html">AWS Global View</a> now provides centralized management of Region and Local Zone access through a single console page, eliminating the need to check opt-in status across multiple locations individually.</li>
<li style="font-weight:400;">The Regions and Zones page displays infrastructure location details, opt-in status, and parent Region relationships, giving administrators a comprehensive view of their global AWS footprint for compliance and governance purposes.</li>
<li style="font-weight:400;">This feature addresses a common pain point for enterprises managing multi-region deployments who previously had to navigate to each Region separately to verify access and opt-in status.</li>
<li style="font-weight:400;">The capability integrates with existing AWS Global View functionality that allows viewing resources across multiple Regions, extending the service’s utility for global infrastructure management.</li>
<li style="font-weight:400;">Available in all commercial AWS Regions at no additional cost, the feature simplifies Region access auditing and helps prevent accidental deployments to unauthorized locations.</li>
<li style="font-weight:400;">This is available for free…so thanks, Amazon. We’ll always happily accept services that should have existed a decade ago. </li>
</ul>
<p>14:42 <a href="https://aws.amazon.com/about-aws/whats-new/2025/09/amazon-cloudwatch-query-metrics-data-two-weeks/">Amazon CloudWatch now supports querying metrics data up to two weeks </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/09/amazon-cloudwatch-query-metrics-data-two-weeks/">old</a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/AmazonCloudWatch/latest/monitoring/query_with_cloudwatch-metrics-insights.html">CloudWatch Metrics Insights</a> now queries metrics data up to 2 weeks old instead of just 3 hours, enabling longer-term trend analysis and post-incident investigations using SQL-based queries.</li>
<li style="font-weight:400;">This extension addresses a significant limitation for teams monitoring dynamic resource groups, who previously couldn’t visualize historical data beyond 3 hours when using Metrics Insights queries.</li>
<li style="font-weight:400;">The feature is automatically available at no additional cost in all commercial AWS regions, with standard CloudWatch pricing applying only for alarms, dashboards, and API usage. (Although you’re already paying for CloudWatch metric insights, so don’t let them fool you.) </li>
<li style="font-weight:400;">Operations teams can now investigate incidents days after they occur and identify patterns across their infrastructure without switching between different query methods or data sources.</li>
<li style="font-weight:400;">This positions CloudWatch Metrics Insights as a more viable alternative to third-party monitoring solutions that already offer extended historical data access for SQL-based metric queries.</li>
</ul>
<p>15:35  Ryan – “ 3 hours is nowhere near enough. So many workloads are cyclical across a day, or we’ll even have different traffic patterns across a week, so it’s kind of crazy to me – 3 hours. I never used CloudWatch Metrics and now I understand why.”     </p>
<p>16:46 <a href="https://aws.amazon.com/about-aws/whats-new/2025/09/amazon-cloudwatch-alarm-multiple-metrics/">Amazon CloudWatch query alarms now support monitoring metrics </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/09/amazon-cloudwatch-alarm-multiple-metrics/">individually</a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/AmazonCloudWatch/latest/monitoring/WhatIsCloudWatch.html">CloudWatch</a> query alarms now monitor multiple individual metrics through a single alarm using Metrics Insights SQL queries with GROUP BY and ORDER BY conditions, automatically adjusting as resources are created or deleted.</li>
<li style="font-weight:400;">This solves the operational burden of managing separate alarms for dynamic resource fleets like auto-scaling groups, where teams previously had to choose between aggregated monitoring or maintaining individual alarms for each resource.</li>
<li style="font-weight:400;">The feature works by creating alarms on Metrics Insights queries that dynamically update results with each evaluation, ensuring no resources go unmonitored as infrastructure scales up or down.</li>
<li style="font-weight:400;">Available in all commercial AWS regions plus GovCloud and China regions, with standard Metrics Insights query alarm pricing applying per the CloudWatch pricing page.</li>
<li style="font-weight:400;">Yet another of the “this should have been here 10 years ago” features. But what do we know? </li>
<li style="font-weight:400;">Real-world use cases include monitoring per-instance metrics across auto-scaling groups, tracking individual Lambda function performance in serverless architectures, or watching container metrics in dynamic ECS/EKS clusters without manual alarm management.</li>
</ul>
<p>17:34  Ryan – “I can’t believe this took so long.” </p>
<p>18:09 <a href="https://aws.amazon.com/about-aws/whats-new/2025/09/general-availability-organizational-notification-configurations-aws-user-notifications/">Announcing general availability of Organizational Notification </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/09/general-availability-organizational-notification-configurations-aws-user-notifications/">Configurations for AWS User Notifications</a></p>
<ul>
<li style="font-weight:400;">AWS User Notifications now supports centralized notification management across Organizations, allowing Management Accounts or up to 5 Delegated Administrators to configure and view notifications for specific OUs or entire organizations from a single location.</li>
<li style="font-weight:400;">The feature integrates with <a href="https://docs.aws.amazon.com/eventbridge/latest/userguide/eb-service-event-list.html">Amazon EventBridge Events</a>, enabling organizations to create notification rules for security events like console sign-ins without MFA, with alerts delivered to the <a href="https://aws.amazon.com/console/mobile/">AWS Console Mobile Application</a> and Console Notifications Center.</li>
<li style="font-weight:400;">This addresses a key operational challenge for multi-account organizations by eliminating the need to configure notifications individually in each member account, significantly reducing administrative overhead for security and compliance monitoring.</li>
<li style="font-weight:400;">Organizations can now implement consistent notification policies across hundreds or thousands of accounts, improving incident response times and ensuring critical events don’t go unnoticed in sprawling AWS environments.</li>
<li style="font-weight:400;">The service is available in all AWS Regions where User Notifications is supported, with no additional pricing beyond standard EventBridge and notification delivery costs.</li>
</ul>
<p>21:15  Justin – “The theme of the Amazon section today is just everything Ryan and I asked for ten years ago, in general.” </p>
<p>20:27 <a href="https://aws.amazon.com/about-aws/whats-new/2025/09/amazon-ec2-ami-usage-monitor-amis/">Amazon EC2 announces AMI Usage to better monitor the use of AMIs</a></p>
<ul>
<li style="font-weight:400;">AMI Usage provides free visibility into which AWS accounts are consuming your AMIs across <a href="https://aws.amazon.com/ec2/">EC2</a> instances and launch templates, eliminating the need for custom tracking scripts that previously created operational overhead.</li>
<li style="font-weight:400;">The feature enables dependency checking within your account to identify resources using specific AMIs, including EC2 instances, launch templates, Image Builder recipes, and SSM parameters before deregistration.</li>
<li style="font-weight:400;">This addresses a common operational challenge where organizations struggle to track AMI proliferation across multiple accounts and teams, potentially reducing costs from unused or orphaned AMIs.</li>
<li style="font-weight:400;">The service is available at no additional cost in all AWS regions, including China and <a href="https://aws.amazon.com/govcloud-us/">GovCloud</a>, making it accessible for compliance-sensitive workloads that need AMI governance.</li>
<li style="font-weight:400;">Organizations can now safely deprecate old AMIs by understanding their full usage footprint, supporting better security hygiene, and reducing the attack surface from outdated images.</li>
</ul>
<p>22:21 <a href="https://aws.amazon.com/about-aws/whats-new/2025/09/ecs-exec-aws-management-console/">ECS Exec is now available in the AWS Management Console</a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/AmazonECS/latest/developerguide/ecs-exec.html">ECS Exec</a> now provides direct console access to running containers without SSH keys or inbound ports, eliminating the need to switch between console and CLI for debugging tasks.</li>
<li style="font-weight:400;">The feature integrates with <a href="https://cloud.google.com/shell/docs/launching-cloud-shell">CloudShell</a> to open interactive sessions directly from task details pages, while displaying the underlying CLI command for local terminal use.</li>
<li style="font-weight:400;">Console configuration includes encryption and logging settings at the cluster level, with ECS Exec enablement available during service and task creation or updates.</li>
<li style="font-weight:400;">This addresses a common debugging workflow where developers need quick container access for troubleshooting applications and examining running processes in production environments.</li>
<li style="font-weight:400;">Available in all AWS commercial regions with no additional charges beyond standard ECS and CloudShell usage costs.</li>
</ul>
<p>23:10  Justin – “You can get to EC2 through SSM, and then you could access ECS tasks from there. But now you can just go right from the console, which is kind of nice.”</p>
<p>26:55 <a href="https://aws.amazon.com/about-aws/whats-new/2025/08/aws-iam-new-vpc-endpoint-condition-keys/">AWS IAM launches new VPC endpoint condition keys for network perimeter </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/08/aws-iam-new-vpc-endpoint-condition-keys/">controls</a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/IAM/latest/UserGuide/reference_policies_condition-keys.html">AWS IAM</a> introduces three new global condition keys (aws:VpceAccount, aws:VpceOrgPaths, aws:VpceOrgID) that enable organizations to enforce network perimeter controls by ensuring requests to AWS resources only come through their VPC endpoints.</li>
<li style="font-weight:400;">These condition keys automatically scale with VPC usage and eliminate the need to manually enumerate VPC endpoints or update policies when adding or removing endpoints, working across SCPs, RCPs, resource-based policies, and identity-based policies.</li>
<li style="font-weight:400;">The feature addresses a common security requirement for enterprises that need to restrict access to AWS resources from specific network boundaries, particularly useful for organizations with strict compliance requirements around data locality and network isolation.</li>
<li style="font-weight:400;">Currently limited to a select set of AWS services that support <a href="https://docs.aws.amazon.com/vpc/latest/privatelink/what-is-privatelink.html">AWS PrivateLink</a>, which may require careful planning for organizations looking to implement comprehensive network perimeter controls across their entire AWS footprint.</li>
<li style="font-weight:400;">This enhancement simplifies zero-trust network architectures by providing granular control at the account, organization path, or entire organization level without the operational overhead of maintaining extensive VPC endpoint lists in policies.</li>
</ul>
<p>27:39  Ryan – “It’s a good thing to have. It’s definitely on a lot of control frameworks, so it’s nice to have that easier button to check that compliance box.”  </p>
<p>28:50 <a href="https://aws.amazon.com/about-aws/whats-new/2025/09/aws-waf-free-vended-logs-request-volume/">AWS WAF now includes free WAF Vended Logs based on request volume</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/waf/">AWS WAF</a> now provides 500 MB of free <a href="https://docs.aws.amazon.com/AmazonCloudWatch/latest/logs/WhatIsCloudWatchLogs.html">CloudWatch Logs</a> Vended Logs ingestion for every 1 million WAF requests processed, helping customers reduce logging costs while maintaining security visibility.</li>
<li style="font-weight:400;">The free allocation applies automatically to your AWS bill at month’s end and covers both CloudWatch and S3 destinations, with usage beyond the included amount charged at standard WAF Vended Logs pricing.</li>
<li style="font-weight:400;">This change addresses a common customer pain point where WAF logging costs could become substantial for high-traffic applications, making comprehensive security monitoring more accessible for cost-conscious organizations.</li>
<li style="font-weight:400;">Customers can leverage CloudWatch’s analytics capabilities, including Log Insights queries, anomaly detection, and dashboards, to analyze web traffic patterns and security events without worrying about base logging costs.</li>
<li style="font-weight:400;">The pricing model scales with usage, meaning customers who process more requests through WAF automatically receive more free log storage, aligning logging costs with actual traffic volume.</li>
</ul>
<p>31:27 <a href="https://aws.amazon.com/about-aws/whats-new/2025/09/aws-config-resource-tags-iam-policies/">AWS Config now supports resource tags for IAM Policies</a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/config/latest/developerguide/WhatIsConfig.html">AWS Config</a> now adds resource tag tracking for IAM policies, enabling teams to filter and evaluate IAM policy configurations based on tags for improved governance and compliance monitoring.</li>
<li style="font-weight:400;">This enhancement allows Config rules to evaluate IAM policies selectively using tags, making it easier to enforce different compliance standards across development, staging, and production policies without creating separate rules for each environment.</li>
<li style="font-weight:400;">Multi-account organizations can now use Config aggregators to collect IAM policy data across accounts filtered by tags, streamlining centralized governance for policies that match specific tag criteria like department or compliance scope.</li>
<li style="font-weight:400;">The feature arrives at no additional cost in all supported AWS regions and automatically populates tags when recording IAM policy resource types, requiring only Config recorder configuration to enable.</li>
<li style="font-weight:400;">This addresses a common pain point where teams struggled to apply granular Config rules to subsets of IAM policies, previously requiring custom Lambda functions or manual processes to achieve tag-based policy governance.</li>
</ul>
<p>32:32  Ryan – “Taking away all that Lamda spackle…making that no longer necessary? That’s fantastic.”  </p>
<h2>GCP</h2>
<p>33:23 <a href="https://cloud.google.com/blog/products/networking/connect-ipv6-only-workloads-to-ipv4-with-dns64-and-nat64/">Connect IPv6-only workloads to IPv4 with DNS64 and NAT64 | Google </a><a href="https://cloud.google.com/blog/products/networking/connect-ipv6-only-workloads-to-ipv4-with-dns64-and-nat64/">Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google Cloud introduces DNS64 and NAT64 to enable IPv6-only workloads to communicate with IPv4 services, addressing the critical gap as enterprises transition away from increasingly scarce IPv4 addresses while maintaining access to legacy IPv4 applications.</li>
<li style="font-weight:400;">This feature allows organizations to build pure IPv6 environments without dual-stack complexity, using DNS64 to synthesize IPv6 addresses from IPv4 DNS records and NAT64 gateways to translate the actual traffic between protocols.</li>
<li style="font-weight:400;">The implementation leverages Google’s existing Cloud NAT infrastructure with a simple three-step setup process: create IPv6-only VPC and subnets, enable a DNS64 server policy, and configure a NAT64 gateway through Cloud Router.</li>
<li style="font-weight:400;">Key use cases include enterprises facing private IPv4 address exhaustion, organizations with IPv6 compliance requirements, and companies wanting to future-proof their infrastructure while maintaining backward compatibility with IPv4-only services.</li>
<li style="font-weight:400;">While AWS offers similar functionality through <a href="https://cloud.google.com/vpc/docs/ipv6-to-ipv4-overview">NAT64 and DNS64</a> in their VPCs, Google’s approach integrates directly with their <a href="https://cloud.google.com/solutions/cross-cloud-network?hl=en">Cross-Cloud Network</a> strategy, potentially simplifying multi-cloud IPv6 deployments for organizations using hybrid architectures.</li>
</ul>
<p>34:50 <a href="https://cloud.google.com/blog/products/data-analytics/bigquery-managed-disaster-recovery-adds-soft-failover/">BigQuery Managed Disaster Recovery adds soft failover | Google Cloud </a><a href="https://cloud.google.com/blog/products/data-analytics/bigquery-managed-disaster-recovery-adds-soft-failover/">Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/bigquery/docs/managed-disaster-recovery">BigQuery Managed Disaster Recovery</a> now offers soft failover, which waits for complete data replication before promoting the secondary region, eliminating the risk of data loss during planned failovers compared to traditional hard failover, which could lose up to 15 minutes of data within the RPO window.</li>
<li style="font-weight:400;">This addresses a key enterprise concern where companies previously had to choose between immediate failover with potential data loss or delayed recovery while waiting for a primary region that might never recover, making DR testing particularly challenging for compliance-driven industries like financial services.</li>
<li style="font-weight:400;">The feature provides multiple failover options through BigQuery UI, DDL, and CLI, giving administrators granular control over disaster recovery transitions while maintaining their required RTO and RPO objectives without the operational complexity of manual verification.</li>
<li style="font-weight:400;">While <a href="https://aws.amazon.com/rds/">AWS RDS</a> offers similar automated failover capabilities and <a href="https://azure.microsoft.com/en-us/products/azure-sql/database/">Azure SQL Database</a> has auto-failover groups, BigQuery’s implementation focuses specifically on analytics workloads with built-in support for cross-region dataset replication and compute failover in a single managed service.</li>
<li style="font-weight:400;">The soft failover capability enables realistic DR drills without production impact, particularly valuable for regulated industries that require regular disaster recovery testing for compliance while maintaining zero data loss tolerance during planned maintenance windows.</li>
</ul>
<p>35:36  Ryan – “There’s nothing worse than trying to DR for a giant data set, especially if you have big data querying or job-based things that you’re fronting into your application with those insights. It just can be so nightmarish.”</p>
<p>36:26 <a href="https://cloud.google.com/blog/products/compute/expanded-coverage-for-compute-flex-cuds/">Expanded coverage for Compute Flex CUDs | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google expands <a href="https://cloud.google.com/compute/docs/instances/committed-use-discounts-overview#spend_based">Compute Flex CUDs</a> to cover memory-optimized VMs (M1-M4), HPC instances (H3,<a href="https://cloud.google.com/blog/products/compute/new-h4d-vms-optimized-for-hpc?e=48754805"> H4D</a>), and serverless offerings like <a href="https://cloud.google.com/run">Cloud Run</a> and Cloud Functions, allowing customers to apply spend commitments across more services.</li>
<li style="font-weight:400;">The new billing model charges discounted rates directly instead of using credits, simplifying cost tracking while expanding coverage beyond traditional compute instances to specialized workloads.</li>
<li style="font-weight:400;">This positions GCP competitively against <a href="https://aws.amazon.com/ec2/pricing/reserved-instances/">AWS Reserved Instances</a> and <a href="https://azure.microsoft.com/en-us/pricing/reserved-vm-instances/">Azure Reserved VM Instances</a> by offering more flexibility – commitments aren’t tied to specific resource types or regions.</li>
<li style="font-weight:400;">Key beneficiaries include <a href="https://www.sap.com/products/data-cloud/hana/what-is-sap-hana.html">SAP HANA</a> deployments, scientific computing workloads, and organizations with mixed traditional and serverless architectures who can now optimize costs across their entire stack.</li>
<li style="font-weight:400;">Customers can opt in immediately, with automatic transition for all accounts by January 21, 2026, though new billing accounts created after July 15, 2025, will automatically use the new model.</li>
</ul>
<p>37:18  Justin – “So, you have to remember there’s CUD and there’s Flex CUDs. So Flex CUDs were only on certain instance types, and it’s more like a savings plan, where the CUD is more like an RI. You get a better discount with a non-flex CUD. So if your workload is pretty static, then a CUD is actually a better use case. But then, when you do want to upgrade, you’re kind of hosed. So this ability allows you to move between the different versions without losing that CUD benefit.”</p>
<p>39:33 <a href="https://cloud.google.com/blog/products/identity-security/introducing-the-agentic-soc-workshops-for-security-professionals/">Introducing the Agentic SOC Workshops for security professionals | </a><a href="https://cloud.google.com/blog/products/identity-security/introducing-the-agentic-soc-workshops-for-security-professionals/">Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google Cloud is launching Agentic SOC Workshops, a free half-day training series for security professionals to learn practical AI applications in security operations centers, starting in Los Angeles and Chicago this September.</li>
<li style="font-weight:400;">The workshops focus on teaching security teams how to use AI agents to automate routine security tasks and reduce alert fatigue, positioning Google’s vision of every customer having a virtual security assistant trained by leading security experts.</li>
<li style="font-weight:400;">Participants will get hands-on experience with Gemini in Google Security Operations through practical exercises and a Capture the Flag challenge, learning to automate workflows that currently consume analyst time.</li>
<li style="font-weight:400;">This initiative targets security architects, SOC managers, analysts, and CISOs who want to move beyond AI marketing hype to actual implementation, with workshops planned for major cities across North America.</li>
<li style="font-weight:400;">While AWS and Azure offer security training and AI tools separately, Google is combining both into a focused workshop format specifically designed for SOC modernization, though no pricing details are provided for the underlying Google Security Operations platform.</li>
</ul>
<p>40:44 <a href="https://cloud.google.com/blog/products/data-analytics/announcing-dataproc-multi-tenant-clusters/">Announcing Dataproc multi-tenant clusters | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google <a href="https://cloud.google.com/dataproc/docs/concepts/overview">Dataproc</a> now supports multi-tenant clusters, allowing multiple data scientists to share compute resources while maintaining per-user authorization to data resources through service account mappings. </li>
<li style="font-weight:400;">This addresses the traditional tradeoff between resource efficiency and workload isolation in shared environments.</li>
<li style="font-weight:400;">The feature enables dynamic user-to-service-account mapping updates on running clusters and supports YAML-based configuration for managing large user bases. </li>
<li style="font-weight:400;">Each user’s workloads run with dedicated OS users, Kerberos principals, and restricted access to only their mapped service account credentials.</li>
<li style="font-weight:400;">Integration with <a href="https://cloud.google.com/vertex-ai/docs/workbench/introduction">Vertex AI Workbench</a> and third-party <a href="https://jupyter.org/">JupyterLab</a> deployments provides notebook users with distributed Jupyter kernels across cluster worker nodes. </li>
<li style="font-weight:400;">The <a href="https://cloud.google.com/bigquery/docs/jupyterlab-plugin">BigQuery JupyterLab extension</a> enables seamless connectivity, with kernel launch times of 30-50 seconds.</li>
<li style="font-weight:400;">This positions GCP competitively against <a href="https://aws.amazon.com/emr/features/studio/">AWS EMR Studio</a> and <a href="https://duckduckgo.com/y.js?ad_domain=microsoft.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=YmnRbwfGhUgHCi3ZPNRD14dY1sAaxkDzGGBbn3N9-0SBeUGRR8KehOUKCxSlbWlefSqQKKMGFX2XmG3r7nTGJh5-1T95i9N3d1F6fpet70h8nWCK9roYEBbiY3SNOewa.VIYam6s9OkB4QUTz2fJdLg&amp;eddgt=8LHiGPkrLRbXc9K0-juhkQ%3D%3D&amp;rut=bfcfaec95491ec716af49159940d19b8f26caea67bb5ee720c45b98666c6a3cd&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8Ig17jlZTupTg5SKboqA-KDVUCUxIrIvNmEw7XMZtm06yJxsUr-C3M4GsDIGzz1293ABkjVfvBdXzBQ2_boNpM_Efo20Q9rDJLClKtNxL6Jc8UVN_jA8KIZ9FWzois8YnZxR59wp3sVyor8IRQyMA38vwcf-Qtl82imQvLaikgKDxK0dT-Lw06vIAgENndOzSnH2t6VvCrqmfb3V-XxIJXn_LlL_Iu-6eTMrEZl-zz1F6tPUKCG46JTBu38QMpsxbDxzgd07SoDpZiJBTeTHRYR3W4lkofeGHsZwXb--FUbnfxbiNRTvuLV3CDHfHWGt-Fw0mim5fTYIZm3JSE1IUMQcI8tvnYsBhZWwymVMD5ps_FnLu---11KytfdJ1BBaeEmZJyLJk-RnQrmo54xxOPbUiBjyt1e8Az3BZqfE0jAEgOcphVPb7bg-sGJPbc7uajY5Hl6qdJTSYE4HH6Eqww6PbzQVowk28-f9TW41-k86bDzWTmg5YX74QMVP-Qzg_hUygud15-79-pDvQtevZV7RXGk-7PF4cmPC9J8dItFszJURF_wVMk5hcLPsvQL1I7z4oCm8zHwYSPn7uKncD9IbRD30WOVBr0KnnkmXi2Sg3zcxzAkCqD0bZxAUGyltKjuZAfgU4XwpjcssPaU6X57cLfFXZCKdY-muV_T29piuhW7EiNLBsDK4oGLQbpF-4gtnTPjJ-MVLzREhWvaZc-4yivTs_mhq4N6zu4v4bkMk-n9ALnwxdpI1e2R2Lscyw1dTGAw%26u%3DaHR0cHMlM2ElMmYlMmY1MzUwLnhnNGtlbi5jb20lMmZ0cmslMmZ2MSUzZnByb2YlM2Q0MzklMjZjYW1wJTNkMTY4ODA3JTI2a2N0JTNkbXNuJTI2a2NoaWQlM2QxNTkwMDE5MjElMjZjcml0ZXJpYWlkJTNka3dkLTc4OTU5Mjg1NDU3NDE0JTNhbG9jLTE5MCUyNmNhbXBhaWduaWQlM2Q1OTAyMzczNTglMjZsb2NwaHklM2Q4MDUzNCUyNmFkZ3JvdXBpZCUzZDEyNjMzNDA1MTI3NDQ2MjclMjZjaWQlM2Q3ODk1ODg4NjM2NzQwNyUyNmtkdiUzZGMlMjZrZXh0JTNkJTI2a3BnJTNkJTI2a3BpZCUzZCUyNnF1ZXJ5U3RyJTNkQXp1cmUlMjUyMFN5bmFwc2UlMjUyMFNwYXJrJTI2dXJsJTNkaHR0cHMlM2ElMmYlMmZhenVyZS5taWNyb3NvZnQuY29tJTJmZW4tdXMlMmZwcm9kdWN0cyUyZnN5bmFwc2UtYW5hbHl0aWNzJTJmJTNmZWZfaWQlM2Rfa18xMTY5MGI1MTc2YTMxZGQ0YjBlYTYzOWIyOWEyNmJkOF9rXyUyNk9DSUQlM2RBSURjbW01ZWRzd2R1dV9TRU1fX2tfMTE2OTBiNTE3NmEzMWRkNGIwZWE2MzliMjlhMjZiZDhfa18lMjZtc2Nsa2lkJTNkMTE2OTBiNTE3NmEzMWRkNGIwZWE2MzliMjlhMjZiZDg%26rlid%3D11690b5176a31dd4b0ea639b29a26bd8&amp;vqd=4-31152001379862362929017181040995572189&amp;iurl=%7B1%7DIG%3D67221A2B628F48188FACA3FB0066D631%26CID%3D38A5496DCE766EE00D095F07CF906F8E%26ID%3DDevEx%2C5045.1">Azure Synapse Spark</a> pools by offering granular IAM-based access control in shared clusters. The autoscaling capability allows administrators to optimize costs by scaling worker nodes based on demand rather than provisioning isolated resources per user.</li>
<li style="font-weight:400;">Currently in public preview with no specific pricing announced beyond standard Dataproc cluster costs. </li>
<li style="font-weight:400;">Key use cases include data science teams in financial services, healthcare, and retail who need collaborative environments with strict data access controls.</li>
</ul>
<p>41:21  Ryan – “Like two months ago, they announced serverless Dataproc, and I thought that that would basically mean you wouldn’t need this anymore? Because this means you’re going to host a giant Dataproc cluster and just pay for it all the time in order to use this.”</p>
<p>42:16 <a href="https://cloud.google.com/blog/topics/developers-practitioners/now-available-rust-sdk-for-google-cloud/">Now available: Rust SDK for Google Cloud | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google Cloud launches its first official Rust SDK supporting over 140 APIs, including <a href="https://cloud.google.com/vertex-ai">Vertex AI</a>, <a href="https://cloud.google.com/kms/docs/">Cloud KMS</a>, and IAM, addressing the gap where developers previously relied on unofficial or community-maintained libraries that lacked consistent support and security updates.</li>
<li style="font-weight:400;">The SDK includes built-in authentication with Application Default Credentials, OAuth2, API Keys, and service accounts, with <a href="https://cloud.google.com/iam/docs/workload-identity-federation">Workload Identity Federation</a> coming soon, making it easier for Rust developers to integrate with Google Cloud’s security model.</li>
<li style="font-weight:400;">This positions Google Cloud competitively with AWS (which has had an official Rust SDK since 2021) and Azure (which offers Rust support through community SDKs), particularly targeting high-performance backend services, data processing pipelines, and real-time analytics workloads.</li>
<li style="font-weight:400;">The SDK is available on crates.io and GitHub with comprehensive documentation and code samples, though pricing follows standard Google Cloud API usage rates with no additional SDK-specific costs.</li>
<li style="font-weight:400;">Key use cases include building memory-safe microservices, secure data processing systems, and performance-critical applications where Rust’s zero-cost abstractions and memory safety guarantees provide advantages over traditional languages.</li>
</ul>
<p>43:41  Justin – “Good to see more Rusts happening, hopefully to replace legacy C++ apps that are not thread safe.” </p>
<h2>Azure</h2>
<p>45:22 <a href="https://azure.microsoft.com/en-us/updates?id=499104">Generally Available: Upgrade existing Azure Gen1 VMs to Gen2-Trusted </a><a href="https://azure.microsoft.com/en-us/updates?id=499104">launch</a></p>
<ul>
<li style="font-weight:400;">Azure now allows customers to upgrade existing Generation 1 VMs to <a href="https://learn.microsoft.com/en-us/azure/virtual-machines/generation-2">Generation 2</a> with <a href="https://aka.ms/TrustedLaunch">Trusted Launch</a> enabled, addressing security gaps in legacy infrastructure without requiring VM recreation or data migration.</li>
<li style="font-weight:400;">Trusted Launch provides foundational security features, including Secure Boot and vTPM (virtual Trusted Platform Module), protecting VMs against boot kits, rootkits, and kernel-level malware – capabilities that were previously unavailable to Gen1 VM users.</li>
<li style="font-weight:400;">This positions Azure competitively with <a href="https://aws.amazon.com/ec2/nitro/">AWS Nitro System</a> and <a href="https://cloud.google.com/security/products/shielded-vm">GCP Shielded VMs</a>, though Azure’s approach focuses on retrofitting existing workloads rather than requiring new deployments, potentially saving customers significant migration costs and downtime.</li>
<li style="font-weight:400;">The upgrade path targets enterprises running legacy Windows Server 2012/2016 and older Linux distributions on Gen1 hardware, enabling them to meet modern compliance requirements without application refactoring.</li>
<li style="font-weight:400;">While the upgrade process requires a VM restart and temporary downtime, it preserves existing configurations, network settings, and data disks, making it practical for production workloads during maintenance windows.</li>
</ul>
<p>45:25  Matt – “So unlike Windows, Azure sometimes takes a scorched Earth technique – kind of like Apple does – when they release a lot of features and it takes them a while to get that migration path in there, and I kind of think some of it is because they want that time to test it out and get the scale.”  </p>
<p>46:25 <a href="https://azure.microsoft.com/en-us/updates?id=501829">Generally Available: Gateway-level metrics and native autoscaling for </a><a href="https://azure.microsoft.com/en-us/updates?id=501829">Azure API Management v2 tiers </a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/api-management/v2-service-tiers-overview">Azure API Management v2 tiers</a> now include gateway-level metrics that provide granular visibility into API performance, request patterns, and error rates at the gateway level rather than just service-wide metrics.</li>
<li style="font-weight:400;">Native autoscaling automatically adjusts compute capacity based on real-time gateway usage metrics, eliminating manual scaling operations and reducing costs during low-traffic periods while maintaining performance during spikes.</li>
<li style="font-weight:400;">This positions Azure API Management closer to AWS API Gateway’s automatic scaling capabilities, though Azure’s implementation focuses on gateway-specific metrics rather than Lambda-style request-based scaling.</li>
<li style="font-weight:400;">The feature targets enterprises running mission-critical APIs that need predictable performance without overprovisioning, particularly useful for organizations with variable traffic patterns or seasonal workloads.</li>
<li style="font-weight:400;">Available across all v2 tiers (Basic, Standard, and Premium), making enterprise-grade scaling accessible to smaller deployments while maintaining the simplified pricing model introduced with v2 tiers.</li>
</ul>
<p>46:54  Matt – “The Premier Tier – it’s an arm and a leg, so be careful what you’re doing, and by default it’s not HA. It adds up real fast.”  </p>
<p>47:42 <a href="https://techcommunity.microsoft.com/blog/azure-ai-foundry-blog/announcing-gpt-realtime-on-azure-ai-foundry/4449666">Announcing gpt-realtime on Azure AI Foundry: | Microsoft Community Hub</a></p>
<ul>
<li style="font-weight:400;">Microsoft launches<a href="https://aka.ms/GPT-Realtime"> gpt-realtime</a> on <a href="https://ai.azure.com/">Azure AI Foundry</a>, a speech-to-speech model that combines voice synthesis improvements into a single API with 20% lower pricing than the preview version, positioning Azure to compete with Google’s voice AI capabilities and <a href="https://aws.amazon.com/polly/">Amazon’s Polly</a> service.</li>
<li style="font-weight:400;">The model introduces two new natural voices (Marin and Cedar), enhanced instruction following, and image input support that allows users to discuss visual content through voice without requiring video, expanding beyond traditional text-to-speech limitations.</li>
<li style="font-weight:400;">Pricing starts at $40 per million input tokens and $160 per million output tokens for the standard tier, with function calling capabilities that let developers integrate custom code directly into voice interactions for building conversational AI applications.</li>
<li style="font-weight:400;">Target use cases include customer service automation, accessibility tools, and real-time translation services, with the Real-time API enabling developers to build interactive voice applications that process speech input and generate natural responses in a single pass.</li>
<li style="font-weight:400;">Integration with Azure AI Foundry provides direct model access through Azure’s infrastructure, offering enterprise customers built-in compliance and security features while simplifying deployment compared to managing separate speech recognition and synthesis services.</li>
</ul>
<p>48:56 <a href="https://techcommunity.microsoft.com/blog/azure-ai-services-blog/the-responses-api-in-azure-ai-foundry-is-now-generally-available/4446567">The Responses API in Azure AI Foundry is now generally available | </a><a href="https://techcommunity.microsoft.com/blog/azure-ai-services-blog/the-responses-api-in-azure-ai-foundry-is-now-generally-available/4446567">Microsoft Community Hub</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/ai-foundry/openai/how-to/responses">Azure’s Responses API</a> simplifies building AI agents by handling multi-turn conversations, tool orchestration, and state management in a single API call, eliminating the need for complex orchestration code that developers typically write themselves.</li>
<li style="font-weight:400;">The API includes six built-in tools: File Search for unstructured content, Function Calling for custom APIs, Code Interpreter for Python execution, Computer Use for UI automation, Image Generation, and Remote MCP Server connectivity, allowing agents to decide which tools to use without manual intervention.</li>
<li style="font-weight:400;">This positions Azure between AWS Bedrock Agents (which requires more manual orchestration) and Google’s Vertex AI Agent Builder, offering a middle ground with pre-built tools while supporting all OpenAI models, including GPT-5 series and fine-tuned models.</li>
<li style="font-weight:400;">Early adopters like UiPath are using it for enterprise automation where agents interpret natural language and execute actions across SaaS applications and legacy desktop software, with other implementations in financial services for compliance tasks and healthcare for document analysis.</li>
<li style="font-weight:400;">The API integrates with Azure AI Foundry’s broader agent stack, where developers can start with the Responses API for single agents, then scale to Agent Service for multi-agent orchestration and enterprise integrations with SharePoint, Bing, and Microsoft Fabric.</li>
</ul>
<p>49:34  Ryan – “I like these things, but I’ve been burnt by the 365 Graph API so many times…I would use it, but I don’t trust it.”</p>
<p>50:19 <a href="https://azure.microsoft.com/en-us/blog/azure-mandatory-multifactor-authentication-phase-2-starting-in-october-2025/">Azure mandatory multifactor authentication: Phase 2 starting in October </a><a href="https://azure.microsoft.com/en-us/blog/azure-mandatory-multifactor-authentication-phase-2-starting-in-october-2025/">2025 | Microsoft Azure Blog</a></p>
<ul>
<li style="font-weight:400;">Azure is implementing mandatory MFA for all resource management operations starting October 1, 2025, expanding beyond the portal-only enforcement that was completed in March 2025. </li>
<li style="font-weight:400;">This Phase 2 enforcement covers Azure CLI, PowerShell, REST APIs, SDKs, and Infrastructure as Code tools, addressing the fact that MFA blocks 99.2% of account compromise attacks.</li>
<li style="font-weight:400;">The enforcement uses <a href="http://aka.ms/AZUREPOLICY">Azure Policy</a> for gradual rollout and allows Global Administrators to postpone implementation if needed. </li>
<li style="font-weight:400;">Workload identities like managed identities and service principals remain unaffected, maintaining automation capabilities while securing human access.</li>
<li style="font-weight:400;">Organizations need to update to <a href="https://learn.microsoft.com/en-us/cli/azure/release-notes-azure-cli?view=azure-cli-latest">Azure CLI version 2.76</a> and <a href="https://duckduckgo.com/y.js?ad_domain=microsoft.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=b8Jy6CxlBh7TJIapExQUETGkbMVU1b2mLqZNqMvOxIcRGiRYsmP59gipcek8K48arnnT56RzhFBn6lH9ZTS47IV7xxwIK2rhgwHii8KG-uSDF4NccnSS9QqFHE7LnYbm.XpN6lap2fh48PAz1ueVu3w&amp;eddgt=RxWvGLAnq_AFrR402kHNPg%3D%3D&amp;rut=f2e2017ce03feab03270c11e84028394a689e627210c3aa44efdd5707bcf912b&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8c4CdTQIhk-cfN4Z6CJKsszVUCUy3q9kivL8d-eyeAL-vNXDIwsb7CqxBLKX8Gb7lqz1alLEEgkdsCMkT3V_4pXfTR3yvP09OZA0eeQwhrw-UhAr0SzsZ1X6XzsJuPQTh2_jtVn0dPNgmzNCcZTg7nC3vW1dqg_hZ4ZT2kkVULC_mW8614Ofrrd0hfeGoZxbqvyisjTQyTutV_5aMmWCoTkDp0HPG-4UvpqQmrV9DECk_c7TLTv8vYC3-s1MOLpLBbhfUChWdb5wtAhasdrX9S6dIw7D3QnhfmDPldInNjX08m_hLY1nJ2GMQFOuTxKI8dvH7_1XyjLbOTJqFMIYoCszkjQIm11F-3Cr_nO6OAjiDb__NHRW8Hx4ZyU5eMNgtJ53aJG0mHhfWzsRYaxp-86iyLJnvmACDYqLtvP1owb0u8t2oCiwGS8oJahnkU6ovHmqrCzHINvgJ4B8Brh38MQPitFgyohhRcoqkbmp2MWoQRZRvg_viSS6HTDv2zRHY3XL9cUePQzd4Wf4K26TC4ieJGrSNDAkbzg_iX6NAjdvi0WfJSsyZXdfS4bCg8VCVsortpn0RTCHJZMcd49ReHfgz6rsEmu7F8e8vIxJ26F9Wu8Fw0kjnCZYdrv68D3gcg1eQwIDNzz-DHQQG2crKXiPCbbdR8Wq7oVsBqJDerEPycZuwhr5D24Dj14b8uACqBTpea2rAvo5J4Fbub67Nz9K5-WfJr3mW40RVFDqgcq5yJyg1F0TBkYxaNlOkCnnMBbmDPg%26u%3DaHR0cHMlM2ElMmYlMmY1MzUwLnhnNGtlbi5jb20lMmZ0cmslMmZ2MSUzZnByb2YlM2Q0MzklMjZjYW1wJTNkMTY4ODA3JTI2a2N0JTNkbXNuJTI2a2NoaWQlM2QxNTkwMDE5MjElMjZjcml0ZXJpYWlkJTNka3dkLTc4ODkwNTY2MjgwMzc5JTNhbG9jLTE5MCUyNmNhbXBhaWduaWQlM2Q1OTAyMzczNTglMjZsb2NwaHklM2Q3OTM4MCUyNmFkZ3JvdXBpZCUzZDEyNjIyNDEwMDEzOTE4NTglMjZjaWQlM2Q3ODg5MDE2NzMyNjAzNyUyNmtkdiUzZGMlMjZrZXh0JTNkJTI2a3BnJTNkJTI2a3BpZCUzZCUyNnF1ZXJ5U3RyJTNkQXp1cmUlMjUyMFBvd2VyU2hlbGwlMjZ1cmwlM2RodHRwcyUzYSUyZiUyZmF6dXJlLm1pY3Jvc29mdC5jb20lMmZlbi11cyUyZmdldC1zdGFydGVkJTJmYXp1cmUtcG9ydGFsJTJmY2xvdWQtc2hlbGwlMmYlM2ZlZl9pZCUzZF9rXzY5Mzc0MzJjYTkyNDFjMjdlMTQzMmJlNmMxMmIxY2UzX2tfJTI2T0NJRCUzZEFJRGNtbTVlZHN3ZHV1X1NFTV9fa182OTM3NDMyY2E5MjQxYzI3ZTE0MzJiZTZjMTJiMWNlM19rXyUyNm1zY2xraWQlM2Q2OTM3NDMyY2E5MjQxYzI3ZTE0MzJiZTZjMTJiMWNlMw%26rlid%3D6937432ca9241c27e1432be6c12b1ce3&amp;vqd=4-157994225332595185128466443524249393146&amp;iurl=%7B1%7DIG%3DAD7B5D2CBCA94240AE0B483F9CF18994%26CID%3D26D9CA43633E6DB80466DC29628B6C52%26ID%3DDevEx%2C5046.1">Azure PowerShell</a> version 14.3 or later for compatibility. Microsoft provides built-in Azure Policy definitions to test impact before enforcement, allowing gradual application across different resource scopes, types, or regions.</li>
<li style="font-weight:400;">This positions Azure ahead of AWS and GCP in mandatory security controls, as neither competitor currently enforces MFA for all management operations by default. The approach balances security improvements with operational flexibility through postponement options and phased rollouts.</li>
<li style="font-weight:400;">The enforcement applies to Azure Public Cloud only, with no announced timeline for Azure Government or other sovereign clouds. </li>
<li style="font-weight:400;">Organizations can use Azure Service Health notifications and email alerts to track their enforcement timeline and prepare accordingly.</li>
</ul>
<p>50:53  Justin – “It went so well – the first phase of this – I can’t imagine Phase 2 is going to go any better than the first phase did.” </p>
<p>52:16 <a href="https://azure.microsoft.com/en-us/blog/agent-factory-from-prototype-to-production-developer-tools-and-rapid-agent-development/">Agent Factory: From prototype to production—developer tools and rapid </a><a href="https://azure.microsoft.com/en-us/blog/agent-factory-from-prototype-to-production-developer-tools-and-rapid-agent-development/">agent development | Microsoft Azure Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://azure.microsoft.com/en-us/products/ai-foundry">Azure AI Foundry</a> addresses the challenge of rapidly moving AI agents from prototype to production by providing a unified development experience across <a href="https://learn.microsoft.com/en-us/azure/ai-foundry/how-to/develop/get-started-projects-vs-code">VS Code</a>, <a href="https://github.blog/news-insights/product-news/github-copilot-meet-the-new-coding-agent/">GitHub</a>, and enterprise deployment channels. </li>
<li style="font-weight:400;">The platform supports both <a href="https://devblogs.microsoft.com/semantic-kernel/semantic-kernel-and-autogen-part-2/">Microsoft frameworks</a>, like <a href="https://devblogs.microsoft.com/semantic-kernel/guest-blog-building-multi-agent-solutions-with-semantic-kernel-and-a2a-protocol/">Semantic Kernel</a> and AutoGen, alongside open-source options, including LangGraph, LlamaIndex, and CrewAI, allowing developers to use their preferred tools while maintaining enterprise-grade capabilities.</li>
<li style="font-weight:400;">The platform implements open protocols, including Model Context Protocol (MCP) for tool interoperability and Agent-to-Agent (A2A) for cross-platform agent collaboration, positioning Azure as protocol-agnostic compared to more proprietary approaches from competitors. </li>
<li style="font-weight:400;">This enables agents built on different frameworks to communicate and share capabilities across vendor boundaries.</li>
<li style="font-weight:400;">Azure AI Foundry integrates directly with Microsoft 365 and Copilot through the <a href="https://learn.microsoft.com/en-us/microsoft-365/agents-sdk/">Microsoft 365 Agents SDK</a>, allowing developers to deploy agents to Teams, BizChat, and other productivity surfaces where business users already work. The platform also provides REST API exposure and Logic Apps integration with thousands of prebuilt connectors to enterprise systems.</li>
<li style="font-weight:400;">The VS Code extension enables local agent development with integrated tracing, evaluation, and one-click deployment to Foundry Agent Service, while the unified Model Inference API allows model swapping without code changes. This addresses the common pain point of agents working locally but requiring extensive rewrites for production deployment.</li>
<li style="font-weight:400;">Built-in observability, continuous evaluation through CI/CD integration, and enterprise guardrails for identity, networking, and compliance are integrated into the development workflow rather than added post-deployment. This positions Azure AI Foundry as focusing on production readiness from the start, targeting enterprises that need rapid agent development without sacrificing governance.</li>
</ul>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback, or ask questions at theCloudPod.net or tweet at us with the hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2145087/c1e-1xdxb5owg6i4mx9g-pkxp620mav0-kzgztm.mp3" length="64940416"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[The Cloud Pod is in Tears Trying to Understand Azure Tiers  
 Welcome to episode 321 of The Cloud Pod, where the forecast is always cloudy! Justin, Ryan, and Matt are all on hand to bring you the latest in cloud and AI news, including increased metrics data (because who doesn’t love more data), some issues over at Cloudflare, and even bigger issues at Builder.ai  – plus so much more. 
Let’s get started! 
Titles we almost went with this week


Lost in Translation: Google Helps IPv6 Find Its Way to IPv4
BigQuery’s Soft Landing for Hard Problems
CloudWatch Gets a Two-Week Memory Upgrade
VM Glow-Up: From Gen1 Zero to Gen2 Hero
Azure Gets Contextual: API Management Learns to Speak AI
The Cloud Pod: Now Broadcasting from 20,000 Leagues Under the Sea
LoRA LoRA on the Wall, Who’s the Finest Model of Them All
Azure Says MFA or the Highway for Resource Management
Two-Factor or Two-Furious: Azure’s Security Ultimatum
Agent 007: License to Build
CUD You Believe It? Google’s Discounts Get More Flexible
WAF’s New Deal: Free Logs with Every Million Requests Served
SOC It To Me: Google’s AI Security Workshop Tour
MFA mandatory in Azure, now you too can hate/hate MS Authenticator
AWS AMIs no longer the Tribbles of cloud computing
ECS Exec; Justin’s prediction from 2018 finally comes true

General News

00:56 FinOps Weekly Summit 2025

Victor Garcia reached out and asked us to share the news about the FinOps Weekly Summit coming up on October 23rd, 2025. 
A lot of great speakers; if you’re in the FinOps space, we recommend it. 
Want to register? You can do that here. 

01:53 Ignite Registration Opens 

San Francisco, Moscone Center
November 18–21, 2025
Need to convince your manager to pay for you to go? Find that letter here. 

02:45 Addressing the unauthorized issuance of multiple TLS certificates for 1.1.1.1

Some issues over at Cloudflare recently…
Fina CA issued 12 unauthorized TLS certificates for Cloudflare’s 1.1.1.1 DNS resolver IP address between February 2024 and August 2025, violating domain control validation requirements and potentially allowing man-in-the-middle attacks on DNS-over-TLS and DNS-over-HTTPS connections.
The incident highlights vulnerabilities in the Certificate Authority trust model where any trusted CA can issue certificates for any domain or IP without proper validation, though exploitation would require the attacker to have the private key, intercept traffic, and target clients that trust Fina CA (primarily Microsoft systems).
Cloudflare failed to detect these certificates for months despite operating its own Certificate Transparency monitoring service because its system wasn’t configured to alert on IP address certificates rather than domain names, exposing gaps in its internal security monitoring.
The certificates have been ]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2145087/c1a-k5d5-6z36jjg8s0g9-jm88oo.jpg"></itunes:image>
                                                                            <itunes:duration>00:54:07</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2145087/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[320: Azure gives your Finops person a heart attack]]>
                </title>
                <pubDate>Thu, 11 Sep 2025 01:51:04 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2137820</guid>
                                    <link>https://tcpfm.castos.com/episodes/320-aws-cost-mcp-your-billing-data-now-speaks-human</link>
                                <description>
                                            <![CDATA[<h3>Welcome to episode 320 of The Cloud Pod, where the forecast is always cloudy! Justin, Matt, and Ryan are coming to you from Justin’s echo chamber and bringing all the latest in AI and Cloud news, including updates to Google’s Anti-trust case, AWS Cost MCP, new regions, updates to EKS, Veo, and Claude, and more! Let’s get into it. </h3>
<h3>Titles we almost went with this week:
</h3>
<ul>
<li>Breaking Bad Bottlenecks: AWS  Cooks Up Faster Container Pulls</li>
<li>The Bucket List: Finding Your Lost Storage Dollars</li>
<li>State of Denial: Terraform Finally Stops Saving Your Passwords</li>
<li>Three Stages of Azure Grief: Development, Preview, and Launch</li>
<li>Ground Control to Major Cloud: Microsoft Launches Planetary Computer Pro</li>
<li>Veo Vidi Vici: Google Conquers Video Editing</li>
<li>Red Alert: AWS Makes Production Accounts Actually Look Dangerous</li>
<li>Amazon EKS Discovers the F5 Key </li>
<li>Chaos Theory Meets ChatGPT: When Your Reliability Data Gets an AI Therapist</li>
<li>Breaking Bad (Services): How AI Helps You Find What’s Already   Broken</li>
<li>Breaking Up is Hard to Cloud: Gemini Moves Back In</li>
<li>Intel Inside Your Secrets: TDX Takes Over Google Cloud</li>
<li>Lord of the Regions: The Return of the Kiwi </li>
<li>All Blacks and All Stacks: AWS Goes Full Kiwi</li>
<li>Azure Forecast: 100% Chance of Budget Alert Storms</li>
<li>Google Keeps Its Cloud Together: A $2.5T Near Miss</li>
<li>Shell We Dance? AWS Makes CLI Scripting Less Painful</li>
<li>AWS Finally Admits Nobody Remembers All Those CLI Commands</li>
<li>Cache Me If You Claude</li>
<li>Your AWS Console gets its Colors, just don’t choose red shirts</li>
<li>Amazon Q walks into a bar, Tells MCP to order it a beer.. The Bartender sighs and mutters “at least chatgpt just hallucinates its beer”</li>
<li>Ryan’s shitty scripts now as a AWS CLI Library</li>
</ul>
<p>A big thanks to this week’s sponsor:</p>
<p>We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our Slack channel for more info.</p>
<h2>General News</h2>
<p>00:57 <a href="https://www.politico.com/news/2025/09/02/google-dodges-a-2-5t-breakup-00540419">Google Dodges A 2.5t Breakup</a></p>
<ul>
<li style="font-weight:400;">We have breaking news – and it’s good news for Google. </li>
<li style="font-weight:400;">Google successfully avoided a potential $2.5 trillion breakup following antitrust proceedings, maintaining its current corporate structure <a href="https://storage.courtlistener.com/recap/gov.uscourts.dcd.223205/gov.uscourts.dcd.223205.1436.0_2.pdf">despite regulatory pressure</a>.</li>
<li style="font-weight:400;">The decision represents a significant outcome for Big Tech antitrust cases, potentially setting a precedent for how regulators approach market dominance issues in the cloud and technology sectors.</li>
<li style="font-weight:400;">Cloud customers and partners can expect business continuity with Google Cloud Platform services, avoiding potential disruptions that could have resulted from a corporate restructuring.</li>
<li style="font-weight:400;">The ruling may influence how other major cloud providers structure their businesses and approach regulatory compliance, particularly around bundling services and market competition.</li>
<li style="font-weight:400;">Enterprise customers relying on Google’s integrated ecosystem of cloud, advertising, and productivity tools can continue their current architectures without concerns about service separation.</li>
<li style="font-weight:400;">You just KNOW Microsoft is super mad about this. </li>
</ul>
<h2>AI Is Going Great – Or How ML Makes Money </h2>
<p>02:16 <a href="https://openai.com/index/introducing-gpt-realtime/">Introducing GPT-Realtime</a></p>
<ul>
<li style="font-weight:400;"><a href="https://openai.com/">OpenAI</a>‘s <a href="https://openai.com/index/introducing-the-realtime-api..."></a></li></ul>
<h3>Chapters</h3>
<ul><li>(00:00:07) - Cloud Pod: Azure vs GCP</li><li>(00:01:01) - Google Stops Exploring a Breakup</li><li>(00:03:49) - Terraform Cloud Provider 7.0 in general availability</li><li>(00:06:13) - How to Query Gremlin's LLM with Chaos Engineering Data</li><li>(00:08:32) - Amazon EKS: Parallel Polls for AI & Windows</li><li>(00:15:52) - Amazon.com: Terraform Deployment for SFTP Connectors</li><li>(00:19:11) - Amazon Q Developer Adds Central Admin Control for MCP Servers</li><li>(00:21:04) - AWS i8ge and M8i Flex Instances</li><li>(00:24:55) - Amazon M7i Flex Instances: Best Cloud Instances</li><li>(00:27:53) - Wales: New AWS Region Launches in New Zealand</li><li>(00:32:56) - Google Cloud: New Features and No Cost Option for Videos</li><li>(00:37:11) - GKE Container Optimized Compute</li><li>(00:38:42) - Intel TDX for Confidential Computing with Google</li><li>(00:40:17) - GCP EventArc Advanced is Now Generally Available</li><li>(00:42:31) - Azure AI Foundry: Comprehensive agent observability capabilities</li><li>(00:46:59) - Microsoft's Planetary Computer Pro: An All-in-One for</li><li>(00:50:56) - Microsoft's Migration From MOSP to Microsoft Accounts Causes False Budget Alert</li><li>(00:52:49) - Microsoft to Make UltraDs More Affordable in Multiple Regions</li><li>(00:54:00) - The Business Talk Podcast</li><li>(00:55:00) - Week in Cloud: Exploring the Cloud</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 320 of The Cloud Pod, where the forecast is always cloudy! Justin, Matt, and Ryan are coming to you from Justin’s echo chamber and bringing all the latest in AI and Cloud news, including updates to Google’s Anti-trust case, AWS Cost MCP, new regions, updates to EKS, Veo, and Claude, and more! Let’s get into it. 
Titles we almost went with this week:


Breaking Bad Bottlenecks: AWS  Cooks Up Faster Container Pulls
The Bucket List: Finding Your Lost Storage Dollars
State of Denial: Terraform Finally Stops Saving Your Passwords
Three Stages of Azure Grief: Development, Preview, and Launch
Ground Control to Major Cloud: Microsoft Launches Planetary Computer Pro
Veo Vidi Vici: Google Conquers Video Editing
Red Alert: AWS Makes Production Accounts Actually Look Dangerous
Amazon EKS Discovers the F5 Key 
Chaos Theory Meets ChatGPT: When Your Reliability Data Gets an AI Therapist
Breaking Bad (Services): How AI Helps You Find What’s Already   Broken
Breaking Up is Hard to Cloud: Gemini Moves Back In
Intel Inside Your Secrets: TDX Takes Over Google Cloud
Lord of the Regions: The Return of the Kiwi 
All Blacks and All Stacks: AWS Goes Full Kiwi
Azure Forecast: 100% Chance of Budget Alert Storms
Google Keeps Its Cloud Together: A $2.5T Near Miss
Shell We Dance? AWS Makes CLI Scripting Less Painful
AWS Finally Admits Nobody Remembers All Those CLI Commands
Cache Me If You Claude
Your AWS Console gets its Colors, just don’t choose red shirts
Amazon Q walks into a bar, Tells MCP to order it a beer.. The Bartender sighs and mutters “at least chatgpt just hallucinates its beer”
Ryan’s shitty scripts now as a AWS CLI Library

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our Slack channel for more info.
General News
00:57 Google Dodges A 2.5t Breakup

We have breaking news – and it’s good news for Google. 
Google successfully avoided a potential $2.5 trillion breakup following antitrust proceedings, maintaining its current corporate structure despite regulatory pressure.
The decision represents a significant outcome for Big Tech antitrust cases, potentially setting a precedent for how regulators approach market dominance issues in the cloud and technology sectors.
Cloud customers and partners can expect business continuity with Google Cloud Platform services, avoiding potential disruptions that could have resulted from a corporate restructuring.
The ruling may influence how other major cloud providers structure their businesses and approach regulatory compliance, particularly around bundling services and market competition.
Enterprise customers relying on Google’s integrated ecosystem of cloud, advertising, and productivity tools can continue their current architectures without concerns about service separation.
You just KNOW Microsoft is super mad about this. 

AI Is Going Great – Or How ML Makes Money 
02:16 Introducing GPT-Realtime

OpenAI‘s ]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[320: Azure gives your Finops person a heart attack]]>
                </itunes:title>
                                    <itunes:episode>320</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<h3>Welcome to episode 320 of The Cloud Pod, where the forecast is always cloudy! Justin, Matt, and Ryan are coming to you from Justin’s echo chamber and bringing all the latest in AI and Cloud news, including updates to Google’s Anti-trust case, AWS Cost MCP, new regions, updates to EKS, Veo, and Claude, and more! Let’s get into it. </h3>
<h3>Titles we almost went with this week:
</h3>
<ul>
<li>Breaking Bad Bottlenecks: AWS  Cooks Up Faster Container Pulls</li>
<li>The Bucket List: Finding Your Lost Storage Dollars</li>
<li>State of Denial: Terraform Finally Stops Saving Your Passwords</li>
<li>Three Stages of Azure Grief: Development, Preview, and Launch</li>
<li>Ground Control to Major Cloud: Microsoft Launches Planetary Computer Pro</li>
<li>Veo Vidi Vici: Google Conquers Video Editing</li>
<li>Red Alert: AWS Makes Production Accounts Actually Look Dangerous</li>
<li>Amazon EKS Discovers the F5 Key </li>
<li>Chaos Theory Meets ChatGPT: When Your Reliability Data Gets an AI Therapist</li>
<li>Breaking Bad (Services): How AI Helps You Find What’s Already   Broken</li>
<li>Breaking Up is Hard to Cloud: Gemini Moves Back In</li>
<li>Intel Inside Your Secrets: TDX Takes Over Google Cloud</li>
<li>Lord of the Regions: The Return of the Kiwi </li>
<li>All Blacks and All Stacks: AWS Goes Full Kiwi</li>
<li>Azure Forecast: 100% Chance of Budget Alert Storms</li>
<li>Google Keeps Its Cloud Together: A $2.5T Near Miss</li>
<li>Shell We Dance? AWS Makes CLI Scripting Less Painful</li>
<li>AWS Finally Admits Nobody Remembers All Those CLI Commands</li>
<li>Cache Me If You Claude</li>
<li>Your AWS Console gets its Colors, just don’t choose red shirts</li>
<li>Amazon Q walks into a bar, Tells MCP to order it a beer.. The Bartender sighs and mutters “at least chatgpt just hallucinates its beer”</li>
<li>Ryan’s shitty scripts now as a AWS CLI Library</li>
</ul>
<p>A big thanks to this week’s sponsor:</p>
<p>We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our Slack channel for more info.</p>
<h2>General News</h2>
<p>00:57 <a href="https://www.politico.com/news/2025/09/02/google-dodges-a-2-5t-breakup-00540419">Google Dodges A 2.5t Breakup</a></p>
<ul>
<li style="font-weight:400;">We have breaking news – and it’s good news for Google. </li>
<li style="font-weight:400;">Google successfully avoided a potential $2.5 trillion breakup following antitrust proceedings, maintaining its current corporate structure <a href="https://storage.courtlistener.com/recap/gov.uscourts.dcd.223205/gov.uscourts.dcd.223205.1436.0_2.pdf">despite regulatory pressure</a>.</li>
<li style="font-weight:400;">The decision represents a significant outcome for Big Tech antitrust cases, potentially setting a precedent for how regulators approach market dominance issues in the cloud and technology sectors.</li>
<li style="font-weight:400;">Cloud customers and partners can expect business continuity with Google Cloud Platform services, avoiding potential disruptions that could have resulted from a corporate restructuring.</li>
<li style="font-weight:400;">The ruling may influence how other major cloud providers structure their businesses and approach regulatory compliance, particularly around bundling services and market competition.</li>
<li style="font-weight:400;">Enterprise customers relying on Google’s integrated ecosystem of cloud, advertising, and productivity tools can continue their current architectures without concerns about service separation.</li>
<li style="font-weight:400;">You just KNOW Microsoft is super mad about this. </li>
</ul>
<h2>AI Is Going Great – Or How ML Makes Money </h2>
<p>02:16 <a href="https://openai.com/index/introducing-gpt-realtime/">Introducing GPT-Realtime</a></p>
<ul>
<li style="font-weight:400;"><a href="https://openai.com/">OpenAI</a>‘s <a href="https://openai.com/index/introducing-the-realtime-api/">GPT-Realtime</a> introduces real-time processing capabilities to <a href="https://www.chatbase.co/blog/gpt-models">GPT models</a>, reducing latency for interactive applications and enabling more responsive AI experiences in cloud environments.</li>
<li style="font-weight:400;">The technology leverages optimized model inference and architectural changes to deliver sub-second response times, making it suitable for live customer service, real-time translation, and interactive coding assistants.</li>
<li style="font-weight:400;">Cloud providers can integrate GPT-Realtime through new API endpoints, offering developers the ability to build applications that require immediate AI responses without traditional batch processing delays.</li>
<li style="font-weight:400;">This development addresses a key limitation in current LLM deployments where response latency has restricted use cases in time-sensitive applications like live streaming, gaming, and financial trading systems.</li>
<li style="font-weight:400;">For businesses running AI workloads in the cloud, GPT-Realtime could reduce infrastructure costs by eliminating the need for pre-processing queues and enabling more efficient resource utilization through streaming inference.</li>
</ul>
<p>02:58  Matt – “More AI scam calling coming your way.” </p>
<h2>Cloud Tools</h2>
<p>04:14 <a href="https://www.hashicorp.com/en/blog/terraform-provider-for-google-cloud-7-0-is-now-ga">Terraform provider for Google Cloud 7.0 is now GA</a></p>
<ul>
<li style="font-weight:400;">Terraform Google Cloud provider 7.0 introduces ephemeral resources and write-only attributes that prevent sensitive data, such as access tokens and passwords, from being stored in state files, addressing a major security concern for infrastructure teams.</li>
<li style="font-weight:400;">The provider now supports over 800 resources and 300 data sources with 1.4 billion downloads, making it one of the most comprehensive infrastructure-as-code tools for Google Cloud Platform management.</li>
<li style="font-weight:400;">New validation logic catches configuration errors during Terraform plan rather than apply, providing fail-fast behavior that makes deployments more predictable and reduces failed infrastructure changes.</li>
<li style="font-weight:400;">Breaking changes in 7.0 align the provider with Google Cloud’s latest APIs and mark functionally required attributes as mandatory in schemas, requiring teams to review upgrade guides before migrating from version 6.</li>
<li style="font-weight:400;">The ephemeral resource feature leverages <a href="https://www.hashicorp.com/en/blog/terraform-1-10-improves-handling-secrets-in-state-with-ephemeral-values">Terraform 1.10</a>+ capabilities to handle temporary credentials, such as service account access tokens, without exposing state file attributes (write-only). This solves the long-standing problem of secret management in GitOps workflows.</li>
</ul>
<p>05:19  Ryan – “I like the ephemeral resources; I think it’s a neat model for handling sensitive information and stuff you don’t want to store. It’s kind of a neat process.” </p>
<p>06:50 <a href="https://www.gremlin.com/blog/how-to-get-fast-easy-insights-with-the-gremlin-mcp-server">How to get fast, easy insights with the Gremlin MCP Server</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.gremlin.com/docs/platform-reliability-intelligence">Gremlin’s MCP Server</a> connects chaos engineering data to LLMs like <a href="https://chatgpt.com/overview">ChatGPT</a> or <a href="https://claude.ai/login">Claude</a>, enabling teams to query their reliability testing results using natural language to uncover insights about service dependencies, test coverage gaps, and which services to test next.</li>
<li style="font-weight:400;">The server architecture consists of three components: the LLM client, a containerized MCP server that interfaces with Gremlin’s API, and the Gremlin API itself – designed for read-only operations to prevent accidental system damage during data exploration.</li>
<li style="font-weight:400;">This solves the problem of making sense of complex reliability testing data by allowing engineers to ask plain English questions like “Which of my services should I test next?” Instead of manually analyzing test results and metrics.</li>
<li style="font-weight:400;">The tool requires a Gremlin account with REST API key, an AI interface that supports MCP servers like <a href="https://claude.ai/download">Claude Desktop</a>, and <a href="http://node.js">Node.js 22+</a> – making it accessible to teams already using Gremlin for chaos engineering.</li>
<li style="font-weight:400;">During internal beta testing at Gremlin, the MCP server helped uncover production-impacting bugs before release, demonstrating its practical value for improving service reliability through AI-assisted data analysis.</li>
</ul>
<p>07:38  Ryan – “It’s amazing they limited this to read-only commands, the API. I don’t know why they did that…it’s kind of neat to see the interaction model with different services.”</p>
<h2>AWS</h2>
<p>09:21 <a href="https://aws.amazon.com/blogs/containers/introducing-seekable-oci-parallel-pull-mode-for-amazon-eks/">Introducing Seekable OCI Parallel Pull mode for Amazon EKS | Containers</a></p>
<ul>
<li style="font-weight:400;">AWS introduces <a href="https://github.com/awslabs/soci-snapshotter/releases/tag/v0.11.0">SOCI Parallel Pull</a> mode for <a href="https://aws.amazon.com/eks/">EKS</a> to address container image pull bottlenecks, particularly for AI/ML workloads where images can exceed 10GB and take several minutes to download using traditional methods.</li>
<li style="font-weight:400;">The feature parallelizes both the download and unpacking phases, utilizing multiple HTTP connections per layer for downloads and concurrent CPU cores for unpacking, to achieve up to 60% faster pull times compared to standard containerd configurations.</li>
<li style="font-weight:400;">SOCI Parallel Pull is built into recent Amazon EKS Optimized AMIs for <a href="https://aws.amazon.com/linux/amazon-linux-2023/">Amazon Linux 2023</a> and <a href="https://aws.amazon.com/bottlerocket/">Bottlerocket</a>, with configurable parameters for download concurrency (recommended 10-20 for ECR), chunk size (16MB recommended), and unpacking parallelism based on your instance resources.</li>
<li style="font-weight:400;">The solution trades reduced pull times for higher network, CPU, and storage utilization, requiring optimized EBS volumes with 1000 MiB/s throughput or instance store NVMe disks for optimal performance on instances like m6i.8xlarge.</li>
<li style="font-weight:400;">This directly impacts deployment responsiveness and cluster scaling operations, with container startup time reductions from nearly 2 minutes to 45 seconds for a 10GB Deep Learning Container, making it particularly valuable for organizations running large-scale AI/ML workloads on EKS.</li>
<li style="font-weight:400;">What Matt was remembering: <a href="https://aws.amazon.com/about-aws/whats-new/2023/11/aws-fargate-amazon-ecs-tasks-selectively-leverage-soci/">https://aws.amazon.com/about-aws/whats-new/2023/11/aws-fargate-amazon-ecs-tasks-selectively-leverage-soci/</a> </li>
</ul>
<p>10:24  Justin – “I personally don’t use all the CPU memory or the network of most of my container instances. So yes, that’s a willing trade-off I’m willing to make.”</p>
<p>13:13 <a href="https://aws.amazon.com/about-aws/whats-new/2025/08/aws-management-console-assigning-color-aws-account/">AWS Management Console now supports assigning a color to an AWS </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/08/aws-management-console-assigning-color-aws-account/">account for easier identification</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/console/">AWS Management Console</a> now allows admins to assign colors to accounts (like red for production, yellow for testing) that appear in the navigation bar, replacing the need to memorize account numbers for identification across multi-account environments.</li>
<li style="font-weight:400;">The feature addresses a common pain point for organizations managing multiple AWS accounts for different workloads, business units, or environments by providing instant visual differentiation when switching between accounts.</li>
<li style="font-weight:400;">Implementation requires admin privileges to set colors through the Account menu, and users need either the AWSManagementConsoleBasicUserAccess managed policy or the custom uxc:getaccountcolor permission to view the assigned colors.</li>
<li style="font-weight:400;">This quality-of-life improvement reduces the risk of accidental changes in the wrong environment and speeds up context switching for engineers and operators who regularly work across multiple AWS accounts.</li>
<li style="font-weight:400;">The feature is available now in all public regions at no additional cost, representing AWS’s continued focus on console usability improvements for enterprise customers managing complex multi-account architectures.</li>
</ul>
<p>14:57  Matt – “I use it for Chrome and that’s always where I’ve identified different users depending on where it was, I kind of like it where it’s something that can be set.”</p>
<p>17:07 <a href="https://aws.amazon.com/about-aws/whats-new/2025/08/aws-transfer-family-terraform-sftp-connectors/">AWS Transfer Family introduces Terraform support for deploying SFTP </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/08/aws-transfer-family-terraform-sftp-connectors/">connectors</a></p>
<ul>
<li style="font-weight:400;"><a href="https://registry.terraform.io/modules/aws-ia/transfer-family/aws/latest">AWS Transfer Family</a> now supports Terraform deployment for SFTP connectors, enabling Infrastructure as Code automation for file transfers between S3 and remote SFTP servers. This extends beyond the existing SFTP server endpoint support to include the connector functionality.</li>
<li style="font-weight:400;">SFTP connectors provide fully managed, low-code file copying between S3 and remote SFTP servers, and the new <a href="https://github.com/aws-ia/terraform-aws-transfer-family">Terraform module</a> allows programmatic provisioning with dependencies and customizations in a single deployment.</li>
<li style="font-weight:400;">The module includes end-to-end examples for automating file transfer workflows using schedule or event triggers, eliminating manual configuration errors and providing repeatable, scalable deployments.</li>
<li style="font-weight:400;">This addresses a common enterprise need for automated file transfers between cloud storage and legacy SFTP systems, particularly useful for organizations migrating to the cloud or maintaining hybrid architectures.</li>
<li style="font-weight:400;">The Terraform module is available on GitHub at github.com/aws-ia/terraform-aws-transfer-family with documentation at registry.terraform.io/modules/aws-ia/transfer-family/aws/latest.</li>
</ul>
<p>18:57  Ryan – “You know you’re getting deep into enterprise orchestration in terms of your customer base when you’re doing stuff like this, because this is ROUGH. “</p>
<p>19:20 <a href="https://aws.amazon.com/about-aws/whats-new/2025/08/amazon-eks-on-demand-insights-refresh/">Amazon EKS introduces on-demand insights refresh</a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/eks/latest/userguide/what-is-eks.html">Amazon EKS</a> now allows on-demand refresh of cluster insights, letting customers immediately verify if their applied recommendations and configuration changes have taken effect instead of waiting for periodic automatic checks.</li>
<li style="font-weight:400;">This feature addresses a key pain point during Kubernetes upgrades by providing instant feedback on whether required changes have been properly implemented, reducing the time between making changes and validating them.</li>
<li style="font-weight:400;">The insights system checks for issues like deprecated APIs before version upgrades and provides specific remediation steps, with the refresh capability now available in all commercial AWS regions.</li>
<li style="font-weight:400;">For DevOps teams managing multiple EKS clusters, this eliminates the guesswork and waiting periods during maintenance windows, particularly useful when performing rolling upgrades across environments.</li>
<li style="font-weight:400;">The feature integrates with existing EKS cluster management workflows at no additional cost, accessible through the EKS console or API as documented at docs.aws.amazon.com/eks/latest/userguide/cluster-insights.html.</li>
</ul>
<p>20:41 <a href="https://aws.amazon.com/about-aws/whats-new/2025/08/amazon-q-developer-mcp-admin-control/">Amazon Q Developer now supports MCP admin control</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/q/developer/">Amazon Q Developer</a> adds centralized admin control for <a href="https://github.com/modelcontextprotocol/servers">Model Context Protocol (MCP) servers</a>, allowing organizations to enable or disable MCP functionality across all Q Developer clients from the AWS console.</li>
<li style="font-weight:400;">The feature provides session-level enforcement, checking admin settings at startup and every 24 hours during runtime, ensuring consistent policy application across <a href="https://code.visualstudio.com/download">VSCode</a>, <a href="https://duckduckgo.com/y.js?ad_domain=jetbrains.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=wUZ_LSgNvqL1DKVaoGV_tR4ZFNnwwiIJL7dePWvk1BQ8ysWWtVUPIrLAOZkXiKayCGnBWL6TfRllebQzHX7PUWRI3zI1H4FUsY73aSztmj0X0cYB6_UyTGExI_fptQT2.69QJ2zLUm3Tf6xOcjADdrw&amp;eddgt=CfcirB-WXt24BQODOJSlYA%3D%3D&amp;rut=8166f522207f2239fc7ce88c3df36c27df31192348919cf7efff090f65e24ad0&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8A06nvs7PnjJGQ-T8K-vKTTVUCUyIVtP85LaiO-WTKP73pLmnKk1U3DB4OXqJ46MvdGbClTkmiNZ6DENXfGIG6IHu5qyfF2vkFTICVnYFRWCA2345ERA_3AjUNlgxxf72c3rz8945b66zHsqrnjV1lGviFIBA7uzAf3GNbpg48XMkIe-xu9W-rm7-e4B-HFjBxdEv7jQu6ttRsQ_5kd3XYWy5X6BPTA6mz69OG98AW5WMY_9uPYwkAw1JVbWaUxemCiKEq1jZjXtAQhgSCekCECgnCdTJj79M9kQSYYmBflcYVIKU3U9YzH-MHoavc0O6QBnCqjY-5bZAJv5VGz7frJSiZs12myy5_u6lcHweTSfsCLmAeyX2u8XyLEBBb2xSG2UAAo0BQpVJB2mXlbd0FJFK3e4AgFM3BkmsF0xguPqK_UELQCkU3z7tyaTfxGLC0FVPdUyJc-1S8EhrNzY-9zrY6yh7uXw7oeylgL5sPSai1LJEHbDJlXQ7zqhCqSuTsNowYkhv_Fz0BEmdq8vtyyL4y66Q_cAGHB5d5QmU0vtl-ZIa8xVEG3K-3_n8P5DgXXTi5oJBQLUKVgNAQiixal1GNTPe6ye8raZvdQyr03xw7wqv4s1NcnOSrZjURGsdlls1hPBHmQjJiotrt_AARc9eV2AL1uiGWEkVGdl85illyk_bL3MMu0zuykzghIt_a4Av84aKRJ9YWjPU_5IR0m-TxjtmPDgBPEmj2Vq2BV5b_Y5kqaVouTUmf9CKzua0YGd9Mg%26u%3DaHR0cHMlM2ElMmYlMmZscC5qZXRicmFpbnMuY29tJTJmaW50ZWxsaWotaWRlYS1wcm9tbyUyZiUzZm1zY2xraWQlM2RjNzQzMDNmMTNlZTUxZTAzNzMyNjZiMzFlNzJkY2FmYiUyNnV0bV9zb3VyY2UlM2RiaW5nJTI2dXRtX21lZGl1bSUzZGNwYyUyNnV0bV9jYW1wYWlnbiUzZEFNRVJfZW5fVVMtUFNUJTI1MkJNU1RfSURFQV9CcmFuZGVkJTI2dXRtX3Rlcm0lM2RpbnRlbGxpaiUyNTIwSURFQSUyNnV0bV9jb250ZW50JTNkaW50ZWxsaWolMjUyMGlkZWE%26rlid%3Dc74303f13ee51e0373266b31e72dcafb&amp;vqd=4-104893368874981554249078490907971548754&amp;iurl=%7B1%7DIG%3D6F737D604C40406E8400851EF188A886%26CID%3D1B3D89C6A03763352BDA9FA5A1586222%26ID%3DDevEx%2C5045.1">JetBrains</a>, <a href="https://visualstudio.microsoft.com/">Visual Studio</a>, <a href="https://www.eclipse.org/downloads/">Eclipse</a>, and the <a href="https://aws.amazon.com/developer/learning/q-developer-cli/">Q Developer CLI</a>.</li>
<li style="font-weight:400;">Organizations gain granular control over external resource access through MCP servers, addressing security concerns by preventing users from adding unauthorized servers when the functionality is disabled.</li>
<li style="font-weight:400;">This update positions Q Developer as a more enterprise-ready AI coding assistant by giving IT administrators the governance tools needed to manage AI-powered development environments at scale.</li>
<li style="font-weight:400;">The control mechanism operates at no additional cost and integrates with existing Q Developer subscriptions, making it immediately available to current enterprise customers without deployment overhead.</li>
</ul>
<p>21:33  Ryan – “This future is going to be a little weird, you know, as we sort it out. You think about like chatbots and being able to sort of create infrastructure there and then, kind of bypassing a lot of the permissions and stuff. This is kind of the same problem, but magnified a lot more. And so like, it’s going to be interesting to see how companies adapt.”</p>
<p>22:48 <a href="https://aws.amazon.com/about-aws/whats-new/2025/08/amazon-ec2-i8ge-instances-generally-available/">Introducing Amazon EC2 I8ge instances</a> </p>
<ul>
<li style="font-weight:400;">AWS launches <a href="https://aws.amazon.com/ec2/instance-types/i8g/">I8ge instances</a> with <a href="https://aws.amazon.com/ec2/graviton/level-up-with-graviton/">Graviton4</a> processors delivering 60% better compute performance than previous <a href="https://aws.amazon.com/ec2/graviton/">Graviton2</a> storage-optimized instances, plus 120TB of local NVMe storage – the highest density among Graviton-based storage instances.</li>
<li style="font-weight:400;">The new third-generation <a href="https://aws.amazon.com/blogs/aws/aws-nitro-ssd-high-performance-storage-for-your-i-o-intensive-applications/">AWS Nitro SSDs</a> provide 55% better real-time storage performance per TB with 60% lower I/O latency compared to I4gn instances, making them ideal for latency-sensitive workloads like real-time databases and streaming analytics.</li>
<li style="font-weight:400;">I8ge instances scale up to 48xlarge with 1,536 GiB memory and offer 300 Gbps networking bandwidth – the highest among storage-optimized EC2 instances – addressing the needs of data-intensive applications requiring both storage density and network throughput.</li>
<li style="font-weight:400;">Currently available only in US East (Ohio), US East (N. Virginia), and US West (Oregon), limiting deployment options for global workloads compared to other EC2 instance families.</li>
<li style="font-weight:400;">The combination of high storage density, improved I/O performance, and Graviton4 efficiency positions these instances for cost-effective deployment of search clusters, time-series databases, and real-time analytics platforms that previously required multiple instances or external storage.</li>
</ul>
<p>PLUS</p>
<p><a href="https://aws.amazon.com/blogs/aws/new-general-purpose-amazon-ec2-m8i-and-m8i-flex-instances-are-now-available/">New general-purpose Amazon EC2 M8i and M8i Flex instances are now </a><a href="https://aws.amazon.com/blogs/aws/new-general-purpose-amazon-ec2-m8i-and-m8i-flex-instances-are-now-available/">available | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;">AWS launches <a href="https://aws.amazon.com/ec2/instance-types/m8i/">M8i and M8i-Flex instances</a> with custom Intel Xeon 6 processors running at 3.9 GHz all-core turbo, delivering up to 15% better price-performance and 2.5x memory bandwidth compared to M7i generation.</li>
<li style="font-weight:400;">M8i-Flex offers a 5% lower price point for workloads that don’t need sustained CPU performance, reaching full CPU performance 95% of the time while maintaining compatibility with existing applications.</li>
<li style="font-weight:400;">Performance gains include 60% faster NGINX web serving, 30% faster PostgreSQL database operations, and 40% faster AI deep learning recommendation models compared to the previous generation.</li>
<li style="font-weight:400;">New sixth-generation <a href="https://aws.amazon.com/ec2/nitro/">AWS Nitro Cards</a> provide 2x network and EBS bandwidth with configurable 25% allocation adjustments between network and storage, improving database query processing and logging speeds.</li>
<li style="font-weight:400;">Available in 4 regions (US East Virginia/Ohio, US West Oregon, Europe Spain) with sizes up to 384 vCPUs and 1.5TB memory, including bare metal options and SAP certification for enterprise workloads.</li>
</ul>
<p>29:30 <a href="https://aws.amazon.com/blogs/aws/now-open-aws-asia-pacific-new-zealand-region/">Now Open — AWS Asia Pacific (New Zealand) Region | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;">AWS launches its 38th global region in <a href="https://aws.amazon.com/local/new-zealand/">New Zealand (ap-southeast-6)</a> with three availability zones, representing a NZD 7.5 billion investment that’s expected to contribute NZD 10.8 billion to New Zealand’s GDP and create 1,000 jobs annually.</li>
<li style="font-weight:400;">The region addresses data residency requirements for New Zealand organizations and government agencies operating under the country’s cloud-first policy, with AWS supporting 143 security standards, including PCI DSS, HIPAA, and GDPR compliance certifications.</li>
<li style="font-weight:400;">New Zealand customers like MATTR, Xero, and Thematic are already leveraging AWS services, including Amazon Bedrock for generative AI applications, with the region powered by renewable energy through an agreement with Mercury New Zealand from day one.</li>
<li style="font-weight:400;">AWS has been building infrastructure in New Zealand since 2013, including CloudFront edge locations, an Auckland Local Zone for single-digit millisecond latency, and Direct Connect locations, with this full region launch completing their local infrastructure footprint.</li>
<li style="font-weight:400;">The launch brings AWS to 120 Availability Zones across 38 regions globally, with strong local partner ecosystem support from companies like Custom D, Grant Thornton Digital, MongoDB, and Parallo serving New Zealand customers.</li>
</ul>
<p>30:54 <a href="https://aws.amazon.com/about-aws/whats-new/2025/09/open-source-aws-cli-scripts/">Announcing a new open source project for scenario-focused AWS CLI </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/09/open-source-aws-cli-scripts/">scripts</a></p>
<ul>
<li style="font-weight:400;">AWS launched an open source project providing tested shell scripts for over 60 AWS services, addressing the common challenge of writing error-handling and cleanup logic when using the <a href="https://aws.amazon.com/cli/">AWS CLI</a> for infrastructure automation.</li>
<li style="font-weight:400;">The <a href="https://aws.amazon.com/training/learn-about/developer/">AWS Developer Tutorials</a> project on GitHub includes end-to-end scripts with built-in resource tracking and cleanup operations, reducing the time developers spend debugging CLI commands and preventing orphaned resources.</li>
<li style="font-weight:400;">Developers can generate new scripts in as little as 15 minutes using generative AI tools like <a href="https://aws.amazon.com/developer/learning/q-developer-cli/">Amazon Q Developer CLI</a>, leveraging existing documentation to create working scripts through an iterative test-and-improve process.</li>
<li style="font-weight:400;">Each script comes with tutorials explaining the AWS service API interactions, making it easier for teams to understand and modify scripts for their specific use cases rather than starting from scratch.</li>
<li style="font-weight:400;">The project accepts community contributions and provides instructions for generating new scripts, potentially building a comprehensive library of production-ready CLI automation patterns across AWS services.</li>
<li style="font-weight:400;">We hereby nominate Ryan’s shitty scripts to the community as a contribution. You’re welcome, world.  </li>
</ul>
<p>31:56  Ryan – “I will definitely give it a look. It’s kind of strange, because most of the contributions right now are very specific to tutorials, like trying to learn a new Amazon service, and there’s very little documentation on what error handling and advanced sorts of logic are built into these scripts. All of the documentation is just directing you at Q and say, Hey Q, build me a thing that looks like that.”</p>
<p>33:15 <a href="https://aws.amazon.com/about-aws/whats-new/2025/09/cache-management-anthropics-claude-models-bedrock/">Simplified Cache Management for Anthropic’s Claude models in Amazon </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/09/cache-management-anthropics-claude-models-bedrock/">Bedrock</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/bedrock/">Amazon Bedrock</a> simplifies prompt caching for Claude models by automatically identifying and reusing the longest previously cached prefix, eliminating manual cache point management for developers using <a href="https://www.anthropic.com/claude/haiku">Claude 3.5 Haiku</a>, <a href="https://assets.anthropic.com/m/785e231869ea8b3b/original/claude-3-7-sonnet-system-card.pdf">Claude 3.7</a>, and <a href="https://www.anthropic.com/claude/sonnet">Claude 4</a>.</li>
<li style="font-weight:400;">The update reduces token consumption and costs since cache read tokens don’t count toward token per minute (TPM) quotas, making multi-turn conversations and research assistants more economical to operate.</li>
<li style="font-weight:400;">Developers now only need to set a single cache breakpoint at the end of their request instead of tracking multiple cache segments, significantly reducing implementation complexity for applications with repetitive context.</li>
<li style="font-weight:400;">This feature addresses a common pain point in LLM applications where repeated context (like system prompts or document analysis) previously required manual cache management logic that was error-prone and time-consuming.</li>
<li style="font-weight:400;">Available immediately in all regions supporting these Claude models on Bedrock, with implementation details in the Amazon Bedrock Developer Guide for teams looking to optimize their existing Claude deployments.</li>
</ul>
<p>34:07  Ryan – “I’m just really glad I don’t have to create any applications that need to be this focused on token usage. It sounds painful.” </p>
<h2>GCP</h2>
<p>35:02 <a href="https://blog.google/feed/new-ai-vids-no-cost-option/">Google Workspace announces new gen AI features and a no-cost option for </a><a href="https://blog.google/feed/new-ai-vids-no-cost-option/">Vids</a></p>
<ul>
<li style="font-weight:400;"><a href="https://workspace.google.com/products/vids/">Google Vids</a> now includes generative AI capabilities powered by <a href="https://veo-3.ai/">Veo 3</a> that can transform static images into short videos, available to paid Workspace customers and Google AI Pro/Ultra subscribers. </li>
<li style="font-weight:400;">This positions Google against competitors like <a href="https://clipchamp.com/en/">Microsoft’s Clipchamp</a> and Adobe’s AI video tools by integrating video creation directly into the productivity suite.</li>
<li style="font-weight:400;">The basic Vids editor without AI features launches as a no-cost option for consumers, marking Google’s first free video editing tool within Workspace. This creates a clear freemium model where basic editing is free, but AI-powered features like avatars and automatic transcript trimming require paid subscriptions.</li>
<li style="font-weight:400;">The Veo 3 integration represents Google’s latest attempt to embed its foundational AI models across productivity tools, similar to how <a href="https://gemini.google.com/">Gemini</a> powers other Workspace features. </li>
<li style="font-weight:400;">This could benefit marketing teams, educators, and content creators who need quick video content from existing image assets.</li>
<li style="font-weight:400;">The feature addresses the growing demand for video content in business communications and training materials, where users often have images but lack video production skills or resources. The automatic transcript trim feature particularly targets corporate training and documentation use cases.</li>
<li style="font-weight:400;">Pricing remains tied to existing Workspace tiers rather than separate charges, making it accessible to current enterprise customers without additional procurement processes. The instructional “Vids on Vids” series suggests Google expects significant adoption and wants to reduce the learning curve.</li>
<li style="font-weight:400;">Expect shenanigans. </li>
</ul>
<p>36:34 <a href="https://cloud.google.com/blog/topics/hybrid-cloud/gemini-is-now-available-anywhere/">Gemini is now available anywhere | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google now offers <a href="https://cloud.google.com/blog/products/ai-machine-learning/run-gemini-and-ai-on-prem-with-google-distributed-cloud">Gemini AI models on-premises</a> through <a href="https://cloud.google.com/distributed-cloud">Google Distributed Cloud</a> (GDC), allowing organizations with strict data sovereignty requirements to run advanced AI workloads in their own data centers without compromising security or compliance.</li>
<li style="font-weight:400;">The platform includes <a href="https://deepmind.google/models/gemini/flash/">Gemini 2.5 Flash</a> and <a href="https://ai.google.dev/gemini-api/docs/models">Pro</a> models, supports <a href="https://www.nvidia.com/en-us/data-center/technologies/hopper-architecture/">NVIDIA Hopper</a> and <a href="https://www.nvidia.com/en-us/data-center/technologies/blackwell-architecture/">Blackwell GPUs,</a> and provides managed infrastructure with automatic scaling, load balancing, and confidential computing capabilities for both CPUs and GPUs.</li>
<li style="font-weight:400;">This positions Google against <a href="https://aws.amazon.com/outposts/">AWS Outposts</a> and <a href="https://learn.microsoft.com/en-us/azure-stack/">Azure Stack</a>, but with a specific focus on AI workloads – offering a complete AI stack including Vertex AI services, pre-built agents, and support for custom models alongside Gemini.</li>
<li style="font-weight:400;">Key customers include Singapore government agencies (CSIT, GovTech, HTX) and KDDI in Japan, highlighting the appeal to the public sector and regulated industries that need AI capabilities while maintaining complete control over sensitive data.</li>
<li style="font-weight:400;">The offering comes in two variants: GDC air-gapped (now generally available) for completely isolated environments and GDC connected (in preview) for hybrid scenarios, though pricing details are not disclosed and require contacting Google directly, which means expensive. Don’t say we didn’t warn you. </li>
</ul>
<p>38:18  Justin – “I 100% expect this is going to be very expensive. I mean, connected and managed Kubernetes for containers and VMs on a one-year half-depth ruggedized server is $415 per node per month with a five-year commitment.”</p>
<p>39:41 <a href="https://cloud.google.com/blog/products/containers-kubernetes/container-optimized-compute-delivers-autoscaling-for-autopilot/">Container-optimized compute delivers autoscaling for Autopilot | Google </a><a href="https://cloud.google.com/blog/products/containers-kubernetes/container-optimized-compute-delivers-autoscaling-for-autopilot/">Cloud Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/kubernetes-engine/docs/concepts/autopilot-overview">GKE Autopilot’s</a> new container-optimized compute platform delivers up to 7x faster pod scheduling by using dynamically resizable VMs and pre-provisioned compute capacity that doesn’t impact billing since customers only pay for requested resources.</li>
<li style="font-weight:400;">The platform addresses a common pain point where autoscaling could take several minutes, forcing users to implement costly workarounds like balloon pods to hold unused capacity for rapid scaling scenarios.</li>
<li style="font-weight:400;">Built-in high-performance HPA profile provides 3x faster calculations and supports up to 1000 HPA objects, making it particularly suitable for web applications and services requiring gradual scaling with 2 CPU or less.</li>
<li style="font-weight:400;">Available in GKE Autopilot 1.32 or later with the general-purpose compute class, though not recommended for one-pod-per-node deployments or batch workloads.</li>
<li style="font-weight:400;">This positions GKE competitively against EKS and AKS by solving the cold start problem for containerized workloads without requiring manual capacity planning or paying for idle resources.</li>
</ul>
<p>40:38  Ryan – “Imagine my surprise when I found out that using GKE autopilot didn’t handle node-level cold start. It was so confusing, so I was like, wait, what? Because you’ve been able to do that on EKS for so long. I was confused. Why do I need to care about node provisioning and size when I have zero access or really other interactions at that node level using autopilot? So it is kind of strange, but glad to see they fixed it.”</p>
<p>41:23  <a href="https://cloud.google.com/blog/products/identity-security/from-clicks-to-clusters-confidential-computing-expands-with-intel-tdx/">From clicks to clusters: Confidential Computing expands with Intel TDX |</a><a href="https://cloud.google.com/blog/products/identity-security/from-clicks-to-clusters-confidential-computing-expands-with-intel-tdx/">Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google expands Confidential Computing with Intel TDX across multiple services, including <a href="https://cloud.google.com/blog/products/identity-security/new-confidential-computing-updates-for-more-hardware-security-options/">Confidential VMs</a>, <a href="https://cloud.google.com/kubernetes-engine/docs/how-to/confidential-gke-nodes">GKE Nodes</a>, and <a href="https://cloud.google.com/confidential-computing/confidential-space/docs/confidential-space-overview">Confidential Space</a>, now available in 10 regions with 21 <a href="https://cloud.google.com/confidential-computing/confidential-vm/docs/supported-configurations#supported-zones">zones</a>. </li>
<li style="font-weight:400;">The technology creates hardware-isolated trust domains that encrypt workloads in memory during processing, addressing the security gap beyond traditional at-rest and in-transit encryption.</li>
<li style="font-weight:400;">Confidential VMs with <a href="https://www.nvidia.com/en-us/data-center/h100/">NVIDIA H100 GPUs</a> on <a href="https://cloud.google.com/compute/docs/accelerator-optimized-machines#a3-high-vms">A3 instances</a> combine Intel TDX for CPU protection with NVIDIA Confidential Computing for GPU security, enabling secure AI/ML workloads during training and inference. </li>
<li style="font-weight:400;">Available in three zones (europe-west4-c, us-central1-a, us-east5-a) with the a3-highgpu-1g machine type.</li>
<li style="font-weight:400;">Confidential GKE Nodes with Intel TDX work on both GKE Standard and Autopilot without code changes, allowing containerized workloads to remain encrypted in memory. Configuration can be set at the cluster or node pool level via CLI, API, UI, or Terraform.</li>
<li style="font-weight:400;">Confidential Space now supports Intel TDX hardware in addition to AMD, enabling multi-party data collaboration and federated learning use cases. Customers like Symphony and Duality use it for isolating customer data from privileged insiders and privacy-preserving ML, respectively.</li>
<li style="font-weight:400;">Intel’s <a href="https://community.intel.com/t5/Blogs/Products-and-Solutions/Security/Intel-Tiber-Trust-Authority-Integrates-with-Google-Cloud/post/1691578">Tiber Trust Authority</a> attestation service now offers a free tier for third-party verification of Confidential VMs and Confidential Space workloads. This provides stronger separation of duties and security guarantees beyond Google’s built-in attestation.</li>
</ul>
<p>43:07 <a href="https://cloud.google.com/blog/products/application-modernization/eventarc-advanced-orchestrates-complex-microservices-environments/">Eventarc Advanced orchestrates complex microservices environments | </a><a href="https://cloud.google.com/blog/products/application-modernization/eventarc-advanced-orchestrates-complex-microservices-environments/">Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/eventarc/advanced/docs">Eventarc Advanced</a> is now GA, evolving from <a href="https://cloud.google.com/eventarc/docs">Eventarc Standard</a> to handle complex event-driven architectures with centralized message bus management, real-time filtering and transformation, and multi-format payload support (Avro, JSON, Protobuf). This positions GCP competitively against AWS EventBridge and Azure Event Grid by offering built-in transformation capabilities and Envoy-based routing.</li>
<li style="font-weight:400;">The service introduces a Publish API for ingesting custom and third-party messages in CloudEvents format, enabling organizations to connect existing systems without major refactoring. The centralized message bus provides per-message fine-grained access control and integrates with Cloud Logging for observability.</li>
<li style="font-weight:400;">Key use cases include large-scale microservices orchestration, IoT data streaming for AI workloads, and hybrid/multi-cloud deployments where event routing across different environments is critical. The example order processing system demonstrates practical filtering (routing new orders to notification services) and transformation (high-value orders to fraud detection).</li>
<li style="font-weight:400;">Future integration with Service Extensions will allow custom code insertion into the data path, and planned Model Armor support suggests Google is positioning this for AI agent communication scenarios. This aligns with GCP’s broader push into AI infrastructure and agentic architectures.</li>
<li style="font-weight:400;">While pricing details aren’t provided in the announcement, the serverless nature suggests pay-per-use pricing similar to other GCP eventing services. Organizations should evaluate whether the advanced features justify potential cost increases over Eventarc Standard for their specific use cases.</li>
</ul>
<p>44:20  Ryan – “So OpenAI is going for real-time inference, and Google is going to be event-based. It seems like two very different directions. I like the event-driven architecture; it’s something I continue to use in most of the apps that I’m developing and creating. I think that having the ability to do something at a larger scale and coordinating across an entire business is pretty handy.”</p>
<h2>Azure</h2>
<p>45:22 <a href="https://azure.microsoft.com/en-us/blog/agent-factory-top-5-agent-observability-best-practices-for-reliable-ai/">Agent Factory: Top 5 agent observability best practices for reliable AI | </a><a href="https://azure.microsoft.com/en-us/blog/agent-factory-top-5-agent-observability-best-practices-for-reliable-ai/">Microsoft Azure Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://azure.microsoft.com/en-us/products/ai-foundry">Azure AI Foundry</a> introduces comprehensive <a href="https://learn.microsoft.com/en-us/azure/ai-foundry/concepts/observability">agent observability</a> capabilities that extend beyond traditional metrics, logs, and traces to include AI-specific evaluations and governance features for monitoring autonomous AI agents throughout their lifecycle.</li>
<li style="font-weight:400;">The platform provides built-in agent evaluators that assess critical behaviors like intent resolution, task adherence, tool call accuracy, and response completeness, with seamless integration into CI/CD pipelines through <a href="https://learn.microsoft.com/en-us/azure/ai-foundry/how-to/evaluation-github-action?tabs=foundry-project">GitHub Actions</a> and <a href="https://learn.microsoft.com/en-us/azure/ai-foundry/how-to/evaluation-azure-devops?tabs=foundry-project">Azure DevOps extensions</a>.</li>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/ai-foundry/concepts/ai-red-teaming-agent">Azure’s AI Red Teaming Agent</a> automates adversarial testing to identify security vulnerabilities before production deployment, simulating attacks on both individual agents and complex multi-agent workflows to validate production readiness.</li>
<li style="font-weight:400;">The solution differentiates from traditional observability tools by addressing the non-deterministic nature of AI agents, offering <a href="https://learn.microsoft.com/en-us/azure/ai-foundry/how-to/benchmark-model-in-catalog">model leaderboards</a> for selection, continuous evaluation capabilities, and integration with Azure Monitor for real-time production monitoring with customizable dashboards and alerts.</li>
<li style="font-weight:400;">Enterprise customers like EY, Accenture, and Veeam are already using these features to ensure their AI agents meet quality, safety, and compliance standards, with particular emphasis on regulatory frameworks like the EU AI Act through integrations with Microsoft Purview, Credo AI, and Saidot.</li>
</ul>
<p>47:31  Matt – “It just feels like we’re saying it’s this revolutionary thing, but really it’s something we have to approach from a slightly different angle. It’s the difference between, hey, we have an API and now we have a UI, and users can do things slightly differently… It’s just the evolution of a tool.” </p>
<p>49:04 <a href="https://azure.microsoft.com/en-us/updates?id=500374">Generally Available: Azure App Service – New Premium v4 Offering</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/azure/app-service/app-service-configure-premium-v4-tier">Azure App Service Premium v4</a> brings NVMe local storage and memory-optimized configurations to both Windows and Linux workloads, addressing performance bottlenecks for I/O-intensive applications like content management systems and e-commerce platforms.</li>
<li style="font-weight:400;">The new tier runs on Azure’s latest hardware with faster processors, positioning it competitively against AWS’s compute-optimized instances and GCP’s N2 series while maintaining App Service’s PaaS simplicity.</li>
<li style="font-weight:400;">Starting configurations at 1 vCPU and 4GB RAM make Premium v4 accessible for smaller production workloads that need enhanced performance without jumping to dedicated VM solutions.</li>
<li style="font-weight:400;">This release signals Microsoft’s continued investment in App Service as enterprises increasingly adopt PaaS for mission-critical applications, particularly those requiring consistent low-latency performance.</li>
<li style="font-weight:400;"><a href="https://aka.ms/Premiumv4/blog">Premium v4</a> fills the gap between standard App Service tiers and isolated environments, giving customers a middle-ground option for applications that need better performance but don’t require full network isolation.</li>
</ul>
<p>52:47 <a href="https://azure.microsoft.com/updates?id=494165">Public Preview: Microsoft Planetary Computer Pro</a></p>
<ul>
<li style="font-weight:400;">Microsoft Planetary Computer Pro enters public preview as a geospatial data platform that ingests, manages, and disseminates location-based data for enterprise Data &amp; AI workflows, targeting organizations that need to process satellite imagery and environmental datasets at scale.</li>
<li style="font-weight:400;">The platform integrates with Azure’s existing data services to accelerate geospatial insights, positioning Microsoft to compete with AWS’s Earth on AWS and Google Earth Engine by offering enterprise-grade tools for climate modeling, agriculture monitoring, and urban planning applications.</li>
<li style="font-weight:400;">Key capabilities include streamlined data ingestion pipelines for various geospatial formats and built-in processing tools that reduce the complexity of working with petabyte-scale Earth observation data.</li>
<li style="font-weight:400;">Target customers include government agencies, environmental organizations, and enterprises in agriculture, insurance, and logistics sectors that require planetary-scale data analysis for decision-making.</li>
<li style="font-weight:400;">While pricing details aren’t provided in the preview announcement, the platform likely follows Azure’s consumption-based model, with costs scaling based on data storage, compute resources, and API calls for geospatial processing.</li>
</ul>
<p>53:55  Matt  – “I just want to play with the satellites.” </p>
<p>54:24 <a href="https://www.theregister.com/2025/09/01/microsoft_azure_migration_misfire/">Microsoft cloud customers hit by messed-up migration • The Register</a></p>
<ul>
<li style="font-weight:400;">Microsoft’s migration from MOSP to the <a href="https://www.microsoft.com/en-us/licensing/how-to-buy/microsoft-customer-agreement">Microsoft Customer Agreement</a> caused incorrect cost calculations that triggered false budget alerts, with some customers seeing forecast increases of over 1000% despite no actual billing impact.</li>
<li style="font-weight:400;">Those poor Finops people. </li>
<li style="font-weight:400;">The incident highlights risks in Azure’s account migration processes where automated systems can send panic-inducing alerts even when actual invoices remain unaffected, creating unnecessary administrative burden.</li>
<li style="font-weight:400;">Microsoft’s support response drew criticism as users reported difficulty reaching human support and some claimed their <a href="https://learn.microsoft.com/en-us/answers/questions/5534853/less-20-month-budget-has-been-backdated-to-over-40?page=2&amp;source=docs#answers">forum comments</a> were being deleted, raising questions about Azure’s customer communication during service disruptions.</li>
<li style="font-weight:400;">This follows other recent Azure security and operational issues, including Storm-0501 ransomware attacks and Pentagon concerns about China-based support staff, suggesting potential systemic challenges in Azure’s operational management.</li>
<li style="font-weight:400;">For cloud architects, this emphasizes the importance of understanding the difference between forecast alerts and actual billing, and maintaining direct billing verification processes rather than relying solely on automated notifications.</li>
</ul>
<p>56:26 <a href="https://azure.microsoft.com/en-us/updates?id=499406">Generally Available: Azure Ultra Disk Price Reduction</a></p>
<ul>
<li style="font-weight:400;">Azure Ultra Disks now cost less in multiple regions, making sub-millisecond latency storage more accessible for demanding enterprise workloads like SAP HANA, SQL Server, and Oracle databases.</li>
</ul>
<ul>
<li style="font-weight:400;">Ultra Disks deliver up to 160,000 IOPS and 4,000 MB/s throughput per disk with consistent performance, positioning them as Azure’s answer to AWS io2 Block Express and GCP Extreme Persistent Disks.</li>
<li style="font-weight:400;">The price reduction targets performance-critical applications where storage latency directly impacts business operations, though specific discount percentages weren’t disclosed in the announcement.</li>
<li style="font-weight:400;">This regional <a href="https://azure.microsoft.com/en-us/pricing/details/managed-disks/">pricing</a> strategy suggests Microsoft is testing market response before potentially expanding discounts to other regions, following similar patterns seen with premium storage tiers.</li>
<li style="font-weight:400;">Enterprise customers running latency-sensitive workloads should evaluate whether migrating to Central US for Ultra Disk deployments offers meaningful cost savings compared to their current storage configurations.</li>
</ul>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback, or ask questions at theCloudPod.net or tweet at us with the hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2137820/c1e-k5d5sgv0wntxw135-z3kg3564igrx-u9vvgp.mp3" length="80294726"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 320 of The Cloud Pod, where the forecast is always cloudy! Justin, Matt, and Ryan are coming to you from Justin’s echo chamber and bringing all the latest in AI and Cloud news, including updates to Google’s Anti-trust case, AWS Cost MCP, new regions, updates to EKS, Veo, and Claude, and more! Let’s get into it. 
Titles we almost went with this week:


Breaking Bad Bottlenecks: AWS  Cooks Up Faster Container Pulls
The Bucket List: Finding Your Lost Storage Dollars
State of Denial: Terraform Finally Stops Saving Your Passwords
Three Stages of Azure Grief: Development, Preview, and Launch
Ground Control to Major Cloud: Microsoft Launches Planetary Computer Pro
Veo Vidi Vici: Google Conquers Video Editing
Red Alert: AWS Makes Production Accounts Actually Look Dangerous
Amazon EKS Discovers the F5 Key 
Chaos Theory Meets ChatGPT: When Your Reliability Data Gets an AI Therapist
Breaking Bad (Services): How AI Helps You Find What’s Already   Broken
Breaking Up is Hard to Cloud: Gemini Moves Back In
Intel Inside Your Secrets: TDX Takes Over Google Cloud
Lord of the Regions: The Return of the Kiwi 
All Blacks and All Stacks: AWS Goes Full Kiwi
Azure Forecast: 100% Chance of Budget Alert Storms
Google Keeps Its Cloud Together: A $2.5T Near Miss
Shell We Dance? AWS Makes CLI Scripting Less Painful
AWS Finally Admits Nobody Remembers All Those CLI Commands
Cache Me If You Claude
Your AWS Console gets its Colors, just don’t choose red shirts
Amazon Q walks into a bar, Tells MCP to order it a beer.. The Bartender sighs and mutters “at least chatgpt just hallucinates its beer”
Ryan’s shitty scripts now as a AWS CLI Library

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our Slack channel for more info.
General News
00:57 Google Dodges A 2.5t Breakup

We have breaking news – and it’s good news for Google. 
Google successfully avoided a potential $2.5 trillion breakup following antitrust proceedings, maintaining its current corporate structure despite regulatory pressure.
The decision represents a significant outcome for Big Tech antitrust cases, potentially setting a precedent for how regulators approach market dominance issues in the cloud and technology sectors.
Cloud customers and partners can expect business continuity with Google Cloud Platform services, avoiding potential disruptions that could have resulted from a corporate restructuring.
The ruling may influence how other major cloud providers structure their businesses and approach regulatory compliance, particularly around bundling services and market competition.
Enterprise customers relying on Google’s integrated ecosystem of cloud, advertising, and productivity tools can continue their current architectures without concerns about service separation.
You just KNOW Microsoft is super mad about this. 

AI Is Going Great – Or How ML Makes Money 
02:16 Introducing GPT-Realtime

OpenAI‘s ]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2137820/c1a-k5d5-mkjvk8p4s353-2nzwyr.jpg"></itunes:image>
                                                                            <itunes:duration>00:55:42</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2137820/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[319: AWS Cost MCP: Your Billing Data Now Speaks Human]]>
                </title>
                <pubDate>Wed, 03 Sep 2025 22:20:57 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2131352</guid>
                                    <link>https://tcpfm.castos.com/episodes/319-aws-cost-mcp-your-billing-data-now-speaks-human</link>
                                <description>
                                            <![CDATA[<h3>Welcome to episode 319 of The Cloud Pod, where the forecast is always cloudy! Justin, Matt, and Ryan are in the studio to bring you all the latest in cloud and AI news. AWS Cost MCP makes exploring your finops data as simple as english text. We’ve got a sunnier view for junior devs, a Microsoft open source development, tokens, and it’s even Kubernetes’ birthday – let’s get into it! </h3>
<h3>Titles we almost went with this week:
</h3>
<ul>
<li>From Linux Hater to Open Source Darling: A Microsoft Love Story</li>
<li>20,000 Lines of Code and a Dream: Microsoft’s Open Source Glow-Up</li>
<li>Ctrl+Alt+Delete Your Assumptions: Microsoft Goes Full Penguin</li>
<li>Token and Esteem: Amazon Bedrock Gets a Counter</li>
<li>CSI: Cloud Scene Investigation</li>
<li>The Great SQL Migration: How AI Became the Universal Translator</li>
<li>Token and Ye Shall Receive: Bedrock’s New Counting Feature</li>
<li>The Count of Monte Token: A Bedrock Tale – mk</li>
<li>Ctrl+Z for Your Database: Now with Built-in Lag Time</li>
<li>IP Freely: GKE Takes the Pain Out of Address Management</li>
<li>AWS CEO: AI Can’t Replace Junior Devs Because Someone Has to Fix the AI’s Code</li>
<li>Better Late Than Never: RDS PostgreSQL Gets Time Travel</li>
<li>The SQL Whisperer: Teaching AI to Speak Database</li>
<li>DigitalOcean Goes Full Chatbot: Your Infrastructure Now Speaks Human</li>
<li>Musk vs Cook: The App Store Wars Episode AI</li>
<li>Firestore Goes Mongo: A Database Love Story</li>
<li>GKE Turns 10: Now With More Candles and Less Complexity</li>
<li>Prime Day Infrastructure: Now With 87,000 AI Chips and a Robot Army</li>
<li>AWS Scales to Quadrillion Requests: Your Black Friday Traffic Looks Cute</li>
<li>AWS billing now speaks human, thanks to MCPs</li>
<li>The Bastion Holds: Azure’s New Gateway to Kubernetes Kingdoms</li>
<li>The Surge Before the Merge: Azure’s New Upgrade Strategy</li>
<li>CNI Overlay: Because Your Pods Deserve Their Own ZIP Code
</li>
</ul>
<h2>AI Is Going Great – or How ML Makes Money </h2>
<p>00:46 <a href="https://www.cnbc.com/2025/08/25/musk-lawsuit-apple-openai-monopoly.html">Musk’s xAI sues Apple, OpenAI alleging scheme that harmed X, Grok</a></p>
<ul>
<li style="font-weight:400;"><a href="https://x.ai/company">xAI</a> filed a lawsuit against <a href="https://www.cnbc.com/quotes/AAPL/">Apple</a> and <a href="https://www.cnbc.com/2025/08/20/openai-compute-ai.html">OpenAI</a>, alleging anticompetitive practices in AI chatbot distribution, claiming Apple deprioritizes competing AI apps like <a href="https://grok.com/">Grok</a> in the App Store while favoring <a href="https://chatgpt.com/">ChatGPT</a> through direct integration into iOS devices.</li>
<li style="font-weight:400;">The lawsuit highlights tensions in AI platform distribution models, where cloud-based AI services depend on mobile app stores for user access, potentially creating gatekeeping concerns for competing generative AI providers.</li>
<li style="font-weight:400;">Apple’s partnership with OpenAI to integrate ChatGPT into iPhone, iPad, and Mac products represents a shift toward native AI integration rather than app-based access, which could impact how cloud AI services reach end users.</li>
<li style="font-weight:400;">The dispute underscores growing competition in the generative AI market, where multiple players, including xAI’s Grok, OpenAI’s ChatGPT, <a href="https://www.deepseek.com/en">DeepSeek</a>, and <a href="https://www.perplexity.ai/">Perplexity</a>, are vying for market position through both cloud APIs and mobile distribution channels.</li>
<li style="font-weight:400;">For cloud developers, this case raises questions about AI service distribution strategies and whether direct device integration partnerships will become necessary to compete effectively against app store-based distribution models.</li>
</ul>
<p>01:55  Justin – “There’s always a potential for conflict of interest when you have a partnership like this, but also the app store – there’s a...</p>
<h3>Chapters</h3>
<ul><li>(00:00:00) - The Cloud Pod</li><li>(00:00:58) - Amazon's Grok Sues Apple Over App Store Distribution</li><li>(00:04:19) - Amazon CEO: AI Replacing Junior Developers is the Dumbest Idea</li><li>(00:11:10) - Amazon: Count Your Tokens With AWS AI</li><li>(00:17:32) - Amazon RDS for Postgres: Delayed Read Replicas</li><li>(00:22:41) - Amazon Prime Day: My Favorite Amazon Announcement</li><li>(00:23:45) - Amazon's Prime Day 2022</li><li>(00:25:15) - AWS: How AWS Met Prime Day</li><li>(00:29:17) - Amazon's Databases Hit Record Highs During Prime Day</li><li>(00:30:14) - CloudTrail: What Caches Do They Use? vs.</li><li>(00:33:37) - Amazon's AWS Countdown</li><li>(00:35:52) - Google's AI Developer Tooling: Which One to Use?</li><li>(00:40:12) - Google Launches Gemini 2.5 Flash Image on Vertex AI</li><li>(00:42:54) - Google Cloud Asset Inventory: Root Cause Analysis Tool</li><li>(00:46:12) - Google's automated SQL Translation from Databrick Spark SQL to Big</li><li>(00:48:10) - Google's White Paper on AI Inference Environmental Impact</li><li>(00:52:23) - Google Cloud Compliance Manager: Integrated Security and Compliance Management</li><li>(00:59:04) - Kubernetes: GK Auto IPAM</li><li>(01:01:59) - GKE: Happy 10th Anniversary!</li><li>(01:08:24) - Microsoft Azure News: Week Three</li><li>(01:09:47) - Microsoft vs. AWS: Open Source and Scale</li><li>(01:14:01) - Microsoft to Give DocumentDB to the Linux Foundation</li><li>(01:15:57) - Azure Bastion now supports Private AKS Clusters via Tunnel</li><li>(01:24:11) - Microsoft Migrate now enables direct migration to zone redundant storage disks</li><li>(01:29:49) - Digital Ocean's MCP Server Now Available</li><li>(01:35:33) - Week in the Cloud: September 7, 2017</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 319 of The Cloud Pod, where the forecast is always cloudy! Justin, Matt, and Ryan are in the studio to bring you all the latest in cloud and AI news. AWS Cost MCP makes exploring your finops data as simple as english text. We’ve got a sunnier view for junior devs, a Microsoft open source development, tokens, and it’s even Kubernetes’ birthday – let’s get into it! 
Titles we almost went with this week:


From Linux Hater to Open Source Darling: A Microsoft Love Story
20,000 Lines of Code and a Dream: Microsoft’s Open Source Glow-Up
Ctrl+Alt+Delete Your Assumptions: Microsoft Goes Full Penguin
Token and Esteem: Amazon Bedrock Gets a Counter
CSI: Cloud Scene Investigation
The Great SQL Migration: How AI Became the Universal Translator
Token and Ye Shall Receive: Bedrock’s New Counting Feature
The Count of Monte Token: A Bedrock Tale – mk
Ctrl+Z for Your Database: Now with Built-in Lag Time
IP Freely: GKE Takes the Pain Out of Address Management
AWS CEO: AI Can’t Replace Junior Devs Because Someone Has to Fix the AI’s Code
Better Late Than Never: RDS PostgreSQL Gets Time Travel
The SQL Whisperer: Teaching AI to Speak Database
DigitalOcean Goes Full Chatbot: Your Infrastructure Now Speaks Human
Musk vs Cook: The App Store Wars Episode AI
Firestore Goes Mongo: A Database Love Story
GKE Turns 10: Now With More Candles and Less Complexity
Prime Day Infrastructure: Now With 87,000 AI Chips and a Robot Army
AWS Scales to Quadrillion Requests: Your Black Friday Traffic Looks Cute
AWS billing now speaks human, thanks to MCPs
The Bastion Holds: Azure’s New Gateway to Kubernetes Kingdoms
The Surge Before the Merge: Azure’s New Upgrade Strategy
CNI Overlay: Because Your Pods Deserve Their Own ZIP Code


AI Is Going Great – or How ML Makes Money 
00:46 Musk’s xAI sues Apple, OpenAI alleging scheme that harmed X, Grok

xAI filed a lawsuit against Apple and OpenAI, alleging anticompetitive practices in AI chatbot distribution, claiming Apple deprioritizes competing AI apps like Grok in the App Store while favoring ChatGPT through direct integration into iOS devices.
The lawsuit highlights tensions in AI platform distribution models, where cloud-based AI services depend on mobile app stores for user access, potentially creating gatekeeping concerns for competing generative AI providers.
Apple’s partnership with OpenAI to integrate ChatGPT into iPhone, iPad, and Mac products represents a shift toward native AI integration rather than app-based access, which could impact how cloud AI services reach end users.
The dispute underscores growing competition in the generative AI market, where multiple players, including xAI’s Grok, OpenAI’s ChatGPT, DeepSeek, and Perplexity, are vying for market position through both cloud APIs and mobile distribution channels.
For cloud developers, this case raises questions about AI service distribution strategies and whether direct device integration partnerships will become necessary to compete effectively against app store-based distribution models.

01:55  Justin – “There’s always a potential for conflict of interest when you have a partnership like this, but also the app store – there’s a...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[319: AWS Cost MCP: Your Billing Data Now Speaks Human]]>
                </itunes:title>
                                    <itunes:episode>319</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<h3>Welcome to episode 319 of The Cloud Pod, where the forecast is always cloudy! Justin, Matt, and Ryan are in the studio to bring you all the latest in cloud and AI news. AWS Cost MCP makes exploring your finops data as simple as english text. We’ve got a sunnier view for junior devs, a Microsoft open source development, tokens, and it’s even Kubernetes’ birthday – let’s get into it! </h3>
<h3>Titles we almost went with this week:
</h3>
<ul>
<li>From Linux Hater to Open Source Darling: A Microsoft Love Story</li>
<li>20,000 Lines of Code and a Dream: Microsoft’s Open Source Glow-Up</li>
<li>Ctrl+Alt+Delete Your Assumptions: Microsoft Goes Full Penguin</li>
<li>Token and Esteem: Amazon Bedrock Gets a Counter</li>
<li>CSI: Cloud Scene Investigation</li>
<li>The Great SQL Migration: How AI Became the Universal Translator</li>
<li>Token and Ye Shall Receive: Bedrock’s New Counting Feature</li>
<li>The Count of Monte Token: A Bedrock Tale – mk</li>
<li>Ctrl+Z for Your Database: Now with Built-in Lag Time</li>
<li>IP Freely: GKE Takes the Pain Out of Address Management</li>
<li>AWS CEO: AI Can’t Replace Junior Devs Because Someone Has to Fix the AI’s Code</li>
<li>Better Late Than Never: RDS PostgreSQL Gets Time Travel</li>
<li>The SQL Whisperer: Teaching AI to Speak Database</li>
<li>DigitalOcean Goes Full Chatbot: Your Infrastructure Now Speaks Human</li>
<li>Musk vs Cook: The App Store Wars Episode AI</li>
<li>Firestore Goes Mongo: A Database Love Story</li>
<li>GKE Turns 10: Now With More Candles and Less Complexity</li>
<li>Prime Day Infrastructure: Now With 87,000 AI Chips and a Robot Army</li>
<li>AWS Scales to Quadrillion Requests: Your Black Friday Traffic Looks Cute</li>
<li>AWS billing now speaks human, thanks to MCPs</li>
<li>The Bastion Holds: Azure’s New Gateway to Kubernetes Kingdoms</li>
<li>The Surge Before the Merge: Azure’s New Upgrade Strategy</li>
<li>CNI Overlay: Because Your Pods Deserve Their Own ZIP Code
</li>
</ul>
<h2>AI Is Going Great – or How ML Makes Money </h2>
<p>00:46 <a href="https://www.cnbc.com/2025/08/25/musk-lawsuit-apple-openai-monopoly.html">Musk’s xAI sues Apple, OpenAI alleging scheme that harmed X, Grok</a></p>
<ul>
<li style="font-weight:400;"><a href="https://x.ai/company">xAI</a> filed a lawsuit against <a href="https://www.cnbc.com/quotes/AAPL/">Apple</a> and <a href="https://www.cnbc.com/2025/08/20/openai-compute-ai.html">OpenAI</a>, alleging anticompetitive practices in AI chatbot distribution, claiming Apple deprioritizes competing AI apps like <a href="https://grok.com/">Grok</a> in the App Store while favoring <a href="https://chatgpt.com/">ChatGPT</a> through direct integration into iOS devices.</li>
<li style="font-weight:400;">The lawsuit highlights tensions in AI platform distribution models, where cloud-based AI services depend on mobile app stores for user access, potentially creating gatekeeping concerns for competing generative AI providers.</li>
<li style="font-weight:400;">Apple’s partnership with OpenAI to integrate ChatGPT into iPhone, iPad, and Mac products represents a shift toward native AI integration rather than app-based access, which could impact how cloud AI services reach end users.</li>
<li style="font-weight:400;">The dispute underscores growing competition in the generative AI market, where multiple players, including xAI’s Grok, OpenAI’s ChatGPT, <a href="https://www.deepseek.com/en">DeepSeek</a>, and <a href="https://www.perplexity.ai/">Perplexity</a>, are vying for market position through both cloud APIs and mobile distribution channels.</li>
<li style="font-weight:400;">For cloud developers, this case raises questions about AI service distribution strategies and whether direct device integration partnerships will become necessary to compete effectively against app store-based distribution models.</li>
</ul>
<p>01:55  Justin – “There’s always a potential for conflict of interest when you have a partnership like this, but also the app store – there’s a ton of companies that track downloads and track usage of these things, and I don’t know that they have hard evidence here, other than this is just a way to keep Apple distracted while they make Grok better.” </p>
<p>04:14 <a href="https://www.theregister.com/2025/08/21/aws_ceo_entry_level_jobs_opinion/">AWS CEO says AI replacing junior staff is ‘dumbest idea’ • The Register</a></p>
<ul>
<li style="font-weight:400;">AWS CEO Matt Garman argues that using AI to replace junior developers is counterproductive, since they’re the least expensive employees and most engaged with AI tools, warning that eliminating entry-level positions creates a pipeline problem for future senior talent.</li>
<li style="font-weight:400;">Garman criticizes the standard metric of measuring AI value by percentage of code written, noting that more lines of code don’t equal better code – and that over 80% of AWS developers already use AI tools for various tasks, including unit tests, documentation, and code writing.</li>
<li style="font-weight:400;">The CEO emphasizes that future tech workers need to learn critical thinking and problem-solving skills, rather than narrowly focused technical skills, as rapid technological change means that specific skills may not sustain a 30-year career.</li>
<li style="font-weight:400;">This perspective aligns with AWS’s push for their <a href="https://www.theregister.com/2025/08/18/aws_updated_kiro_pricing/">Kiro AI coding assistant</a> while acknowledging that AI should augment rather than replace human developers, particularly as organizations need experienced developers to evaluate and implement AI-generated code properly.</li>
<li style="font-weight:400;">Garman’s comments come amid industry concerns about AI’s impact on employment and follow recent issues with AWS’s Q Developer tool, which had security vulnerabilities, highlighting the ongoing need for human oversight in AI development.</li>
</ul>
<p>05:25  Ryan – “I do really think the industry is using AI wrong, and I think that the layoffs are a sign of that. And it’s really easy to say ‘oh, well our mid to senior developer staff can now do all these junior tasks, so let’s replace them,’ but I don’t think that’s a sustainable model.” </p>
<h2>AWS</h2>
<p>11:14 <a href="https://aws.amazon.com/about-aws/whats-new/2025/08/count-tokens-api-anthropics-claude-models-bedrock/">Count Tokens API is now supported for Anthropic’s Claude models now in </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/08/count-tokens-api-anthropics-claude-models-bedrock/">Amazon Bedrock</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/bedrock/">Amazon Bedrock</a> now offers a Count Tokens API for <a href="https://docs.anthropic.com/en/docs/about-claude/models/overview">Claude models</a>, enabling developers to calculate token usage before making inference calls, which helps predict costs and avoid unexpected rate limit issues.</li>
<li style="font-weight:400;">This API addresses a common pain point where developers would submit prompts that exceed context windows or trigger throttling, only discovering the issue after the fact and potentially incurring unnecessary costs.</li>
<li style="font-weight:400;">The feature enables more efficient prompt engineering by allowing teams to test different prompt variations and measure their token consumption without actually running inference, which is particularly useful for optimizing system prompts and templates.</li>
<li style="font-weight:400;">Currently limited to Claude models only, Amazon is prioritizing<a href="https://www.anthropic.com/"> Anthropic’s</a> integration, while potentially planning similar support for other Bedrock models, such as <a href="https://aws.amazon.com/bedrock/amazon-models/titan/">Titan</a>, or third-party options.</li>
<li style="font-weight:400;">For cost-conscious organizations, this pre-flight check capability allows better budget forecasting and helps implement guardrails before expensive model calls, critical as enterprises scale their AI workloads.</li>
</ul>
<p>12:10  Justin – “Now, I appreciate the idea of allowing better budget forecasting, but budget forecasting does not move with the scale of AI, so there is no way that you’re getting an accurate forecast unless you have very specific prompts that you’re going to reuse a LOT of times.”    </p>
<p>13:39 <a href="https://aws.amazon.com/about-aws/whats-new/2025/08/aws-billing-cost-management-mcp-server/">Announcing the AWS Billing and Cost Management MCP server</a></p>
<ul>
<li style="font-weight:400;">AWS releases an open-source Model Context Protocol (MCP) server for Billing and Cost Management that enables AI assistants like <a href="https://support.anthropic.com/en/articles/10949351-getting-started-with-local-mcp-servers-on-claude-desktop">Claude Desktop</a>, <a href="https://code.visualstudio.com/docs/copilot/chat/mcp-servers">VS Code Copilot</a>, and <a href="https://docs.aws.amazon.com/amazonq/latest/qdeveloper-ug/qdev-mcp.html">Q Developer CLI</a> to analyze AWS spending patterns and identify cost optimization opportunities.</li>
<li style="font-weight:400;">The MCP server features a dedicated SQL-based calculation engine that handles large volumes of cost data and performs reproducible calculations for period-over-period changes and unit cost metrics, providing more comprehensive functionality than simple API access.</li>
<li style="font-weight:400;">This integration enables customers to utilize their preferred AI assistant for FinOps tasks, including historical spending analysis, cost anomaly detection, workload cost estimation, and AWS service pricing queries, all without needing to switch to the AWS console.</li>
<li style="font-weight:400;">The server connects securely using standard AWS credentials, with minimal configuration required, and is now available in the <a href="https://github.com/awslabs/mcp/tree/main/src/billing-cost-management-mcp-server">AWS Labs GitHub repository</a> as an open-source project.</li>
<li style="font-weight:400;">By supporting the MCP standard, AWS enables customers to maintain their existing AI toolchain workflows while gaining access to comprehensive billing and cost management capabilities previously available only in <a href="https://aws.amazon.com/q/developer/">Amazon Q Developer</a> in the console.</li>
</ul>
<p>14:33  Justin – “All I want to know is, can I ask the MCP to tell me what the hell EC2  Other is?” </p>
<p>16:07 <a href="https://aws.amazon.com/about-aws/whats-new/2025/08/amazon-rds-for-db2-read-replicas/">Amazon RDS for Db2 now supports read replicas</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/rds/db2/">Amazon RDS for Db2</a> now supports up to three read replicas per database instance, enabling customers to offload read-only workloads from the primary database and improve application performance through asynchronous replication.</li>
<li style="font-weight:400;">Read replicas can be deployed within the same region or cross-region, providing both performance scaling for read-heavy applications and disaster recovery capabilities through replica promotion to handle read/write operations.</li>
<li style="font-weight:400;">The feature requires <a href="https://www.ibm.com/products/db2-database/pricing">IBM Db2 licenses</a> for all vCPUs on replica instances, which customers can obtain through AWS Marketplace On-Demand licensing or bring their own licenses (BYOL). Note: You’re going to want to do this. On-demand pricing is going to be high. Don’t say we didn’t warn you.</li>
<li style="font-weight:400;">This addition brings RDS for Db2 to feature parity with other RDS engines, such as <a href="https://www.mysql.com/">MySQL</a> and PostgreSQL, which have long supported read replicas, making it more viable for enterprise workloads that require high availability and read scaling.</li>
<li style="font-weight:400;">Key use cases include analytics workloads that require consistent read performance, geographic distribution of read traffic, and maintaining standby instances for disaster recovery without the complexity of manually managing replication.</li>
</ul>
<p>11:26 <a href="https://aws.amazon.com/about-aws/whats-new/2025/08/amazon-rds-postgresql-delayed-replica/">Amazon RDS for PostgreSQL now supports delayed read replicas</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/rds/postgresql/">Amazon RDS for PostgreSQL</a> now supports delayed read replicas, allowing you to configure a time lag between source and replica databases to protect against accidental data deletions or modifications.</li>
<li style="font-weight:400;">The feature enables faster disaster recovery by allowing you to pause replication before problematic changes propagate, then resume up to a specific log position and promote the replica as primary – significantly faster than traditional point-in-time restores, which can take hours for large databases.</li>
<li style="font-weight:400;">Available in all AWS regions where RDS PostgreSQL operates at no additional cost beyond standard RDS pricing, making it an accessible safety net for production databases.</li>
<li style="font-weight:400;">This addresses a common enterprise need for protection against human error while maintaining the performance benefits of read replicas for scaling read workloads.</li>
<li style="font-weight:400;">The implementation follows similar delayed replication features in MySQL and other database systems, bringing PostgreSQL on RDS to feature parity with competitor offerings.</li>
</ul>
<p>18:39  Justin – “The chances of me being able to realize that I screwed up that badly within 15 minutes before this replicated is probably pretty slim.” </p>
<p>23:07 <a href="https://aws.amazon.com/blogs/aws/aws-services-scale-to-new-heights-for-prime-day-2025-key-metrics-and-milestones/">AWS services scale to new heights for Prime Day 2025: key and </a><a href="https://aws.amazon.com/blogs/aws/aws-services-scale-to-new-heights-for-prime-day-2025-key-metrics-and-milestones/">milestones | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;">AWS infrastructure handled record-breaking <a href="https://www.aboutamazon.com/news/retail/prime-day-2025-recap">Prime Day 2025</a> traffic with <a href="https://aws.amazon.com/dynamodb">DynamoDB</a> processing 151 million requests per second, ElastiCache serving 1.5 quadrillion daily requests, and Lambda handling 1.7 trillion invocations per day, demonstrating AWS’s ability to scale for extreme workloads.</li>
<li style="font-weight:400;">Amazon deployed over 87,000 AWS <a href="https://aws.amazon.com/ai/machine-learning/inferentia/">Inferentia</a> and <a href="https://aws.amazon.com/ai/machine-learning/trainium/">Trainium</a> chips to power the <a href="https://www.aboutamazon.com/news/retail/amazon-rufus-online-shopping-tips">Rufus AI shopping assistant</a>, while <a href="https://aws.amazon.com/sagemaker/ai/">SageMaker AI</a> processed 626 billion inference requests, demonstrating a significant investment in custom silicon for AI workloads at scale.</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/outposts/">AWS Outposts</a> at Amazon fulfillment centers sent 524 million commands to 7,000 robots with peak volumes of 8 million commands per hour (160% increase from 2024), highlighting edge computing’s role in modern logistics and same-day delivery operations.</li>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/fis/latest/userguide/what-is.html">AWS Fault Injection Service</a> ran 6,800 experiments (8x more than 2024) to test resilience, enabled by new <a href="https://aws.amazon.com/ecs">ECS</a> support for network fault injection on <a href="https://aws.amazon.com/fargate/">Fargate</a> and CI/CD pipeline integration, emphasizing chaos engineering as standard practice for high-availability systems.</li>
<li style="font-weight:400;">AWS rebranded Infrastructure Event Management to <a href="https://aws.amazon.com/premiumsupport/aws-countdown/">AWS Countdown</a>, expanding support to include generative AI implementation, mainframe modernization, and sector-specific optimization for elections, retail, healthcare, and sports events.</li>
</ul>
<p>28:22  Justin – “What I don’t want our listeners to take away from this is ‘Hey, I should install Fizz and use it on Black Friday!’ If you haven’t had a culture of that chaos testing and the resiliency and redundancy built into your engineering culture for more than a year…do not do that.”</p>
<h2>GCP</h2>
<p>36:25 <a href="https://cloud.google.com/blog/products/ai-machine-learning/choose-the-right-google-ai-developer-tool-for-your-workflow/">Choose the right Google AI developer tool for your workflow | Google Cloud </a><a href="https://cloud.google.com/blog/products/ai-machine-learning/choose-the-right-google-ai-developer-tool-for-your-workflow/">Blog</a></p>
<ul>
<li style="font-weight:400;">Google has diversified its AI developer tooling into six distinct offerings: <a href="https://jules.google/">Jule</a>s for GitHub automation, <a href="https://github.com/google-gemini/gemini-cli">Gemini CLI</a> for flexible code interactions, <a href="https://codeassist.google/">Gemini Code Assist </a>for IDE integration, <a href="https://firebase.studio/">Firebase Studio</a> for browser-based development, <a href="https://aistudio.google.com/welcome">Google AI Studio</a> for prompt experimentation, and the Gemini app for prototyping.</li>
<li style="font-weight:400;">The tools are categorized by interaction model: delegated/agentic (Jules), supervised (Gemini CLI and Code Assist), and collaborative (Firebase Studio and AI Studio), each targeting different developer workflows and skill levels.</li>
<li style="font-weight:400;">Jules stands out as a GitHub-specific agent that can autonomously handle tasks such as documentation, test coverage, and code modernization through pull requests, offering a free tier and paid Pro/Ultra options.</li>
<li style="font-weight:400;">Firebase Studio enables non-professional developers to build production-grade applications in a Google-managed browser environment, complete with built-in templates and Gemini-powered code generation, during its free preview period.</li>
<li style="font-weight:400;">Most tools offer generous free tiers with access to the Gemini model. At the same time, paid options provide higher rate limits and enterprise features through Vertex AI integration, making AI-assisted development accessible across various budget levels.</li>
</ul>
<p>37:40  Ryan – “The Gemini App – a lot of the documentation that is accompanying the app  – is very likely to lead you astray, in terms of whether this is something that can handle a production deployment referencing that API endpoint.” </p>
<p>40:13 <a href="https://cloud.google.com/blog/products/ai-machine-learning/gemini-2-5-flash-image-on-vertex-ai/">Gemini 2.5 Flash Image on Vertex AI | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google has launched Gemini 2.5 Flash Image on <a href="https://console.cloud.google.com/vertex-ai/studio/multimodal?model=gemini-2.5-flash-image-preview">Vertex AI</a> in preview, adding native image generation and editing capabilities with state-of-the-art performance for both functions. The feature includes built-in SynthID watermarking for responsible use.</li>
<li style="font-weight:400;">The model introduces three key capabilities: multi-image fusion, which combines multiple reference images into a unified visual, character, and style consistency across generations without requiring fine-tuning; and conversational editing, utilizing natural language instructions.</li>
<li style="font-weight:400;">Early adopters include Adobe, integrating it into Firefly and Express, WPP testing it for retail and CPG applications, and Figma adding it to their AI image tools, indicating broad enterprise interest across creative workflows.</li>
<li style="font-weight:400;">The conversational editing feature enables iterative refinement through simple text prompts, maintaining object consistency while allowing for significant adjustments—a capability that Leonardo.ai’s CEO describes as enabling entirely new creative workflows.</li>
<li style="font-weight:400;">Available now in preview on Vertex AI with documentation for developers, this positions Google to compete directly with other cloud providers’ image generation services while leveraging their existing Vertex AI infrastructure.</li>
</ul>
<p>41:49  Justin – “I had complained about how expensive Veo was; now you can make three videos a day with Veo in Geimini Pro.” </p>
<p>43:07 <a href="https://cloud.google.com/blog/products/management-tools/gemini-cloud-assist-investigations-performs-root-cause-analysis/">Gemini Cloud Assist investigations performs root-cause analysis | Google </a><a href="https://cloud.google.com/blog/products/management-tools/gemini-cloud-assist-investigations-performs-root-cause-analysis/">Cloud Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/gemini/docs/cloud-assist/investigations">Gemini Cloud Assist investigations</a> is a new AI-powered root cause analysis tool that automatically analyzes logs, configurations, metrics, and error patterns across GCP environments to diagnose infrastructure and application issues, reducing troubleshooting time from hours to minutes, according to early users.</li>
<li style="font-weight:400;">The service provides multiple access points, including API integration for <a href="https://slack.com/signin">Slack</a> and incident management tools, direct triggering from <a href="https://console.cloud.google.com/logs/query;query=severity%3D%22ERROR%22;duration=PT1H">Logs Explorer</a> or monitoring alerts, and seamless handoff to Google Cloud Support with full investigation context preserved.</li>
<li style="font-weight:400;">Unlike traditional monitoring tools, this approach leverages Google’s internal SRE runbooks and support knowledge bases, combined with Gemini AI, to generate ranked observations, probable root causes, and specific remediation steps, rather than just surfacing raw data.</li>
<li style="font-weight:400;">Key differentiator is the comprehensive signal analysis across Cloud Logs, Asset Inventory, App Hub, and Log Themes in parallel, automatically building resource topology and correlating changes to identify issues that would be difficult to spot manually in distributed systems.</li>
<li style="font-weight:400;">Currently in preview with no pricing announced, this positions GCP competitively against AWS DevOps Guru and Azure Monitor’s similar AI-driven troubleshooting capabilities, particularly valuable for organizations with complex Kubernetes or Cloud Run deployments.</li>
</ul>
<p>46:23 <a href="https://cloud.google.com/blog/products/data-analytics/automate-sql-translation-databricks-to-bigquery-with-gemini/">Automate SQL translation: Databricks to BigQuery with Gemini | Google </a><a href="https://cloud.google.com/blog/products/data-analytics/automate-sql-translation-databricks-to-bigquery-with-gemini/">Cloud Blo</a></p>
<ul>
<li style="font-weight:400;">Google introduces automated SQL translation from <a href="https://databricks-prod-cloudfront.cloud.databricks.com/public/4027ec902e239c93eaaa8714f173bcfc/3137082781873852/3704545280501166/1264763342038607/latest.html">Databricks Spark SQL</a> to <a href="https://cloud.google.com/bigquery?gad=1">BigQuery</a> using <a href="https://gemini.google.com/">Gemini AI</a>, addressing the growing need for cross-platform data migration as businesses diversify their cloud ecosystems. The solution combines Gemini with Vertex AI’s RAG Engine to handle complex syntax differences, function mappings, and geospatial operations like H3 functions.</li>
<li style="font-weight:400;">The architecture leverages Google Cloud Storage for source files, a curated function mapping guide, and a few-shot examples to ground Gemini’s responses, resulting in more accurate translations. The system includes a validation layer using BigQuery’s dry run mode to catch syntax errors before execution.</li>
<li style="font-weight:400;">Key technical challenges include handling differences in window functions (like FIRST_VALUE syntax variations), data type mappings, and Databricks-specific functions that need BigQuery equivalents. The RAG-enhanced approach significantly improves translation accuracy compared to using Gemini alone.</li>
<li style="font-weight:400;">This capability targets organizations looking to reduce operational costs by migrating analytics workloads from Databricks to BigQuery’s serverless architecture. Industries with complex SQL workloads and geospatial analytics would benefit most from automated translation versus manual query rewriting.</li>
<li style="font-weight:400;">While no specific pricing is mentioned, the solution promises to reduce migration time and errors compared to manual translation efforts. Google positions this as part of their broader strategy to simplify multi-cloud data operations and lower barriers for customers switching between platforms.</li>
</ul>
<p>47:13  Justin – “I find it interesting that they call out that their product is not as good as Databricks by saying ‘we’ll help you build all the things that you need for equivalents!’ And likes, that’s helpful. Thanks, Google.” </p>
<p>48:28 <a href="https://cloud.google.com/blog/products/infrastructure/measuring-the-environmental-impact-of-ai-inference/">Measuring the environmental impact of AI inference | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google released a technical paper detailing their methodology for measuring AI inference environmental impact, revealing that a median Gemini Apps text prompt uses only 0.24 watt-hours of energy, 0.03 grams of CO2e emissions, and 0.26 milliliters of water – substantially lower than many public estimates and equivalent to watching TV for less than 9 seconds.</li>
<li style="font-weight:400;">Their comprehensive measurement approach accounts for complete system dynamic power, idle machines, CPU/RAM usage, data center overhead (PUE), and water consumption—factors often overlooked in industry calculations that only consider active GPU/TPU consumption. This makes it one of the most comprehensive assessments of AI’s operational footprint.</li>
<li style="font-weight:400;">Google achieved a 33x reduction in energy consumption and a 44x reduction in carbon footprint for Gemini text prompts over 12 months through full-stack optimizations, including Mixture-of-Experts architectures, quantization techniques, speculative decoding, and its custom Ironwood TPUs, which are 30x more energy-efficient than first-generation TPUs.</li>
<li style="font-weight:400;">The methodology provides a framework for consistent industry-wide measurement of AI resource consumption, addressing growing concerns about AI’s environmental impact as inference workloads scale – fundamental as enterprises increasingly deploy generative AI applications.</li>
<li style="font-weight:400;">Google’s data centers operate at an average PUE of 1.09 and the company is pursuing 24/7 carbon-free energy while targeting 120% freshwater replenishment, demonstrating how infrastructure efficiency directly impacts AI workload sustainability.</li>
</ul>
<p>50:09  Justin – “I do appreciate that they’re trying something here.” </p>
<p>52:44 <a href="https://cloud.google.com/blog/products/identity-security/streamline-auditing-compliance-manager-is-now-in-preview/">From silos to synergy: New Compliance Manager, now in preview | Google </a><a href="https://cloud.google.com/blog/products/identity-security/streamline-auditing-compliance-manager-is-now-in-preview/">Cloud Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/security-command-center/docs/compliance-manager-overview">Google Cloud Compliance Manager</a> enters preview as an integrated Security Command Center feature, unifying security and compliance management across infrastructure, workloads, and data. </li>
<li style="font-weight:400;">It addresses the growing challenge of managing multiple regulatory frameworks by providing a single platform for configuration, monitoring, and auditing compliance requirements.</li>
<li style="font-weight:400;">The platform introduces two core constructs: Frameworks (collections of technical controls mapped to regulations, such as CIS, SOC2, ISO 27001, and FedRAMP) and CloudControls (platform-agnostic building blocks for preventive, detective, and audit modes). Organizations can utilize pre-built frameworks or create custom ones, leveraging AI-powered control authoring to expedite deployment.</li>
<li style="font-weight:400;">This positions Google Cloud competitively against AWS Security Hub and Azure Policy/Compliance Manager by offering bidirectional translation between regulatory controls and technical configurations. The integration with Security Command Center provides a unified view that competitors typically require multiple tools to achieve.</li>
<li style="font-weight:400;">Key differentiator is the automated evidence generation for audits, validated through Google’s <a href="https://cloud.google.com/blog/topics/public-sector/accelerating-fedramp-20x-how-google-cloud-is-automating-compliance?e=48754805">FedRAMP 20X partnership</a>, which could significantly reduce manual compliance work for regulated industries like healthcare, finance, and government. The platform supports deployment at the organization, folder, and project levels for granular control.</li>
<li style="font-weight:400;">Available now in preview through the Google Cloud Console under Security &gt; Compliance navigation. While pricing details aren’t provided, interested organizations can contact their Google Cloud account team or email compliance-manager-preview@google.com for access and feedback opportunities.</li>
</ul>
<p>54:01  Ryan – “The automated evidence gathering is spectacular on these tools. And it’s really what’s needed – even from a security engineer standpoint – being able to view those frameworks to see the compliance metrics, and how you’re actually performing across those things, and what’s actually impactful is super important too.” </p>
<p>59:50 <a href="https://cloud.google.com/blog/products/containers-kubernetes/gke-auto-ipam-simplifies-ip-address-management/">GKE Auto-IPAM simplifies IP address management | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/kubernetes-engine/docs/how-to/enable-auto-ipam">GKE Auto-IPAM</a> dynamically allocates and deallocates IP address ranges for nodes and pods as clusters scale, eliminating the need for large upfront IP reservations and manual intervention during scaling operations.</li>
<li style="font-weight:400;">This addresses a critical pain point in Kubernetes networking where poor IP management leads to IP_SPACE_EXHAUSTED errors that halt cluster scaling and deployments, particularly problematic given IPv4 address scarcity.</li>
<li style="font-weight:400;">The feature works with both new and existing clusters running GKE version 1.33 or higher, currently configurable via gcloud CLI or API, with Terraform and UI support coming soon.</li>
<li style="font-weight:400;">Unlike traditional static IP allocation approaches used by other cloud providers, GKE Auto-IPAM proactively manages addresses on demand, reducing administrative overhead while optimizing IPv4 utilization.</li>
<li style="font-weight:400;">Key beneficiaries include organizations running resource-intensive workloads requiring rapid scaling, as the feature ensures sufficient IP capacity is dynamically available without manual planning or intervention.</li>
</ul>
<p>1:00:58 Ryan – “I think it was just last week that Google announced that you could add IP_Space to existing clusters.” </p>
<p>1:02:47 <a href="https://cloud.google.com/blog/products/databases/firestore-with-mongodb-compatibility-is-now-ga/">Firestore with MongoDB compatibility is now GA | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/products/firestore/mongodb-compatibility?e=48754805&amp;hl=en">Firestore now supports MongoDB-compatible APIs</a> in GA, allowing developers to use existing MongoDB code, drivers, and tools with Firestore’s serverless infrastructure that offers up to 99.999% SLA and multi-region replication with strong consistency.</li>
<li style="font-weight:400;">The service includes over 200 MongoDB Query Language capabilities, unique indexes, and new aggregation stages like $lookup for joining data across collections, addressing enterprise needs for complex queries and data relationships.</li>
<li style="font-weight:400;">Enterprise features include <a href="https://cloud.google.com/firestore/mongodb-compatibility/docs/pitr">Point-in-Time Recovery</a> for 7-day rollback capability, <a href="https://cloud.google.com/firestore/mongodb-compatibility/docs/create-databases#clone-database">database cloning</a> for staging environments, <a href="https://cloud.google.com/firestore/mongodb-compatibility/docs/export-import">managed export/import</a> to Cloud Storage, and change data capture triggers for replicating data to services like <a href="https://cloud.google.com/bigquery">BigQuery</a>.</li>
<li style="font-weight:400;">Available through both <a href="https://firebase.google.com/">Firebase</a> and Google Cloud consoles as part of <a href="https://firebase.google.com/docs/firestore/editions">Firestore Enterprise</a> edition with pay-as-you-go pricing and a free tier, targeting industries like financial services, healthcare, and retail seeking MongoDB compatibility without operational overhead.</li>
<li style="font-weight:400;">This positions Google against <a href="https://docs.aws.amazon.com/documentdb/latest/developerguide/what-is.html">AWS DocumentDB</a> and <a href="https://duckduckgo.com/y.js?ad_domain=microsoft.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=h4-qSEHUxDnEL3nUBJ3YLHiZ4jR3Apgnk3F_XWikxJ6_2iABG-qhl5WJN5zxscakDRzT0zczpoSJu0a6QwQWILqWhy3d35Wa2CCt7Cp25OPQAXdWv21_OZeI6Eqzi6UE.M7TtiLQoh5T9dGlv5hBlLw&amp;eddgt=ZV3dWN5kxM16gDLqqmX8Cg%3D%3D&amp;rut=5bcc5c0b21a011037cf8f88088d1fef0ecb64677b05eed9a6aeea6d2553b98d4&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8bKq2XrOGyYfXzaxHFpH8lTVUCUxzFAzAHwPcqSl2iesLy9-ssHRhfpdQGHxw8aQHVtxeyerS-wXD7TrapbFhcC5BPNCG84GVB18V6MnEws8dd3BwLLaeK9gwrEyjT8pNRZN81QMZZq_Rcv5riZIc1Z-Dci8YojlGYqSa1g-vx-c2wNi4f_qPbqy8QohRBNyhdXtuAPKiv3PRyXdg-__Af-WRZytrP-LbbYyocF30nVogmqd1wk6iQ70MLwSyXzcNy34lx1QZayf5s63-F3E59nQmFHbjLrZJnhUzrnxW-SUZBbzHRvuiZBGUJUZjdS0TaVF79RrkBFocsUcHSW7eLN4Vg0CzJ-m2VArm9ML06osJk-C8tbBRzBl0-sRMkJ_klHf_-2u9P15gfQjw0zUTBc-iSsH-TccgZwHpCmHMSS4qVSaKDlPnCDR_hJO8pMuUNv8i0_9RfEfPK8q0J-59S6u2HJHaxNBTXxPk6EyWpeq2vCT644HdA3F39xUlCXoXEHgu3nsaqZbA_iE-FN38JuOu0GSr4ahn7Mou1knWPiuvyA6LFfdb_zXAdYYXU1BfjzjiAKY92j9MNq82XEdNEayxoIuVUrsGWkt-uviY358MdyFmNkkT40odxdshk5EH6AOb31kktsHNIE7P0nC_xXWYUluTImBlMTfzvF8qedOcG_nXb-NzzY0xxKwX8VEHCk8cqjFTv3KqKMh2MjHz4JzWvZaBAvTXzP7QWMgzkf0ybnclMagLU4DGnc4J04tjY5SIJg%26u%3DaHR0cHMlM2ElMmYlMmY1MzUwLnhnNGtlbi5jb20lMmZ0cmslMmZ2MSUzZnByb2YlM2Q0MzklMjZjYW1wJTNkMTY4ODA3JTI2a2N0JTNkbXNuJTI2a2NoaWQlM2QxNTkwMDE5MjElMjZjcml0ZXJpYWlkJTNka3dkLTc5MjM0MTYzMzgyOTQ2JTNhbG9jLTE5MCUyNmNhbXBhaWduaWQlM2Q1OTAyMzczNTglMjZsb2NwaHklM2Q3OTcxMiUyNmFkZ3JvdXBpZCUzZDEyNjc3Mzg1NTkwNTE0NDclMjZjaWQlM2Q3OTIzMzc2NDM5NDIzOCUyNmtkdiUzZGMlMjZrZXh0JTNkJTI2a3BnJTNkJTI2a3BpZCUzZCUyNnF1ZXJ5U3RyJTNkQXp1cmUlMjUyMENvc21vcyUyNTIwREIlMjdzJTI2dXJsJTNkaHR0cHMlM2ElMmYlMmZhenVyZS5taWNyb3NvZnQuY29tJTJmZW4tdXMlMmZwcm9kdWN0cyUyZmNvc21vcy1kYiUyZiUzZmVmX2lkJTNkX2tfZTgyNzVkMDgzODI4MWQ3MDQxZmMzNWJmZTI0NGNjNmJfa18lMjZPQ0lEJTNkQUlEY21tNWVkc3dkdXVfU0VNX19rX2U4Mjc1ZDA4MzgyODFkNzA0MWZjMzViZmUyNDRjYzZiX2tfJTI2bXNjbGtpZCUzZGU4Mjc1ZDA4MzgyODFkNzA0MWZjMzViZmUyNDRjYzZi%26rlid%3De8275d0838281d7041fc35bfe244cc6b&amp;vqd=4-333022869782158990937203659863020959256&amp;iurl=%7B1%7DIG%3DE9DEC7E2698A4A1B8FCBAE5F43476D27%26CID%3D207573C1BB7164BA1556659DBAED65C1%26ID%3DDevEx%2C5046.1">Azure Cosmos DB’s</a> <a href="https://www.mongodb.com/docs/manual/query-api/">MongoDB API</a> by leveraging Firestore’s existing serverless architecture rather than building a separate MongoDB-compatible service.</li>
</ul>
<p>1:04:42 <a href="https://cloud.google.com/blog/products/containers-kubernetes/gke-gets-new-pricing-and-capabilities-on-10th-birthday/">GKE gets new pricing and capabilities on 10th birthday | Google Cloud </a><a href="https://cloud.google.com/blog/products/containers-kubernetes/gke-gets-new-pricing-and-capabilities-on-10th-birthday/">Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/kubernetes-engine">GKE</a> is transitioning to a single paid tier in September 2025, which includes multi-cluster management features such as Fleets, Teams, Config Management, and Policy Controller, all at no additional cost. Optional à la carte features will be available as needed.</li>
<li style="font-weight:400;">Autopilot mode, which provides fully managed Kubernetes without requiring deep expertise, will soon be available for all clusters, including existing GKE Standard clusters on a per-workload basis with the ability to toggle on and off.</li>
<li style="font-weight:400;">GKE now supports larger clusters to handle AI workloads at scale, with customers such as Anthropic, Moloco, and Signify utilizing the platform for training and serving AI models on TPUs, as well as running global services.</li>
<li style="font-weight:400;">The new container-optimized compute platform in Autopilot delivers improved efficiency and performance, allowing workloads to serve more traffic with the same capacity or maintain existing traffic with fewer resources.</li>
<li style="font-weight:400;">After 10 years since its launch and 11 years since Kubernetes was open-sourced from Google’s Borg system, GKE continues to incorporate learnings from running Google’s own services, such as <a href="https://cloud.google.com/vertex-ai">Vertex AI</a>, into the managed platform.</li>
<li style="font-weight:400;">Happy Birthday…</li>
</ul>
<h2>Azure</h2>
<p>1:09:17 <a href="https://azure.microsoft.com/en-us/blog/microsofts-open-source-journey-from-20000-lines-of-linux-code-to-ai-at-global-scale/">From 20,000 lines of Linux code to global scale: Microsoft’s open-source </a><a href="https://azure.microsoft.com/en-us/blog/microsofts-open-source-journey-from-20000-lines-of-linux-code-to-ai-at-global-scale/">journey | Microsoft Azure Blog</a></p>
<ul>
<li style="font-weight:400;">Microsoft has evolved from contributing 20,000 lines of <a href="https://www.linux.org/pages/download/">Linux</a> code in 2009 to becoming the largest public cloud contributor to <a href="https://www.cncf.io/training/">CNCF</a> over the past three years, with 66% of Azure customer cores now running Linux workloads.</li>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/aks/what-is-aks">Azure Kubernetes Service</a> powers some of the world’s largest deployments, including <a href="https://www.microsoft.com/en-us/microsoft-365/microsoft-365-business">Microsoft 365</a>‘s COSMIC platform, which runs millions of cores, and OpenAI’s ChatGPT, serving 700 million weekly users with just 12 engineers managing the infrastructure.</li>
<li style="font-weight:400;">Microsoft has open-sourced multiple enterprise-grade tools, including Dapr for distributed applications, <a href="https://learn.microsoft.com/en-us/azure/aks/ai-toolchain-operator-fine-tune">KAITO</a> for AI workload automation on Kubernetes, and <a href="https://huggingface.co/microsoft/Phi-4-mini-instruct">Phi-4 Mini</a>, a 3.8 billion parameter AI model optimized for edge computing.</li>
<li style="font-weight:400;">The company’s open-source strategy focuses on upstream-first contributions, then downstream product integration, contrasting with AWS and GCP’s tendency to fork projects or build proprietary alternatives.</li>
<li style="font-weight:400;">Azure’s managed services like AKS and PostgreSQL abstract operational complexity while maintaining open-source flexibility, enabling rapid scaling without large operations teams, as demonstrated by ChatGPT handling over 1 billion queries daily.</li>
</ul>
<p>1:11:15  Matt – “I’m confused by that fourth thing, because they fully backed Redis when they changed the licensing and were the only cloud that did, but we focus on open source first…” </p>
<p>1:15:02 <a href="https://opensource.microsoft.com/blog/2025/08/25/documentdb-joins-the-linux-foundation/">DocumentDB joins the Linux Foundation – Microsoft Open Source Blog</a></p>
<ul>
<li style="font-weight:400;">Microsoft’s <a href="https://github.com/microsoft/documentdb">DocumentDB</a>, an open-source <a href="https://www.mongodb.com/try/download/community">MongoDB</a>-compatible database built on PostgreSQL, has <a href="https://www.linuxfoundation.org/press/linux-foundation-welcomes-documentdb-to-advance-open-developer-first-nosql-innovation">joined the Linux Foundation</a> to ensure vendor-neutral governance and broader community collaboration. </li>
<li style="font-weight:400;">The project provides a <a href="https://azure.microsoft.com/en-us/resources/cloud-computing-dictionary/what-is-nosql-database/">NoSQL</a> document database experience while leveraging PostgreSQL’s reliability and ecosystem.</li>
<li style="font-weight:400;">The move positions DocumentDB as a potential industry standard for NoSQL databases, similar to ANSI SQL for relational databases, with companies like Yugabyte and SingleStore already joining the technical steering committee. This contrasts with AWS DocumentDB, which remains a proprietary managed service.</li>
<li style="font-weight:400;">DocumentDB offers developers MongoDB wire protocol compatibility without vendor lock-in, using standard PostgreSQL extensions under the MIT license rather than requiring a forked database engine. This approach enables existing PostgreSQL deployments to add document database capabilities without requiring a migration to a separate system.</li>
<li style="font-weight:400;">The project targets organizations wanting MongoDB-style document databases but preferring PostgreSQL’s operational model, backup tools, and existing infrastructure investments. Unlike Azure Cosmos DB’s multi-model approach, DocumentDB focuses specifically on document workloads with PostgreSQL’s proven scalability.</li>
<li style="font-weight:400;">With the Linux Foundation governance, DocumentDB provides an open alternative to proprietary document databases from cloud vendors, potentially reducing costs for self-managed deployments while maintaining compatibility with MongoDB applications and tools.</li>
</ul>
<p>56:01  Justin – “Now the question is, can I take these DocumentDB extensions and put them on Cloud SQL from Google without having to use Firestore? That’s the real question.” </p>
<p>1:17:31 <a href="https://azure.microsoft.com/en-us/updates?id=500996">Public Preview: Azure Bastion now supports connectivity to private AKS </a><a href="https://azure.microsoft.com/en-us/updates?id=500996">clusters via tunneling</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/bastion/bastion-overview">Azure Bastion</a> now enables secure tunneling from local machines to private AKS clusters’ API servers, eliminating the need for VPN connections or exposing clusters to public internet while maintaining standard kubectl workflows.</li>
<li style="font-weight:400;">This feature addresses a common security challenge where organizations want private AKS clusters but struggle with developer access, competing with AWS Systems Manager Session Manager and GCP Identity-Aware Proxy for Kubernetes access.</li>
<li style="font-weight:400;">The tunneling capability works with existing Kubernetes tooling and supports both private and public clusters with API server authorized IP ranges, reducing operational complexity for teams managing multiple cluster types.</li>
<li style="font-weight:400;">Target customers include enterprises with strict security requirements and regulated industries that need private clusters but want to avoid managing complex VPN infrastructure or jump boxes for developer access.</li>
<li style="font-weight:400;">While Azure Bastion pricing starts at $0.095/hour plus data transfer costs, this feature could reduce overall infrastructure costs by eliminating dedicated VPN gateways or bastion hosts typically required for private cluster access.</li>
</ul>
<p>1:18:36  Matt – “Azure Bastion is actually pretty good. We use it at my day job, and it’s really not bad.”</p>
<p>1:23:37 <a href="https://azure.microsoft.com/en-us/updates?id=501017">Generally Available: Application Gateway adds MaxSurge support for </a><a href="https://azure.microsoft.com/en-us/updates?id=501017">zero-capacity-impact upgrades</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/application-gateway/overview">Azure Application Gateway</a> now provisions new instances during rolling upgrades before taking old ones offline through <a href="https://learn.microsoft.com/en-us/azure/virtual-machine-scale-sets/virtual-machine-scale-sets-maxsurge">MaxSurge support</a>, eliminating the capacity drops that previously occurred during version transitions.</li>
<li style="font-weight:400;">This addresses a long-standing pain point where Application Gateway upgrades would temporarily reduce available capacity as instances cycled, potentially impacting application availability during maintenance windows.</li>
<li style="font-weight:400;">The feature brings Azure closer to <a href="https://docs.aws.amazon.com/elasticloadbalancing/latest/application/introduction.html">AWS Application Load Balancer</a>‘s connection draining capabilities, though AWS still maintains an edge with more granular control over instance replacement timing.</li>
<li style="font-weight:400;">Enterprise customers running mission-critical workloads will benefit most, as they can now perform gateway updates during business hours without risking performance degradation or connection drops.</li>
<li style="font-weight:400;">While the feature itself doesn’t add direct costs, it may temporarily increase compute charges during upgrades as both old and new instances run simultaneously before the transition completes.</li>
</ul>
<p>1:24:53  Matt – “It’s amazing this wasn’t there and native, and why is this something you have to think about? It’s supposed to be a managed service. I have to tell it the number of nodes, tell it to do these things…it just feels like a very clunky managed service. And you still have to bring your own certificate.” </p>
<p>1:26:00 <a href="https://azure.microsoft.com/en-us/updates?id=501233">Generally Available: Azure Migrate now supports migration to disks with</a> <a href="https://azure.microsoft.com/en-us/updates?id=501233">Zone-Redundant Storage (ZRS) redundancy </a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/migrate/?view=migrate-classic">Azure Migrate</a> now enables direct migration to Zone-Redundant Storage (ZRS) disks, which automatically replicate data synchronously across three availability zones in a region for enhanced durability and availability compared to locally redundant storage.</li>
<li style="font-weight:400;">This feature addresses a key gap for organizations requiring high availability during cloud migrations, as they can now maintain zone redundancy from the start rather than converting disks post-migration, reducing operational overhead and potential downtime.</li>
<li style="font-weight:400;">ZRS disks provide 99.9999999999% (12 9’s) durability over a given year and protect against datacenter-level failures, making this particularly valuable for mission-critical workloads that need continuous availability during zone outages.</li>
<li style="font-weight:400;">While AWS offers similar zone-redundant storage options through EBS Multi-Attach and GCP has regional persistent disks, Azure’s integration directly into the migration tool streamlines the process compared to competitors, who require post-migration configuration.</li>
<li style="font-weight:400;">The feature targets enterprises with strict compliance requirements and those running stateful applications where data loss or extended downtime during zone failures would have a significant business impact, though ZRS disks typically cost 50% more than standard locally redundant storage.</li>
</ul>
<p>1:28:40  Matt  – “This is more for backup. So if you’re running a file server in one region, in one zone, and that zone goes down, your data is still in the other zone – so you spin up a server and attach it.”  </p>
<h2>Other Clouds </h2>
<p>1:31:45 <a href="https://www.digitalocean.com/blog/mcp-server-public-release">DigitalOcean MCP Server is now available | DigitalOcean</a></p>
<ul>
<li style="font-weight:400;">DigitalOcean launched an MCP (Model Context Protocol) </li>
<li style="font-weight:400;">Server that enables developers to manage cloud resources using natural language commands through AI tools like Claude and Cursor. </li>
<li style="font-weight:400;">The server runs locally and currently supports 9 services, including <a href="https://www.digitalocean.com/products/app-platform">App Platform</a>, <a href="https://www.digitalocean.com/products/managed-databases">Databases</a>, Kubernetes, and <a href="https://www.digitalocean.com/products/droplets">Droplets</a>.</li>
<li style="font-weight:400;">MCP is an open-source standard that provides a consistent way for AI systems to connect with external tools and data sources. This eliminates the need for fragmented integrations and allows developers to perform cloud operations directly within their development environment.</li>
<li style="font-weight:400;">The implementation allows developers to use plain English commands like “deploy a Ruby on Rails app from my <a href="https://github.com/digitalocean-labs/mcp-digitalocean">GitHub repo</a>” or “create a new PostgreSQL database” instead of writing scripts or navigating multiple dashboards. Users maintain control of their API credentials, which stay local.</li>
<li style="font-weight:400;">Security is managed through service scoping, where developers can restrict AI assistant access to only specific services using flags. This prevents context bloat and limits access to only necessary resources while maintaining audit trails and error handling.</li>
<li style="font-weight:400;">The service is currently free and in public preview with hundreds of developers already using it daily for provisioning infrastructure, monitoring usage, and automating cloud tasks. It works with <a href="https://claude.ai/">Claude</a>, <a href="https://cursor.com/">Cursor</a>, <a href="https://code.visualstudio.com/download">VS Code</a>, <a href="https://windsurf.com/editor">Windsurf</a>, and other MCP-compatible clients.</li>
</ul>
<h2>Cloud Journey</h2>
<p>1:00:42 <a href="https://cloud.google.com/blog/products/application-modernization/a-guide-to-platform-engineering/">A guide to platform engineering | Google Cloud Blog</a></p>
<p>We had homework to watch the full video — We tried but it was so boring.</p>
<p>The blog post is good. Video is a recording of a conference talk…but man. We promise to find more interesting topics for the next Cloud Journey installation. </p>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback, or ask questions at theCloudPod.net or tweet at us with the hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2131352/c1e-zo9ob7j879un3m64-okz62jx5aq0q-kh7nnv.mp3" length="138647136"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 319 of The Cloud Pod, where the forecast is always cloudy! Justin, Matt, and Ryan are in the studio to bring you all the latest in cloud and AI news. AWS Cost MCP makes exploring your finops data as simple as english text. We’ve got a sunnier view for junior devs, a Microsoft open source development, tokens, and it’s even Kubernetes’ birthday – let’s get into it! 
Titles we almost went with this week:


From Linux Hater to Open Source Darling: A Microsoft Love Story
20,000 Lines of Code and a Dream: Microsoft’s Open Source Glow-Up
Ctrl+Alt+Delete Your Assumptions: Microsoft Goes Full Penguin
Token and Esteem: Amazon Bedrock Gets a Counter
CSI: Cloud Scene Investigation
The Great SQL Migration: How AI Became the Universal Translator
Token and Ye Shall Receive: Bedrock’s New Counting Feature
The Count of Monte Token: A Bedrock Tale – mk
Ctrl+Z for Your Database: Now with Built-in Lag Time
IP Freely: GKE Takes the Pain Out of Address Management
AWS CEO: AI Can’t Replace Junior Devs Because Someone Has to Fix the AI’s Code
Better Late Than Never: RDS PostgreSQL Gets Time Travel
The SQL Whisperer: Teaching AI to Speak Database
DigitalOcean Goes Full Chatbot: Your Infrastructure Now Speaks Human
Musk vs Cook: The App Store Wars Episode AI
Firestore Goes Mongo: A Database Love Story
GKE Turns 10: Now With More Candles and Less Complexity
Prime Day Infrastructure: Now With 87,000 AI Chips and a Robot Army
AWS Scales to Quadrillion Requests: Your Black Friday Traffic Looks Cute
AWS billing now speaks human, thanks to MCPs
The Bastion Holds: Azure’s New Gateway to Kubernetes Kingdoms
The Surge Before the Merge: Azure’s New Upgrade Strategy
CNI Overlay: Because Your Pods Deserve Their Own ZIP Code


AI Is Going Great – or How ML Makes Money 
00:46 Musk’s xAI sues Apple, OpenAI alleging scheme that harmed X, Grok

xAI filed a lawsuit against Apple and OpenAI, alleging anticompetitive practices in AI chatbot distribution, claiming Apple deprioritizes competing AI apps like Grok in the App Store while favoring ChatGPT through direct integration into iOS devices.
The lawsuit highlights tensions in AI platform distribution models, where cloud-based AI services depend on mobile app stores for user access, potentially creating gatekeeping concerns for competing generative AI providers.
Apple’s partnership with OpenAI to integrate ChatGPT into iPhone, iPad, and Mac products represents a shift toward native AI integration rather than app-based access, which could impact how cloud AI services reach end users.
The dispute underscores growing competition in the generative AI market, where multiple players, including xAI’s Grok, OpenAI’s ChatGPT, DeepSeek, and Perplexity, are vying for market position through both cloud APIs and mobile distribution channels.
For cloud developers, this case raises questions about AI service distribution strategies and whether direct device integration partnerships will become necessary to compete effectively against app store-based distribution models.

01:55  Justin – “There’s always a potential for conflict of interest when you have a partnership like this, but also the app store – there’s a...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2131352/c1a-k5d5-6z3jr8mvcwmd-jjmol3.jpg"></itunes:image>
                                                                            <itunes:duration>01:36:14</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2131352/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[318: One Extension to Rule Them All (And in the VS Code Bind Them)]]>
                </title>
                <pubDate>Fri, 29 Aug 2025 23:21:39 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2127639</guid>
                                    <link>https://tcpfm.castos.com/episodes/318-one-extension-to-rule-them-all-and-in-the-vs-code-bind-them</link>
                                <description>
                                            <![CDATA[<h3> Welcome to episode 318 of The Cloud Pod, where the forecast is always cloudy! We’re going on an adventure! Justin and Ryan have formed a fellowship of the cloud, and they’re bringing you all the latest and greatest news from Valinor to Helm’s Deep, and Azure to AWS to GCP. We’ve water issues, some Magic Quadrants, and Aurora updates…but sadly no potatoes. Let’s get into it! </h3>
<h3>Titles we almost went with this week:</h3>
<ul>
<li>You’ve Got No Mail: AOL Finally Hangs  Up on Dial-Up</li>
<li>Ctrl+Alt+Delete Climate Change</li>
<li>H2-Oh No: Your Gmail is Thirsty</li>
<li>The Price is Vibe: Kiro’s New    Request-Based Model</li>
<li>Spec-tacular Pricing: Kiro Leaves the Waitlist Behind</li>
<li>SHA-zam! GitHub Actions Gets Its Security Cape</li>
<li>Breaking Bad Actions: GitHub’s Supply Chain Intervention</li>
<li>Graph Your Way to Infrastructure Happiness</li>
<li>The Tables Have Turned: S3 Gets Its Iceberg Moment</li>
<li>Subnet Where It Hurts: GKE Finally Gets IP Address Relief</li>
<li>All Your Database Are Belong to Database Center</li>
<li>From Droplets to Dollars: DigitalOcean’s AI Pivot Pays Off</li>
<li>DigitalOcean Rides the AI Wave to Record Earnings</li>
<li>Agent Smith Would Be Proud: Microsoft’s Multi-Agent Matrix</li>
<li>Aurora Borealis: A Decade of Database Enlightenment</li>
<li>Fifteen Shades of Cloud: AWS’s Unbroken Streak</li>
<li>The Fast and the Failover-ious: Aurora Edition</li>
<li>Gone in Single-Digit Seconds: AWS’s Speedy Database Recovery</li>
<li>Agent 007: License to Secure Your AI</li>
</ul>
<p>A big thanks to this week’s sponsor:</p>
<p>We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our Slack channel for more info.</p>
<h2>General News </h2>
<p>01:02 <a href="https://apnews.com/article/aol-shuts-down-dial-up-internet-275a81f8a725619663437350b7c7ed36">AOL is finally shutting down its dial-up internet service | AP News</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.aol.com/">AOL</a> is <a href="https://help.aol.com/articles/dial-up-internet-to-be-discontinued?guccounter=1">discontinuing</a> its dial-up internet service on September 30, 2024, marking the end of a technology that introduced millions to the internet in the 1990s and early 2000s.</li>
<li style="font-weight:400;">Census data shows 163,401 US households still used dial-up in 2023, representing 0.13% of homes with internet subscriptions, highlighting the persistence of legacy infrastructure in underserved areas – which is honestly crazy. </li>
<li style="font-weight:400;">Here’s hoping that these folks are able to switch to alternatives, like Starlink.</li>
<li style="font-weight:400;">This shutdown reflects broader technology lifecycle patterns as companies retire legacy services like <a href="https://apnews.com/article/microsoft-closing-skype-7ac4e86f55acb40098476e01d8d4a473">Skype</a>, <a href="https://apnews.com/article/internet-explorer-shutting-down-e45abf1df9d34c135e41a01cf7d96c25">Internet Explorer</a>, and <a href="https://apnews.com/general-news-f1b9748bc7db41b9adea3231a6adfa14">AOL Instant Messenger</a> to focus resources on modern platforms.</li>
<li style="font-weight:400;">The transition away from dial-up demonstrates the evolution from telephone-based connectivity to broadband and wireless technologies that now dominate internet access.</li>
<li style="font-weight:400;">AOL’s journey from a $164 billion valuation in 2000 to being sold by Verizon in 2021 illustrates the rapid shifts in technology markets and the challenges of adapting legacy business models.</li>
</ul>
<p>02:30 <a href="https://www.thejournal.ie/uk-government-advises-people-to-delete-old-photos-and-emails-as-england-faces-water-shortfall-6788643-Aug2025/?utm_source=shortlink">British government asks people to delete old emails to reduce data centres’ </a><a href="https://www.thejournal.ie/uk-..."></a></p>
<h3>Chapters</h3>
<ul><li>(00:00:00) - Week in the Cloud: GCP, Azure, VS Code Bind</li><li>(00:00:57) - AOL to discontinue dial-up service</li><li>(00:02:27) - UK Government Tells You to Deactivate Your Emails to Save Water</li><li>(00:06:03) - UK's Data Center Problem</li><li>(00:08:18) - GitHub Actions: SHA pinning and more</li><li>(00:11:04) - Curo Pricing Plans Go Live for AWS</li><li>(00:16:05) - Aurora DB Turns 10 Years Old</li><li>(00:18:22) - Happy Birthday to My Sister!</li><li>(00:18:36) - Gartner Magic Quadrant for Strategic Cloud Platform Services</li><li>(00:20:53) - Gartner's Strategic Cloud Platform Services</li><li>(00:25:01) - Gartner's Cloud Assessment: Microsoft, Google, Azure</li><li>(00:26:32) - Go Driver to Reduce Database Failover Times by 60%</li><li>(00:28:23) - Amazon AWS Announces R8i Flex and R7i Flex</li><li>(00:30:58) - GKE: Multi-Subnet Support for Kubernetes</li><li>(00:33:51) - Database Center for Google Cloud: Unifying Database Fleet Management</li><li>(00:35:59) - Google Cloud HSM: Client Side Encryption</li><li>(00:38:06) - Google Cloud Announces Comprehensive AI Security Abilities</li><li>(00:41:23) - Google LLM: Right Size for GPUs and TPUs</li><li>(00:44:14) - Microsoft Terraform Adds Ms. Graph Provider in Public Preview</li><li>(00:46:45) - Azure AI Foundry: Unifying OneLake and Agent Factory</li><li>(00:52:03) - Gartner's Cloud: Oracle-Microsoft partnership</li><li>(00:54:52) - DigitalOcean Announces SQL Stored Procedures Support</li><li>(00:58:35) - Shifting Down: How Google Does It</li><li>(01:04:19) - Back in the Cloud: Week Three</li><li>(01:04:43) - Week in Cloud: The Cloud Podcast</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[ Welcome to episode 318 of The Cloud Pod, where the forecast is always cloudy! We’re going on an adventure! Justin and Ryan have formed a fellowship of the cloud, and they’re bringing you all the latest and greatest news from Valinor to Helm’s Deep, and Azure to AWS to GCP. We’ve water issues, some Magic Quadrants, and Aurora updates…but sadly no potatoes. Let’s get into it! 
Titles we almost went with this week:

You’ve Got No Mail: AOL Finally Hangs  Up on Dial-Up
Ctrl+Alt+Delete Climate Change
H2-Oh No: Your Gmail is Thirsty
The Price is Vibe: Kiro’s New    Request-Based Model
Spec-tacular Pricing: Kiro Leaves the Waitlist Behind
SHA-zam! GitHub Actions Gets Its Security Cape
Breaking Bad Actions: GitHub’s Supply Chain Intervention
Graph Your Way to Infrastructure Happiness
The Tables Have Turned: S3 Gets Its Iceberg Moment
Subnet Where It Hurts: GKE Finally Gets IP Address Relief
All Your Database Are Belong to Database Center
From Droplets to Dollars: DigitalOcean’s AI Pivot Pays Off
DigitalOcean Rides the AI Wave to Record Earnings
Agent Smith Would Be Proud: Microsoft’s Multi-Agent Matrix
Aurora Borealis: A Decade of Database Enlightenment
Fifteen Shades of Cloud: AWS’s Unbroken Streak
The Fast and the Failover-ious: Aurora Edition
Gone in Single-Digit Seconds: AWS’s Speedy Database Recovery
Agent 007: License to Secure Your AI

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our Slack channel for more info.
General News 
01:02 AOL is finally shutting down its dial-up internet service | AP News

AOL is discontinuing its dial-up internet service on September 30, 2024, marking the end of a technology that introduced millions to the internet in the 1990s and early 2000s.
Census data shows 163,401 US households still used dial-up in 2023, representing 0.13% of homes with internet subscriptions, highlighting the persistence of legacy infrastructure in underserved areas – which is honestly crazy. 
Here’s hoping that these folks are able to switch to alternatives, like Starlink.
This shutdown reflects broader technology lifecycle patterns as companies retire legacy services like Skype, Internet Explorer, and AOL Instant Messenger to focus resources on modern platforms.
The transition away from dial-up demonstrates the evolution from telephone-based connectivity to broadband and wireless technologies that now dominate internet access.
AOL’s journey from a $164 billion valuation in 2000 to being sold by Verizon in 2021 illustrates the rapid shifts in technology markets and the challenges of adapting legacy business models.

02:30 British government asks people to delete old emails to reduce data centres’ ]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[318: One Extension to Rule Them All (And in the VS Code Bind Them)]]>
                </itunes:title>
                                    <itunes:episode>318</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<h3> Welcome to episode 318 of The Cloud Pod, where the forecast is always cloudy! We’re going on an adventure! Justin and Ryan have formed a fellowship of the cloud, and they’re bringing you all the latest and greatest news from Valinor to Helm’s Deep, and Azure to AWS to GCP. We’ve water issues, some Magic Quadrants, and Aurora updates…but sadly no potatoes. Let’s get into it! </h3>
<h3>Titles we almost went with this week:</h3>
<ul>
<li>You’ve Got No Mail: AOL Finally Hangs  Up on Dial-Up</li>
<li>Ctrl+Alt+Delete Climate Change</li>
<li>H2-Oh No: Your Gmail is Thirsty</li>
<li>The Price is Vibe: Kiro’s New    Request-Based Model</li>
<li>Spec-tacular Pricing: Kiro Leaves the Waitlist Behind</li>
<li>SHA-zam! GitHub Actions Gets Its Security Cape</li>
<li>Breaking Bad Actions: GitHub’s Supply Chain Intervention</li>
<li>Graph Your Way to Infrastructure Happiness</li>
<li>The Tables Have Turned: S3 Gets Its Iceberg Moment</li>
<li>Subnet Where It Hurts: GKE Finally Gets IP Address Relief</li>
<li>All Your Database Are Belong to Database Center</li>
<li>From Droplets to Dollars: DigitalOcean’s AI Pivot Pays Off</li>
<li>DigitalOcean Rides the AI Wave to Record Earnings</li>
<li>Agent Smith Would Be Proud: Microsoft’s Multi-Agent Matrix</li>
<li>Aurora Borealis: A Decade of Database Enlightenment</li>
<li>Fifteen Shades of Cloud: AWS’s Unbroken Streak</li>
<li>The Fast and the Failover-ious: Aurora Edition</li>
<li>Gone in Single-Digit Seconds: AWS’s Speedy Database Recovery</li>
<li>Agent 007: License to Secure Your AI</li>
</ul>
<p>A big thanks to this week’s sponsor:</p>
<p>We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our Slack channel for more info.</p>
<h2>General News </h2>
<p>01:02 <a href="https://apnews.com/article/aol-shuts-down-dial-up-internet-275a81f8a725619663437350b7c7ed36">AOL is finally shutting down its dial-up internet service | AP News</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.aol.com/">AOL</a> is <a href="https://help.aol.com/articles/dial-up-internet-to-be-discontinued?guccounter=1">discontinuing</a> its dial-up internet service on September 30, 2024, marking the end of a technology that introduced millions to the internet in the 1990s and early 2000s.</li>
<li style="font-weight:400;">Census data shows 163,401 US households still used dial-up in 2023, representing 0.13% of homes with internet subscriptions, highlighting the persistence of legacy infrastructure in underserved areas – which is honestly crazy. </li>
<li style="font-weight:400;">Here’s hoping that these folks are able to switch to alternatives, like Starlink.</li>
<li style="font-weight:400;">This shutdown reflects broader technology lifecycle patterns as companies retire legacy services like <a href="https://apnews.com/article/microsoft-closing-skype-7ac4e86f55acb40098476e01d8d4a473">Skype</a>, <a href="https://apnews.com/article/internet-explorer-shutting-down-e45abf1df9d34c135e41a01cf7d96c25">Internet Explorer</a>, and <a href="https://apnews.com/general-news-f1b9748bc7db41b9adea3231a6adfa14">AOL Instant Messenger</a> to focus resources on modern platforms.</li>
<li style="font-weight:400;">The transition away from dial-up demonstrates the evolution from telephone-based connectivity to broadband and wireless technologies that now dominate internet access.</li>
<li style="font-weight:400;">AOL’s journey from a $164 billion valuation in 2000 to being sold by Verizon in 2021 illustrates the rapid shifts in technology markets and the challenges of adapting legacy business models.</li>
</ul>
<p>02:30 <a href="https://www.thejournal.ie/uk-government-advises-people-to-delete-old-photos-and-emails-as-england-faces-water-shortfall-6788643-Aug2025/?utm_source=shortlink">British government asks people to delete old emails to reduce data centres’ </a><a href="https://www.thejournal.ie/uk-government-advises-people-to-delete-old-photos-and-emails-as-england-faces-water-shortfall-6788643-Aug2025/?utm_source=shortlink">water use</a></p>
<ul>
<li style="font-weight:400;">The UK government is advising citizens to delete old emails and photos to reduce water consumption by data centers, as England faces potential water shortages by 2050.</li>
<li style="font-weight:400;">Data centers require significant water for cooling systems, with some facilities using millions of gallons daily to maintain optimal operating temperatures for servers.</li>
<li style="font-weight:400;">This highlights the often-overlooked environmental impact of cloud storage, where seemingly harmless archived data contributes to ongoing resource consumption even when unused.</li>
<li style="font-weight:400;">The recommendation represents a shift toward individual responsibility for cloud sustainability, though the actual impact of consumer data deletion versus enterprise usage remains unclear.</li>
<li style="font-weight:400;">This raises questions about whether cloud providers should implement more aggressive data lifecycle policies or invest in water-efficient cooling technologies rather than relying on user behavior changes.</li>
<li style="font-weight:400;">Bottom line: good for data privacy, bad for water usage. </li>
</ul>
<p>03:01  Ryan – “It’s going to make it worse! Data at rest doesn’t use a whole lot of resources. Deleting anything from a file system is expensive from a CPU perspective, and it’s going to cause the temperature to go up – therefore, more cooling…” </p>
<p>01:17 <a href="https://www.bbc.com/news/articles/clyr9nx0jrzo">Data centres to be expanded across UK as concerns mount</a></p>
<ul>
<li style="font-weight:400;">The UK is planning nearly 100 new data centers by 2030, representing a 20% increase from the current 477 facilities, with major investments from Microsoft, Google, and Blackstone Group totaling billions of pounds. </li>
<li style="font-weight:400;">This expansion is driven by AI workload demands and positions the UK as a critical hub for cloud infrastructure.</li>
<li style="font-weight:400;">Energy consumption concerns are mounting as these facilities could add 71 TWh of electricity demand over 25 years, with evidence from Ohio showing residential energy bills increasing by $20 monthly due to data center operations. </li>
<li style="font-weight:400;">The UK government has established an AI Energy Council to address supply-demand challenges.</li>
<li style="font-weight:400;">Water usage for cooling systems is creating infrastructure strain, particularly in areas serviced by Thames Water, with Anglian Water already objecting to one proposed site. New facilities are exploring air cooling and closed-loop systems to reduce environmental impact.</li>
<li style="font-weight:400;">Planning approval timelines of 5-7 years are pushing some operators to consider building in other countries, potentially threatening the UK’s position as a major data center hub. </li>
<li style="font-weight:400;">The government has designated data centers as critical national infrastructure and is overturning local planning rejections to accelerate development.</li>
<li style="font-weight:400;">The concentration of new facilities in London and surrounding counties raises questions about regional infrastructure capacity and whether existing power and water systems can support this rapid expansion without impacting residential services and pricing.</li>
</ul>
<p>07:12  Justin – “Power and cooling are definitely a problem… There is pressure on using water in data centers to cool them. That is a valid concern – especially with a hundred new data centers coming online, as well as powering. How do you power all those hungry, hungry GPUs?” </p>
<h2>Cloud Tools </h2>
<p>08:30 <a href="https://github.blog/changelog/2025-08-15-github-actions-policy-now-supports-blocking-and-sha-pinning-actions/">GitHub Actions policy now supports blocking and SHA pinning actions – </a><a href="https://github.blog/changelog/2025-08-15-github-actions-policy-now-supports-blocking-and-sha-pinning-actions/">GitHub Changelog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.github.com/en/actions">GitHub Actions</a> now lets administrators explicitly block malicious or compromised actions by adding a ! prefix to entries in the allowed actions policy, providing a critical defense mechanism when third-party workflows are identified as security threats.</li>
<li style="font-weight:400;">The new <a href="https://docs.github.com/actions/reference/security/secure-use#using-third-party-actions">SHA pinning enforcement</a> feature requires workflows to reference actions using full commit SHAs instead of tags or branches, preventing automatic execution of malicious code that could be injected into compromised dependencies.</li>
<li style="font-weight:400;">This addresses a major supply chain security gap where compromised actions could exfiltrate secrets or modify code across all dependent workflows, giving organizations rapid response capabilities to limit exposure.</li>
<li style="font-weight:400;">GitHub is also introducing <a href="https://github.com/github/roadmap/issues/1137">immutable releases</a> that prevent changes to existing tags and assets, enabling developers to pin tags with confidence and use <a href="https://docs.github.com/code-security/dependabot/working-with-dependabot/keeping-your-actions-up-to-date-with-dependabot">Dependabot</a> for safe updates without risk of malicious modifications.</li>
<li style="font-weight:400;">These features are particularly valuable for enterprises managing large GitHub Actions ecosystems, as they can now enforce security policies at the organization or repository level while maintaining the flexibility of the open source action marketplace.</li>
</ul>
<p>09:41  Ryan – “This is something that’s been really relevant to my day job; I’ve been arguing for months now to NOT expand permissions to cloud and other integrations for GitHub actions, because I’m not a fan of the security actions.” </p>
<h2>AWS</h2>
<p>11:26 <a href="https://kiro.dev/blog/pricing-plans-are-live/">Kiro Pricing Plans Are Now Live – Kiro</a></p>
<ul>
<li style="font-weight:400;"><a href="https://kiro.dev/">Kiro</a> is launching <a href="https://kiro.dev/blog/understanding-kiro-pricing-specs-vibes-usage-tracking/">a tiered pricing model</a> with Free, Pro ($29/month), Pro+ ($99/month), and Power ($299/month) plans, transitioning from their preview/waitlist model to allow broader access to their cloud development tool.</li>
<li style="font-weight:400;">The pricing structure is based on <a href="https://kiro.dev/docs/chat/vibe/">“Vibe” and “Spec”</a> requests, with the free tier offering 50 Vibe requests monthly and paid tiers providing varying amounts of both request types, plus optional overage charges for flexibility.</li>
<li style="font-weight:400;">New users receive a 14-day welcome bonus of 100 Spec and 100 Vibe requests to evaluate the tool’s capabilities before committing to a paid plan, with immediate plan activation and modification available.</li>
<li style="font-weight:400;">The tool integrates with Google, GitHub, and <a href="https://docs.aws.amazon.com/signin/latest/userguide/mfa-aws_builder_id.html">AWS Builder ID authentication</a>, suggesting it’s positioned as a cloud development assistant or automation tool that works across major platforms.</li>
<li style="font-weight:400;">Kiro appears to solve the problem of cloud development workflow optimization by providing request-based interactions, though the exact nature of what Vibe and Spec requests accomplish isn’t detailed in this pricing announcement.</li>
</ul>
<p>13:19  Ryan – “I think it’s great, but I’m kind of put off by the free plan not including anything, and then the 14-day limit for new users. I just feel like that’s too constricting, and it will keep me from trying it.” </p>
<p>13:47 <a href="https://aws.amazon.com/about-aws/whats-new/2025/08/amazon-athena-create-table-select-amazon-s3-tables/">Amazon Athena now supports CREATE TABLE AS SELECT with Amazon </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/08/amazon-athena-create-table-select-amazon-s3-tables/">S3 Tables</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/athena/">Athena</a> now supports CREATE TABLE AS SELECT (CTAS) with <a href="https://aws.amazon.com/s3/features/tables/">Amazon S3 Tables</a>, enabling users to query existing datasets and create new S3 Tables with results in a single SQL statement. </li>
<li style="font-weight:400;">This simplifies data transformation workflows by eliminating the need for separate ETL processes.</li>
<li style="font-weight:400;">S3 Tables provide the first cloud object store with built-in Apache Iceberg support, and this integration allows conversion of existing <a href="https://parquet.apache.org/docs/overview/">Parquet</a>, <a href="https://docs.python.org/3/library/csv.html">CSV</a>, <a href="https://json.org/">JSON</a>, <a href="https://hudi.apache.org/">Hudi</a>, and <a href="https://delta.io/">Delta Lake</a> formats into fully-managed tables. </li>
<li style="font-weight:400;">Users can leverage Athena’s familiar SQL interface to modernize their data lake architecture.</li>
<li style="font-weight:400;">The feature enables on-the-fly partitioning during table creation, allowing optimization for different query patterns without reprocessing entire datasets. This flexibility is particularly valuable for organizations managing large-scale analytics workloads.</li>
<li style="font-weight:400;">Once created, S3 Tables support INSERT and UPDATE operations through Athena, moving beyond the traditional read-only nature of S3-based analytics. 
<ul>
<li style="font-weight:400;">This positions S3 Tables as a more complete data warehouse alternative for cost-conscious organizations.</li>
</ul>
</li>
<li style="font-weight:400;">Available in all regions where both Athena and S3 Tables are supported, though specific pricing for S3 Tables operations isn’t detailed in the announcement. 
<ul>
<li style="font-weight:400;">Organizations should evaluate the cost implications of S3 Tables’ managed optimization features versus traditional S3 storage.</li>
</ul>
</li>
</ul>
<p>14:28  Ryan – “It’s the partitioning of data in your table on the fly. That’s the part where this is super valuable.” </p>
<p>16:44 <a href="https://aws.amazon.com/blogs/aws/celebrating-10-years-of-amazon-aurora-innovation/">Celebrating 10 years of Amazon Aurora innovation | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/category/database/amazon-aurora/">Aurora</a> celebrates 10 years since GA with a livestream event on August 21, 2025, featuring technical leaders discussing the architectural decision to decouple storage from compute that enabled commercial database performance at one-tenth the cost.</li>
<li style="font-weight:400;">Key milestone announcements include <a href="https://aws.amazon.com/rds/aurora/dsql/">Aurora DSQL</a> (GA May 2025), a serverless distributed SQL database offering 99.99% single-Region and 99.999% multi-Region availability with strong consistency across all endpoints for always-available applications.</li>
<li style="font-weight:400;">Storage capacity <a href="https://aws.amazon.com/about-aws/whats-new/2025/07/amazon-aurora-postgresql-database-clusters-256-tib-storage-volume/">doubled from 128 TiB to 256 TiB</a> with no upfront provisioning and pay-as-you-go pricing, while <a href="https://aws.amazon.com/blogs/aws/new-amazon-aurora-i-o-optimized-cluster-configuration-with-up-to-40-cost-savings-for-i-o-intensive-applications/">Aurora I/O-Optimized</a> provides predictable pricing with up to 40% cost savings for I/O-intensive workloads.</li>
<li style="font-weight:400;">Aurora now integrates with <a href="https://aws.amazon.com/blogs/database/supercharging-vector-search-performance-and-relevance-with-pgvector-0-8-0-on-amazon-aurora-postgresql/">AI services through pgvector</a> for similarity search, zero-ETL to <a href="https://aws.amazon.com/redshift/">Amazon Redshift</a> and <a href="https://aws.amazon.com/sagemaker/lakehouse/">SageMaker</a> for near real-time analytics, and Model Context Protocol (MCP) servers for AI agent integration with data sources.</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/amazon-aurora-postgresql-limitless-database-is-now-generally-available/">Aurora PostgreSQL Limitless Database</a> provides serverless horizontal scaling (sharding) capabilities, while blue/green deployments simplify database updates, and optimized read instances improve query performance for hundreds of thousands of AWS customers.</li>
</ul>
<p>19:21 <a href="https://aws.amazon.com/blogs/aws/aws-named-as-a-leader-in-2025-gartner-magic-quadrant-for-strategic-cloud-platform-services-for-15-years-in-a-row/">AWS named as a Leader in 2025 Gartner Magic Quadrant for Strategic </a><a href="https://aws.amazon.com/blogs/aws/aws-named-as-a-leader-in-2025-gartner-magic-quadrant-for-strategic-cloud-platform-services-for-15-years-in-a-row/">Cloud Platform Services for 15 years in a row | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/">AWS</a> maintains its position as the highest-ranked provider on <a href="https://www.gartner.com/en">Gartner</a>‘s “Ability to Execute” axis for the 15th consecutive year, reinforcing its market leadership in strategic cloud platform services.</li>
<li style="font-weight:400;">Gartner highlights AWS’s custom silicon portfolio (<a href="https://aws.amazon.com/ec2/graviton/">Graviton</a>, <a href="https://aws.amazon.com/ai/machine-learning/inferentia/">Inferentia</a>, <a href="https://aws.amazon.com/ai/machine-learning/trainium/">Trainium</a>) as a key differentiator, enabling better hardware-software integration and improved power efficiency for customer workloads.</li>
<li style="font-weight:400;">The report emphasizes AWS’s extensive global community as a competitive advantage, with millions of active customers and tens of thousands of partners providing knowledge sharing and support through the new <a href="https://builder.aws.com/">AWS Builder Center </a>hub.</li>
<li style="font-weight:400;">AWS Transform emerges as the first agentic AI service specifically designed to accelerate enterprise modernization of legacy workloads, including .NET, mainframe, and VMware migrations.</li>
<li style="font-weight:400;">The recognition underscores AWS’s operational scale advantage, with its market share enabling a more robust partner ecosystem that helps organizations successfully adopt cloud services.</li>
<li style="font-weight:400;">Right below Amazon was Google (yes, it came above Microsoft on ability to execute and completeness of vision), then Oracle in 4th. Alibaba was the only challenger from China, and IBM placed too. Although we’re not sure how. </li>
</ul>
<p>27:45 <a href="https://aws.amazon.com/about-aws/whats-new/2025/08/aws-advanced-go-driver-generally-available/">Amazon Web Services (AWS) Advanced Go Driver is generally available</a></p>
<ul>
<li style="font-weight:400;">AWS releases an open-source Go driver that wraps pgx <a href="https://www.postgresql.org/download/">PostgreSQL</a> and native <a href="https://www.mysql.com/">MySQL</a> drivers to reduce database failover times from minutes to single-digit seconds for <a href="https://aws.amazon.com/rds/">RDS</a> and <a href="https://aws.amazon.com/rds/aurora/">Aurora</a> clusters.</li>
<li style="font-weight:400;">The driver monitors cluster topology and status to identify new writers quickly during failovers, while adding support for <a href="https://aws.amazon.com/blogs/security/aws-federated-authentication-with-active-directory-federation-services-ad-fs/">Federated Authentication</a>, <a href="https://docs.aws.amazon.com/secretsmanager/latest/userguide/intro.html">AWS Secrets Manager</a>, and <a href="https://docs.aws.amazon.com/IAM/latest/UserGuide/introduction.html">IAM authentication</a>.</li>
<li style="font-weight:400;">This addresses a common pain point where standard database drivers can take 30-60 seconds to detect failovers, causing application timeouts and errors during Aurora’s automated failover events.</li>
<li style="font-weight:400;">Available under <a href="https://www.apache.org/licenses/LICENSE-2.0.html">Apache 2.0 license</a> on GitHub, the driver requires no code changes beyond swapping import statements, making it a drop-in replacement for existing Go applications using PostgreSQL or MySQL.</li>
<li style="font-weight:400;">For teams running critical Go applications on Aurora, this could significantly reduce downtime during maintenance windows and unplanned failovers without additional infrastructure costs.</li>
</ul>
<p>27:43 <a href="https://aws.amazon.com/blogs/aws/best-performance-and-fastest-memory-with-the-new-amazon-ec2-r8i-and-r8i-flex-instances/">Best performance and fastest memory with the new Amazon EC2 R8i and </a><a href="https://aws.amazon.com/blogs/aws/best-performance-and-fastest-memory-with-the-new-amazon-ec2-r8i-and-r8i-flex-instances/">R8i-flex instances | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;">AWS launches R8i and R8i-flex instances with custom <a href="https://www.intel.com/content/www/us/en/ark/products/series/240357/intel-xeon-6-processors.html">Intel Xeon 6</a> processors, delivering 20% better performance and 2.5x memory bandwidth compared to R7i instances, specifically targeting memory-intensive workloads like <a href="https://www.sap.com/products/hana.html">SAP HANA</a>, <a href="https://redis.io/">Redis</a>, and real-time analytics.</li>
<li style="font-weight:400;">R8i instances scale up to 96xlarge with 384 vCPUs and 3TB memory (double the previous generation), achieving 142,100 aSAPS certification for SAP workloads – the highest among comparable cloud and on-premises systems.</li>
<li style="font-weight:400;">R8i-flex instances offer 5% better price-performance at 5% lower cost for workloads that don’t need sustained CPU usage, reaching full performance 95% of the time while maintaining the same memory bandwidth improvements.</li>
<li style="font-weight:400;">Both instance types feature sixth-generation <a href="https://aws.amazon.com/ec2/nitro/?trk=7c8639c6-87c6-47d6-9bd0-a5812eecb848&amp;sc_channel=el">AWS Nitro Cards</a> with 2x network and <a href="https://aws.amazon.com/ebs/?trk=7c8639c6-87c6-47d6-9bd0-a5812eecb848&amp;sc_channel=el">EBS bandwidth</a>, plus configurable bandwidth allocation (25% adjustments between network and storage) for optimizing database performance.</li>
<li style="font-weight:400;">Currently available in four regions (US East Virginia/Ohio, US West Oregon, Europe Spain) with specific performance gains: 30% faster for PostgreSQL, 60% faster for NGINX, and 40% faster for AI recommendation models.</li>
</ul>
<p>30:58  Ryan – “I feel like AWS is just trolling us wth instance announcements now. I feel like there’s a new one – and I don’t know the difference. They’re just different words.” </p>
<h2>GCP</h2>
<p>32:20  <a href="https://cloud.google.com/blog/products/networking/multi-subnet-support-for-gke-clusters-increases-scalability/">Multi-subnet support for GKE clusters increases scalability | Google Cloud </a><a href="https://cloud.google.com/blog/products/networking/multi-subnet-support-for-gke-clusters-increases-scalability/">Blog</a></p>
<ul>
<li style="font-weight:400;">GKE clusters can now use multiple subnets instead of being limited to a single subnet’s primary IP range, allowing clusters to scale beyond previous node limits when IP addresses are exhausted. </li>
<li style="font-weight:400;">This addresses a common scaling bottleneck where clusters couldn’t add new nodes once the subnet’s IPs were depleted.</li>
<li style="font-weight:400;">The feature enables on-demand subnet addition to existing clusters without recreation, with GKE automatically selecting subnets for new node pools based on IP availability. This provides more efficient IP address utilization and reduces waste compared to pre-allocating large IP ranges upfront.</li>
<li style="font-weight:400;">Available in preview for GKE version 1.30.3-gke.1211000 or greater, with CLI and API support currently available, while Terraform and UI support are coming soon. This puts GKE on par with EKS, which has supported multiple subnets since launch.</li>
<li style="font-weight:400;">Key benefit for enterprises running large-scale workloads that need to grow beyond initial capacity planning, particularly useful for auto-scaling scenarios where node count can vary significantly. The feature works with existing multi-pod CIDR capabilities for comprehensive IP management.</li>
<li style="font-weight:400;">No additional costs are mentioned for the multi-subnet capability itself, though standard networking charges apply for the additional subnets created in the VPC.</li>
</ul>
<p>30:58  Justin – “I always like when a feature comes out right when I need it.” </p>
<p>34:45 <a href="https://cloud.google.com/blog/products/databases/database-center-expands-coverage/">Database Center expands coverage | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/blog/products/databases/database-center-is-now-generally-available?e=48754805">Database Center</a> now monitors self-managed <a href="https://www.mysql.com/">MySQL</a>, <a href="https://www.postgresql.org/">PostgreSQL</a>, and <a href="https://www.iso.org/standard/76583.html">SQL</a> Server databases on <a href="https://cloud.google.com/compute">Compute Engine VMs</a>, extending beyond just managed Google Cloud databases to provide unified fleet management across your entire database estate.</li>
<li style="font-weight:400;">The service automatically detects common security vulnerabilities in self-managed databases, including outdated versions, missing audit logs, overly permissive IP ranges, missing root passwords, and unencrypted connections – addressing a significant gap for customers running databases on VMs.</li>
<li style="font-weight:400;">New alerting capabilities notify teams when new databases are provisioned or when Database Center detects new issues, while Gemini-powered natural language queries now work at the folder level for better organization-wide database management.</li>
<li style="font-weight:400;">Historical comparison features have expanded from 7 days to 30 days, enabling better capacity planning and trend analysis across database fleets, with Database Center remaining free for Google Cloud customers.</li>
<li style="font-weight:400;">This positions Google competitively against AWS Systems Manager and Azure Arc, which offer similar hybrid database monitoring, though Google’s AI-powered approach and zero-cost model provide notable differentiation for enterprises managing mixed database environments.</li>
</ul>
<p>35:33  Justin – “I’m glad to have this. I’m also glad that it can notify me that someone created a SQL cluster, rather than me being surprised by the bill, so that I do appreciate!” </p>
<p>36:54 <a href="https://cloud.google.com/blog/products/identity-security/introducing-cloud-hsm-as-an-encryption-key-service-for-workspace-cse/">Introducing Cloud HSM as an encryption key service for Workspace CSE | </a><a href="https://cloud.google.com/blog/products/identity-security/introducing-cloud-hsm-as-an-encryption-key-service-for-workspace-cse/">Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google Cloud HSM now integrates with Workspace client-side encryption (CSE) to provide FIPS 140-2 Level 3 compliant hardware security modules for organizations in highly regulated sectors like government, defense, and healthcare that need to maintain complete control over their encryption keys.</li>
<li style="font-weight:400;">The service addresses compliance requirements for ITAR, EAR, FedRAMP High, and DISA IL5 certifications while ensuring customer-managed encryption keys never leave the HSM boundary, giving organizations demonstrable data sovereignty and control over sensitive intellectual property or regulated data.</li>
<li style="font-weight:400;"><a href="https://cloud.google.com/kms/docs/onboard-hsm-workspace">Cloud HSM for Google Workspace</a> offers a 99.95% uptime SLA and can be deployed in minutes with a flat pricing model, currently available in the U.S. with global expansion planned in the coming months.</li>
<li style="font-weight:400;">The architecture uses a two-step encryption process where data encryption keys (DEKs) are wrapped by customer-managed encryption keys (CMEKs) stored in the HSM, with all cryptographic operations performed inside the hardware security module and comprehensive audit logging through Cloud Logging.</li>
<li style="font-weight:400;">This positions Google competitively against <a href="https://docs.aws.amazon.com/cloudhsm/latest/userguide/introduction.html">AWS CloudHSM</a> and <a href="https://learn.microsoft.com/en-us/azure/dedicated-hsm/overview">Azure Dedicated HSM</a> by specifically targeting Workspace users who need hardware-backed key management, though pricing details aren’t disclosed in the announcement.</li>
</ul>
<p>35:33  Justin – “It’s really going to be the CSE side, so it’s actually encrypting on my client. So my Gmail client actually will have a key that is being accessed from this HSM to encrypt the mail at my browser, before it gets sent.” </p>
<p>39:05 <a href="https://cloud.google.com/blog/products/identity-security/security-summit-2025-enabling-defenders-and-securing-ai-innovation/">Security Summit 2025: Enabling defenders and securing AI innovation | </a><a href="https://cloud.google.com/blog/products/identity-security/security-summit-2025-enabling-defenders-and-securing-ai-innovation/">Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google Cloud announces comprehensive AI security capabilities at <a href="https://cloudonair.withgoogle.com/events/summit-amer-security-25">Security Summit 2025</a>, introducing agent-specific protections for <a href="https://cloud.google.com/products/agentspace">Agentspace</a> and <a href="https://cloud.google.com/products/agent-builder">Agent Builder,</a> including automated discovery, real-time threat detection, and <a href="https://cloud.google.com/security-command-center/docs/model-armor-overview">Model Armor</a> integration to prevent prompt injection and data leakage.</li>
<li style="font-weight:400;">The new Alert Investigation agent in <a href="https://cloud.google.com/chronicle/docs/secops/secops-overview">Google Security Operations</a> autonomously enriches security events and builds process trees based on <a href="https://www.mandiant.com/">Mandiant</a> analyst practices, reducing manual effort in SOC operations while providing verdict recommendations for human intervention.</li>
<li style="font-weight:400;">Security Command Center gains three preview features: <a href="https://cloud.google.com/security-command-center/docs/compliance-manager-overview">Compliance Manager</a> for unified policy enforcement, <a href="https://cloud.google.com/security-command-center/docs/dspm-data-security">Data Security Posture Management</a> with native <a href="https://cloud.google.com/bigquery">BigQuery</a> integration, and <a href="https://cloud.google.com/security-command-center/docs/risk-reports-overview">Risk Reports</a> powered by virtual red team technology to identify cloud defense gaps.</li>
<li style="font-weight:400;">Agentic IAM coming later this year will auto-provision agent identities across cloud environments with support for multiple credential types and authorization policies, addressing the growing need for AI-specific identity management as organizations deploy more autonomous agents.</li>
<li style="font-weight:400;">Mandiant Consulting expands services to include <a href="https://cloud.google.com/transform/gen-ai-governance-10-tips-to-level-up-your-ai-program">AI governance frameworks</a>, pre-deployment hardening guidance, and <a href="https://cloud.google.com/transform/3-new-ways-ai-security-sidekick">AI threat modeling</a>, recognizing that organizations need specialized expertise to secure their generative and agentic AI deployments.</li>
</ul>
<p>35:33  Ryan – “A lot of good features; I’ve been waiting for these announcements…I’m really happy to see these, and there’s a whole bunch I didn’t know about that they announced that I’m super excited about.”</p>
<p>42:26 <a href="https://cloud.google.com/blog/topics/developers-practitioners/rightsizing-llm-serving-on-vllm-for-gpus-and-tpus/">Rightsizing LLM Serving on vLLM for GPUs and TPUs | Google Cloud Blog</a></p>
<p>FYI – the link is broken. I tried to find an alternate version, but couldn’t. You’re just going to have to rely on Justin and Ryan’s summary. I apologize in advance. -Heather </p>
<ul>
<li style="font-weight:400;">Google published a comprehensive guide for optimizing LLM serving on vLLM across GPUs and TPUs, providing a systematic approach to selecting the right accelerator based on workload requirements like model size, request rate, and latency constraints.</li>
<li style="font-weight:400;">The guide demonstrates that TPU v6e (Trillium) achieved 35% higher throughput (5.63 req/s vs 4.17 req/s) compared to H100 GPUs when serving Gemma-3-27b, resulting in 25% lower costs ($40.32/hr vs $54/hr) to handle 100 requests per second.</li>
<li style="font-weight:400;">Key technical considerations include calculating minimum VRAM requirements (57GB for Gemma-3-27b), determining tensor parallelism needs, and using the auto_tune.sh script to find optimal gpu_memory_utilization and batch configurations.</li>
<li style="font-weight:400;">The approach addresses a critical gap in LLM deployment where teams often overprovision expensive hardware without systematic benchmarking, potentially saving significant costs for production workloads.</li>
<li style="font-weight:400;">Google’s support for both GPU and TPU options in vLLM provides flexibility for different use cases, with TPUs showing particular strength for models requiring tensor parallelism due to memory constraints.</li>
</ul>
<h2>Azure</h2>
<p>45:38 <a href="https://techcommunity.microsoft.com/blog/azuretoolsblog/announcing-msgraph-provider-public-preview-and-the-microsoft-terraform-vscode-ex/4443614">Announcing MSGraph Provider Public Preview and the Microsoft Terraform </a><a href="https://techcommunity.microsoft.com/blog/azuretoolsblog/announcing-msgraph-provider-public-preview-and-the-microsoft-terraform-vscode-ex/4443614">VSCode Extension | Microsoft Community Hub</a></p>
<ul>
<li style="font-weight:400;">Ryan claims he’s excited about this story, so I stand by my previous prediction that he is angling for an Azure job. </li>
<li style="font-weight:400;">Microsoft launches the <a href="https://techcommunity.microsoft.com/t5/learn.microsoft.com/graph/templates/terraform">Terraform MSGraph provider</a> in public preview, enabling day-zero support for all Microsoft Graph APIs, including <a href="https://github.com/MicrosoftDocs/msgraph-terraform-docs-pr/blob/main/entra/id-governance/privileged-identity-management/pim-apis">Entra ID</a> and <a href="https://github.com/MicrosoftDocs/msgraph-terraform-docs-pr/blob/main/graph/api/resources/sharepoint">M365 services like SharePoint, </a>through standard HCL syntax. </li>
<li style="font-weight:400;">This positions MSGraph as the AzureAD equivalent of what <a href="https://learn.microsoft.com/en-us/azure/developer/terraform/overview-azapi-provider">AzAPI</a> is to <a href="https://learn.microsoft.com/en-us/powershell/azure/azurerm-retirement-overview?view=azps-14.3.0">AzureRM</a> – providing immediate access to new features without waiting for provider updates.</li>
<li style="font-weight:400;">The new <a href="https://learn.microsoft.com/en-us/azure/developer/terraform/configure-vs-code-extension-for-terraform">Microsoft Terraform VSCode extension</a> consolidates AzureRM, AzAPI, and MSGraph support into a single tool, replacing the separate Azure Terraform and AzAPI extensions. Key features include exporting existing Azure resources as Terraform code, intelligent code completion, and automatic conversion of ARM templates to AzAPI format.</li>
<li style="font-weight:400;">This release targets organizations managing Microsoft 365 and Entra ID resources alongside traditional Azure infrastructure, addressing a gap where AWS has separate providers for different services (aws, aws-cc, awscc) while Microsoft now offers unified tooling. The MSGraph provider extends beyond the limited azuread provider to support all beta and v1 Graph endpoints.</li>
<li style="font-weight:400;">The extension includes practical migration features like one-click migration from the old Azure Terraform extension and built-in conversion tools for moving AzureRM resources to AzAPI. </li>
<li style="font-weight:400;">No pricing information was provided, but the tools follow standard Terraform provider models.</li>
<li style="font-weight:400;">For DevOps teams, this enables infrastructure-as-code workflows for previously manual tasks like managing privileged identity management roles, SharePoint site provisioning, and Outlook notification templates – bringing Microsoft 365 administration into the same automation pipelines as cloud infrastructure.</li>
</ul>
<p>46:42  Ryan – “So I understand why you hate this, because you hate all the services that are behind the Graph API, but there’s a single API point if you want to do anything in Teams. It’s the same API point if you want to query Entra ID for membership in a list of groups. It’s a graph API endpoint for anything in the docs or the mail space.. It’s all just the same API. Because it’s a single API that way, the structure can get real weird real fast… so this is kind of neat. I’m hoping it makes things easier.” </p>
<p>48:07 <a href="https://azure.microsoft.com/en-us/blog/agent-factory-the-new-era-of-agentic-ai-common-use-cases-and-design-patterns/">Agent Factory: The new era of agentic AI—common use cases and design </a><a href="https://azure.microsoft.com/en-us/blog/agent-factory-the-new-era-of-agentic-ai-common-use-cases-and-design-patterns/">patterns | Microsoft Azure Blog</a></p>
<ul>
<li style="font-weight:400;">Microsoft introduces <a href="https://azure.microsoft.com/en-us/blog/agent-factory-the-new-era-of-agentic-ai-common-use-cases-and-design-patterns/">Agent Factory</a>, a six-part blog series showcasing five core patterns for building agentic AI that moves beyond simple Q&amp;A to executing complex enterprise workflows through tool use, reflection, planning, multi-agent collaboration, and real-time reasoning (ReAct).</li>
<li style="font-weight:400;"><a href="https://ai.azure.com/">Azure AI Foundry</a> serves as the unified platform for agentic AI development, offering local-to-cloud deployment, 1,400+ enterprise connectors, support for <a href="https://azure.microsoft.com/en-us/products/ai-services/openai-service/?ef_id=_k_959f41de1fe817122cdc2970b35b320b_k_&amp;OCID=AIDcmm5edswduu_SEM__k_959f41de1fe817122cdc2970b35b320b_k_&amp;msclkid=959f41de1fe817122cdc2970b35b320b">Azure OpenAI</a> and 10,000+ open-source models, and built-in security with managed Entra Agent IDs and RBAC controls.</li>
<li style="font-weight:400;">Real-world implementations show significant efficiency gains: Fujitsu reduced proposal creation time by 67%, ContraForce automated 80% of security incident response for under $1 per incident, and JM Family cut QA time by 60% using multi-agent orchestration patterns.</li>
<li style="font-weight:400;">The platform differentiates from competitors by supporting open protocols like Agent-to-Agent (A2A) and Model Context Protocol (MCP) for cross-cloud interoperability, while providing enterprise-grade observability through <a href="https://duckduckgo.com/y.js?ad_domain=microsoft.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=T1OTlCpczfhdFPwKtiH5HyqzMMS4naPUcvqHRy8eZZXVlklp9x2f8XQfXtbhchhja-6jmlI5_0MNCXDWkbZljUSXi9-x7n1ShQkXx1Hdjbzpg-97SqSdE8GclY2fUhHv.m-LTtrjiFX_K5fn9TogVdg&amp;eddgt=s408k-Fdy_FAa0Y8fmHMGg%3D%3D&amp;rut=7960457bedc3345eb7bfa2752357d50ca7c80d9e7e923186acc9f487fcd4a2ba&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8DMh71HkGH4bGG6RbjdmUwDVUCUy0BXxUymCOXdmOFIspbPAFno0NzPL5QhScUwNY5iTlZKbC42H06IKPmJ72qzrwe0x__MpJiK2CdTboiLHs_JcEz5FWhN2PIdrGc2TQSDG2vBkDvaIo1nn1BqbbmcWQsDSnk1ALurGfU546WP_cUdNqfI_RqS_rhMgFx3QSJyeKLXbC2amJt4oHA4iBLbQGpIR_gCwHpv4h59cdg2zfFGdiqbTcoPTBqhU3RrPFpXzc7aSbiT4IOQxQ2PX3I31ihHweD6WO3wmG2WdqVu4Fo-7oRlK3jag6zncaLW-lt1n-jIuruxDECRBwSL89YW4ZtXvJSLcaFRJXphs5NAChRYy_fcQ2ZyKj3ZbLAjciRFueQ69v3Gq_1tL3NwIgOBJ6DHtHM73tdxkcKDnAWjBTvMt19O2C4ec2p2k7-Kb7i9aF6NDPpQYtFSpQPrp66fcVri-spzLv7RN11M6HAh19HTrERPW6ECBcdVtZ3oU3VeArEZGxpI8E05SoXh8EJ0VqKYr1EY09tXYNg_p6mMxma0Ow-AY5FQSSdd8HCsXTUyEpYmGNrsboPdliQC-V9jo-KdUcswiHlAfEbBvvWNGe1-_hQzb0zokrIZ-W0UxZ7SinvRKq1rLM3rtwuyHrHGaYmRL97B5ZbNuB57eJwq1tEgFVANiwCsZzhHVzflZFLfkqf93qPNqYGU6DTN4VnvIcPltSi5J2DhMrJrO4vemB2obNFLHO8HMiuYvaZGT53zTKZA%26u%3DaHR0cHMlM2ElMmYlMmY1MzUwLnhnNGtlbi5jb20lMmZ0cmslMmZ2MSUzZnByb2YlM2Q0MzklMjZjYW1wJTNkMTY4ODA2JTI2a2N0JTNkbXNuJTI2a2NoaWQlM2QxNTkwMDE5MjElMjZjcml0ZXJpYWlkJTNka3dkLTc5NTc3NzYwMjk2ODQyJTNhbG9jLTE5MCUyNmNhbXBhaWduaWQlM2Q1OTAyMzczNTclMjZsb2NwaHklM2Q3OTc2NCUyNmFkZ3JvdXBpZCUzZDEyNzMyMzYxMTc0NDY2NjglMjZjaWQlM2Q3OTU3NzM2NDA1OTU0MiUyNmtkdiUzZGMlMjZrZXh0JTNkJTI2a3BnJTNkJTI2a3BpZCUzZCUyNnF1ZXJ5U3RyJTNkQXp1cmUlMjUyME1vbml0b3IlMjUyMGludGVncmF0aW9uJTI2dXJsJTNkaHR0cHMlM2ElMmYlMmZhenVyZS5taWNyb3NvZnQuY29tJTJmZW4tdXMlMmZwcm9kdWN0cyUyZm1vbml0b3IlMmYlM2Ztc2Nsa2lkJTNkNWRmNGM4MjdjZDU3MTFjNGRiNThhMDg5NjE5YzQxOGMlMjNvdmVydmlldyUyZiUyNmVmX2lkJTNkX2tfNWRmNGM4MjdjZDU3MTFjNGRiNThhMDg5NjE5YzQxOGNfa18lMjZPQ0lEJTNkQUlEY21tNWVkc3dkdXVfU0VNX19rXzVkZjRjODI3Y2Q1NzExYzRkYjU4YTA4OTYxOWM0MThjX2tf%26rlid%3D5df4c827cd5711c4db58a089619c418c&amp;vqd=4-4165855326189257356800050413243772184&amp;iurl=%7B1%7DIG%3DED766D00488E42BFA8329D06654DF55D%26CID%3D0D34D6C215246AC92ABBC09714C26B87%26ID%3DDevEx%2C5046.1">Azure Monitor integration</a> and automated evaluation tools.</li>
<li style="font-weight:400;">Target customers include enterprises seeking to automate complex multi-step processes across systems, with the platform addressing common challenges like secure data access, agent monitoring, and scaling from single agents to collaborative agent networks without custom scaffolding.</li>
</ul>
<p>49:46 <a href="https://blog.fabric.microsoft.com/en-GB/blog/onelake-costs-simplified-lowering-capacity-utilization-when-accessing-onelake/">OneLake costs simplified: lowering capacity utilization when accessing </a><a href="https://blog.fabric.microsoft.com/en-GB/blog/onelake-costs-simplified-lowering-capacity-utilization-when-accessing-onelake/">OneLake | Microsoft Fabric Blog | Microsoft Fabric</a></p>
<ul>
<li style="font-weight:400;">Microsoft has unified <a href="https://blog.fabric.microsoft.com/en-gb/blog/category/onelake">OneLake</a>’s capacity pricing by reducing proxy transaction rates to match redirect rates, eliminating cost differences based on access method and simplifying capacity planning for <a href="https://blog.fabric.microsoft.com/en-gb/blog/category/fabric-platform">Fabric</a> customers.</li>
<li style="font-weight:400;">OneLake serves as the centralized data storage foundation for all Microsoft Fabric workloads, including lakehouses and warehouses, with storage billed pay-as-you-go per GB, similar to <a href="https://learn.microsoft.com/en-us/azure/storage/blobs/data-lake-storage-introduction">Azure Data Lake Storage</a> and Amazon S3.</li>
<li style="font-weight:400;">The pricing alignment removes architectural complexity for organizations using OneLake with third-party tools like <a href="https://duckduckgo.com/y.js?ad_domain=microsoft.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=JeDV3lNUvC0wfER_xIZ12UxpLha4eVP4dkSpEl5CXIKyX1Uejh8C48XZFuJxdfJhscEmN9dzWl1LCWbCe4dFqJyKkAxkUEZax35gjG1s4Fo58V-R6O21By_qQaVA_yUF.YCN4uDiuRQp0yHrCiSlDzw&amp;eddgt=CCsp-SmqGfJyUcSQU_-xnw%3D%3D&amp;rut=301d6f8f73358fe057a61b38f53dc5a2aed1626cde6bc449c2961ffb0b1124c4&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8a2JCwZPb20RR1VNx4u5e-DVUCUyOTSttawhbsG96tiJnWrAGjWNfUuNy_I5F_-rWLEEMLuulbiFSpVjD8LKekh0LK66aGufLkKLbBMvhW9i0R2p4n3RmrXiB0bS3j8-xSXHSmO_CzKo0Pa9HtAH49i69SNGaLmo7zsOVRGPiGxEYdQHz1M5y3jhj76gzw42-3A_OPeVcXtcJ2gxX1yRphfZbOgfCvUaZWoDTrALL-Bsv1bFzB7tIbmm557yuLq7aRsr94N7h62AVr9k2y8k_W4j13znQUY4WHn3mwx3g34PSOFqEmbze2D2dGIFEMGlelc9KZF7itW1l0KpZ8tFDIdwETW7WbUjHIlmrnmifxvpTbM8fiKNtCnoYl_2EJJU4Xra08tSMm1elzFdNPLsnpc63Y5v4VY5XGqF2Ran5ydhvIgPCRW0tCYCeW8AECelg_mEs59Bxu5VvxlKcRnoa43KRcx6MP2-QRlzSQddbdUy7j7kD952R50t2u16zlNzfdKUNKFdvFBZnLLXW8FEdhi3slCGhl36XGmcM-LuKOB8OJQ0OR4hQqxo2oo5Y0RMFHBbHJrHPnbQhD1DMYwHf7k7OgKdPSC85uvvGO17wWz_9BJJ58euC-LQNHZVxCZ1rZY6BtKSW-pDqnafAVq5BxeuLhqzX22DjM_4JJYn-aSj_rpafMbAbAbYVz0Jds4Ma1Yr08DSPwQDMHGwQ-nGMemk6EFjQepRIW7C8N6c_zmke7yp7X209xIIkKPr3L4i-ilP2Cg%26u%3DaHR0cHMlM2ElMmYlMmY1MzUwLnhnNGtlbi5jb20lMmZ0cmslMmZ2MSUzZnByb2YlM2Q0MzklMjZjYW1wJTNkMTY4ODA3JTI2a2N0JTNkbXNuJTI2a2NoaWQlM2QxNTkwMDE5MjElMjZjcml0ZXJpYWlkJTNka3dkLTc5NTA5MDQxMTQ1NzAwJTNhbG9jLTE5MCUyNmNhbXBhaWduaWQlM2Q1OTAyMzczNTglMjZsb2NwaHklM2Q3OTY5NSUyNmFkZ3JvdXBpZCUzZDEyNzIxMzY2MDUzMjI1NTUlMjZjaWQlM2Q3OTUwODY0MjM1NjM0MSUyNmtkdiUzZGMlMjZrZXh0JTNkJTI2a3BnJTNkJTI2a3BpZCUzZCUyNnF1ZXJ5U3RyJTNkQXp1cmUlMjUyMERhdGFicmlja3MlMjZ1cmwlM2RodHRwcyUzYSUyZiUyZmF6dXJlLm1pY3Jvc29mdC5jb20lMmZlbi11cyUyZnByaWNpbmclMmZwdXJjaGFzZS1vcHRpb25zJTJmYXp1cmUtYWNjb3VudCUyZnNlYXJjaCUzZmljaWQlM2RkYXRhYnJpY2tzJTI2ZWZfaWQlM2Rfa19kMWZjODI1NWM1NDQxMmVlNDc0OWVhZmI2OGRhNmEzZl9rXyUyNk9DSUQlM2RBSURjbW01ZWRzd2R1dV9TRU1fX2tfZDFmYzgyNTVjNTQ0MTJlZTQ3NDllYWZiNjhkYTZhM2Zfa18lMjZtc2Nsa2lkJTNkZDFmYzgyNTVjNTQ0MTJlZTQ3NDllYWZiNjhkYTZhM2Y%26rlid%3Dd1fc8255c54412ee4749eafb68da6a3f&amp;vqd=4-278224459629524532101763311632567179147&amp;iurl=%7B1%7DIG%3D85C08356690C4ED3BF3180FE15DCF1D0%26CID%3D0947BD9DFABC68493C66ABC8FB5A69F3%26ID%3DDevEx%2C5047.1">Azure Databricks</a> or <a href="https://duckduckgo.com/y.js?ad_domain=snowflake.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=jqNBxhGrq0qXfbC5osBUQJaTuT8_bYzt_u2uvLRgdvw1_K9SmrzFGtdXp3OfnIkMeyWJuDkhEWdX4yPpsy_El01PX9FxnqDerZMyd2tE4x_PTfAL8Wluz9e8kRLRdRie.Q8h4nj2hjTmOu4yvCDn_Rw&amp;eddgt=sHVw6VmCqhD4-pPUtLo9cA%3D%3D&amp;rut=401d284b41de731c59cb920658d696544c1babd307ef7fb62e2c616580f289e7&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8lz3peXQhMK4lT88u30CGeDVUCUxmokAWiZpF3ntBw85fKRLvUvaiVYWR3y3fRsH0UP8YgprArDe0KoOLsqRPhVbqFwUxqaCcZeYq6KS3M4z61sY-WT2snERKXWHf6c5BcQNadpiVGksrhCq6A8D-RNoVCNiJukRi1ALDaMuVNFYS_oYi2w646C0n4hpcxegY6ybH7XnZ5w_bcWJ8w2eJnw69kpnM7bWcxEpXT0lpOS4KmM53frDj0K9hFDxJB-uhWbfQqq3RRoeaCnwkG2bYZmrQYxWRFcAC84Vwvqd16qsz9senNdBE-UP9amWfQ6qgnbQmDlqJ2Pbr07SrGqiKoK-i775GFnWBL5POZjdcJQGknfnIlYMhwyCX63wlio6QDXc1-zHnhW3NTzUizd2f13QjOD6hkQ48boCIcm3z9yAHdHLIrNiaC_dc6KDeFPg7BrMwgHDCkLaL_BzFtC8udEyyQZt77aML02nwi0GpisXnT1IX7782EQoE8HZ4FYK6JK3ARmMo0WqjCvF6oEXbLf4cRUNee9YAdMzMe6XOLh0KguuVGlil1swE321aI3ftrVRW1qUAkg-Y5cDLpLmtnfvgVBjX9gqXJHCMxYgFzPGkqUkS17B3mZwEupu7DFk149dTBFavsE7mr14gaC7dCnbA59KFsNEeQhnHU9_h7VmentJq9sULun8Q__lpIgvjsmmzw1XRf1YHx_coT1Xi38WsOGxjfrnVrGFoDTL98hegXdx3_3oi6P5lsGTfI74hsevxfQ%26u%3DaHR0cHMlM2ElMmYlMmZjbGlja3NlcnZlLmRhcnRzZWFyY2gubmV0JTJmbGluayUyZmNsaWNrJTNmbGlkJTNkNDM3MDAwNjg0NTU2MTY2OTYlMjZkc19zX2t3Z2lkJTNkNTg3MDAwMDc1ODczMzQwNzIlMjZkc19hX2NpZCUzZDY4OTM2NzA4MSUyNmRzX2FfY2FpZCUzZDE1NzIxNTUxNTI1JTI2ZHNfYV9hZ2lkJTNkMTMzMzEzMTM4NTYyJTI2ZHNfYV9saWQlM2Rrd2QtMTM5NjcyMDAwJTI2JTI2ZHNfZV9hZGlkJTNkNzkzMDI0NTQ3MDE1NzglMjZkc19lX3RhcmdldF9pZCUzZGt3ZC03OTMwMjc5NzYxNzE0NSUzYWxvYy00MDg0JTI2JTI2ZHNfZV9uZXR3b3JrJTNkcyUyNmRzX3VybF92JTNkMiUyNmRzX2Rlc3RfdXJsJTNkaHR0cHMlM2ElMmYlMmZzaWdudXAuc25vd2ZsYWtlLmNvbSUyZiUzZnV0bV9zb3VyY2UlM2RiaW5nJTI2dXRtX21lZGl1bSUzZHBhaWRzZWFyY2glMjZ1dG1fY2FtcGFpZ24lM2RuYS11cy1lbi1icmFuZC1jb3JlLWV4YWN0JTI2dXRtX2NvbnRlbnQlM2RiaS1yc2EtZXZnLXNzLWZyZWUtdHJpYWwlMjZ1dG1fdGVybSUzZGMtcy1zbm93Zmxha2UtZSUyNl9idCUzZCUyNl9iayUzZHNub3dmbGFrZSUyNl9ibSUzZGUlMjZfYm4lM2RzJTI2X2JnJTNkMTI2ODgzNzIzMDUxMjc3NiUyNmdjbHNyYyUzZDNwLmRzJTI2JTI2Z2NsaWQlM2QxNTk3ZjZjYTExYzcxZmJjYmJjOTllNjRiYzZjZDJkYSUyNmdjbHNyYyUzZDNwLmRzJTI2JTI2bXNjbGtpZCUzZDE1OTdmNmNhMTFjNzFmYmNiYmM5OWU2NGJjNmNkMmRh%26rlid%3D1597f6ca11c71fbcbbc99e64bc6cd2da&amp;vqd=4-162535873724531290766568670920993226900&amp;iurl=%7B1%7DIG%3DCD02D8CD8F3A400D9E42D1E110724A22%26CID%3D38D4FEE4156367F52AAAE8B114856650%26ID%3DDevEx%2C5046.1">Snowflake</a>, as all access paths now consume Fabric Capacity Units at the same low rate.
<ul>
<li style="font-weight:400;">The term “low” is VERY subjective. </li>
</ul>
</li>
<li style="font-weight:400;">This positions OneLake as a more cost-predictable alternative to managing separate data lakes across cloud providers, particularly for enterprises already invested in the Microsoft ecosystem.</li>
<li style="font-weight:400;">The change reflects Microsoft’s strategy to make OneLake an open, vendor-neutral data platform that can serve as a single source of truth regardless of which analytics tools organizations choose to use.</li>
</ul>
<p>51:12 <a href="https://techcommunity.microsoft.com/blog/linuxandopensourceblog/azure-linux-with-os-guard-immutable-container-host-with-code-integrity-and-open-/4437473">Introducing Azure Linux with OS Guard: Secure, Immutable, and </a><a href="https://techcommunity.microsoft.com/blog/linuxandopensourceblog/azure-linux-with-os-guard-immutable-container-host-with-code-integrity-and-open-/4437473">Open-Source Container Host</a></p>
<ul>
<li style="font-weight:400;"><a href="https://techcommunity.microsoft.com/blog/linuxandopensourceblog/azure-linux-with-os-guard-immutable-container-host-with-code-integrity-and-open-/4437473">Azure Linux with OS Guard</a> is Microsoft’s new hardened container host OS that enforces immutability, code integrity, and mandatory access control – essentially a locked-down version of <a href="https://duckduckgo.com/y.js?ad_domain=microsoft.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=duxuP24BVcFO1ZtJJj8Hdfe8EbFMTsbs1to7jBy33uohr89RGyvcUHth5D721DRms5AvIngm4kmJwt8e_lICUwrl_Seln_HElERrDajbMLqMCzlRgaBgTscMa4wY8OKb.PVC-Wqh8X6tCBkQxDEfUAg&amp;eddgt=d3evDjI5XkuSiAN-ufNZWQ%3D%3D&amp;rut=8405187755c58556c634fd448c43cc93622634345f8f852eee1b3bc9c88650d9&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8MzVO_B389rf_BK86Dt9SqzVUCUweIszNfeZkhupnd_HB0yKewr8GtBd8DoVuF9CEOfN4_lGwx-clK-eMojFtIIOPXPMtyBMyT5QS0mjalb1e3ZcvZ-R-NNXmcXAuaSSeD6R2-T51XBfSzsuJ6I0Q9LhBs2adNDtDhKGIFNIUC1IsYGKPkoi-dqghBrtU9AK2PMhhZ27pXbKDlOYRuSA0R50LY-spUvXDlURR29Mg1VnPo4rto79D35qUusrJ65emV978vn6Fv5_z53GfzStz4cjEvweUfc2wMqv836d36gdimgxoYLxi7BMBN0kJsQ6jiFefmYOHWVXGbJBnl0O-BEDfqEaEUf9bCNsDBy4ogI1l7g9on2_nj6gvs_nZhkKLEFTCRfgoQtnI-T1upDfY1XYeVJXxJyrM78--Hcge2DBaGBLORctBCfZ7Po8BOP5TnFoHawo7tMpGE8f5PiPCAYS9PQv6kojJPBr2F8GvJ3xaMjwz2ysIyGZqWTtnLsoeBq1BzoSoYJInWBl57N0xB4N7Ix_etW3wWC_Bga5zKHRXBj43-w1saqDRqKwfmWUO-dC5UtPyzV8kqigImFvsy99ZKainyhuS4jQYdaOM0TJJJ6ETXWzLZ222zY3AFI6dpPqLP344hSyszuPQpb_FvRLMziuVFL30p2aHAGpD9lqyujCePcQ2UMFoPyeEL5TYazsHi7cmfF5hmm25J6W3Pin46GJ9EdsOsyorQo0LvnRsCsKpWYK4bcBJH4J1j7wXYDmU3Q%26u%3DaHR0cHMlM2ElMmYlMmY1MzUwLnhnNGtlbi5jb20lMmZ0cmslMmZ2MSUzZnByb2YlM2Q0MzklMjZjYW1wJTNkMTY4ODA2JTI2a2N0JTNkbXNuJTI2a2NoaWQlM2QxNTkwMDE5MjElMjZjcml0ZXJpYWlkJTNka3dkLTc4OTU5Mjg1NDU3NTExJTNhbG9jLTE5MCUyNmNhbXBhaWduaWQlM2Q1OTAyMzczNTclMjZsb2NwaHklM2Q4MDUxMSUyNmFkZ3JvdXBpZCUzZDEyNjMzNDA1MTI3NDQ3MDclMjZjaWQlM2Q3ODk1ODg4NjM2NzQxNyUyNmtkdiUzZGMlMjZrZXh0JTNkJTI2a3BnJTNkJTI2a3BpZCUzZCUyNnF1ZXJ5U3RyJTNkQXp1cmUlMjUyMExpbnV4JTI1MjB3aXRoJTI1MjBPUyUyNTIwR3VhcmQlMjZ1cmwlM2RodHRwcyUzYSUyZiUyZmF6dXJlLm1pY3Jvc29mdC5jb20lMmZlbi11cyUyZnNvbHV0aW9ucyUyZmxpbnV4LW9uLWF6dXJlJTNmZWZfaWQlM2Rfa19mZjNlN2Y2YTJlZTExNTA1N2E3MjE3YzQ4MzA3NmQ0NF9rXyUyNk9DSUQlM2RBSURjbW01ZWRzd2R1dV9TRU1fX2tfZmYzZTdmNmEyZWUxMTUwNTdhNzIxN2M0ODMwNzZkNDRfa18lMjZtc2Nsa2lkJTNkZmYzZTdmNmEyZWUxMTUwNTdhNzIxN2M0ODMwNzZkNDQ%26rlid%3Dff3e7f6a2ee115057a7217c483076d44&amp;vqd=4-5817050452231351324469349784542195083&amp;iurl=%7B1%7DIG%3D40E9BAB83B9D487EB04C4AD89E9E0BA3%26CID%3D2059CD626A5566D30183DB376B3A67A4%26ID%3DDevEx%2C5047.1">Azure Linux</a> designed specifically for high-security container workloads on AKS.</li>
<li style="font-weight:400;">The OS uses IPE (I<a href="https://docs.kernel.org/admin-guide/LSM/ipe.html">ntegrity Policy Enforcement</a>), recently upstreamed in Linux kernel 6.12, to ensure only trusted binaries from dm-verity protected volumes can execute, including container layers – this prevents rootkits, container escapes, and unauthorized code execution.</li>
<li style="font-weight:400;">Built on FedRAMP-certified <a href="https://learn.microsoft.com/en-us/azure/azure-linux/how-to-enable-azure-linux-3">Azure Linux 3.0</a>, it inherits FIPS 140-3 cryptographic modules and will gain post-quantum cryptography support as NIST algorithms become available – positioning it for regulated workloads and future security requirements.</li>
<li style="font-weight:400;">Unlike <a href="https://aws.amazon.com/bottlerocket/">AWS Bottlerocket</a>, which focuses on minimal attack surface, Azure Linux with OS Guard emphasizes code integrity verification throughout the stack – from Secure Boot through user space – while maintaining compatibility with standard container workloads.</li>
<li style="font-weight:400;">Available soon as an AKS OS SKU via preview CLI with feature flag, customers can test the community edition now on Azure VMs – targeting enterprises needing stronger container security without sacrificing the operational benefits of managed Kubernetes.</li>
</ul>
<p>46:42  Ryan – “This is interesting, because according to the blog post, it takes a sort of different approach than what we’ve seen in the past with core OS and Bottlerocket and stuff – where they’re trying to reduce what’s in that limit so much that you can’t have anything that vulnerable that can be exploited in it. And this uses a lot more of the protected VMs, where it uses the sort of encrypted memory objects. And so this is sort of a new take on securing container-wise workloads at the compute level.” </p>
<p>53:45 <a href="https://azure.microsoft.com/en-us/blog/microsoft-is-a-leader-in-the-2025-gartner-magic-quadrant-for-container-management/">Microsoft is a Leader in the 2025 Gartner® Magic Quadrant for Container </a><a href="https://azure.microsoft.com/en-us/blog/microsoft-is-a-leader-in-the-2025-gartner-magic-quadrant-for-container-management/">Management | Microsoft Azure Blog</a></p>
<ul>
<li style="font-weight:400;">Justin is turning into a softy, and so wanted to make it up to Azure for being so low on the last Magic Quadrant, so where we are. </li>
<li style="font-weight:400;">Microsoft has been named a Leader in Gartner’s 2025 Magic Quadrant for Container Management for the third consecutive year, highlighting its comprehensive container portfolio that includes <a href="https://learn.microsoft.com/en-us/azure/aks/what-is-aks">Azure Kubernetes Service</a> (AKS), <a href="https://duckduckgo.com/y.js?ad_domain=microsoft.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=lbc3XCjUwIrDWjdGFRFUcmSSrCqzBhNxNgxXZU_TS-lug9RyfgxAbZbIjTssZswyeq6UydoGX0upHIGtoT5i3tm9ogCm1hJZVWUfQLbNRRA9LX3zdVhhmWFeyHpC59Bf.dZuNC6NJUcMb56zrFknjXQ&amp;eddgt=9Syct1Mt09VfjbjTFDPG3g%3D%3D&amp;rut=cf050afda937e1bb4d48705b79366ecf7b1e04c0b5beffe5f400ba15b7216ed6&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De87KYuF0W945yzDSrcwBFLTTVUCUwyjpDaSQS8_NFX5sK5qHG2gY7pD8l8yixw_MAhHnBiVSUWKqgZMdQatC5Okg5H8JLKWKZh7hs3IlJ0EP8U6yk0r3z0j0SxUyI_OZlj8k7qlIiSc5q5bn-u6CcuQVjo7dNHCAM33QooLPyzYzLlRzupyNVU3t5Cg1BHw0FNlvD-Ubv3ZE7Xa3VFnmGGMej-hycE3fK2sb7Y8kj8Y_OxpNetd00h-ZpFO67m5ITJEBun31jeQZ6MM1784i2GLzYEhCyZjySMKD09RpeM4oCEMFQdic2-oPoMZG_XPyXQiDaUepaOvaIw0LzOLp6XD5WOzv9StS0w0D9rIZBLiqNWC4x2VSgz28b6PddYGF_J7GkjqIvAQZJ1C4gtTmqE9eiHa5mN90yua-P6EIRKx340FuO6bK6GGO8yCP1X-6gz3XGHaTe-VXaOCVuU8JsGMf_pIuhHmfH2XfuNmfp4vSkDhbscI0CFqkjk2GOLG9Ff5lll4XsOBcG7_9cEodyGHUoGiXn0w3K47jrigSSAX8K4SMYNdKigA33YUaq_4LV90Dr_NpXUYF1u7sAyzNcenB7zEyCAl0RuguTnWP2HSZQk4K1-PSZksMRqifLDjsuVI5sJ7PYbyCKXGakdc8bNxIAkOTaFzOV0t3P88Xi09_swrkLMWM-7mymWYNJoBpmZMfUZ3RXTGJdL5bJzbsFX_ZOBmx4D7V17JpdtquKqyAUU0iCT0g7orp-zfi78iTNsAC9oHw%26u%3DaHR0cHMlM2ElMmYlMmY1MzUwLnhnNGtlbi5jb20lMmZ0cmslMmZ2MSUzZnByb2YlM2Q0MzklMjZjYW1wJTNkMTY4ODA3JTI2a2N0JTNkbXNuJTI2a2NoaWQlM2QxNTkwMDE5MjElMjZjcml0ZXJpYWlkJTNka3dkLTc5NzgzOTE4OTkxOTY4JTNhbG9jLTE5MCUyNmNhbXBhaWduaWQlM2Q1OTAyMzczNTglMjZsb2NwaHklM2Q3OTU1OSUyNmFkZ3JvdXBpZCUzZDEyNzY1MzQ2NTI2ODc5MzUlMjZjaWQlM2Q3OTc4MzUyMDQyNTU5NyUyNmtkdiUzZGMlMjZrZXh0JTNkJTI2a3BnJTNkJTI2a3BpZCUzZCUyNnF1ZXJ5U3RyJTNkQXp1cmUlMjUyMENvbnRhaW5lciUyNTIwQXBwcyUyNnVybCUzZGh0dHBzJTNhJTJmJTJmYXp1cmUubWljcm9zb2Z0LmNvbSUyZmVuLXVzJTJmcHJvZHVjdHMlMmZjb250YWluZXItYXBwcyUyZiUzZmVmX2lkJTNkX2tfOWExMDE4NWQ0ZDUxMTc1YjJkOTI3MGYzZTJhMzI5MDZfa18lMjZPQ0lEJTNkQUlEY21tNWVkc3dkdXVfU0VNX19rXzlhMTAxODVkNGQ1MTE3NWIyZDkyNzBmM2UyYTMyOTA2X2tfJTI2bXNjbGtpZCUzZDlhMTAxODVkNGQ1MTE3NWIyZDkyNzBmM2UyYTMyOTA2%26rlid%3D9a10185d4d51175b2d9270f3e2a32906&amp;vqd=4-34056014431717853417178866729589671463&amp;iurl=%7B1%7DIG%3D0F2F99AF8E5A4B3C9512A0FA6F46E664%26CID%3D39500D318DDE6A8F093A1B648C386B97%26ID%3DDevEx%2C5046.1">Azure Container Apps</a>, and <a href="https://learn.microsoft.com/en-us/azure/azure-arc/overview">Azure Arc for hybrid/multi-cloud deployments</a>.</li>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/aks/intro-aks-automatic">AKS Automatic</a> (preview) aims to simplify Kubernetes adoption by providing production-ready clusters with automated node provisioning, scaling, and CI/CD integration, while Azure Container Apps offers serverless containers with scale-to-zero capabilities and per-second billing for GPU workloads.</li>
<li style="font-weight:400;">The platform integrates AI workload support through GPU-optimized containers in AKS and serverless GPUs in Container Apps, with Microsoft’s KAITO project simplifying open-source AI model deployment on Kubernetes – notably powering ChatGPT’s infrastructure serving 500M weekly users.</li>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/kubernetes-fleet/overview">Azure Kubernetes Fleet Manager</a> addresses enterprise-scale challenges by enabling policy-driven governance across multiple AKS clusters, while node auto-provisioning automatically selects cost-effective VM sizes based on workload demands to optimize spending.</li>
<li style="font-weight:400;">Key differentiators include deep integration with Azure’s ecosystem (networking, databases, AI services), developer tools like <a href="https://learn.microsoft.com/en-us/azure/aks/aks-extension-ghcopilot-plugins">GitHub Copilot for Kubernetes</a> manifest generation, and Azure Arc’s ability to manage on-premises and edge Kubernetes deployments through a single control plane.</li>
</ul>
<h2>Oracle</h2>
<p>54:57  <a href="https://www.oracle.com/news/announcement/oracle-to-offer-google-gemini-models-to-customers-2025-08-14/">Oracle To Offer Google Gemini Models To Customers 2025 08 14</a></p>
<ul>
<li style="font-weight:400;">Oracle is partnering with Google Cloud to bring <a href="https://cloud.google.com/vertex-ai/generative-ai/docs/models/gemini/1-5-pro">Gemini 1.5 Pro</a> and <a href="https://cloud.google.com/vertex-ai/generative-ai/docs/models/gemini/1-5-flash">Gemini 1.5 Flash</a> models to <a href="https://www.oracle.com/artificial-intelligence/generative-ai/generative-ai-service/">Oracle Cloud Infrastructure (OCI) Generative AI service</a>, marking Oracle’s first major third-party LLM partnership beyond Cohere.</li>
<li style="font-weight:400;">This positions Oracle as a multi-model cloud provider similar to AWS Bedrock and Azure OpenAI Service, though arriving later to market with a more limited selection compared to competitors’ broader model portfolios.</li>
<li style="font-weight:400;">The integration targets Oracle’s existing enterprise customers who want to use Google’s models while keeping data within OCI’s security boundaries, particularly appealing to regulated industries already invested in Oracle’s ecosystem.</li>
<li style="font-weight:400;">Gemini models will be available through OCI’s standard APIs with Oracle’s built-in security features, though pricing details remain unannounced, which makes cost comparison with direct Google Cloud access impossible.</li>
<li style="font-weight:400;">The real test will be whether Oracle can attract new AI workloads or simply provide convenience for existing Oracle shops that would have used Google Cloud directly anyway.</li>
</ul>
<p>56:01  Ryan – “What a weird thing.” </p>
<h2>Other Clouds </h2>
<p>56:42 <a href="https://siliconangle.com/2025/08/05/digitalocean-stock-jumps-nearly-29-second-quarter-earnings-revenue-top-expectations/">DigitalOcean stock jumps nearly 29% as earnings and revenue top</a> <a href="https://siliconangle.com/2025/08/05/digitalocean-stock-jumps-nearly-29-second-quarter-earnings-revenue-top-expectations/">expectations – SiliconANGLE</a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.digitalocean.com/login">DigitalOcean</a> reported Q2 earnings of 59 cents per share on $219M revenue (14% YoY growth), beating analyst expectations and driving a 29% stock surge. The company’s focus on higher-spending “Scalers+” customers (spending $500+ monthly) showed 35% YoY growth and now represents nearly 25% of total revenue.</li>
<li style="font-weight:400;">The company launched <a href="https://docs.digitalocean.com/products/gradient-ai-platform/">Gradient AI Platform</a>, providing managed access to GPU infrastructure and foundation models from <a href="https://www.anthropic.com/">Anthropic</a>, <a href="https://www.meta.com/about/">Meta</a>, <a href="https://mistral.ai/%5C">Mistral AI</a>, and <a href="https://openai.com/">OpenAI</a>. AI-related revenue more than doubled year-over-year, indicating strong developer adoption for building AI applications.</li>
<li style="font-weight:400;">DigitalOcean partnered with AMD to expand GPU capabilities through <a href="https://duckduckgo.com/y.js?ad_domain=digitalocean.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=ZZh7u6183N4_KbX17YP3zB01Dn_pKGuinJqePq8M-JJ1VlqvIHCaVZWfkbykT6zBy-xx0aOLeUVrU6vAphYcqlOMqoni6w-pPdYO02BmBgAreks6Eewyubpd8mryQl-w.QzWdxg4SMPYrjlh4-dQXrw&amp;eddgt=-9KWAYBOtnyqzYXTbya7aA%3D%3D&amp;rut=9d05e2ff3617576de6c5c959069fc593752fc5114fda7b58bc8deed3367bc6a0&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8h8I4ZdPDw4nCGMNAFOaWTzVUCUxQ6RJoMEQZiaYaDt9K7qei41wjTwAWRdicaqgnTkDHH90dLaylHJ_kYElPkM3bLwVCjGhAlid0w52G7YzW1N49xTkFhAphFFNa4Itr7fJ7MOSJPVw5HZthafPe1EHib_IFlTXFFaqUisayp2m-W9Z9zMoGjXX448fTA-VlcZCoe7M6Upy0tuhi1w1PH66CSsihShbse0Pih5VVO_1Sx9KNh-6V8-RCHLsM9Amd89gY7TQOCQUlKJKqGMT7AJoCBdY2rzlGDSHGtVR2RSDztK1xjBA4V86cdjrFLdZMT4h5FYdz9-gXALx-MvrairAmCB6cqdwHGYFuF5ryAf2xPhE_nZch2FtSGuwfFBuMIx_8ZYmvSppgT5A2eAgL9wZSW4X-1-GPFfaxeSoe4jygEd5LAn9oBq-Sn_d26k-R_cKcUz0-jlBN4h4QtuKsi9bPa1BDWLyOKD8WpEE6Uv1siJP7LzvCmM4Dn_RiUt4K7jaTH4mNN6DaYOShiGOu7cOeDA0f6k4JDebSmIKQURycdhSF_QJ1Y1u-5oCzJzigvw4OmwcLVfeUoeNfMjC8tzKbSGbXmZpsi-Mo017dGdQ1kZxWhoxo24j6_giJJJBePJmHJ7ON-XUBkDmWpcImu3fBZGXsE8-leh-45kZ5OLtz_GMURNOUAhdYwXB36Z4w_ATQQaxL1VNK5lDCr9SCrtRljA12A0fHiqU6r62vR1kDTHfURhyuMhxYgRf0Ivb4_mMwfQ%26u%3DaHR0cHMlM2ElMmYlMmZ3d3cuZGlnaXRhbG9jZWFuLmNvbSUyZnByb2R1Y3RzJTJmZ3JhZGllbnQlMmZncHUtZHJvcGxldHMlM2Z1dG1fYWRncm91cCUzZHdvcmtzdGF0aW9uJTI2dXRtX2NyZWF0aXZlJTNkJTI2dXRtX2xvY2F0aW9uJTNkNzk3NjIlMjZ1dG1fbWF0Y2h0eXBlJTNkcCUyNnV0bV9kZXZpY2UlM2RjJTI2bXNjbGtpZCUzZDlkNjEzMWE2MWNhMzE4ZmYxMDdjNTkwOGFlMzBhYzliJTI2dXRtX3NvdXJjZSUzZGJpbmclMjZ1dG1fbWVkaXVtJTNkY3BjJTI2dXRtX2NhbXBhaWduJTNkc2VhcmNoX25iX2dwdV9nZW5lcmljX3VzYV9lbiUyNnV0bV90ZXJtJTNkR1BVJTI1MjBtYWNoaW5lJTI2dXRtX2NvbnRlbnQlM2RXb3Jrc3RhdGlvbg%26rlid%3D9d6131a61ca318ff107c5908ae30ac9b&amp;vqd=4-143083494194502736017203257459528922306&amp;iurl=%7B1%7DIG%3DF5067E1B155C4C6D91345F9282739035%26CID%3D10F74169FBEC6EFC0E77573CFA0A6F0F%26ID%3DDevEx%2C5046.1">GPU Droplets</a> and the <a href="https://www.amd.com/en/developer/resources/cloud-access/amd-developer-cloud.html">AMD Developer Cloud</a>. </li>
<li style="font-weight:400;">This positions them to compete more effectively in the AI infrastructure market against larger cloud providers.</li>
<li style="font-weight:400;">The company achieved its highest incremental ARR since Q4 2022 and maintained a 109% net dollar retention rate for Scalers+ customers. </li>
<li style="font-weight:400;">Full-year guidance of $888-892M revenue exceeded analyst expectations of $880.81M.</li>
<li style="font-weight:400;">With over 60 new product features shipped across compute, storage, and networking categories, DigitalOcean continues to expand beyond its traditional developer-focused offerings. </li>
<li style="font-weight:400;">The strong financial performance suggests their strategy of targeting both core cloud and AI workloads is resonating with customers.</li>
</ul>
<p>58:14 <a href="https://www.databricks.com/blog/introducing-sql-stored-procedures-databricks">Introducing SQL Stored Procedures in Databricks | Databricks Blog</a></p>
<ul>
<li style="font-weight:400;">Hold on to your butts… Databricks has entered Jurassic Park territory. Insert an Ian Malcolm meme here. </li>
<li style="font-weight:400;">Databricks introduces SQL Stored Procedures following ANSI/PSM standards, enabling users to encapsulate repetitive SQL logic for data cleaning, ETL workflows, and business rule updates while maintaining Unity Catalog governance. </li>
<li style="font-weight:400;">This addresses a key gap for enterprises migrating from traditional data warehouses that rely heavily on stored procedures.</li>
<li style="font-weight:400;">The feature supports parameter types (IN, OUT, INOUT), nested/recursive calls, and integrates with SQL Scripting capabilities, including control flow, variables, and dynamic SQL execution. Unlike functions that return values, procedures execute sequences of statements, making them ideal for complex workflows.</li>
<li style="font-weight:400;">Early adopters like ClicTechnologies report improved performance, scalability, and reduced deployment time for critical workloads like customer segmentation. The ability to migrate existing procedures without rewriting code significantly simplifies transitions from legacy systems.</li>
<li style="font-weight:400;">Key limitations heading toward GA include a lack of support for cursors, exception handling, and table-valued parameters, with temporary tables and multi-statement transactions currently in private preview. These gaps may impact complex enterprise workload migrations.</li>
<li style="font-weight:400;">This positions Databricks to better compete with traditional enterprise data warehouses by offering familiar SQL constructs while maintaining lakehouse advantages. The commitment to contribute this to Apache Spark ensures broader ecosystem adoption beyond Databricks.</li>
</ul>
<p>59:28 Ryan – “Database people are gonna do data things.” </p>
<h2>Cloud Journey</h2>
<p>1:00:42 <a href="https://cloud.google.com/blog/products/application-modernization/a-guide-to-platform-engineering/">A guide to platform engineering | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google introduces “shift down” strategy for platform engineering, moving responsibilities from developers into the underlying platform infrastructure rather than the traditional DevOps “shift left” approach that pushes work earlier in development cycles.</li>
<li style="font-weight:400;">The approach categorizes development ecosystems into types (0-4) based on how much control and quality assurance the platform provides – from flexible “YOLO” (yes, it really is called that, and yes, Ryan is now contractually obligated to get a tattoo of it) environments to highly controlled “Assured” systems where the platform handles security and reliability.</li>
<li style="font-weight:400;">Key technical implementation relies on proper abstractions and coupling design to embed quality attributes like security and performance directly into the platform, reducing operational burden on individual developers.</li>
<li style="font-weight:400;">Organizations should work backwards from their business model to determine the right platform type, balancing developer flexibility against risk tolerance and quality requirements for different applications.</li>
<li style="font-weight:400;">This represents a shift in thinking about platform engineering – instead of one-size-fits-all approaches, Google advocates for intentionally choosing different platform types based on specific business needs and acceptable risk levels.</li>
</ul>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback, or ask questions at theCloudPod.net or tweet at us with the hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2127639/c1e-k5d5sgq950axrvpx-mkjpgjm3fnrj-jnomvs.mp3" length="94174212"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[ Welcome to episode 318 of The Cloud Pod, where the forecast is always cloudy! We’re going on an adventure! Justin and Ryan have formed a fellowship of the cloud, and they’re bringing you all the latest and greatest news from Valinor to Helm’s Deep, and Azure to AWS to GCP. We’ve water issues, some Magic Quadrants, and Aurora updates…but sadly no potatoes. Let’s get into it! 
Titles we almost went with this week:

You’ve Got No Mail: AOL Finally Hangs  Up on Dial-Up
Ctrl+Alt+Delete Climate Change
H2-Oh No: Your Gmail is Thirsty
The Price is Vibe: Kiro’s New    Request-Based Model
Spec-tacular Pricing: Kiro Leaves the Waitlist Behind
SHA-zam! GitHub Actions Gets Its Security Cape
Breaking Bad Actions: GitHub’s Supply Chain Intervention
Graph Your Way to Infrastructure Happiness
The Tables Have Turned: S3 Gets Its Iceberg Moment
Subnet Where It Hurts: GKE Finally Gets IP Address Relief
All Your Database Are Belong to Database Center
From Droplets to Dollars: DigitalOcean’s AI Pivot Pays Off
DigitalOcean Rides the AI Wave to Record Earnings
Agent Smith Would Be Proud: Microsoft’s Multi-Agent Matrix
Aurora Borealis: A Decade of Database Enlightenment
Fifteen Shades of Cloud: AWS’s Unbroken Streak
The Fast and the Failover-ious: Aurora Edition
Gone in Single-Digit Seconds: AWS’s Speedy Database Recovery
Agent 007: License to Secure Your AI

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our Slack channel for more info.
General News 
01:02 AOL is finally shutting down its dial-up internet service | AP News

AOL is discontinuing its dial-up internet service on September 30, 2024, marking the end of a technology that introduced millions to the internet in the 1990s and early 2000s.
Census data shows 163,401 US households still used dial-up in 2023, representing 0.13% of homes with internet subscriptions, highlighting the persistence of legacy infrastructure in underserved areas – which is honestly crazy. 
Here’s hoping that these folks are able to switch to alternatives, like Starlink.
This shutdown reflects broader technology lifecycle patterns as companies retire legacy services like Skype, Internet Explorer, and AOL Instant Messenger to focus resources on modern platforms.
The transition away from dial-up demonstrates the evolution from telephone-based connectivity to broadband and wireless technologies that now dominate internet access.
AOL’s journey from a $164 billion valuation in 2000 to being sold by Verizon in 2021 illustrates the rapid shifts in technology markets and the challenges of adapting legacy business models.

02:30 British government asks people to delete old emails to reduce data centres’ ]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2127639/c1a-k5d5-jp3zqk48uqkz-xcmxoh.jpg"></itunes:image>
                                                                            <itunes:duration>01:05:22</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2127639/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[317: I Got 99 Problems, But a Hallucination Ain’t One]]>
                </title>
                <pubDate>Sat, 23 Aug 2025 20:35:45 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2119081</guid>
                                    <link>https://tcpfm.castos.com/episodes/317-i-got-99-problems-but-a-hallucination-aint-one</link>
                                <description>
                                            <![CDATA[<h3>Welcome to episode 317 of The Cloud Pod, where the forecast is always cloudy! Justin, Matt, and an out-of-breath (from outrunning bears) Ryan are back in the studio to bring you another episode of everyone’s favorite cloud and AI news wrap-up. This week we’ve got GTP-5, Oracle’s newly minted AI conference, hallucinations (not the good kind), and even a Cloud Journey follow-up. Let’s get into it! </h3>
<h3>Titles we almost went with this week:
</h3>
<ul>
<li>Oracle Intelligence: Mission Las Vegas</li>
<li>AI World: Oracle’s Excellent Adventure</li>
<li>AI Gets a Reality Check: Amazon’s New Math Teacher for Hallucinating Models</li>
<li>Jules Verne’s 20,000 Lines Under the C</li>
<li>GPT-5: The Empire Strikes Back at Computing Costs</li>
<li>5⃣Five Alive: OpenAI’s Latest Language Model Drops</li>
<li>GPT-5 is Alive! (And Ready for Your API Calls)</li>
<li>From Kanban to Kan’t-Ban: Alienate Your User Base in One Update</li>
<li>No More Console Hopping: ECS Logs Stay Put</li>
<li>Following the Paper Trail: ECS Logs Go Live</li>
<li>The Pull Request Whisperer</li>
<li>Five’s Company: DigitalOcean Joins the GPT Party</li>
<li>WireGuard Your Kubernetes: The Mesh-iah Has Arrived</li>
<li>EKS-tending Your Reach: When Your Nodes Need a VPN Alternative</li>
<li>Buttercup Blooms: DARPA’s Prize-Winning AI Security Tool Goes Public</li>
<li>From DARPA to Docker: How Buttercup Brings AI Bug-Hunting to Your Laptop</li>
<li>Agent 007: License to Query</li>
<li>Compliance Manager: Because Nobody Dreams of Filling Out Federal Paperwork</li>
<li>Do Compliance Managers dream of Public Sector sheep?</li>
<li>Blob’s Your Uncle: Finding Lost Data in the Cloud</li>
<li>Wassette: Teaching Your AI Assistant to Go Shopping for Tools</li>
<li>Monitor, Monitor on the Wall, Who’s the Most Secure of All?</li>
<li>Better Late Than IPv-Never</li>
<li>VPC Logs: Now with 100% Less Manual Labor</li>
<li>CloudWatch Catches All the Flows in Your Organization</li>
<li>The Organization-Wide Net: No VPC Left Behind</li>
<li>SQS Goes Super Size: Would You Like to Quadruple That?</li>
<li>One MiB to Rule Them All: SQS’s Payload Growth Spurt</li>
<li>Microsoft Finally Merges with Its $7.5 Billion Side Piece</li>
<li>From Hub to Spoke: GitHub Loses Its Independence</li>
<li>Cloud Run Forest Run: Google’s AI Workshop Marathon</li>
<li>From Zero to AI Hero: Google’s Production Pipeline Workshop</li>
<li>The Fast and the Serverless: Cloud Run Drift
</li>
</ul>
<p>A big thanks to this week’s sponsor:</p>
<p>We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our Slack channel for more info.</p>
<h2>General News </h2>
<p>01:17 <a href="https://arstechnica.com/gadgets/2025/08/github-will-be-folded-into-microsoft-proper-as-ceo-steps-down/">GitHub will be folded into Microsoft proper as CEO steps down – Ars </a><a href="https://arstechnica.com/gadgets/2025/08/github-will-be-folded-into-microsoft-proper-as-ceo-steps-down/">Technica</a></p>
<ul>
<li style="font-weight:400;"><a href="https://github.com/">GitHub</a> will lose its operational independence and be integrated into <a href="https://blogs.microsoft.com/blog/2025/01/13/introducing-core-ai-platform-and-tools/">Microsoft’s CoreAI</a> organization in 2025, ending its separate CEO structure that has existed since Microsoft’s $7.5 billion acquisition in 2018.</li>
<li style="font-weight:400;">The reorganization eliminates the CEO position, with GitHub’s leadership team reporting to multiple executives within CoreAI rather than a single leader, potentially impacting decision-making speed and product direction.</li>
<li style="font-weight:400;">This structural change could affect GitHub’s developer-focused culture and remote-first operations that have distinguished it from Microsoft’s traditional corporate structure.</li>
<li style="font-weight:400;">The integration into CoreAI suggests Micr...</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 317 of The Cloud Pod, where the forecast is always cloudy! Justin, Matt, and an out-of-breath (from outrunning bears) Ryan are back in the studio to bring you another episode of everyone’s favorite cloud and AI news wrap-up. This week we’ve got GTP-5, Oracle’s newly minted AI conference, hallucinations (not the good kind), and even a Cloud Journey follow-up. Let’s get into it! 
Titles we almost went with this week:


Oracle Intelligence: Mission Las Vegas
AI World: Oracle’s Excellent Adventure
AI Gets a Reality Check: Amazon’s New Math Teacher for Hallucinating Models
Jules Verne’s 20,000 Lines Under the C
GPT-5: The Empire Strikes Back at Computing Costs
5⃣Five Alive: OpenAI’s Latest Language Model Drops
GPT-5 is Alive! (And Ready for Your API Calls)
From Kanban to Kan’t-Ban: Alienate Your User Base in One Update
No More Console Hopping: ECS Logs Stay Put
Following the Paper Trail: ECS Logs Go Live
The Pull Request Whisperer
Five’s Company: DigitalOcean Joins the GPT Party
WireGuard Your Kubernetes: The Mesh-iah Has Arrived
EKS-tending Your Reach: When Your Nodes Need a VPN Alternative
Buttercup Blooms: DARPA’s Prize-Winning AI Security Tool Goes Public
From DARPA to Docker: How Buttercup Brings AI Bug-Hunting to Your Laptop
Agent 007: License to Query
Compliance Manager: Because Nobody Dreams of Filling Out Federal Paperwork
Do Compliance Managers dream of Public Sector sheep?
Blob’s Your Uncle: Finding Lost Data in the Cloud
Wassette: Teaching Your AI Assistant to Go Shopping for Tools
Monitor, Monitor on the Wall, Who’s the Most Secure of All?
Better Late Than IPv-Never
VPC Logs: Now with 100% Less Manual Labor
CloudWatch Catches All the Flows in Your Organization
The Organization-Wide Net: No VPC Left Behind
SQS Goes Super Size: Would You Like to Quadruple That?
One MiB to Rule Them All: SQS’s Payload Growth Spurt
Microsoft Finally Merges with Its $7.5 Billion Side Piece
From Hub to Spoke: GitHub Loses Its Independence
Cloud Run Forest Run: Google’s AI Workshop Marathon
From Zero to AI Hero: Google’s Production Pipeline Workshop
The Fast and the Serverless: Cloud Run Drift


A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our Slack channel for more info.
General News 
01:17 GitHub will be folded into Microsoft proper as CEO steps down – Ars Technica

GitHub will lose its operational independence and be integrated into Microsoft’s CoreAI organization in 2025, ending its separate CEO structure that has existed since Microsoft’s $7.5 billion acquisition in 2018.
The reorganization eliminates the CEO position, with GitHub’s leadership team reporting to multiple executives within CoreAI rather than a single leader, potentially impacting decision-making speed and product direction.
This structural change could affect GitHub’s developer-focused culture and remote-first operations that have distinguished it from Microsoft’s traditional corporate structure.
The integration into CoreAI suggests Micr...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[317: I Got 99 Problems, But a Hallucination Ain’t One]]>
                </itunes:title>
                                    <itunes:episode>317</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<h3>Welcome to episode 317 of The Cloud Pod, where the forecast is always cloudy! Justin, Matt, and an out-of-breath (from outrunning bears) Ryan are back in the studio to bring you another episode of everyone’s favorite cloud and AI news wrap-up. This week we’ve got GTP-5, Oracle’s newly minted AI conference, hallucinations (not the good kind), and even a Cloud Journey follow-up. Let’s get into it! </h3>
<h3>Titles we almost went with this week:
</h3>
<ul>
<li>Oracle Intelligence: Mission Las Vegas</li>
<li>AI World: Oracle’s Excellent Adventure</li>
<li>AI Gets a Reality Check: Amazon’s New Math Teacher for Hallucinating Models</li>
<li>Jules Verne’s 20,000 Lines Under the C</li>
<li>GPT-5: The Empire Strikes Back at Computing Costs</li>
<li>5⃣Five Alive: OpenAI’s Latest Language Model Drops</li>
<li>GPT-5 is Alive! (And Ready for Your API Calls)</li>
<li>From Kanban to Kan’t-Ban: Alienate Your User Base in One Update</li>
<li>No More Console Hopping: ECS Logs Stay Put</li>
<li>Following the Paper Trail: ECS Logs Go Live</li>
<li>The Pull Request Whisperer</li>
<li>Five’s Company: DigitalOcean Joins the GPT Party</li>
<li>WireGuard Your Kubernetes: The Mesh-iah Has Arrived</li>
<li>EKS-tending Your Reach: When Your Nodes Need a VPN Alternative</li>
<li>Buttercup Blooms: DARPA’s Prize-Winning AI Security Tool Goes Public</li>
<li>From DARPA to Docker: How Buttercup Brings AI Bug-Hunting to Your Laptop</li>
<li>Agent 007: License to Query</li>
<li>Compliance Manager: Because Nobody Dreams of Filling Out Federal Paperwork</li>
<li>Do Compliance Managers dream of Public Sector sheep?</li>
<li>Blob’s Your Uncle: Finding Lost Data in the Cloud</li>
<li>Wassette: Teaching Your AI Assistant to Go Shopping for Tools</li>
<li>Monitor, Monitor on the Wall, Who’s the Most Secure of All?</li>
<li>Better Late Than IPv-Never</li>
<li>VPC Logs: Now with 100% Less Manual Labor</li>
<li>CloudWatch Catches All the Flows in Your Organization</li>
<li>The Organization-Wide Net: No VPC Left Behind</li>
<li>SQS Goes Super Size: Would You Like to Quadruple That?</li>
<li>One MiB to Rule Them All: SQS’s Payload Growth Spurt</li>
<li>Microsoft Finally Merges with Its $7.5 Billion Side Piece</li>
<li>From Hub to Spoke: GitHub Loses Its Independence</li>
<li>Cloud Run Forest Run: Google’s AI Workshop Marathon</li>
<li>From Zero to AI Hero: Google’s Production Pipeline Workshop</li>
<li>The Fast and the Serverless: Cloud Run Drift
</li>
</ul>
<p>A big thanks to this week’s sponsor:</p>
<p>We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our Slack channel for more info.</p>
<h2>General News </h2>
<p>01:17 <a href="https://arstechnica.com/gadgets/2025/08/github-will-be-folded-into-microsoft-proper-as-ceo-steps-down/">GitHub will be folded into Microsoft proper as CEO steps down – Ars </a><a href="https://arstechnica.com/gadgets/2025/08/github-will-be-folded-into-microsoft-proper-as-ceo-steps-down/">Technica</a></p>
<ul>
<li style="font-weight:400;"><a href="https://github.com/">GitHub</a> will lose its operational independence and be integrated into <a href="https://blogs.microsoft.com/blog/2025/01/13/introducing-core-ai-platform-and-tools/">Microsoft’s CoreAI</a> organization in 2025, ending its separate CEO structure that has existed since Microsoft’s $7.5 billion acquisition in 2018.</li>
<li style="font-weight:400;">The reorganization eliminates the CEO position, with GitHub’s leadership team reporting to multiple executives within CoreAI rather than a single leader, potentially impacting decision-making speed and product direction.</li>
<li style="font-weight:400;">This structural change could affect GitHub’s developer-focused culture and remote-first operations that have distinguished it from Microsoft’s traditional corporate structure.</li>
<li style="font-weight:400;">The integration into CoreAI suggests Microsoft plans to more tightly couple GitHub with its AI initiatives, potentially accelerating AI-powered development features but raising concerns about platform neutrality.</li>
<li style="font-weight:400;">Developers and enterprises should monitor how this affects GitHub’s roadmap, pricing, and commitment to open source projects, as tighter Microsoft integration historically has led to significant platform changes.</li>
</ul>
<p>03:01  Matt – “God knows how long a decision is going to take to get made.”  </p>
<h2>AI Is Going Great – or How ML Makes Its Money </h2>
<p>05:10 <a href="https://blog.google/technology/google-labs/jules-now-available/">Jules, Google’s asynchronous AI coding agent, is out of public beta</a></p>
<ul>
<li style="font-weight:400;">If you’ve forgotten about it, Jules is the worst-marketed Google AI coding agent tool. </li>
<li style="font-weight:400;"><a href="https://blog.google/technology/google-labs/jules/">Jules</a> is Google’s AI coding agent that operates asynchronously to handle development tasks.</li>
<li style="font-weight:400;">It’s now publicly available, after processing 140,000+ code improvements during beta testing with thousands of developers.</li>
<li style="font-weight:400;">The service runs on <a href="https://cloud.google.com/vertex-ai/generative-ai/docs/models/gemini/2-5-pro">Gemini 2.5 Pro’s</a> advanced reasoning capabilities to create coding plans and generate higher-quality code outputs, with new features including GitHub issues integration and multimodal support.</li>
<li style="font-weight:400;">Google introduced three pricing tiers: free introductory access (which you will blow through almost immediately), <a href="https://gemini.google/subscriptions/">Google AI Pro</a> with 5x higher limits for daily coding, and Google AI Ultra with 20x limits for intensive multi-agent workflows at scale.</li>
<li style="font-weight:400;">Is it just us, or is this the same pricing structure as Claude? </li>
<li style="font-weight:400;">This represents a shift toward autonomous coding assistants that can work independently on tasks while developers focus on other work, potentially changing how cloud-based development teams operate.</li>
<li style="font-weight:400;">The asynchronous nature allows Jules to handle time-consuming tasks like bug fixes and code improvements without requiring constant developer oversight, which could significantly impact productivity for cloud development projects.</li>
</ul>
<p>06:30  Ryan – “I think it’s a perfect example of like where GitHub might go, right? Because this already integrates with GitHub, so you can communicate with the AI in issues or point at certain issues, or use it in comments. And it’s synchronous, so it’s just running in the background. It’s not a chat or an interactive agent conversation. You’re sort of like giving it directions and sending it off.”</p>
<p>08:11 <a href="https://openai.com/index/introducing-gpt-5/">Introducing GPT-5</a></p>
<ul>
<li style="font-weight:400;">Were you waiting for the drumroll? Well, no sound effects this week. Sad face. </li>
<li style="font-weight:400;"><a href="https://openai.com/gpt-5/">GPT-5</a> introduces a larger model architecture with refined attention mechanisms and multimodal input processing, requiring substantial cloud compute resources for deployment and inference at scale.</li>
<li style="font-weight:400;">Enhanced contextual comprehension and faster processing speeds enable more efficient API calls and reduced latency for cloud-based AI services, potentially lowering operational costs for businesses.</li>
<li style="font-weight:400;">Technical improvements in training efficiency could reduce the computational overhead for fine-tuning models on cloud platforms, making custom AI deployments more accessible to smaller organizations.</li>
<li style="font-weight:400;">Healthcare, education, and creative industries can leverage GPT-5 through cloud APIs for applications like medical documentation, personalized learning systems, and content generation workflows.</li>
<li style="font-weight:400;"><a href="https://openai.com/">OpenAI’s</a> safety measures and ethical deployment guidelines will likely influence cloud provider policies for hosting and serving large language models, affecting compliance requirements for enterprise users.</li>
<li style="font-weight:400;">AGI is here, guys! Well, not really. Maybe. Sort of. Getting close? Ryan is excited about it, anyway. </li>
</ul>
<p>09:38 <a href="https://openai.com/index/introducing-gpt-5-for-developers/">Introducing GPT-5 for Developers</a></p>
<ul>
<li style="font-weight:400;"><a href="https://openai.com/gpt-5/">GPT-5</a> represents the next iteration of <a href="https://openai.com/">OpenAI’s</a> language model series, likely offering improved language understanding and generation capabilities that developers can integrate via API endpoints into cloud-based applications.</li>
<li style="font-weight:400;">The model would provide enhanced performance benchmarks compared to <a href="https://openai.com/index/gpt-4/">GPT-4</a>, potentially including better context handling, reduced hallucinations, and more accurate responses for enterprise cloud deployments.</li>
<li style="font-weight:400;">Developer integration features may include new API capabilities, updated SDKs, and code examples for implementing GPT-5 across various cloud platforms and programming languages.</li>
<li style="font-weight:400;">Pricing and rate limits will be critical factors for businesses evaluating GPT-5 adoption, particularly for high-volume cloud applications requiring scalable AI inference.</li>
<li style="font-weight:400;">The release could impact cloud computing costs and architecture decisions as organizations determine whether to use OpenAI’s hosted service or explore self-hosting options on their cloud infrastructure.</li>
</ul>
<p>11:09  Ryan – “I’m kind of afraid of AGI, and I’m putting my head in the sand about it right now.” </p>
<p>12:29 <a href="https://www.snowflake.com/en/blog/category/product-and-technology/announcing-openai-gpt-5-on-snowflake-cortex-ai/">Announcing OpenAI GPT-5 on Snowflake Cortex AI</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.snowflake.com/en/product/features/cortex/">Snowflake Cortex AI</a> is their existing platform for running LLMs and ML models directly on data stored in <a href="https://www.snowflake.com/en/">Snowflake</a>, currently supporting models like <a href="https://www.llama.com/llama2/">Llama 2</a>, <a href="https://mistral.ai/">Mistral</a>, and other open-source options.</li>
<li style="font-weight:400;">If GPT-5 were to be integrated with Cortex AI, it would allow enterprises to run advanced language models on their private data without moving it outside Snowflake’s secure environment.</li>
<li style="font-weight:400;">This integration would follow Snowflake’s pattern of adding major LLMs to Cortex, enabling SQL-based access to AI capabilities for data analysts and developers.</li>
<li style="font-weight:400;">The announcement timing would be notable given OpenAI hasn’t officially released GPT-5 yet, making this either premature or indicative of an exclusive cloud partnership.</li>
<li style="font-weight:400;">Cool. </li>
</ul>
<p>12:35 <a href="https://arstechnica.com/ai/2025/08/apple-brings-openais-gpt-5-to-ios-and-macos/">Apple brings OpenAI’s GPT-5 to iOS and macOS – Ars Technica</a></p>
<ul>
<li style="font-weight:400;">Apple followed up the deluge of GPT-5 announcements with one of their own. </li>
<li style="font-weight:400;">Apple will integrate <a href="https://arstechnica.com/ai/2025/08/openai-launches-gpt-5-free-to-all-chatgpt-users/">OpenAI’s GPT-5</a> into <a href="https://www.apple.com/os/ios/">iOS 26</a>,<a href="https://www.apple.com/os/ipados/"> iPadOS 26</a>, and <a href="https://www.apple.com/newsroom/2025/06/macos-tahoe-26-makes-the-mac-more-capable-productive-and-intelligent-than-ever/">macOS Tahoe 26</a>, likely launching in September 2025, replacing the current GPT-4o integration for Siri and system-level AI queries.</li>
<li style="font-weight:400;">GPT-5 claims an 80% reduction in hallucinations and introduces automatic model selection between standard and reasoning-optimized modes based on prompt complexity, though it’s unclear how Apple will implement this dual-mode functionality in their OS integration.</li>
<li style="font-weight:400;">The rollout follows GPT-5 deployments to <a href="https://github.com/features/copilot">GitHub Copilot</a> (public preview) and <a href="https://www.microsoft365.com/?omkt=en-US">Microsoft 365 Copilot</a>, positioning major cloud platforms as the primary distribution channels for OpenAI’s latest models rather than direct consumer access.</li>
<li style="font-weight:400;">Apple’s implementation raises questions about feature parity with ChatGPT’s paid tier, particularly whether iOS users will have manual model selection capabilities or be limited to automatic selection like free ChatGPT users.</li>
<li style="font-weight:400;">This marks a significant shift in how consumers will access advanced AI models, with cloud-integrated operating systems becoming the default interface rather than standalone AI applications.</li>
</ul>
<p>12:50 <a href="https://www.digitalocean.com/blog/gpt-5-now-on-digitalocean-gradient-ai-platform">Now Live: GPT-5 on the DigitalOcean Gradient AI Platform | DigitalOcean</a></p>
<ul>
<li style="font-weight:400;">What could DO possibly have had to announce? Oh yeah – GPT-5. Weird. </li>
<li style="font-weight:400;">DigitalOcean’s Gradient AI Platform now offers GPT-5 integration with two deployment options: using DigitalOcean’s infrastructure or bringing your own OpenAI API key for direct billing flexibility.</li>
<li style="font-weight:400;">GPT-5 introduces improved reasoning capabilities and domain specialization, targeting enterprise use cases like financial planning, medical document analysis, and advanced code generation beyond general-purpose chat applications.</li>
<li style="font-weight:400;">The platform positions GPT-5 as an “agent-ready” model, enabling developers to build autonomous AI agents within DigitalOcean’s infrastructure rather than just API-based integrations.</li>
<li style="font-weight:400;">This marks DigitalOcean’s entry into the hosting frontier for AI models, competing with hyperscalers by offering simplified deployment and management for developers who want cloud infrastructure without complexity.</li>
<li style="font-weight:400;">The bring-your-own-key option allows organizations to maintain existing OpenAI enterprise agreements while leveraging DigitalOcean’s compute and orchestration layer for agent workflows.</li>
</ul>
<p>13:39  Matt – “It’s going to be a question, in a month, of ‘why don’t you have GPT-5, where is it in your roadmap?’ More than anything.” </p>
<p>14:15 <a href="https://arstechnica.com/ai/2025/08/chatgpt-users-outraged-as-gpt-5-replaces-the-models-they-love/">ChatGPT users hate GPT-5’s “overworked secretary” energy, miss their </a><a href="https://arstechnica.com/ai/2025/08/chatgpt-users-outraged-as-gpt-5-replaces-the-models-they-love/">GPT-4o buddy – Ars Technica</a></p>
<ul>
<li style="font-weight:400;">After all that buzz wore off, there were some complaints. </li>
<li style="font-weight:400;">OpenAI released GPT-5 as the default model for <a href="https://chatgpt.com/">ChatGPT</a> users while restricting <a href="https://openai.com/index/hello-gpt-4o/">GPT-4o</a> access to developer APIs only, causing user backlash over losing their preferred conversational AI experience.</li>
<li style="font-weight:400;">Users report GPT-5 outputs feel more sterile and corporate compared to GPT-4o, with complaints about reduced creativity and broken workflows that were optimized for the previous model.</li>
<li style="font-weight:400;">This highlights a key challenge for cloud AI services: maintaining consistency in user experience while upgrading models, especially when users develop emotional attachments or specific workflows around particular AI behaviors.</li>
<li style="font-weight:400;">The situation demonstrates the importance of model versioning and user choice in AI platforms, suggesting cloud providers should consider maintaining multiple model options for different use cases rather than forcing migrations.</li>
<li style="font-weight:400;">For businesses building on AI APIs, this serves as a reminder to plan for model deprecation and changes in AI behavior that could impact customer-facing applications or internal workflows.</li>
</ul>
<p>15:00 <a href="https://arstechnica.com/information-technology/2025/08/the-gpt-5-rollout-has-been-a-big-mess/">The GPT-5 rollout has been a big mess – Ars Technica</a></p>
<ul>
<li style="font-weight:400;">OpenAI automatically removed access to nine previous ChatGPT models when GPT-5 launched on August 7, forcing users to migrate without warning, unlike API users who receive deprecation notices.</li>
<li style="font-weight:400;">The forced migration broke established workflows as each model has unique training and output styles that users had optimized their prompts for over months of use.</li>
<li style="font-weight:400;">User revolt included over 4,000 comments on Reddit, with marketing professionals, researchers, and developers reporting broken systems and lost functionality within 24 hours of launch.</li>
<li style="font-weight:400;">CEO Sam Altman issued a public apology and reversed the decision, highlighting the operational challenges of managing multiple model versions in consumer-facing AI services.</li>
<li style="font-weight:400;">The incident demonstrates the dependency risk when building workflows around specific AI models and the importance of version control strategies for production AI applications.</li>
</ul>
<p>16:51  Matt – “Could go the Microsoft or AWS route and never depricate anything until you can 100% guarantee no one is using it anymore.” </p>
<h2>Cloud Tools</h2>
<p>17:18 <a href="https://share.google/zxtGetiMwXhW5R0oF">Buttercup is now open-source! -The Trail of Bits Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.trailofbits.com/">Trail of Bits</a> has open-sourced <a href="https://blog.trailofbits.com/2025/08/08/buttercup-is-now-open-source/">Buttercup</a>, their AI-powered Cyber Reasoning System that won second place in <a href="https://archive.aicyberchallenge.com/">DARPA’s AI Cyber Challenge</a>, making automated vulnerability discovery and patching accessible to individual developers on standard laptops with 8 cores, 16GB RAM, and 100GB storage.</li>
<li style="font-weight:400;">The system combines AI-augmented fuzzing with multi-agent patch generation, using 7 distinct AI agents to create and validate fixes while leveraging third-party LLMs like OpenAI or Anthropic with built-in cost controls for budget management.</li>
<li style="font-weight:400;">Buttercup integrates OSS-Fuzz/ClusterFuzz for vulnerability discovery, tree-sitter and CodeQuery for static analysis, and provides a complete orchestration layer with web UI and SigNoz telemetry monitoring, demonstrating practical AI application in automated security testing.</li>
<li style="font-weight:400;">The standalone version can find and patch vulnerabilities in under 10 minutes on sample code, offering cloud-native deployment through containerized pods and making enterprise-grade security automation available to smaller teams and projects.</li>
<li style="font-weight:400;">This release represents a shift in AI-powered security tools from competition-scale systems to practical developer tools, potentially reducing the barrier to entry for automated vulnerability management in CI/CD pipelines and cloud deployments.</li>
</ul>
<p>19:19  Ryan – “I do like anything that’s going to go and detect the vulnerabilities and then also try to fix them on behalf of developers. I haven’t used any of these tools, and it’s an interesting fit with the existing pipelines. It’s pretty cool though.”  </p>
<h2>AWS </h2>
<p>19:58  <a href="https://aws.amazon.com/about-aws/whats-new/2025/08/amazon-aurora-serverless-v2-up-to-30-performance/">Amazon Aurora Serverless v2 now offers up to 30% performance </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/08/amazon-aurora-serverless-v2-up-to-30-performance/">improvement</a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/AmazonRDS/latest/AuroraUserGuide/aurora-serverless-v2.html">Aurora Serverless v2</a> delivers up to 30% performance improvement on platform version 3, making it viable for more demanding workloads that previously required provisioned instances.</li>
<li style="font-weight:400;">The service now scales from 0 to 256 ACUs (<a href="https://docs.aws.amazon.com/AmazonRDS/latest/AuroraUserGuide/aurora-serverless-v2.setting-capacity.html">Aurora Capacity Units</a>), where each ACU provides approximately 2 GiB of memory plus corresponding CPU and networking resources.</li>
<li style="font-weight:400;">Existing clusters require manual upgrade via stop/restart or Blue/Green Deployments to access the performance gains, while new clusters automatically launch on the latest platform.</li>
<li style="font-weight:400;">The 30% performance boost, combined with automatic scaling, addresses the common serverless database challenge of balancing cost efficiency with consistent performance for variable workloads.</li>
<li style="font-weight:400;">Available across all AWS regions, including <a href="https://aws.amazon.com/govcloud-us/">GovCloud</a>, this update strengthens Aurora’s position against competitors like <a href="https://cloud.google.com/spanner">Google Cloud Spanner</a> and <a href="https://azure.microsoft.com/en-us/products/azure-sql/database/">Azure SQL Database</a> serverless offerings.</li>
</ul>
<p>21:28  Justin – “I almost went down the blue-green path, but when you do blue-green, it’s not just a temporary thing; you end up running it forever – which I don’t want to do because I don’t have that kind of money to burn. But this is not easy to get on to; I wish they would just give you a button.” </p>
<p>23:14 <a href="https://aws.amazon.com/blogs/aws/minimize-ai-hallucinations-and-deliver-up-to-99-verification-accuracy-with-automated-reasoning-checks-now-available/">Minimize AI hallucinations and deliver up to 99% verification accuracy with </a><a href="https://aws.amazon.com/blogs/aws/minimize-ai-hallucinations-and-deliver-up-to-99-verification-accuracy-with-automated-reasoning-checks-now-available/">Automated Reasoning checks: Now available | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/category/artificial-intelligence/amazon-machine-learning/amazon-bedrock/amazon-bedrock-guardrails/">Amazon Bedrock Guardrails</a> now includes Automated Reasoning checks that use mathematical logic and formal verification to validate AI-generated content against domain knowledge, achieving up to 99% verification accuracy for detecting hallucinations – a significant improvement over probabilistic methods.</li>
<li style="font-weight:400;">The feature supports documents up to 80K tokens (approximately 100 pages), includes automated test scenario generation, and allows users to encode business rules into formal logic policies that can validate whether AI responses comply with established guidelines and regulations.</li>
<li style="font-weight:400;">PwC is already using this for utility outage management systems where AI-generated response plans must comply with strict regulatory requirements – the system automatically validates protocols, creates severity-based workflows, and ensures responses meet defined targets.</li>
<li style="font-weight:400;">Pricing is based on text processed volume, and the service is available in US East (Ohio, N. Virginia), US West (Oregon), and Europe (Frankfurt, Ireland, Paris) regions, with integration support for both Amazon Bedrock models and third-party models like OpenAI and Google Gemini via the <a href="https://docs.aws.amazon.com/bedrock/latest/userguide/guardrails-use-independent-api.html">ApplyGuardrail API</a>.</li>
<li style="font-weight:400;">The policy creation process involves uploading natural language documents (like PDFs of business rules), which are then translated into formal logic with rules, variables, and custom types that can be tested and validated before deployment in production guardrails.</li>
</ul>
<p>24:44  Ryan – “It is kind of crazy the idea that the reasoning checks are just using mathematical logic.” </p>
<p>26:04 <a href="https://aws.amazon.com/about-aws/whats-new/2025/08/amazon-ecs-console-analytics-cloudwatch-logs-live-tail/">Amazon ECS console now supports real-time log analytics via Amazon </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/08/amazon-ecs-console-analytics-cloudwatch-logs-live-tail/">CloudWatch Logs Live Tail</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/ecs/">Amazon ECS</a> console now integrates <a href="https://docs.aws.amazon.com/AmazonCloudWatch/latest/logs/CloudWatchLogs_LiveTail.html">CloudWatch Logs Live Tail</a> directly, eliminating the need to switch between consoles for real-time log monitoring during container troubleshooting and deployment investigations. </li>
<li style="font-weight:400;">This is 99% of Justin’s day, so he’s loving this one. </li>
<li style="font-weight:400;">The Live Tail panel stays visible while navigating the ECS console, allowing operators to monitor logs while checking metrics or making configuration changes – addressing a common workflow interruption.</li>
<li style="font-weight:400;">Access is straightforward through the logs tab on any ECS service or task details page with a simple “Open CloudWatch Logs Live Tail” button, making real-time debugging more accessible for containerized applications.</li>
<li style="font-weight:400;">This integration reduces context switching for common ECS operations like investigating deployment failures and monitoring container health, improving operational efficiency for teams managing containerized workloads.</li>
<li style="font-weight:400;">Available in all AWS commercial regions, with standard <a href="https://docs.aws.amazon.com/AmazonCloudWatch/latest/logs/WhatIsCloudWatchLogs.html">CloudWatch Logs</a> pricing applying to the Live Tail feature usage.</li>
</ul>
<p>27:44  Matt – “I wish this was here years ago when I did my first ECS deployments.” </p>
<p>27:57 <a href="https://aws.amazon.com/about-aws/whats-new/2025/08/aws-lambda-github-actions-function-deployment/">AWS Lambda now supports GitHub Actions to simplify function </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/08/aws-lambda-github-actions-function-deployment/">deployment</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/lambda/">AWS Lambda</a> now supports native <a href="https://github.com/features/actions">GitHub Actions</a> for automated function deployment, eliminating the need for custom scripts and manual AWS CLI commands that previously made CI/CD pipelines complex and error-prone.</li>
<li style="font-weight:400;">The new <a href="https://docs.aws.amazon.com/lambda/latest/dg/getting-started.html">Deploy Lambda Function</a> action handles both zip file and container image deployments automatically, supports OIDC authentication for secure IAM integration, and includes configuration options for runtime, memory, timeout, and environment variables.</li>
<li style="font-weight:400;">This addresses a significant pain point where developers had to write repetitive boilerplate code across repositories, manually package artifacts, and configure IAM permissions for each Lambda deployment from GitHub.</li>
<li style="font-weight:400;">The action includes practical features like dry run mode for validation without changes and S3-based deployment support for larger zip packages, making it suitable for both development testing and production deployments.</li>
<li style="font-weight:400;">Available in all commercial AWS regions where Lambda operates, this integration reduces onboarding time for new developers and decreases deployment errors by providing a declarative configuration approach within GitHub Actions workflows.</li>
</ul>
<p>29:03 Ryan – “I love this with every bone in my body. This is an easy button for development, where I can’t think of the amount of bad scripting I’ve done…  trying to build pipelines to do what I want. This is definitely something that will make that a lot easier.”  </p>
<p>30:36 <a href="https://aws.amazon.com/about-aws/whats-new/2025/08/amazon-dynamodb-adds-console-to-code/">Amazon DynamoDB adds support for Console-to-Code</a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/amazondynamodb/latest/developerguide/Introduction.html">DynamoDB</a> Console-to-Code uses <a href="https://aws.amazon.com/q/developer/">Amazon Q Developer</a> to automatically generate infrastructure-as-code from console actions, supporting AWS CDK in <a href="https://www.typescriptlang.org/">TypeScript</a>, <a href="https://www.python.org/">Python</a>, and <a href="https://www.java.com/">Java</a>, plus <a href="https://docs.aws.amazon.com/AWSCloudFormation/latest/UserGuide/Welcome.html">CloudFormation</a> in <a href="https://yaml.org/">YAML</a> or <a href="https://www.json.org/json-en.html">JSON</a> formats.</li>
<li style="font-weight:400;">This feature addresses the common workflow where developers prototype in the console, then manually recreate configurations as code, reducing time spent on infrastructure automation setup.</li>
<li style="font-weight:400;">The integration leverages generative AI to translate recorded console actions into production-ready code templates, streamlining the path from experimentation to automated deployment.</li>
<li style="font-weight:400;">Available now in commercial regions, this positions DynamoDB alongside other AWS services adopting Console-to-Code functionality, part of AWS’s broader push to simplify infrastructure automation.</li>
<li style="font-weight:400;">For teams managing multiple DynamoDB tables or complex configurations, this reduces manual coding effort and potential errors when transitioning from development to production environments.</li>
</ul>
<p>31:17  Ryan – “I promise you that CloudFormation takes that YAML and converts it to JSON before execution.” </p>
<p>36:41 <a href="https://aws.amazon.com/blogs/containers/simplify-network-connectivity-using-tailscale-with-amazon-eks-hybrid-nodes/">Simplify network connectivity using Tailscale with Amazon EKS Hybrid </a><a href="https://aws.amazon.com/blogs/containers/simplify-network-connectivity-using-tailscale-with-amazon-eks-hybrid-nodes/">Nodes | Containers</a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/eks/latest/userguide/hybrid-nodes-overview.html">AWS EKS Hybrid Nodes</a> now integrates with <a href="https://tailscale.com/">Tailscale</a> to simplify network connectivity between on-premises infrastructure and AWS-hosted <a href="https://kubernetes.io/">Kubernetes</a> control planes. </li>
<li style="font-weight:400;">This eliminates complex VPN configurations by using Tailscale’s peer-to-peer mesh networking with <a href="https://www.wireguard.com/protocol/">WireGuard encryption</a> for direct, secure connections.</li>
<li style="font-weight:400;">The solution addresses a key challenge in hybrid Kubernetes deployments by allowing organizations to manage their control plane in AWS while keeping worker nodes on-premises or at edge locations. Tailscale acts as a subnet router within the VPC, advertising routes between the remote pod network (like 10.80.0.0/16) and node addresses (192.168.169.0/24).</li>
<li style="font-weight:400;">Implementation requires installing Tailscale on hybrid nodes, deploying a subnet router EC2 instance in your VPC, and updating route tables to direct traffic through the Tailscale network interface. </li>
<li style="font-weight:400;">The setup supports both <a href="https://docs.tigera.io/calico/latest/getting-started/kubernetes/hardway/install-cni-plugin">Calico</a> and <a href="https://cilium.io/use-cases/cni/">Cilium CNIs</a> with per-node /32 addressing for optimal routing.</li>
<li style="font-weight:400;">This approach reduces operational complexity compared to traditional site-to-site VPNs or <a href="https://docs.aws.amazon.com/directconnect/latest/UserGuide/Welcome.html">AWS Direct Connect</a>, making hybrid Kubernetes deployments more accessible for organizations with existing on-premises infrastructure. Tailscale is available through <a href="https://aws.amazon.com/marketplace/">AWS Marketplace</a> with standard EC2 instance costs for the subnet router.</li>
<li style="font-weight:400;">Key considerations include planning non-overlapping CIDR ranges, enabling IP forwarding on the subnet router, and potentially deploying multiple subnet routers across availability zones for high availability. </li>
<li style="font-weight:400;">The solution works with EKS-validated operating systems on hybrid nodes.</li>
</ul>
<p>38:49  Ryan – “If everything is using a mesh peer-to-peer communication network, great. But if you’re doing this on top of VPC, that’s on top of transit gateway, that already has a Direct Connect gateway, and you’re just doing it to bypass your network infrastructure, boo! Don’t do that.” </p>
<p>41:21 <a href="https://aws.amazon.com/about-aws/whats-new/2025/08/amazon-cloudwatch-organization-vpc-flow-logs-enablement">Amazon CloudWatch introduces organization-wide VPC flow logs </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/08/amazon-cloudwatch-organization-vpc-flow-logs-enablement">enablement</a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/AmazonCloudWatch/latest/monitoring/telemetry-config-cloudwatch.html">CloudWatch</a> now enables automatic VPC flow logs across entire AWS Organizations through Telemetry Config rules, eliminating manual setup for each VPC and ensuring consistent network monitoring coverage.</li>
<li style="font-weight:400;">Organizations can scope rules by entire org, specific accounts, or resource tags, allowing DevOps teams to automatically enable flow logs for production VPCs or other critical infrastructure based on tagging strategies.</li>
<li style="font-weight:400;">The feature leverages AWS Config Service-Linked recorders to discover matching resources and applies to both existing and newly created VPCs, preventing monitoring gaps as infrastructure scales.</li>
<li style="font-weight:400;">Customers pay <a href="https://aws.amazon.com/config/pricing/">AWS Config pricing</a> for configuration items plus CloudWatch vended logs pricing for flow log ingestion, making cost predictable based on VPC count and log volume.</li>
<li style="font-weight:400;">Available in 16 commercial regions, this addresses a common compliance and security requirement where organizations need complete network traffic visibility without manual intervention.</li>
</ul>
<h2>GCP</h2>
<p>44:50 <a href="https://blog.google/technology/developers/introducing-gemini-cli-github-actions/">Gemini CLI GitHub Actions: AI coding made for collaboration</a></p>
<ul>
<li style="font-weight:400;">Google launches <a href="https://blog.google/technology/developers/introducing-gemini-cli-open-source-ai-agent/">Gemini CLI</a> GitHub Actions, a free AI coding assistant that automates issue triage, pull request reviews, and on-demand development tasks through simple @gemini-cli mentions in GitHub repositories.</li>
<li style="font-weight:400;">The tool provides enterprise-grade security through <a href="https://cloud.google.com/iam/docs/workload-identity-federation">Workload Identity Federation</a> for credential-less authentication, command allowlisting for granular control, and <a href="https://opentelemetry.io/">OpenTelemetry</a> integration for complete observability of all AI actions.</li>
<li style="font-weight:400;">Available in beta with generous free quotas for Google AI Studio users, with support for Vertex AI and Gemini Code Assist Standard/Enterprise tiers, positioning it as a direct competitor to GitHub Copilot’s workflow automation features.</li>
<li style="font-weight:400;">Three pre-built workflows handle intelligent issue labeling and prioritization, automated code review feedback, and delegated coding tasks like writing tests or implementing bug fixes based on issue descriptions.</li>
<li style="font-weight:400;">The open-source nature allows teams to customize workflows or create new ones, with Google using the tool internally to manage contributions to the Gemini CLI project itself, demonstrating practical scalability for high-volume repositories.</li>
</ul>
<p>45:54  Ryan – “I like that this is also directly competing with Jules – it’s very similar – without all the polish. In fact, now I’m worried that I was confusing features between the two of them when we were talking about Jules earlier.” </p>
<p>46:41 <a href="https://cloud.google.com/blog/products/data-analytics/new-agents-and-ai-foundations-for-data-teams/">New agents and AI foundations for data teams | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google introduces specialized AI agents for data teams, including Data Engineering Agent for pipeline automation, Data Science Agent for autonomous analytical workflows, and enhanced Conversational Analytics Agent with Code Interpreter that can execute Python code for complex business questions beyond SQL capabilities.</li>
<li style="font-weight:400;">We were silent when they came for the DevOps engineers. </li>
<li style="font-weight:400;">We were silent when they came for the SQL engineers. </li>
<li style="font-weight:400;">Will we now remain silent as they take out the ML Ops people? </li>
<li style="font-weight:400;">Ryan says: Absolutely YES. </li>
<li style="font-weight:400;">New Gemini Data Agents APIs and Agent Development Kit enable developers to build custom agents and integrate conversational intelligence into their applications, with Model Context Protocol support for secure agent interactions across systems.</li>
<li style="font-weight:400;">Spanner gets a columnar engine delivering up to 200x faster analytical query performance on transactional data, while BigQuery adds autonomous vector embeddings and an AI Query Engine that brings LLM capabilities directly to SQL queries.</li>
<li style="font-weight:400;">The platform unifies operational and analytical data in a single AI-native foundation, addressing the traditional divide between OLTP and OLAP systems while providing persistent memory and reasoning capabilities for agents.</li>
<li style="font-weight:400;">Offering pre-built agents rather than just infrastructure, though pricing details aren’t provided, and the preview status suggests production readiness is still developing</li>
</ul>
<p>47:07  Ryan – “I’m going to throw a party. Those people have been screwing up the data in my Data Lakes for how long? This is awesome. Now it will be screwed up, but it will be done by a computer.” </p>
<p>48:50 <a href="https://cloud.google.com/blog/products/ai-machine-learning/ai-first-colab-notebooks-in-bigquery-and-vertex-ai/">AI First Colab Notebooks in BigQuery and Vertex AI | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google brings AI-first capabilities to <a href="https://console.cloud.google.com/vertex-ai/colab/runtimes">Colab Enterprise</a> notebooks in <a href="https://console.cloud.google.com/bigquery">BigQuery</a> and <a href="https://console.cloud.google.com/vertex-ai">Vertex AI</a>, featuring a Data Science Agent that automates end-to-end ML workflows from data exploration to model evaluation. </li>
<li style="font-weight:400;">The agent generates multi-step plans, executes code, and self-corrects errors while maintaining human oversight for each step.</li>
<li style="font-weight:400;">The service competes directly with <a href="https://docs.aws.amazon.com/sagemaker/latest/dg/code-editor.html">AWS SageMaker Studio’s Code Editor</a> and <a href="http://notebooks.azure.com/">Azure Machine Learning’s notebook experiences</a>, but differentiates through its conversational interface and automatic error correction. Users can generate visualizations, transform existing code, and interact with other Google Cloud services through natural language prompts.</li>
<li style="font-weight:400;">Currently available in Preview for the US and Asia regions only, with expansion planned for other Google Cloud regions. </li>
<li style="font-weight:400;">Access is through console.cloud.google.com/bigquery for BigQuery users or console.cloud.google.com/vertex-ai/colab/notebooks for Vertex AI users.</li>
<li style="font-weight:400;">Key use cases include data scientists automating repetitive ML tasks, analysts creating visualizations without deep library knowledge, and teams needing to quickly prototype and iterate on models. The human-in-the-loop design ensures transparency while reducing time spent on boilerplate code.</li>
<li style="font-weight:400;">Integration with BigQuery Pipelines allows scheduled notebook runs and multi-step DAG creation, making it practical for production workflows. </li>
<li style="font-weight:400;">The notebooks are interoperable between BigQuery and Vertex AI, providing flexibility in where teams choose to work.</li>
</ul>
<p>50:27 <a href="https://cloud.google.com/blog/topics/public-sector/accelerating-fedramp-20x-how-google-cloud-is-automating-compliance/">Accelerate FedRAMP Authorization with Google Cloud Compliance </a><a href="https://cloud.google.com/blog/topics/public-sector/accelerating-fedramp-20x-how-google-cloud-is-automating-compliance/">Manager | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/security-command-center/docs/compliance-manager-overview">Google Cloud Compliance Manager</a> enters public preview to automate FedRAMP authorization processes, reducing manual evidence collection and targeting faster federal cloud deployments through integration with the <a href="https://www.fedramp.gov/20x/phase-one/">FedRAMP 20x</a> pilot program.</li>
<li style="font-weight:400;">The service automates compliance validation for <a href="https://github.com/FedRAMP/docs/blob/main/markdown/FRMR.KSI.key-security-indicators.md">FedRAMP 20x Key Security Indicators (KSIs)</a> and provides machine-readable evidence, moving away from traditional narrative-based requirements that typically slow down federal authorization processes.</li>
<li style="font-weight:400;">Google partnered with StackArmor for proof of concept demonstrations and Coalfire (a <a href="https://www.fedramp.gov/assets/resources/documents/3PAO_Obligations_and_Performance_Standards.pdf">FedRAMP 3PAO</a>) for independent validation, positioning Compliance Manager as a native platform solution rather than a third-party add-on.</li>
<li style="font-weight:400;">This addresses a significant pain point for federal contractors and agencies who often spend months or years achieving FedRAMP authorization, with general availability for FedRAMP 20x support planned for later this year.</li>
<li style="font-weight:400;">The announcement follows recent <a href="https://cloud.google.com/blog/topics/public-sector/accelerating-innovation-with-agent-assist-looker-google-cloud-core-and-vertex-ai-vector-search-now-fedramp-high-authorized">FedRAMP High authorizations for Agent Assist</a>, Looker, and Vertex AI Vector Search, demonstrating Google’s broader push into federal cloud services alongside competitors AWS and Azure, who dominate this market.</li>
</ul>
<p>53:29  Justin – “It’s basically the government saying, it’s too hard to get FedRAMP, we want to level the playing field, and so they’ve changed the rules, but made them more confusing – because they haven’t actually provided clarifications for most of them. And so it’s a promise of better, but no reality of it yet.”</p>
<p>49:10 <a href="https://cloud.google.com/blog/products/databases/introducing-enhanced-backups-for-cloud-sql/">Introducing Enhanced Backups for Cloud SQL | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/sql">Google Cloud SQL</a> now offers Enhanced Backups through integration with their <a href="https://cloud.google.com/backup-disaster-recovery?e=48754805&amp;hl=en">Backup and DR Service</a>, providing immutable, logically air-gapped backup vaults managed separately from source projects. </li>
<li style="font-weight:400;">This addresses a critical gap where database backups could be compromised if the entire project were deleted or attacked.</li>
<li style="font-weight:400;">The feature supports flexible retention policies from days to decades with hourly, daily, weekly, monthly, and yearly backup schedules. Backups are protected with retention locks and zero-trust access policies, making them truly immutable for compliance requirements.</li>
<li style="font-weight:400;">Available in Preview for <a href="https://cloud.google.com/sql/docs/editions-intro">Cloud SQL Enterprise</a> and Enterprise Plus editions, this positions Google competitively against AWS RDS automated backups and Azure SQL Database’s long-term retention. The key differentiator is the complete separation of backups from the source project infrastructure.</li>
<li style="font-weight:400;">Implementation requires three simple steps: create a backup vault in the Backup and DR service, define a backup plan with retention rules, and apply it to Cloud SQL instances. </li>
<li style="font-weight:400;">No additional infrastructure deployment is needed as it integrates with the existing console, gcloud, and API tools.</li>
<li style="font-weight:400;">Early adopters like SQUARE ENIX and JFrog highlight the value for gaming, DevOps, and regulated industries where data protection against project-level failures is critical. The centralized management dashboard simplifies compliance reporting and monitoring across multiple database instances.</li>
</ul>
<p>58:18 <a href="https://cloud.google.com/blog/products/business-intelligence/introducing-looker-mcp-server/">Introducing Looker MCP Server | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google launches Looker <a href="https://modelcontextprotocol.io/introduction">MCP Server</a>, enabling AI applications like chatbots and custom agents to directly query Looker’s <a href="https://cloud.google.com/looker-modeling">semantic layer</a> through the Model Context Protocol standard, eliminating the need for AI to write SQL while maintaining data governance and security controls.</li>
<li style="font-weight:400;">The integration works with existing AI developer tools, including Gemini CLI, Claude Desktop, and Cursor, allowing developers to connect AI agents to pre-defined, trusted data models without complex integration work or risk of data misinterpretation.</li>
<li style="font-weight:400;">Unlike traditional AI-to-database connections, Looker MCP Server inherits Looker’s security model with fine-grained access controls, audit trails, and the ability to define which AI applications can access specific data at what granularity.</li>
<li style="font-weight:400;">Extending Looker’s semantic layer capabilities to the AI development ecosystem, particularly valuable for organizations already using Looker for BI who want consistent data definitions across both analytics and AI applications.</li>
<li style="font-weight:400;">The Quickstart guide is available on GitHub at googleapis.github.io/genai-toolbox/samples/looker/looker_gemini/, with no additional licensing costs mentioned beyond existing Looker subscriptions.</li>
</ul>
<p>58:10  Justin – “Not having to write Looker reports to get my data </p>
<p>Is super nice. But also, if Looker is getting more and more capabilities, so that I can – potentially from a different system –  reach out to Looker and tell it to create a report with the pretty dashboards I love as an executive, all is right in the world.” </p>
<p>59:15 <a href="https://cloud.google.com/blog/topics/developers-practitioners/accelerate-ai-with-cloud-run-sign-up-now-for-a-developer-workshop-near-you/">Accelerate AI with Cloud Run: Sign up now for a developer workshop near </a></p>
<p><a href="https://cloud.google.com/blog/topics/developers-practitioners/accelerate-ai-with-cloud-run-sign-up-now-for-a-developer-workshop-near-you/">you! | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google is launching “Accelerate AI with <a href="https://cloud.google.com/run">Cloud Run</a>,” a global series of free, full-day in-person workshops focused on helping developers move AI prototypes to production using Cloud Run’s serverless infrastructure with GPU acceleration.</li>
<li style="font-weight:400;">The workshops teach developers to build secure AI applications using the <a href="https://cloud.google.com/blog/topics/developers-practitioners/build-and-deploy-a-remote-mcp-server-to-google-cloud-run-in-under-10-minutes">Model Context Protocol (MCP) on Cloud Run</a> and Google’s <a href="https://google.github.io/adk-docs/">Agent Development Kit</a> (ADK), providing hands-on experience with containerization and deployment patterns for production-scale AI agents.</li>
<li style="font-weight:400;"><a href="https://workshops.aws/card/sagemaker">AWS’s SageMaker workshops</a> and <a href="https://techcommunity.microsoft.com/blog/azure-ai-foundry-blog/learn-about-azure-ai-during-the-global-ai-bootcamp-2025/4387217">Azure’s AI bootcamps</a> emphasize serverless deployment and the complete prototype-to-production journey rather than just model training, targeting both application developers and startup founders.</li>
<li style="font-weight:400;">The timing aligns with Google’s push to make Cloud Run a primary platform for AI workloads, leveraging its automatic scaling, built-in security, and pay-per-use pricing model that can significantly reduce costs compared to dedicated GPU instances.</li>
<li style="font-weight:400;">The focus on practical implementation of AI agents with secure tool access through MCP addresses the common challenge developers face when trying to scale AI prototypes beyond proof-of-concept demos.</li>
</ul>
<p>1:01:14  Matt – “Who needs security on MCPs? It’s so new, no one is going to know how to break into it.” </p>
<h2>Azure</h2>
<p>1:02:17 <a href="https://azure.microsoft.com/en-us/blog/openais-open%E2%80%91source-model-gpt%E2%80%91oss-on-azure-ai-foundry-and-windows-ai-foundry/">OpenAI’s open‑source model: gpt‑oss on Azure AI Foundry and Windows </a><a href="https://azure.microsoft.com/en-us/blog/openais-open%E2%80%91source-model-gpt%E2%80%91oss-on-azure-ai-foundry-and-windows-ai-foundry/">AI Foundry  | Microsoft Azure Blog</a></p>
<ul>
<li style="font-weight:400;">OpenAI released its first open-weight models since <a href="https://openai.com/index/gpt-2-1-5b-release/">GPT-2</a> with <a href="https://huggingface.co/openai/gpt-oss-120b">gpt-oss-120b</a> and gpt-oss-20b, now available on <a href="https://ai.azure.com/">Azure AI Foundry</a> and <a href="https://developer.microsoft.com/en-us/windows/ai/">Windows AI Foundry</a>, giving developers full control to fine-tune, distill, and deploy these models on their own infrastructure.</li>
<li style="font-weight:400;">The 120B parameter model delivers o4-mini level performance on a single datacenter GPU, while the 20B model runs locally on Windows devices with 16GB+ VRAM, enabling both cloud-scale reasoning and edge deployment scenarios without API dependencies.</li>
<li style="font-weight:400;">Azure AI Foundry provides the full toolchain for customization, including LoRA fine-tuning, quantization, and ONNX export, while Foundry Local brings these models to Windows 11 for offline and secure deployments across CPUs, GPUs, and NPUs.</li>
<li style="font-weight:400;">Pricing starts at $0.15 per million input tokens for gpt-oss-20b and $0.60 for gpt-oss-120b, positioning these as cost-effective alternatives to proprietary models while maintaining API compatibility for easy migration.</li>
<li style="font-weight:400;">This marks a significant shift in Microsoft’s AI strategy by offering open-weight frontier models alongside proprietary options, directly competing with Meta’s Llama and Google’s open model initiatives while leveraging Azure’s infrastructure advantage.</li>
<li style="font-weight:400;">Cool. Moving on. </li>
</ul>
<p>1:02:27 <a href="https://azure.microsoft.com/en-us/blog/introducing-azure-storage-discovery-transform-data-management-with-storage-insights/">Introducing Azure Storage Discovery: Transform data management with </a><a href="https://azure.microsoft.com/en-us/blog/introducing-azure-storage-discovery-transform-data-management-with-storage-insights/">storage insights | Microsoft Azure Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/storage-discovery/overview">Azure Storage Discovery</a> provides a centralized dashboard to analyze and manage <a href="https://duckduckgo.com/y.js?ad_domain=microsoft.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=C8zUDEr5ZeME9jP_Wv7aLKTu-HoAZodF7IywZeSyd7ojobX-vKJQz6-IZ9T22N2lz5nTuGJEzEEdG3QmhrycvZucacnguSMrtTcS8eJDUYImDTgdSyANCH9_TAeSuNKN.Fr5ZvlAOWwhbLRok2EDqtQ&amp;eddgt=GpgdN4sIRzpaCKD5CJwJ_A%3D%3D&amp;rut=ac0d80cb9f334056d5f62ca03daaa4e12969c718ca4b8e726e2b94d27ed13b52&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8tYjOlKJQT_V839nHo8_N1jVUCUxYTHBfvC8MpMNtrq3XenSjEUsPNG6S7cgkq9bm74-ZA-SBzxIiHWoMFZIXmcNiHTGuNYQ1DN_mvRXKYT82a6a_05fc17dNIRbkMAv_zH-U5EnzbxiftHd1-PC7X8AYRLngi29tEVmCAXmBRaF3FIV5HbAmCByHn_ievHi1XcL2cPOrXNyLtDcbB04HglYu6tg3w_TMnYhFvwKUPH1NeZhqKxAlkZTmrWlyJ9CfreTtbHij28Pe6gX7hP3lDb_Tz4qIn5tkDXQ4qcWbE5nJV9i53mxqbpnXuIKS0hk8Eta1pDhWW0hVsv3Quktq0C57llhFmLDxcsQTaioTFP2erxmoEjXopIZF6CTO1g-YRQBZ1_eX1ct4RCBWPjLneAFVVSAuILhdJyOldnBDTRh-9__iqSHEysmTMI6ArJAoSwQTigc5OtuTaZ99oavQL_26uuwjCbDBMPPmmVU4xphapEU-X24ZWs-_IEf0_pUXJqWHojFeksTI2TxjeGQlMq2aLerhNKd7xFR1-xmWevN_FXKIdW9YmyZHOOKQI91eSJpS6JcZAOJXXw3KPbSmq0vmbd9sivAlyEygwn09uMRwlgfiCQf7mLW_46-LAja6bRtcNm5BLqwH50LV4GZqKKFfeqNKc1_H4Ku4PaHkUNOc-crenVz02NzmVKRYJSqX2sj6FNqZjyx787mIVBavPlTjIfdMJGN3F2-KJMhjGCzjvOEBVB1JIkbc-GngWhSZ37eQjw%26u%3DaHR0cHMlM2ElMmYlMmY1MzUwLnhnNGtlbi5jb20lMmZ0cmslMmZ2MSUzZnByb2YlM2Q0MzklMjZjYW1wJTNkMTY4ODA3JTI2a2N0JTNkbXNuJTI2a2NoaWQlM2QxNTkwMDE5MjElMjZjcml0ZXJpYWlkJTNka3dkLTc5NzgzOTE4OTkxODg4JTNhbG9jLTE5MCUyNmNhbXBhaWduaWQlM2Q1OTAyMzczNTglMjZsb2NwaHklM2Q4MDM0NCUyNmFkZ3JvdXBpZCUzZDEyNzY1MzQ2NTI2ODc4NTUlMjZjaWQlM2Q3OTc4MzUyMDQyNTU4OSUyNmtkdiUzZGMlMjZrZXh0JTNkJTI2a3BnJTNkJTI2a3BpZCUzZCUyNnF1ZXJ5U3RyJTNkQXp1cmUlMjUyMEJsb2IlMjUyMFN0b3JhZ2UlMjZ1cmwlM2RodHRwcyUzYSUyZiUyZmF6dXJlLm1pY3Jvc29mdC5jb20lMmZlbi11cyUyZnByb2R1Y3RzJTJmc3RvcmFnZSUyZmJsb2JzJTJmJTNmZWZfaWQlM2Rfa19iNmViMGU1MTE2YzMxNjI2MThmMzY2ZjQ5NGUxYzg0Y19rXyUyNk9DSUQlM2RBSURjbW01ZWRzd2R1dV9TRU1fX2tfYjZlYjBlNTExNmMzMTYyNjE4ZjM2NmY0OTRlMWM4NGNfa18lMjZtc2Nsa2lkJTNkYjZlYjBlNTExNmMzMTYyNjE4ZjM2NmY0OTRlMWM4NGM%26rlid%3Db6eb0e5116c3162618f366f494e1c84c&amp;vqd=4-159055971323617346298032746408268973821&amp;iurl=%7B1%7DIG%3DAFDD345A77EA4BED8AF092F17D9DD17E%26CID%3D0ED0DAFAD01A67520D59CCB4D1FC667F%26ID%3DDevEx%2C5046.1">Azure Blob Storage</a> across entire organizations, aggregating insights from up to 1 million storage accounts without requiring custom scripts or infrastructure deployment.</li>
<li style="font-weight:400;">The service integrates with <a href="https://learn.microsoft.com/en-us/azure/copilot/overview">Azure Copilot</a> for natural language queries and offers both free and standard pricing tiers, with the standard tier providing 18 months of historical data retention for analyzing trends in capacity, activity, errors, and security configurations.</li>
<li style="font-weight:400;">Early adopters like Tesco and Willis Towers Watson report significant time savings in identifying cost optimization opportunities, such as finding rapidly growing storage accounts and data that hasn’t been accessed recently for lifecycle management.</li>
<li style="font-weight:400;">Unlike <a href="https://aws.amazon.com/s3/storage-lens/">AWS Storage Lens</a> or <a href="https://cloud.google.com/blog/products/storage-data-transfer/manage-your-data-with-cloud-storage-insights-inventory-report">GCP Cloud Storage Insights</a>, which focus primarily on metrics, Azure Storage Discovery emphasizes actionable insights with direct navigation to specific resources and pre-built reports for security compliance and cost optimization.</li>
<li style="font-weight:400;">The service will be free until September 30, 2025, after which pricing will be based on the number of storage accounts and objects analyzed, making it accessible for organizations to evaluate its value before committing to costs.</li>
</ul>
<p>1:03:24  Ryan – “I think this is a feature they had to develop in self-defense, because the way they organize the blob storage with those storage accounts. Because coming from another cloud, it’s completely undecipherable.” </p>
<p>1:07:24 <a href="https://techcommunity.microsoft.com/blog/azureobservabilityblog/general-availability-of-azure-monitor-network-security-perimeter-features/4440307">General Availability of Azure Monitor Network Security Perimeter Features </a><a href="https://techcommunity.microsoft.com/blog/azureobservabilityblog/general-availability-of-azure-monitor-network-security-perimeter-features/4440307">| Microsoft Community Hub</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/azure-monitor/essentials/network-security-perimeter">Azure Monitor Network Security Perimeter</a> creates a virtual firewall at the service level that blocks public access to Log Analytics workspaces and Application Insights by default, allowing only explicitly defined traffic through IP ranges or subscription IDs – addressing enterprise demands for zero-trust network isolation of monitoring data.</li>
<li style="font-weight:400;">The feature provides granular control with inbound rules for specific IP ranges and outbound rules for approved FQDNs, plus comprehensive logging of all connection attempts for compliance auditing – particularly valuable for regulated industries like finance, healthcare, and government.</li>
<li style="font-weight:400;">Network Security Perimeter integrates natively with Azure Monitor services, including alerts and action groups, ensuring security rules are enforced across ingestion, queries, and notifications without breaking functionality – managed through a single pane of glass for multiple resources across subscriptions.</li>
<li style="font-weight:400;">This complements existing Private Link deployments by securing Azure Monitor’s service endpoints themselves, creating defense-in-depth where Private Link secures VNet-to-service traffic and Network Security Perimeter locks down the service side – similar to AWS PrivateLink combined with VPC endpoint policies.</li>
<li style="font-weight:400;">The feature is now generally available at no additional cost beyond standard Azure Monitor pricing, making it accessible for organizations needing to prove that monitoring data never touches public internet or unauthorized destinations.</li>
</ul>
<p>1:08:07  Ryan – “If you think about your API endpoints, there is security rules for that. So they’re touting logs and the log out analytics here because those aren’t natively available directly within your VPC network and your subscription. So they’re just accessible via a platform service. And so now, you can basically put rules around accessing that platform service, which won’t confuse anyone at all.”</p>
<p>1:10:50 <a href="https://techcommunity.microsoft.com/blog/azureobservabilityblog/general-availability-of-auxiliary-logs-and-reduced-pricing/4439460">General Availability of Auxiliary Logs and Reduced Pricing | Microsoft </a><a href="https://techcommunity.microsoft.com/blog/azureobservabilityblog/general-availability-of-auxiliary-logs-and-reduced-pricing/4439460">Community Hub</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/azure-monitor/logs/create-custom-table-auxiliary">Azure Monitor’s Auxiliary Logs</a> are now GA with significant price reductions, targeting customers ingesting petabyte-scale logs daily who need cost-effective storage for high-volume, low-fidelity data alongside existing Analytics and Basic log tiers.</li>
<li style="font-weight:400;">Key technical improvements include expanded KQL operator support, Delta Parquet-based storage for better query performance, unlimited time range queries (previously 30 days), and new ingestion-time transformations using Data Collection Rules with KQL expressions.</li>
<li style="font-weight:400;">Integration with <a href="https://techcommunity.microsoft.com/blog/microsoft-security-blog/introducing-microsoft-sentinel-data-lake/4434280">Microsoft Sentinel data lake</a> enables cross-access between security and observability workloads without data duplication, positioning Azure to compete with <a href="https://docs.aws.amazon.com/AmazonCloudWatch/latest/logs/AnalyzingLogData.html">AWS CloudWatch Logs Insights</a> and <a href="https://cloud.google.com/logging/docs/overview">GCP Cloud Logging’s multi-tier storage</a> options.</li>
<li style="font-weight:400;">Summary rules allow efficient data summarization across all log tiers while keeping raw data accessible, and enhanced search jobs support up to 100 million records with cost prediction capabilities.</li>
<li style="font-weight:400;">Target use cases include organizations needing to balance cost and performance for massive log volumes, with the ability to filter noise at ingestion, split data across tiers, and apply transformations to both custom and platform logs.</li>
<li style="font-weight:400;">This leads us to a couple of questions. What is an auxiliary log? Why do we care? Also – why do we have petabytes of them? </li>
</ul>
<p>1:12:02  Ryan – “You’re legally required to have it, that’s why! It’s your firewall logs, your SQL server transaction logs – that you are obligated ot maintain – and that’s exactly what this is for. It’s a routing layer in your existing logging infrastructure, and it just routes these to a low-cost, different sort of query method.”   </p>
<p>1:13:16 <a href="https://techcommunity.microsoft.com/blog/appsonazureblog/announcing-general-availability-of-app-service-inbound-ipv6-support/4423358">Announcing General Availability of App Service Inbound IPv6 Support | </a><a href="https://techcommunity.microsoft.com/blog/appsonazureblog/announcing-general-availability-of-app-service-inbound-ipv6-support/4423358">Microsoft Community Hub</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/app-service/overview">Azure App Service</a> now supports inbound IPv6 traffic across all public regions, government clouds, and China regions for multi-tenant apps on Basic, Standard, and Premium SKUs, plus Functions Consumption, Functions Elastic Premium, and Logic Apps Standard. </li>
<li style="font-weight:400;">This brings Azure closer to feature parity with AWS and GCP, both of which have offered IPv6 support for their compute services for several years.</li>
<li style="font-weight:400;">The implementation uses a new IPMode property that controls DNS responses – apps can return IPv4-only (default for backward compatibility), IPv6-only, or dual-stack IPv4/IPv6 addresses. </li>
<li style="font-weight:400;">All App Service sites can now receive traffic on both IPv4 and IPv6 endpoints regardless of IPMode setting, which only affects DNS resolution behavior.</li>
<li style="font-weight:400;">This addresses growing IPv6 adoption requirements, particularly for government contracts and international deployments where IPv6 is mandatory. The feature works with custom domains through standard AAAA DNS records, though IP-SSL IPv6 bindings remain unsupported.</li>
<li style="font-weight:400;">Microsoft is playing catch-up here – AWS has had dual-stack load balancers since 2016, and GCP has offered IPv6 on compute instances since 2017. The phased rollout continues with Linux outbound IPv6 in preview and VNet IPv6 support still on the backlog.</li>
<li style="font-weight:400;">No additional costs are mentioned for IPv6 support, making this a free upgrade for existing App Service customers. Testing requires IPv6-capable networks since many corporate and home networks still only support IPv4, which could complicate adoption.</li>
<li style="font-weight:400;">Welcome to 2025 Azure. </li>
</ul>
<h2>Oracle</h2>
<p>1:15:00 <a href="https://www.oracle.com/news/announcement/blog/oracle-announces-oracle-ai-world-2025-08-06/">Oracle Announces Oracle AI World 2025 08 06</a></p>
<ul>
<li style="font-weight:400;">Oracle is hosting <a href="https://www.oracle.com/ai-world/">Oracle AI World</a> 2025 on January 15 in Las Vegas, positioning it as their “premier” AI conference with keynotes from Larry Ellison and other executives focusing on enterprise AI applications.</li>
<li style="font-weight:400;">The event will showcase Oracle’s AI strategy across its cloud infrastructure, applications, and database services, with particular emphasis on its OCI Generative AI service and AI-powered features in Oracle Fusion Cloud Applications.</li>
<li style="font-weight:400;">Oracle is targeting enterprise customers who want pre-built AI capabilities integrated into their existing Oracle stack, competing with AWS re:Invent and Microsoft Ignite, but with a narrower focus on Oracle-specific implementations.</li>
<li style="font-weight:400;">The conference format includes hands-on labs and certification opportunities, suggesting Oracle is trying to build practitioner expertise around its AI tools rather than just executive buy-in.</li>
<li style="font-weight:400;">Registration is free, but the January timing puts it awkwardly between major cloud conferences, potentially limiting attendance from decision-makers who may have exhausted conference budgets after re:Invent and Ignite.</li>
<li style="font-weight:400;">We’re not super interested in this one. </li>
<li style="font-weight:400;">For this one, we’d love to invite listeners to make predictions on what is going to be announced! </li>
</ul>
<h2>Cloud Journey</h2>
<p>1:17:18 <a href="https://aws.amazon.com/blogs/security/beyond-iam-access-keys-modern-authentication-approaches-for-aws/">Beyond IAM access keys: Modern authentication approaches for AWS | </a><a href="https://aws.amazon.com/blogs/security/beyond-iam-access-keys-modern-authentication-approaches-for-aws/">AWS Security Blog</a></p>
<ul>
<li style="font-weight:400;">AWS is pushing developers away from long-term IAM access keys toward temporary credential solutions like CloudShell, IAM Identity Center, and IAM roles to reduce security risks from credential exposure and unauthorized sharing.</li>
<li style="font-weight:400;">CloudShell provides a browser-based CLI that eliminates local credential management, while IAM Identity Center integration with AWS CLI v2 adds centralized user management and seamless MFA support.</li>
<li style="font-weight:400;">For CI/CD pipelines and third-party services, AWS recommends using IAM Roles Anywhere for on-premises workloads and OIDC integration for services like GitHub Actions instead of static access keys.</li>
<li style="font-weight:400;">Modern IDEs like VS Code now support secure authentication through IAM Identity Center via AWS Toolkit, removing the need for developers to store access keys locally.</li>
<li style="font-weight:400;">AWS emphasizes implementing least privilege policies and offers automated policy generation based on CloudTrail logs to help create permission templates from actual usage patterns.</li>
</ul>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback, or ask questions at theCloudPod.net or tweet at us with the hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2119081/c1e-9202fd6dk3i07mzz-z3k68d0rt5x3-njjmg7.mp3" length="127741951"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 317 of The Cloud Pod, where the forecast is always cloudy! Justin, Matt, and an out-of-breath (from outrunning bears) Ryan are back in the studio to bring you another episode of everyone’s favorite cloud and AI news wrap-up. This week we’ve got GTP-5, Oracle’s newly minted AI conference, hallucinations (not the good kind), and even a Cloud Journey follow-up. Let’s get into it! 
Titles we almost went with this week:


Oracle Intelligence: Mission Las Vegas
AI World: Oracle’s Excellent Adventure
AI Gets a Reality Check: Amazon’s New Math Teacher for Hallucinating Models
Jules Verne’s 20,000 Lines Under the C
GPT-5: The Empire Strikes Back at Computing Costs
5⃣Five Alive: OpenAI’s Latest Language Model Drops
GPT-5 is Alive! (And Ready for Your API Calls)
From Kanban to Kan’t-Ban: Alienate Your User Base in One Update
No More Console Hopping: ECS Logs Stay Put
Following the Paper Trail: ECS Logs Go Live
The Pull Request Whisperer
Five’s Company: DigitalOcean Joins the GPT Party
WireGuard Your Kubernetes: The Mesh-iah Has Arrived
EKS-tending Your Reach: When Your Nodes Need a VPN Alternative
Buttercup Blooms: DARPA’s Prize-Winning AI Security Tool Goes Public
From DARPA to Docker: How Buttercup Brings AI Bug-Hunting to Your Laptop
Agent 007: License to Query
Compliance Manager: Because Nobody Dreams of Filling Out Federal Paperwork
Do Compliance Managers dream of Public Sector sheep?
Blob’s Your Uncle: Finding Lost Data in the Cloud
Wassette: Teaching Your AI Assistant to Go Shopping for Tools
Monitor, Monitor on the Wall, Who’s the Most Secure of All?
Better Late Than IPv-Never
VPC Logs: Now with 100% Less Manual Labor
CloudWatch Catches All the Flows in Your Organization
The Organization-Wide Net: No VPC Left Behind
SQS Goes Super Size: Would You Like to Quadruple That?
One MiB to Rule Them All: SQS’s Payload Growth Spurt
Microsoft Finally Merges with Its $7.5 Billion Side Piece
From Hub to Spoke: GitHub Loses Its Independence
Cloud Run Forest Run: Google’s AI Workshop Marathon
From Zero to AI Hero: Google’s Production Pipeline Workshop
The Fast and the Serverless: Cloud Run Drift


A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our Slack channel for more info.
General News 
01:17 GitHub will be folded into Microsoft proper as CEO steps down – Ars Technica

GitHub will lose its operational independence and be integrated into Microsoft’s CoreAI organization in 2025, ending its separate CEO structure that has existed since Microsoft’s $7.5 billion acquisition in 2018.
The reorganization eliminates the CEO position, with GitHub’s leadership team reporting to multiple executives within CoreAI rather than a single leader, potentially impacting decision-making speed and product direction.
This structural change could affect GitHub’s developer-focused culture and remote-first operations that have distinguished it from Microsoft’s traditional corporate structure.
The integration into CoreAI suggests Micr...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2119081/c1a-k5d5-7z9wg21pbv40-a9yj78.jpg"></itunes:image>
                                                                            <itunes:duration>01:28:42</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                            </item>
                    <item>
                <title>
                    <![CDATA[316: Microsoft’s New AI Agent Has Trust Issues (With Software)]]>
                </title>
                <pubDate>Thu, 14 Aug 2025 05:23:44 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2111836</guid>
                                    <link>https://tcpfm.castos.com/episodes/316-microsofts-new-ai-agent-has-trust-issues-with-software</link>
                                <description>
                                            <![CDATA[<h3>Welcome to episode 316 of The Cloud Pod, where the forecast is always cloudy! This week we’ve got earnings (with sound effects, obviously) as well as news from DeepSeek, DocumentDB, DigitalOcean, and a bunch of GPU news. Justin and Matt are here to lead you through all of it, so let’s get started! </h3>
<h3>Titles we almost went with this week:
</h3>
<ul>
<li>Lake Sentinel: The Security Data Monster Nobody Asked For</li>
<li>Certificate Authority Issues: When Your Free Lunch Gets a Security Audit</li>
<li>Slash and Learn: Gemini Gets Command-ing</li>
<li>DigitalOcean Drops Anchor in AI Waters with Gradient Platform</li>
<li>The Three Stages of Azure Grief: Development, Preview, and Launch</li>
<li>E for Enormous: Azure’s New VM Sizes Are Anything But Virtual</li>
<li>SRE You Later: Azure’s AI Agent Takes Over Your On-Call Duties</li>
<li>Site Reliability Engineer? More Like AI Reliability Engineer</li>
<li>Azure Disks Get Elastic Waistbands</li>
<li>Agent Smith Would Be Proud: Google’s Multi-Agent Matrix Gets Real</li>
<li>C4 Yourself: Google Explodes Into GA with Intel’s Latest Silicon</li>
<li>The Cost is Right: GCP Edition</li>
<li>Penny for Your Cloud Thoughts: Google’s Budget-Friendly Update</li>
<li>DocumentDB Goes on a Diet: Now Available in Serverless Size</li>
<li>MongoDB Compatibility Gets the AWS Serverless Treatment</li>
<li>No Server? No Problem: DocumentDB Joins the Serverless Party</li>
<li>Stream Big or Go Home: Lambda’s 10x Payload Boost</li>
<li>Lambda Response Streaming: Because Size Matters</li>
<li>GPT Goes Open Source Shopping</li>
<li>GPT’s Open Source Awakening</li>
<li>When Your Antivirus Needs an Antivirus: Enter Project Ire</li>
<li>The Opus Among Us: Anthropic’s Coding Assistant Gets an Upgrade</li>
<li>Serverless is becoming serverful in streaming responses</li>
</ul>
<h2>General News </h2>
<p>02:08 It’s Earnings Time! (INSERT AWESOME SOUND EFFECTS HERE) </p>
<p>02:16 <a href="https://www.cnbc.com/2025/07/23/alphabet-google-q2-earnings.html">Alphabet beats earnings expectations, raises spending forecast</a></p>
<ul>
<li style="font-weight:400;">Google Cloud revenue hit $13.62 billion, up 32% year-over-year, with OpenAI now using Google’s infrastructure for ChatGPT, signaling growing enterprise confidence in Google’s AI infrastructure capabilities.</li>
<li style="font-weight:400;"><a href="https://www.cnbc.com/quotes/GOOGL/">Alphabet</a> is <a href="https://www.cnbc.com/2025/07/23/googles-85-billion-capital-spend-spurred-by-cloud-ai-demand.html">raising its 2025 capital expenditure forecast from $75 billion to $85 billion</a>, driven by cloud and AI demand, with plans to increase spending further in 2026 as it competes for AI workloads.</li>
<li style="font-weight:400;">AI Overviews now serves 2 billion monthly users across 200+ countries, while the Gemini app reached 450 million monthly active users, demonstrating Google’s scale in deploying AI services globally.</li>
<li style="font-weight:400;">The $10 billion increase in planned capital spending reflects the infrastructure arms race among cloud providers to capture AI workloads, which require significant compute and specialized hardware investments.</li>
<li style="font-weight:400;">Google’s cloud growth rate of 32% outpaces its overall revenue growth of 14%, indicating the strategic importance of cloud services as traditional search and advertising face increased AI competition.</li>
</ul>
<p>03:55  Justin – “I don’t know what it takes to actually run one of these large models at like ultimate scale that like a ChatGPT needs or Anthropic, but I have to imagine it’s just thousands and thousands of GPUs just working nonstop.”</p>
<p>04:31 <a href="https://www.cnbc.com/2025/07/30/microsoft-msft-q4-earnings-report-2025.html">Microsoft (MSFT) Q4 earnings report 2025</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.cnbc.com/quotes/MSFT/">Microsoft</a> reported Q4 fiscal 2025 earnings with revenue of $76.44 billion, up 18% year-ove...</li></ul>
<h3>Chapters</h3>
<ul><li>(00:00:00) - Azure: Why Microsoft's New AI Agent Won't Work</li><li>(00:01:17) - Earnings season</li><li>(00:01:43) - Google Cloud Revenue Up 32%, Capital Spending Forecast Up</li><li>(00:03:51) - Microsoft Reports Strong Cloud Growth, AI Investment</li><li>(00:05:51) - Amazon's AI, Cloud Growth</li><li>(00:10:24) - Google's DeepThink AI for Complex Reasoning</li><li>(00:13:13) - OpenAI releases new GPT OSS120B and OSS</li><li>(00:15:32) - Microsoft's AI-enabled Binary Analyzer</li><li>(00:24:27) - Good Testing Practices in Cloud</li><li>(00:25:59) - Claude Opus 4.1 Upgrade to Sonnet 4</li><li>(00:27:46) - AWS G6F: Fractional GPU Instances</li><li>(00:29:40) - Amazon DocumentDB DCU Scale</li><li>(00:34:13) - Amazon's Region Switch</li><li>(00:37:28) - AWS Lambda: 200 Megabyte Response Streaming Capacity</li><li>(00:38:55) - Gemini CLI: Adding slash commands to Google Cloud Code</li><li>(00:41:06) - Agent to Agent Protocol Upgraded to Version 3</li><li>(00:42:57) - GK Cloud: C4 Bare Metal VM on the Intel Xeon</li><li>(00:44:35) - Google Cloud Hub Optimization and Cost Explorer Expands to Public Preview</li><li>(00:47:04) - Microsoft's Sentinel Data Lake Announcement</li><li>(00:50:42) - Microsoft's New E128 & E1092 VM Sizes</li><li>(00:54:17) - Azure SRE Agent Billing Model</li><li>(00:57:02) - Azure 2.8 Live Resizing for Ultra NVMe disks</li><li>(00:59:13) - Azure Backup now supports agentless multi-disk backups</li><li>(01:02:05) - Digital Ocean Brings AI to a Unified Platform</li><li>(01:03:50) - This Week in the Cloud: Ending</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 316 of The Cloud Pod, where the forecast is always cloudy! This week we’ve got earnings (with sound effects, obviously) as well as news from DeepSeek, DocumentDB, DigitalOcean, and a bunch of GPU news. Justin and Matt are here to lead you through all of it, so let’s get started! 
Titles we almost went with this week:


Lake Sentinel: The Security Data Monster Nobody Asked For
Certificate Authority Issues: When Your Free Lunch Gets a Security Audit
Slash and Learn: Gemini Gets Command-ing
DigitalOcean Drops Anchor in AI Waters with Gradient Platform
The Three Stages of Azure Grief: Development, Preview, and Launch
E for Enormous: Azure’s New VM Sizes Are Anything But Virtual
SRE You Later: Azure’s AI Agent Takes Over Your On-Call Duties
Site Reliability Engineer? More Like AI Reliability Engineer
Azure Disks Get Elastic Waistbands
Agent Smith Would Be Proud: Google’s Multi-Agent Matrix Gets Real
C4 Yourself: Google Explodes Into GA with Intel’s Latest Silicon
The Cost is Right: GCP Edition
Penny for Your Cloud Thoughts: Google’s Budget-Friendly Update
DocumentDB Goes on a Diet: Now Available in Serverless Size
MongoDB Compatibility Gets the AWS Serverless Treatment
No Server? No Problem: DocumentDB Joins the Serverless Party
Stream Big or Go Home: Lambda’s 10x Payload Boost
Lambda Response Streaming: Because Size Matters
GPT Goes Open Source Shopping
GPT’s Open Source Awakening
When Your Antivirus Needs an Antivirus: Enter Project Ire
The Opus Among Us: Anthropic’s Coding Assistant Gets an Upgrade
Serverless is becoming serverful in streaming responses

General News 
02:08 It’s Earnings Time! (INSERT AWESOME SOUND EFFECTS HERE) 
02:16 Alphabet beats earnings expectations, raises spending forecast

Google Cloud revenue hit $13.62 billion, up 32% year-over-year, with OpenAI now using Google’s infrastructure for ChatGPT, signaling growing enterprise confidence in Google’s AI infrastructure capabilities.
Alphabet is raising its 2025 capital expenditure forecast from $75 billion to $85 billion, driven by cloud and AI demand, with plans to increase spending further in 2026 as it competes for AI workloads.
AI Overviews now serves 2 billion monthly users across 200+ countries, while the Gemini app reached 450 million monthly active users, demonstrating Google’s scale in deploying AI services globally.
The $10 billion increase in planned capital spending reflects the infrastructure arms race among cloud providers to capture AI workloads, which require significant compute and specialized hardware investments.
Google’s cloud growth rate of 32% outpaces its overall revenue growth of 14%, indicating the strategic importance of cloud services as traditional search and advertising face increased AI competition.

03:55  Justin – “I don’t know what it takes to actually run one of these large models at like ultimate scale that like a ChatGPT needs or Anthropic, but I have to imagine it’s just thousands and thousands of GPUs just working nonstop.”
04:31 Microsoft (MSFT) Q4 earnings report 2025

Microsoft reported Q4 fiscal 2025 earnings with revenue of $76.44 billion, up 18% year-ove...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[316: Microsoft’s New AI Agent Has Trust Issues (With Software)]]>
                </itunes:title>
                                    <itunes:episode>316</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<h3>Welcome to episode 316 of The Cloud Pod, where the forecast is always cloudy! This week we’ve got earnings (with sound effects, obviously) as well as news from DeepSeek, DocumentDB, DigitalOcean, and a bunch of GPU news. Justin and Matt are here to lead you through all of it, so let’s get started! </h3>
<h3>Titles we almost went with this week:
</h3>
<ul>
<li>Lake Sentinel: The Security Data Monster Nobody Asked For</li>
<li>Certificate Authority Issues: When Your Free Lunch Gets a Security Audit</li>
<li>Slash and Learn: Gemini Gets Command-ing</li>
<li>DigitalOcean Drops Anchor in AI Waters with Gradient Platform</li>
<li>The Three Stages of Azure Grief: Development, Preview, and Launch</li>
<li>E for Enormous: Azure’s New VM Sizes Are Anything But Virtual</li>
<li>SRE You Later: Azure’s AI Agent Takes Over Your On-Call Duties</li>
<li>Site Reliability Engineer? More Like AI Reliability Engineer</li>
<li>Azure Disks Get Elastic Waistbands</li>
<li>Agent Smith Would Be Proud: Google’s Multi-Agent Matrix Gets Real</li>
<li>C4 Yourself: Google Explodes Into GA with Intel’s Latest Silicon</li>
<li>The Cost is Right: GCP Edition</li>
<li>Penny for Your Cloud Thoughts: Google’s Budget-Friendly Update</li>
<li>DocumentDB Goes on a Diet: Now Available in Serverless Size</li>
<li>MongoDB Compatibility Gets the AWS Serverless Treatment</li>
<li>No Server? No Problem: DocumentDB Joins the Serverless Party</li>
<li>Stream Big or Go Home: Lambda’s 10x Payload Boost</li>
<li>Lambda Response Streaming: Because Size Matters</li>
<li>GPT Goes Open Source Shopping</li>
<li>GPT’s Open Source Awakening</li>
<li>When Your Antivirus Needs an Antivirus: Enter Project Ire</li>
<li>The Opus Among Us: Anthropic’s Coding Assistant Gets an Upgrade</li>
<li>Serverless is becoming serverful in streaming responses</li>
</ul>
<h2>General News </h2>
<p>02:08 It’s Earnings Time! (INSERT AWESOME SOUND EFFECTS HERE) </p>
<p>02:16 <a href="https://www.cnbc.com/2025/07/23/alphabet-google-q2-earnings.html">Alphabet beats earnings expectations, raises spending forecast</a></p>
<ul>
<li style="font-weight:400;">Google Cloud revenue hit $13.62 billion, up 32% year-over-year, with OpenAI now using Google’s infrastructure for ChatGPT, signaling growing enterprise confidence in Google’s AI infrastructure capabilities.</li>
<li style="font-weight:400;"><a href="https://www.cnbc.com/quotes/GOOGL/">Alphabet</a> is <a href="https://www.cnbc.com/2025/07/23/googles-85-billion-capital-spend-spurred-by-cloud-ai-demand.html">raising its 2025 capital expenditure forecast from $75 billion to $85 billion</a>, driven by cloud and AI demand, with plans to increase spending further in 2026 as it competes for AI workloads.</li>
<li style="font-weight:400;">AI Overviews now serves 2 billion monthly users across 200+ countries, while the Gemini app reached 450 million monthly active users, demonstrating Google’s scale in deploying AI services globally.</li>
<li style="font-weight:400;">The $10 billion increase in planned capital spending reflects the infrastructure arms race among cloud providers to capture AI workloads, which require significant compute and specialized hardware investments.</li>
<li style="font-weight:400;">Google’s cloud growth rate of 32% outpaces its overall revenue growth of 14%, indicating the strategic importance of cloud services as traditional search and advertising face increased AI competition.</li>
</ul>
<p>03:55  Justin – “I don’t know what it takes to actually run one of these large models at like ultimate scale that like a ChatGPT needs or Anthropic, but I have to imagine it’s just thousands and thousands of GPUs just working nonstop.”</p>
<p>04:31 <a href="https://www.cnbc.com/2025/07/30/microsoft-msft-q4-earnings-report-2025.html">Microsoft (MSFT) Q4 earnings report 2025</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.cnbc.com/quotes/MSFT/">Microsoft</a> reported Q4 fiscal 2025 earnings with revenue of $76.44 billion, up 18% year-over-year and beating expectations, marking the fastest growth in over three years.</li>
<li style="font-weight:400;"><a href="https://portal.azure.com/">Azure</a> revenue grew 39% in Q4, significantly exceeding analyst expectations of 34-35%, with Microsoft disclosing for the first time that Azure and cloud services exceeded $75 billion in annual revenue for fiscal 2025.</li>
<li style="font-weight:400;">Microsoft’s AI investments are showing returns with 100 million monthly active users across Copilot products, driving higher revenue per user for Microsoft 365 commercial cloud products.</li>
<li style="font-weight:400;">Capital expenditures reached $24.2 billion for the quarter, up 27% year-over-year, as Microsoft continues aggressive data center buildout for AI workloads alongside peers like Alphabet ($85B annual) and Meta ($66-72B annual).</li>
<li style="font-weight:400;">Microsoft’s market cap crossed $4 trillion in after-hours trading, becoming only the second company, after Nvidi,a to reach this milestone, driven by strong cloud and AI momentum.</li>
</ul>
<p>06:33 <a href="https://www.cnbc.com/2025/08/01/amazon-earnings-ai-aws-tariffs.html?&amp;qsearchterm=amazon%20earnings">Amazon earnings key takeaways: AI, cloud growth, tariffs</a></p>
<ul>
<li style="font-weight:400;">Things weren’t quite as great for Amazon…</li>
<li style="font-weight:400;"><a href="https://www.cnbc.com/quotes/AMZN/">Amazon’s</a> capital expenditure could reach $118 billion in 2025, up from the previous $100 billion forecast, with spending primarily focused on AI infrastructure alongside competitors <a href="https://business.facebook.com/">Meta</a> ($66-72B) and Alphabet ($85B).</li>
<li style="font-weight:400;">AWS revenue grew 18% year-over-year, trailing Microsoft Azure’s 39% and Google Cloud’s 32% growth rates, though AWS maintains a significantly larger market share with the second player at approximately 65% of AWS’s size.</li>
<li style="font-weight:400;">Amazon’s generative AI initiatives are generating multiple billions in annualized revenue for AWS, with potential monetization through services like Alexa+ at $19.99/month or free for Prime members.</li>
<li style="font-weight:400;">Despite initial concerns about tariffs impacting costs, Amazon reported 11% growth in online store sales and 12% increase in items sold, with no significant price increases or demand reduction observed.</li>
<li style="font-weight:400;">The company expects Q3 revenue growth of up to 13%, suggesting tariffs have been absorbed by suppliers and customers, though uncertainty remains with the U.S.-China trade agreement deadline on August 12.</li>
</ul>
<p>08:08  Justin – “They’re not there yet. And they, they haven’t been there for a while, which is the concerning part. And I don’t know, you know – I haven’t really heard much about Nova since they launched.  They talk a lot about their Anthropic partnership, which makes sense. But I don’t feel like they have the swagger in AI that the others do.”</p>
<h2>AI Is Going Great – or How ML Makes Its Money </h2>
<p>11:23 <a href="https://blog.google/products/gemini/gemini-2-5-deep-think/">Gemini 2.5: Deep Think is now rolling out</a></p>
<ul>
<li style="font-weight:400;">Google’s Gemini 2.5 Deep Think uses parallel thinking techniques and extended inference time to solve complex problems, now available to <a href="https://one.google.com/about/google-ai-plans/">Google AI Ultra subscribers</a> in the <a href="https://gemini.google/">Gemini app</a> with a fixed daily prompt limit.</li>
<li style="font-weight:400;">The model achieves state-of-the-art performance on <a href="https://artificialanalysis.ai/evaluations/livecodebench">LiveCodeBench V6</a> and <a href="https://lastexam.ai/">Humanity’s Last Exam</a> benchmarks, with a variation reaching gold-medal standard at the International Mathematical Olympiad, though the consumer version trades some capability for faster response times.</li>
<li style="font-weight:400;">Deep Think excels at iterative development tasks like web development, scientific research, and algorithmic coding problems that require careful consideration of tradeoffs and time complexity.</li>
<li style="font-weight:400;">The technology uses novel reinforcement learning techniques to improve problem-solving over time and automatically integrates with tools like code execution and Google Search for enhanced functionality.</li>
<li style="font-weight:400;">Google plans to release Deep Think via the <a href="https://ai.google.dev/gemini-api/docs">Gemini API</a> to trusted testers in the coming weeks, signaling potential enterprise and developer applications for complex reasoning tasks in cloud environments.</li>
</ul>
<p>13:02  Justin  – “…these deep thinking models are the most fun to play with, because you  know, you don’t need it right away, but you want to go plan out a weekend in Paris, or I want you to, uh, go compare these three companies products based on public data and Reddit posts and things like that. And it goes, it does all this research, then it comes back with suggestions. That’s kind of fun. The more in depth it is, the better it is in my opinion.So the deep thinking stuff is kind of the coolest, like heavy duty research stuff.”</p>
<p>14:17 <a href="https://openai.com/index/introducing-gpt-oss/">Introducing Gpt OSS</a></p>
<ul>
<li style="font-weight:400;"><a href="https://openai.com/">OpenAI</a> is releasing the new <a href="https://openai.com/index/introducing-gpt-oss/">GPT-OSS-120b and GPT-oss-20b</a> open weight language models that deliver strong real-world performance at low costs. </li>
<li style="font-weight:400;">They’re both available under the flexible<a href="https://www.apache.org/licenses/LICENSE-2.0.html"> Apache 2.0 license</a>; these models on reasoning tasks demonstrate strong tool use capabilities and are optimized for efficient deployment on consumer hardware. </li>
<li style="font-weight:400;">Gpt-oss-120b model achieves near-parity with <a href="https://openai.com/index/introducing-o3-and-o4-mini/">OpenAI o4-mini</a> on core reasoning benchmarks while running efficiently on a single 80 GB GPU. </li>
<li style="font-weight:400;">The gpt-oss-20b model delivers similar results to <a href="https://openai.com/index/openai-o3-mini/">OpenAI o3-mini</a> on common benchmarks and can run on edge devices with just 16 GB of memory, making it ideal for on-device use cases, local inferenc,e or rapid iteration without costly infrastructure. </li>
<li style="font-weight:400;">They’re also both compatible with the responses API and are designed to be used within agentic workflows with exceptional instruction following, tool use like web search or Python code execution, and reasoning capabilities. </li>
</ul>
<p>15:30  Matt – “I’m still stuck on the 16 gigabytes of memory on your video card. I still remember, I bought my video first video card, it had 256 megabytes. It was a high end video card. And now I’m like, God, these things got so much bigger and faster. Okay, I’m officially old.”</p>
<p>16:43 <a href="https://www.microsoft.com/en-us/research/blog/project-ire-autonomously-identifies-malware-at-scale/">Project Ire autonomously identifies malware at scale – Microsoft Research</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.microsoft.com/en-us/research/?msockid=218ef287528c64c1155be48a53626586">Microsoft Research</a> developed <a href="https://www.microsoft.com/en-us/research/project/project-ire/">Project Ire</a>, an autonomous AI agent that reverse engineers software files to determine if they’re malicious, achieving 0.98 precision and 0.83 recall on Windows driver datasets. 
The system uses LLMs combined with decompilers, binary analysis tools, and memory sandboxes to analyze code without human assistance.</li>
<li style="font-weight:400;">The technology addresses a significant cloud security challenge where Microsoft Defender <a href="https://news.microsoft.com/apac/2020/03/17/windows-10-powering-the-world-with-one-billion-monthly-active-devices/">scans over 1 billion devices monthly</a>, requiring manual review of suspicious files by experts who face burnout and alert fatigue. </li>
<li style="font-weight:400;">Project Ire automates this gold-standard malware classification process at scale.</li>
<li style="font-weight:400;">The system creates an auditable “chain of evidence” for each analysis, using tools like angr and Ghidra to reconstruct control flow graphs and identify malicious behaviors like process termination, code injection, and command-and-control communication. It was the first reverse engineer at Microsoft (human or machine) to author a conviction case for blocking an APT malware sample.</li>
<li style="font-weight:400;">In real-world testing on 4,000 hard-target files that couldn’t be classified by other automated systems, Project Ire achieved 0.89 precision with only 4% false positives, demonstrating potential for deployment alongside human analysts. </li>
<li style="font-weight:400;">The prototype will be integrated into Microsoft Defender as Binary Analyzer for threat detection.</li>
<li style="font-weight:400;">This development represents a practical application of agentic AI in cybersecurity, building on the same foundation as <a href="https://www.microsoft.com/en-us/research/project/graphrag/">GraphRAG</a> and <a href="https://azure.microsoft.com/en-us/blog/transforming-rd-with-agentic-ai-introducing-microsoft-discovery/#:~:text=Get%20started%20today%20by%20using%20Azure%20HPC%20and%20Azure%20AI%20Foundry%20infrastructure.&amp;text=Microsoft%20Discovery%20is%20built%20on,make%20any%20adjustments%20as%20needed.">Microsoft Discovery</a>, with future goals to detect novel malware directly in memory at cloud scale.</li>
</ul>
<p>19:15  Justin – “I can think of all the things that can make us more efficient at and more productive with, and it’s like wow, that’s a great use case… it just takes away all of the noise.” </p>
<p>27:22 <a href="https://www.anthropic.com/news/claude-opus-4-1">Claude Opus 4.1 \ Anthropic</a></p>
<ul>
<li style="font-weight:400;">Claude Opus 4.1 achieves 74.5% on <a href="https://www.swebench.com/">SWE-bench Verified</a> coding benchmark, with GitHub reporting notable improvements in multi-file code refactoring and Rakuten praising its precision in debugging large codebases without introducing bugs</li>
<li style="font-weight:400;">The model is available across major cloud platforms, including Amazon Bedrock and <a href="https://console.cloud.google.com/">Google Cloud’s</a> <a href="https://cloud.google.com/vertex-ai">Vertex AI</a>, at the same pricing as <a href="https://ai-claude.net/opus/">Opus 4</a>, making it accessible for enterprise cloud deployments</li>
<li style="font-weight:400;"><a href="https://www.anthropic.com/news/claude-opus-4-1">Opus 4.1</a> uses a hybrid reasoning approach with extended thinking capabilities up to 64K tokens for complex benchmarks, while maintaining simpler scaffolding for coding tasks using just bash and file editing tools</li>
<li style="font-weight:400;">Windsurf reports the upgrade delivers a one standard deviation improvement over Opus 4 on their junior developer benchmark, comparable to the performance leap between Sonnet 3.7 and Sonnet 4</li>
<li style="font-weight:400;">For cloud developers, the immediate upgrade path is straightforward – simply switch to claude-opus-4-1-20250805 via the API with no pricing changes or major integration modifications required</li>
</ul>
<h2>AWS</h2>
<p>29:09 <a href="https://aws.amazon.com/about-aws/whats-new/2025/07/amazon-ec2-g6f-instances-fractional-gpus/">Announcing general availability of Amazon EC2 G6f instances with </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/07/amazon-ec2-g6f-instances-fractional-gpus/">fractional GPUs – AWS</a></p>
<ul>
<li style="font-weight:400;">AWS launches G6f instances with fractional GPU capabilities, offering 1/8, 1/4, and 1/2 GPU partitions powered by <a href="https://www.nvidia.com/en-us/data-center/l4/">NVIDIA L4 Tensor Core GPUs</a>, enabling customers to right-size workloads and reduce costs compared to full GPU instances.</li>
<li style="font-weight:400;">The instances target graphics workloads, including remote workstations for media production, CAD engineering, ML research, and game streaming, with configurations ranging from 3-12 GB GPU memory paired with AMD EPYC processors.</li>
<li style="font-weight:400;">This represents AWS’s first GPU partitioning offering, addressing the common challenge of GPU underutilization where workloads don’t require full GPU resources but previously had no smaller options.</li>
<li style="font-weight:400;">Available across 11 regions with On-Demand, Spot, and Savings Plan pricing options, requiring NVIDIA GRID driver 18.4+ and supporting Amazon DCV for remote desktop access.</li>
<li style="font-weight:400;">The fractional approach could significantly reduce costs for organizations running multiple smaller GPU workloads that previously required dedicated full GPU instances, particularly beneficial for development, testing, and lighter production workloads.</li>
</ul>
<p>30:15  Matt – “The fractional GPUs is an interesting concept; most people probably don’t need a massive GPU… so of you’re just doing one off things or you need it for a specific project, then you can get that small usage. “</p>
<p>31:07 <a href="https://aws.amazon.com/blogs/aws/amazon-documentdb-serverless-is-now-available/">Amazon DocumentDB Serverless is now available | AWS News Blog</a> </p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/documentdb/?trk=769a1a2b-8c19-4976-9c45-b6b1226c7d20&amp;sc_channel=el">Amazon DocumentDB</a> <a href="https://aws.amazon.com/blogs/aws/category/serverless/">Serverless</a> automatically scales compute and memory using DocumentDB Capacity Units (DCUs), where each DCU provides approximately 2 GiB of memory plus corresponding CPU and networking resources, with a capacity range of 0.5-256 DCUs.</li>
<li style="font-weight:400;">The service offers up to 90% cost savings compared to provisioning for peak capacity and charges a flat rate per second of DCU usage, making it cost-effective for variable workloads, multi-tenant environments, and mixed read/write scenarios.</li>
<li style="font-weight:400;">Existing DocumentDB clusters can add serverless instances without data migration by simply changing the instance type, requiring DocumentDB version 5.0 or higher, with the ability to mix provisioned and serverless instances in the same cluster.</li>
<li style="font-weight:400;">Key use cases include handling traffic spikes for promotional events, managing individual database capacity across multi-tenant SaaS applications, and building agentic AI applications that leverage DocumentDB’s built-in vector search capabilities.</li>
<li style="font-weight:400;">The service maintains all standard DocumentDB features, including MongoDB-compatible APIs, read replicas, <a href="https://docs.aws.amazon.com/documentdb/latest/developerguide/performance-insights.html?trk=769a1a2b-8c19-4976-9c45-b6b1226c7d20&amp;sc_channel=el">Performance Insights</a>, and AWS service integrations, while automatically tracking CPU, memory, and network utilization to scale without disrupting availability.</li>
</ul>
<p>33:04  Justin – “I mean, the one thing about the DCU model – and I see it a bunch of places, because I’ve been doing a lot more serverless with Valkey, and this DCU model comes up a lot. I actually just moved the CloudPod database to serverless Aurora for MySQL. And so I’ve been getting a little more exposed to the whole, whatever that one’s called; something like DCU as well. And it’s a little bit opaque. I definitely don’t love it as a model, but it is so much cheaper.” </p>
<p>35:18 <a href="https://aws.amazon.com/blogs/aws/introducing-amazon-application-recovery-controller-region-switch-a-multi-region-application-recovery-service/">Introducing Amazon Application Recovery Controller Region switch: A </a><a href="https://aws.amazon.com/blogs/aws/introducing-amazon-application-recovery-controller-region-switch-a-multi-region-application-recovery-service/">multi-Region application recovery service | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/application-recovery-controller/">Amazon Application Recovery Controller (ARC)</a> Region switch provides automated orchestration for <a href="https://docs.aws.amazon.com/glossary/latest/reference/glos-chap.html#region">multi-Region</a> application failover, addressing enterprise concerns about untested recovery procedures and unknown dependencies during Regional outages.</li>
<li style="font-weight:400;">The service supports nine execution block types, including <a href="https://aws.amazon.com/ec2/autoscaling/">EC2 Auto Scaling</a>, <a href="https://aws.amazon.com/rds/aurora/global-database/">Aurora Global Database</a> failover, <a href="https://aws.amazon.com/route53/">Route 53</a> health checks, and <a href="https://aws.amazon.com/eks/">EKS</a>/<a href="https://aws.amazon.com/ecs/">ECS</a> resource scaling, enabling coordinated recovery across compute, database, and DNS services.</li>
<li style="font-weight:400;">Region switch uses a Regional data plane architecture where recovery plans execute from the target Region, eliminating dependencies on the impacted Region and providing more resilient recovery operations.</li>
<li style="font-weight:400;">Continuous validation runs every 30 minutes to check resource configurations and <a href="https://aws.amazon.com/iam/">IAM permissions</a>. </li>
<li style="font-weight:400;">The service costs $70 per month per plan supporting up to 100 execution blocks or 25 child plans.</li>
<li style="font-weight:400;">Organizations can balance cost and reliability by configuring standby resource percentages, though actual capacity depends on Regional availability at recovery time, making regular testing essential for confidence in disaster recovery strategies.</li>
</ul>
<p>36:23  Matt – “I like the note here: ‘to facilitate the best possible outcomes, we recommend you regularly test your recovery plans and maintain appropriate service quotas in your standby region’ because the amount of times I’ve seen people try to do DR testing and then they his a service quota limit is comical at this point.” </p>
<p>38:42 <a href="https://aws.amazon.com/about-aws/whats-new/2025/07/aws-lambda-response-streaming-200-mb-payloads/?trk=e61dee65-4ce8-4738-84db-75305c9cd4fe&amp;sc_channel=el">AWS Lambda response streaming now supports 200 MB response </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/07/aws-lambda-response-streaming-200-mb-payloads/?trk=e61dee65-4ce8-4738-84db-75305c9cd4fe&amp;sc_channel=el">payloads – AWS</a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/lambda/latest/dg/configuration-response-streaming.html?refid=e61dee65-4ce8-4738-84db-75305c9cd4fe">AWS Lambda response streaming</a> now supports 200 MB response payloads, a 10x increase from the previous 20 MB limit, enabling direct processing of larger datasets without compression or <a href="https://aws.amazon.com/s3/?refid=e61dee65-4ce8-4738-84db-75305c9cd4fe">S3</a> intermediary steps.</li>
<li style="font-weight:400;">This enhancement targets latency-sensitive applications like real-time AI chat interfaces and mobile apps where time to first byte directly impacts user experience and engagement metrics.</li>
<li style="font-weight:400;">The expanded payload capacity opens new use cases, including streaming image-heavy PDFs, music files, and real-time processing of larger datasets directly through Lambda functions.</li>
<li style="font-weight:400;">Response streaming is available on <a href="http://node.js">Node.js</a> managed runtimes and custom runtimes across all AWS regions where the feature is supported, with the 200 MB limit now set as default.</li>
<li style="font-weight:400;">This update reduces architectural complexity by eliminating workarounds previously required for payloads exceeding 20 MB, potentially lowering costs associated with S3 storage and data transfer fees.</li>
</ul>
<h2>GCP</h2>
<p>40:26  <a href="https://cloud.google.com/blog/topics/developers-practitioners/gemini-cli-custom-slash-commands/">Gemini CLI: Custom slash commands | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://github.com/google-gemini/gemini-cli">Gemini CLI</a> now supports <a href="https://github.com/google-gemini/gemini-cli/blob/main/docs/cli/commands.md#custom-commands">custom slash commands</a> through .toml files and Model Context Protocol (MCP) prompts, allowing developers to create reusable prompts for common workflows like code reviews or planning tasks. </li>
<li style="font-weight:400;">This brings <a href="https://github.com/features/copilot">GitHub Copilot</a>-style command functionality to Google’s AI assistant in the terminal.</li>
<li style="font-weight:400;">Commands can be scoped at the user level (available across all projects) or the project level (checked into Git repos), with namespacing support through directory structures. </li>
<li style="font-weight:400;">The implementation uses minimal configuration requirements – just a prompt field – making it accessible for quick adoption.</li>
<li style="font-weight:400;">The MCP integration enables Gemini CLI to automatically expose prompts from configured MCP servers as slash commands, supporting both named and positional arguments. This positions Google to leverage the growing ecosystem of MCP-compatible tools and services.</li>
<li style="font-weight:400;">Key use cases include automating code reviews, generating implementation plans, and standardizing team workflows through shared command libraries. The shell command execution feature (!{…}) allows integration with existing CLI tools and scripts.</li>
<li style="font-weight:400;">While this is a developer productivity tool rather than a cloud service, it strengthens Google’s developer ecosystem play against GitHub Copilot and Amazon Q Developer. </li>
<li style="font-weight:400;">The feature is available now with a simple npm update, requiring only a <a href="https://ai.google.dev/gemini-api/docs">Gemini API</a> key to get started.</li>
</ul>
<p>37:18  Matt – “I still like the VS Code plugin, and making it interact more that way. I find that a little bit better from the little bit I’ve played with Claude Code, but recently I’ve been talking to people who say Claude Code has gotten better since the initial release so I have to go back and play with it and see.” </p>
<p>42:40 <a href="https://cloud.google.com/blog/products/ai-machine-learning/agent2agent-protocol-is-getting-an-upgrade/">Agent2Agent protocol (A2A) is getting an upgrade | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google releases <a href="https://a2a-protocol.org/latest/">A2A</a> protocol version 0.3 with gRPC support, security card signing, and Python SDK improvements, positioning it as an open standard for multi-agent AI systems that can communicate across different platforms and vendors.</li>
<li style="font-weight:400;">The protocol now has native support in Google’s<a href="http://google.github.io/adk-docs/a2a/"> Agent Development Kit (ADK)</a> and offers three deployment paths: managed Agent Engine, serverless Cloud Run, or full control with GKE, giving developers flexibility in how they scale their agent systems.</li>
<li style="font-weight:400;">Over 150 organizations, including Adobe, ServiceNow, and Twili,o are adopting A2A, with real implementations like Tyson Foods and Gordon Food Service using collaborative agents to share supply chain data and reduce friction in their operations.</li>
<li style="font-weight:400;">Google is launching an <a href="https://console.cloud.google.com/marketplace/browse?filter=category:ai-agent">AI Agent Marketplace</a> where partners can sell A2A-enabled agents directly to customers, while Agentspace provides a governed environment for users to access these agents with enterprise security controls.</li>
<li style="font-weight:400;">The protocol was <a href="https://developers.googleblog.com/en/google-cloud-donates-a2a-to-linux-foundation/">contributed</a> to the Linux Foundation in June 2024, making it a vendor-neutral standard that could become the HTTP of agent-to-agent communication, though adoption will depend on whether competitors embrace an open approach.</li>
</ul>
<p>44:18  Justin – “Agent to Agent is basically how you make MCP to MCP work in the cloud.” </p>
<p>44:38 <a href="https://cloud.google.com/blog/products/compute/c4-vms-based-on-intel-6th-gen-xeon-granite-rapids-now-ga/">C4 VMs based on Intel 6th Gen Xeon Granite Rapids now GA | Google </a><a href="https://cloud.google.com/blog/products/compute/c4-vms-based-on-intel-6th-gen-xeon-granite-rapids-now-ga/">Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google launches<a href="https://cloud.google.com/blog/products/compute/c4-machine-series-is-now-ga"> C4 VMs</a> on <a href="https://www.intel.com/content/www/us/en/support/products/240357/processors/intel-xeon-processors/intel-xeon-6-processors.html">Intel Xeon 6 processors (Granite Rapids)</a> with up to 30% better general compute performance and 60% improvement for ML recommendation workloads compared to the previous generation, making them the first major cloud provider to offer Xeon 6.</li>
<li style="font-weight:400;">New C4 shapes include Titanium Local SSD variants delivering 7.2M max read IOPS (3x higher than comparable offerings from other hyperscalers) and 35% lower access latency, targeting high-performance databases, big data processing, and media rendering workloads.</li>
<li style="font-weight:400;">C4 bare metal instances provide direct CPU/memory access for commercial hypervisors and SAP workloads, achieving <a href="https://www.sap.com/dmc/exp/2018-benchmark-directory/#/q2c?sort=Certification%20Date&amp;sortDesc=true">132,600 aSAPs</a> – the highest of any comparable machine – with 35% performance improvement over C3 bare metal.</li>
<li style="font-weight:400;">The expanded C4 series maintains existing CUD discounts and integrations with managed instance groups and GKE custom compute classes, available in 19 zones with shapes ranging from 4 to 288 vCPUs.</li>
<li style="font-weight:400;">Key use cases include AI inference with FP16-trained models using Intel AMX-FP16, financial services requiring microsecond-level latency improvements, and visual effects rendering with reported 50% speedups over n2d instances..</li>
</ul>
<p>46:24 <a href="https://cloud.google.com/blog/products/management-tools/announcing-cloud-hub-optimization-and-cost-explorer-for-developers/">Announcing Cloud Hub Optimization and Cost Explorer for developers | </a><a href="https://cloud.google.com/blog/products/management-tools/announcing-cloud-hub-optimization-and-cost-explorer-for-developers/">Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google launches <a href="https://cloud.google.com/blog/products/management-tools/announcing-cloud-hub-optimization-and-cost-explorer-for-developers">Cloud Hub Optimization and Cost Explorer</a> in public preview, providing application-centric cost visibility across multiple projects without additional charges, addressing the challenge of tracking expenses for applications that span dozens of GCP projects.</li>
<li style="font-weight:400;">The tools integrate Cloud Billing cost data with <a href="https://cloud.google.com/monitoring?hl=en">Cloud Monitoring utilization</a> metrics to surface underutilized resources like GKE clusters with idle GPUs, showing average vCPU utilization at the project level to identify optimization candidates.</li>
<li style="font-weight:400;">Unlike traditional cost dashboards that show aggregate Compute Engine costs, Cost Explorer breaks down spending by specific products, including GKE clusters, Persistent Disks, and Cloud Load Balancing for more granular cost attribution.</li>
<li style="font-weight:400;">Built on <a href="https://cloud.google.com/products/app-hub">AppHub Applications</a> framework, the solution reorganizes cloud resources around applications rather than projects, competing with AWS Cost Explorer and Azure Cost Management by focusing on application-level cost optimization.</li>
<li style="font-weight:400;">MLB’s Principal Cloud Architect reports that the tools help monitor costs across tens of business units and hundreds of developers, with particular value for organizations shifting left on cloud cost management.</li>
</ul>
<p>47:26  Justin – “And if you’ve ever used the Google Cloud Optimization Hub and Cost Explorer previously, you’d know they’re hot garbage. So this was a very appreciated announcement at Google Next.” </p>
<h2>Azure</h2>
<p>49:10 <a href="https://techcommunity.microsoft.com/t5/microsoft-security-community/introducing-microsoft-sentinel-data-lake/ba-p/4434280">Introducing Microsoft Sentinel data lake | Microsoft Community Hub</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/sentinel/datalake/sentinel-lake-overview">Microsoft Sentinel data lake</a> enters public preview as a fully managed security data lake built directly into <a href="https://www.microsoft.com/en-us/security/business/siem-and-xdr/microsoft-sentinel/">Sentinel</a>, allowing organizations to store all security data in one place with cost-effective long-term retention while eliminating the need to build custom data architectures.</li>
<li style="font-weight:400;">The service integrates with 350+ existing Sentinel connectors including <a href="https://duckduckgo.com/y.js?ad_domain=microsoft.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=lcZp4kaJMJKZ-r54vjRb5mLUgjIASs-Yx6_I2h2DPQq98vbhCEB0Ildddvuoq9cZ_DoRCLYlIFWCvI_X8uY8ctyxNJu5mPtnQh8-yfRZC7x_JymqrvE0VXDulcWYlASw.x-9c5JvXyhi5NuVHEQGbvw&amp;eddgt=o1wQu3UjYGam4POgicbmnw%3D%3D&amp;rut=baae740930aeacaf55c39f4c64713e78d594e9ad614e71a3fa31cde6d45ee57f&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8dPYWlLvTqE3FJcEW4CsZ-DVUCUwgtMGnL9NEfzy8pOYWrImOT8RCEUUEn_ZXQl7AAbQx4iiSop6dynu7wdsaQwrPGpOUp2mIBHfm1UtPrYHeeIg-_k2yNnb90U81Em4IPVTEn1tZjGVUGZ65kc3AnwHxyPL_RddSa3F0hdQ8Nt-Zc_V-8-7CX3v8ZVoDRMsMxM7Ha0wLS-jkn1zQi6HGundLn-EMeNGIux5ZRkuovE5uQ1LwFr5qC-ejKK2YiQ2a-cyivPT_Ccsjiq5Q--FdqItiFq7z90vgnGBle18uJwPdmOvB2-mPbPCtGxZr6pJ5-dVR0PinETS6ZNXzhN8eNirgCl22_2_AGCOSBlU7m7MgNopRjH0irrndZihVx8v-YXjnHjjm0PUgOYkO3mAdvjO7kgFG8uqB3vhLD0ONcKTtSyBzRerwADz2FGnxZ4KKvSG0BgI5ymkG9KJonXWhsKM0btWkdug8Zk5jMZ8Sg0876qftA6MUJxQnXmC8YyRLQQOJEHgWTMVes0QCTtMoL32A6ve6uuRxImgj5kGcZaB86NmBPyj440TvOfIGoNAjGZPYbX3pNSgmgx-tH9HM1PZoxlIZMzlRBIG2rClKlwNIUdF1rDZVfI2K02fVr_UWz7ATSZSPK0sMdCMfvZ4PxzLYut23HTpAUx0qGu8faG8AtEgoOP7KKyrxUEx8FzxkZw3lFbarNgJRAOQY16JBc5lHCl009XZa0uGexU33cfkasfucm7-YhPixJwwSzFYzkM87lA%26u%3DaHR0cHMlM2ElMmYlMmY1MzUwLnhnNGtlbi5jb20lMmZ0cmslMmZ2MSUzZnByb2YlM2Q0NDAlMjZjYW1wJTNkMTc0MTQyJTI2a2N0JTNkbXNuJTI2a2NoaWQlM2QxNTkwMDI1MjYlMjZjcml0ZXJpYWlkJTNka3dkLTc5MzcxNjEwMDc2NjI5JTNhbG9jLTcxMjI4JTI2Y2FtcGFpZ25pZCUzZDU5MDM2MTI0NyUyNmxvY3BoeSUzZDgwMTM2JTI2YWRncm91cGlkJTNkMTI2OTkzNzU5NTgwODY2NSUyNmNpZCUzZDc5MzcxMjA2NTMxMDIwJTI2a2R2JTNkYyUyNmtleHQlM2QlMjZrcGclM2QlMjZrcGlkJTNkJTI2cXVlcnlTdHIlM2RNaWNyb3NvZnQlMjUyMDM2NSUyNnVybCUzZGh0dHBzJTNhJTJmJTJmd3d3Lm1pY3Jvc29mdC5jb20lMmZlbi11cyUyZm1pY3Jvc29mdC0zNjUlMmZidXNpbmVzcyUyZmNvbXBhcmUtYWxsLW1pY3Jvc29mdC0zNjUtYnVzaW5lc3MtcHJvZHVjdHMtYiUzZmVmX2lkJTNkX2tfYzRlZTZlZTUwN2Q2MTdkMzRiYWIwNjc5MjBlYjhjNzBfa18lMjZPQ0lEJTNkQUlEY21tNDc0cXA4ZWxfU0VNX19rX2M0ZWU2ZWU1MDdkNjE3ZDM0YmFiMDY3OTIwZWI4YzcwX2tfJTI2bXNjbGtpZCUzZGM0ZWU2ZWU1MDdkNjE3ZDM0YmFiMDY3OTIwZWI4Yzcw%26rlid%3Dc4ee6ee507d617d34bab067920eb8c70&amp;vqd=4-64971286945521639513677825966133115271&amp;iurl=%7B1%7DIG%3DEE3D966BD22A4319B2AFCC6C9516A278%26CID%3D2DDCA4D143B36F6D22CAB29642066EE4%26ID%3DDevEx%2C5045.1">Microsoft 365</a>, <a href="https://duckduckgo.com/y.js?ad_domain=microsoft.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=fKAKzPCkL54oIn06trTMPRe5-1U9gXtAY8JEvBs6I4Sr6gNS8_c8NN9usKHabFEFkVRJoVbvKJZX8VlhqZ9LrbBR28mTRY7Pkvb0EaB5z6rAiCS1TBeTe61aK0kgFecv.9IrL5iqq8KdzwjebxORq8g&amp;eddgt=6Fj2nM1v2eatmctNij65TA%3D%3D&amp;rut=7d42ec4c94857b355ddb9fad31298209b804ba9095b4dc7779d1c4192106177d&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8lENX6LznHgtgdLYS33ODjzVUCUwEqP_ZwqknntU19o1xdP6R7K_Ip4s4arXGSQ5ujESXiiq02xhlYpWLNIo-BMcm0Occb_zVG6Fm8CpPYlYajoVVxzmoJebixoglbzP0ZNb8n-l5srR8JfvJ2xcW8X_8AebSVcXxfC-imQepTi2WrD8SysIwWTbUAeZ62BoZz8LA-wabJn17vlVbGDdprCkfU0O_6ePEI2wDhyeUA1J1qR5c7kuqXz6G8i8nO13sYSlfUCWhvIp3AUoOuw-jOLaEMW12BCkxqPgASpUJcd4BhDqRn7mo5ltX6a75d9Guz2kB0MBGdjq1zEYughg5OUJp1fXfqKwXb82c7B_EWsJ3s0DLP9-as2OZd3d8Zrs0naVawqcaLvm2u7-SjNNTnH3_SXIa4Gr3nUJBbD5uURovopc7HlyJl50jXYaxz9KnAJrWWwNO5-11nlm98vfAu5nVFrRG7Vn3KNWwtd_hrn0pXhMV_JTXomZM4atASbu63neM-Lqs6s4tEv-kOHulfZckJTph4jnOZH8-DCECD-bLmzS1I3qX_jGaXdO_XnCk_UlEgVfHz3u9kjsd7aUfrLGKnO06NMibh3g7tpfXT-tad4FmBLNgnh7dRhrVnaKj9_ra38boOUo2gwYTC9H3DB7v7QxKhsw4921pOfZhf8JBn8IGbOHe0p85ttgdGA5V3uKizXUna2FSIE5BoegY3oa3SRjPSnHzNK0Xab7IiqEKfPciQO4PzDeNybVf_Xz_SkIHJA%26u%3DaHR0cHMlM2ElMmYlMmY1MzUwLnhnNGtlbi5jb20lMmZ0cmslMmZ2MSUzZnByb2YlM2Q0NDAlMjZjYW1wJTNkMTc0MTU2JTI2a2N0JTNkbXNuJTI2a2NoaWQlM2QxNTkwMDI1MjYlMjZjcml0ZXJpYWlkJTNka3dkLTc5MDI4MDEzODIxODE0JTNhbG9jLTcxMjI4JTI2Y2FtcGFpZ25pZCUzZDU5MDM2MTI1NCUyNmxvY3BoeSUzZDc5MTI5JTI2YWRncm91cGlkJTNkMTI2NDQ0MDAzNzY5ODQzNiUyNmNpZCUzZDc5MDI3NjA5NjEyNzc2JTI2a2R2JTNkYyUyNmtleHQlM2QlMjZrcGclM2QlMjZrcGlkJTNkJTI2cXVlcnlTdHIlM2RtaWNyb3NvZnQlMjUyMGRlZmVuZGVyJTI2dXJsJTNkaHR0cHMlM2ElMmYlMmZ3d3cubWljcm9zb2Z0LmNvbSUyZmVuLXVzJTJmc2VjdXJpdHklMmZidXNpbmVzcyUyZm1pY3Jvc29mdC1kZWZlbmRlci1mb3ItYnVzaW5lc3MtYW5kLWluZGl2aWR1YWxzLWZyZWUtdHJpYWwlM2ZlZl9pZCUzZF9rX2I1NDg3YjEzMDIwODFmYzkyNjliMTM4ZjI1NGZmMjhiX2tfJTI2T0NJRCUzZEFJRGNtbTQ3NHFwOGVsX1NFTV9fa19iNTQ4N2IxMzAyMDgxZmM5MjY5YjEzOGYyNTRmZjI4Yl9rXyUyNm1zY2xraWQlM2RiNTQ4N2IxMzAyMDgxZmM5MjY5YjEzOGYyNTRmZjI4Yg%26rlid%3Db5487b1302081fc9269b138f254ff28b&amp;vqd=4-303210519280333419624178211580054577443&amp;iurl=%7B1%7DIG%3DE088D46B76664F87B9D733F430814FA4%26CID%3D3813A534EB206E703148B373EA076F87%26ID%3DDevEx%2C5046.1">Defender</a>, <a href="https://duckduckgo.com/y.js?ad_domain=microsoft.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=CIL37f26sFuyf20I5Lhux89Lu0dIouMWVkUJ-g3sCgWY_WhGDsp-bF7ZUPf7M9s8wLj_J1DCTrj_My-mpdVQ6WatrFX79IEczyhHnG9C1FpwyytUl9AwezTYAcXqQybU.MtqBPv4QnpL1pjDrxk_ToQ&amp;eddgt=YQC02-HkLXTJLKY8H6fldQ%3D%3D&amp;rut=e46b5885a33f3eb42f83c712a33e865e5b9ed188e4dd184db70a9d462dddeaf0&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De82X1CucJkis8Jp4SHJURZVTVUCUxIea6aA6bgP_Yhv-4AHd5vPJ2nPbzlmbSPUVQORFkR_wPEmvVFxI63MqzL07dW0Hq6th7Vj5xQWn-59VTL47c_MpG-Y9wwGFR19YkjEdKqNVRACx44dVK6xY4X9ANw6jCHEJO_A_am1vEEWYa3SivgTpVsxpFmqoTgsl1yTDlNooYgcuJf7mN4e0qfNkV6pHA3CsiCfTY5R9Ocm6bBx35ymrD4i97GVfiYHJuu8lDXQXcH-cgm3d3AAGZj3tOP-JGHE54XDt4zCCnDh-ErcqTxMxKjG6JFKhJv6oPtHw0Cg0reONplz94MLyhaIPS873WfNRNafUVrIcoTaodeA6lOQixn07Z94CtEuL9qBF5Evepdy-5N57m5glWECHvizCEyp84HQZaDdd6PgGL9SrfYQGDjhHcPFEArfLmKz-hM0SIta7opravCX29TYcAHLDoDIKrjCZdYL9t3cIxY3bUSX_gVN_qkNj-AjyyWzNdx1Aps0q7sZTIuxmen6J-k537Jj64UeMUVTSIZRKPtaSVIhpIyuLBHt_JEFJBELpw_DJFo6OyxRGt5OrhbK4xIQqz8_q2jxdOOZ-d68gSlOhbgO7shXt9t0LTE68W9B1XR7CAv5L9yhk23JVGYokUlB_Bl0Akl7AL_j-lqFTV5snzgRGMHuLBiiRSjDn_BsAiek8yJeOYquRG0P9jTfqBfqMNAaWADQLk-oMRqoLBK5ts3PWR4GphGIEe39P1nTLu-eA%26u%3DaHR0cHMlM2ElMmYlMmY1MzUwLnhnNGtlbi5jb20lMmZ0cmslMmZ2MSUzZnByb2YlM2Q0MzklMjZjYW1wJTNkMTY4ODA3JTI2a2N0JTNkbXNuJTI2a2NoaWQlM2QxNTkwMDE5MjElMjZjcml0ZXJpYWlkJTNka3dkLTc4ODIxODQ2NDc2Njc1JTNhbG9jLTE5MCUyNmNhbXBhaWduaWQlM2Q1OTAyMzczNTglMjZsb2NwaHklM2Q4MDUxMSUyNmFkZ3JvdXBpZCUzZDEyNjExNDE0ODk3MDI1NDUlMjZjaWQlM2Q3ODgyMTQ0ODU3MjY3MyUyNmtkdiUzZGMlMjZrZXh0JTNkJTI2a3BnJTNkJTI2a3BpZCUzZCUyNnF1ZXJ5U3RyJTNkQXp1cmUlMjZ1cmwlM2RodHRwcyUzYSUyZiUyZmF6dXJlLm1pY3Jvc29mdC5jb20lMmZlbi11cyUyZnByaWNpbmclMmZwdXJjaGFzZS1vcHRpb25zJTJmYXp1cmUtYWNjb3VudCUyZnNlYXJjaCUzZmVmX2lkJTNkX2tfNjFhMDFiMTc4NTc1MTAwMWMxNWM3MzhiYTIyNjgxZThfa18lMjZPQ0lEJTNkQUlEY21tNWVkc3dkdXVfU0VNX19rXzYxYTAxYjE3ODU3NTEwMDFjMTVjNzM4YmEyMjY4MWU4X2tfJTI2bXNjbGtpZCUzZDYxYTAxYjE3ODU3NTEwMDFjMTVjNzM4YmEyMjY4MWU4%26rlid%3D61a01b1785751001c15c738ba22681e8&amp;vqd=4-307963477881636289343725066872020858571&amp;iurl=%7B1%7DIG%3DC39869A3F6464184BFF23660C4B387BB%26CID%3D30F19AFF9E6F695C08638CB89F486880%26ID%3DDevEx%2C5045.1">Azure</a>, <a href="https://aws.amazon.com/">AWS</a>, and <a href="https://console.cloud.google.com/">GCP</a> sources, storing data in open formats that support both <a href="https://learn.microsoft.com/en-us/kusto/query/?view=microsoft-fabric">Kusto queries</a> and Python notebooks through a new <a href="https://code.visualstudio.com/download">Visual Studio Code</a> extension for advanced analytics.</li>
<li style="font-weight:400;">Pricing separates data ingestion/storage from analytics consumption, enabling customers to store high-volume, low-fidelity logs like network traffic cost-effectively in the data lake tier while automatically mirroring critical analytics-tier data to the lake at no extra charge.</li>
<li style="font-weight:400;">Key differentiator from <a href="https://aws.amazon.com/security-lake/">AWS Security Lake</a> is the native integration with Microsoft’s security ecosystem and managed compute environment – security teams can run scheduled analytics jobs and retroactive threat intelligence matching without managing infrastructure.</li>
<li style="font-weight:400;">Target use cases include forensics analysis, compliance reporting, tracking slow attacks over extended timeframes, and running ML-based anomaly detection on historical data, with results easily promoted back to the analytics tier for investigation.</li>
</ul>
<p>51:40  Matt – “Kusto is their proprietary time series database. So all of Azure metrics. And you can even pay for teh service and leverage it yourself as Azure data explorer.” </p>
<p>38:01 <a href="https://techcommunity.microsoft.com/blog/azurecompute/announcing-general-availability-of-azure-e128--e192-sizes-in-the-esv6-and-edsv6-/4438822">Announcing General Availability of Azure E128 &amp; E192 Sizes in the Esv6 </a><a href="https://techcommunity.microsoft.com/blog/azurecompute/announcing-general-availability-of-azure-e128--e192-sizes-in-the-esv6-and-edsv6-/4438822">and Edsv6-series VM Families | Microsoft Community Hub</a></p>
<ul>
<li style="font-weight:400;">Azure launches E128 and E192 VM sizes with up to 192 vCPUs and 1832 GiB RAM, targeting enterprise workloads like <a href="https://www.sap.com/products/data-cloud/hana/what-is-sap-hana.html">SAP HANA</a>, large SQL databases, and in-memory analytics. </li>
<li style="font-weight:400;">These new sizes use Intel’s <a href="https://www.intel.com/content/www/us/en/products/details/processors/xeon/scalable/platinum/products.html">5th Gen Xeon Platinum processors</a> and deliver 30% better performance than the previous Ev5-series.</li>
<li style="font-weight:400;">The VMs feature <a href="https://learn.microsoft.com/en-us/azure/azure-boost/overview?toc=%2Fazure%2Fvirtual-machines%2Ftoc.json&amp;bc=%2Fazure%2Fvirtual-machines%2Fbreadcrumb%2Ftoc.json">Azure Boost</a> technology providing 400K IOPS and 12 GB/s storage throughput with 200 Gbps network bandwidth, plus NVMe interface delivering 3X improvement in local storage IOPS. This positions them competitively against AWS’s memory-optimized instances like X2iezn and GCP’s M3 series.</li>
<li style="font-weight:400;"><a href="https://www.intel.com/content/dam/www/central-libraries/us/en/documents/white-paper-intel-tme.pdf">Intel Total Memory Encryption (TME)</a> provides hardware-based memory encryption for enhanced security, addressing enterprise concerns about data protection in multi-tenant environments. The isolated VM option (E128i and E192i) offers dedicated physical hosts for compliance-sensitive workloads.</li>
<li style="font-weight:400;">Currently available in 14 regions including major markets like East US, West Europe, and Japan East, with expansion planned for 2025. Pricing follows standard Azure VM models with both diskful (Edsv6) and diskless (Esv6) options to optimize costs based on storage needs.</li>
<li style="font-weight:400;">These sizes specifically target customers running memory-intensive applications who need to scale beyond traditional VM limits without moving to specialized services. The combination of high memory capacity, enhanced networking, and improved storage performance makes them suitable for consolidating multiple workloads.</li>
</ul>
<p>56:12 <a href="https://techcommunity.microsoft.com/blog/appsonazureblog/announcing-a-flexible-predictable-billing-model-for-azure-sre-agent/4427270">Announcing a flexible, predictable billing model for Azure SRE Agent | </a><a href="https://techcommunity.microsoft.com/blog/appsonazureblog/announcing-a-flexible-predictable-billing-model-for-azure-sre-agent/4427270">Microsoft Community Hub</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.azure.com/sreagent">Azure SRE Agent</a> is a pre-built AI tool for root cause analysis and incident response that uses machine learning to analyze logs and metrics, helping site reliability engineers focus on higher-value tasks while reducing operational costs and improving uptime.</li>
<li style="font-weight:400;">The <a href="https://aka.ms/sreagent/pricing">billing model</a> introduces Azure Agent Units (AAU) as a standardized metric across all Azure agents, with a fixed baseline cost of 4 AAU per hour ($0.40/hour) for continuous monitoring plus 0.25 AAU per second for active incident response tasks.</li>
<li style="font-weight:400;">As part of Microsoft’s Agentic DevOps strategy, SRE Agent represents a shift toward AI-native cloud operations where intelligent agents handle routine tasks automatically, competing with <a href="https://aws.amazon.com/devops-guru/">AWS DevOps Guru</a> and <a href="https://cloud.google.com/blog/topics/developers-practitioners/introduction-google-clouds-operations-suite">Google Cloud’s Operations suite</a>.</li>
<li style="font-weight:400;">The dual-flow architecture keeps the agent always learning from normal behavior patterns while ready to activate AI components instantly when anomalies are detected, providing 24/7 intelligent monitoring without manual intervention.</li>
<li style="font-weight:400;">Target customers include organizations managing complex cloud workloads who want predictable operational costs – the usage-based pricing means you only pay for active incident response time beyond the baseline monitoring fee.</li>
</ul>
<p>57:25  Matt – “I really want to play with this. I’m a little terrified of what the cost is gong to be.” </p>
<p>59:02 <a href="https://azure.microsoft.com/en-us/updates?id=495106">Generally Available: Live Resize for Premium SSD v2 and Ultra </a><a href="https://azure.microsoft.com/en-us/updates?id=495106">NVMe Disks</a></p>
<ul>
<li style="font-weight:400;">Azure’s Live Resize feature for Premium SSD v2 and Ultra <a href="https://learn.microsoft.com/en-us/azure/virtual-machines/nvme-overview">NVMe disks</a> enables storage capacity expansion without downtime, addressing a common pain point where disk resizing traditionally required VM restarts and application disruption.</li>
<li style="font-weight:400;">Hasn’t Amazon had this forever? </li>
<li style="font-weight:400;">This positions Azure competitively against <a href="https://docs.aws.amazon.com/ebs/latest/userguide/requesting-ebs-volume-modifications.html">AWS EBS volume modifications</a> and GCP persistent disk resizing, though Azure’s implementation specifically targets their high-performance disk tiers used for latency-sensitive workloads like databases and analytics.</li>
<li style="font-weight:400;">The feature supports cost optimization by allowing customers to start with smaller disk sizes and scale up only when needed, avoiding overprovisioning costs that can add thousands of dollars monthly for enterprise workloads.</li>
<li style="font-weight:400;">Target use cases include production databases, real-time analytics platforms, and high-transaction applications where both performance consistency and zero-downtime operations are critical requirements.</li>
<li style="font-weight:400;">Implementation requires no code changes and works through standard Azure portal, CLI, or API commands, making it accessible for both manual operations and automated infrastructure-as-code deployments.</li>
</ul>
<p>1:00:03  Justin – “I’m just mad this didn’t exist until today.” </p>
<p>1:01:20 <a href="https://azure.microsoft.com/en-us/updates?id=499192">Generally Available: Agentless multi-disk crash consistent </a><a href="https://azure.microsoft.com/en-us/updates?id=499192">backup for Azure VMs</a></p>
<ul>
<li style="font-weight:400;"><a href="https://duckduckgo.com/y.js?ad_domain=microsoft.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=Sni3-e6VLupp0w_rUFPIe4tztz9MBfsWLPXi97hMnRhStW9-usVldGucGzd3B2zIplDRgMIjlL9thLl4_NPZj5hKxYNTKyHSzahMiOAy_tu8Kl8y0TlaeazrqJR_zEfv.RR_ISE8cNK9w2PgwUgLfpw&amp;eddgt=fA8GpeVdtwF8CTdrqI6i6A%3D%3D&amp;rut=f2895ce389d91d85e2f041d637ebe660a82f336aa16b73bc6994927a2f778d6f&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De81yofH9d_0axlON-iogJtITVUCUwEyE3TD7GHrDi2MsPrqbXoPjPSfsOtAISgLd-3_c0EvfJjZPToIrQENMt8jEcjjNdhGnk7XMldLjRF_Xinm8SOxRxVdPTWi7EIsxveIViVddBExsd-9G99DEK1ef12PHwCk13RgG7Js7qkRET87v01FCF1IdltyRwipx8A6c80b6ptX5R2eKFdQokrQbdvi7i8DXv3EAbKOZRCyuWVWAe69ms5z7VjEWxXMXt6UFtjVrP9XsSqW0xSV3Wz-vcYPqCtZ8-AhVPVshh-VyRLymRzne63GFvy95cWUixNfHxB8cjIcAnwiP3Umo_Mg1re-j3F1r8uklxdn5b3W1SrSGiFdtMiSJQlmRnbhnV8aVA3XTyaC7E9nooDtjxnPIzxhpe5lKU8sAxytqo3fA_28q9NyWkNr_zUQMZPMxeccGtyAHla5Vz6Rk6Iu4NlgLF5Q0yE2nqGYYMla5la_5aeSUUZ34ZcoCPMRmMThD_45cGVzEtb6nu0Cj0JQ72hCi_PqIZg8ycbvuVWrP3htxyiw5LmMkooY2qgJlTOHRCTcWYxPdojY6eqRtl2OXjMhZ4wtoxxWJ_dMPRYvV3Vyp4ty8YMEYd7cvDhRm3hrQZC1GP2DY1tUfn0VuqZERZ2xGk59_GjqNDCCDSeO2VokViIblQ916jUPUeieFVv1Zm4KXoJ_IcrTG6IJonGgsdHZTdWpdaVBUhmEakoPGt8WCuIS7siqBxV5VhtFuk8xvweWdD7Jw%26u%3DaHR0cHMlM2ElMmYlMmY1MzUwLnhnNGtlbi5jb20lMmZ0cmslMmZ2MSUzZnByb2YlM2Q0MzklMjZjYW1wJTNkMTY4ODA3JTI2a2N0JTNkbXNuJTI2a2NoaWQlM2QxNTkwMDE5MjElMjZjcml0ZXJpYWlkJTNka3dkLTc5MzAyODgzNzYzMDkzJTNhbG9jLTE5MCUyNmNhbXBhaWduaWQlM2Q1OTAyMzczNTglMjZsb2NwaHklM2Q4MDI2NiUyNmFkZ3JvdXBpZCUzZDEyNjg4MzgwNzEwODYyNjQlMjZjaWQlM2Q3OTMwMjQ4NTk5MDQzOCUyNmtkdiUzZGMlMjZrZXh0JTNkJTI2a3BnJTNkJTI2a3BpZCUzZCUyNnF1ZXJ5U3RyJTNkQXp1cmUlMjUyMEJhY2t1cCUyNnVybCUzZGh0dHBzJTNhJTJmJTJmYXp1cmUubWljcm9zb2Z0LmNvbSUyZmVuLXVzJTJmcHJvZHVjdHMlMmZiYWNrdXAlMmYlM2ZlZl9pZCUzZF9rX2I1MzA1ZTUzNzZlZTE4NjczODFlMzZjZjVhZGFlN2YwX2tfJTI2T0NJRCUzZEFJRGNtbTVlZHN3ZHV1X1NFTV9fa19iNTMwNWU1Mzc2ZWUxODY3MzgxZTM2Y2Y1YWRhZTdmMF9rXyUyNm1zY2xraWQlM2RiNTMwNWU1Mzc2ZWUxODY3MzgxZTM2Y2Y1YWRhZTdmMA%26rlid%3Db5305e5376ee1867381e36cf5adae7f0&amp;vqd=4-193408492036639719747617879840107848319&amp;iurl=%7B1%7DIG%3DA59AD658CF6A4BA4B71B1B2E74E733AF%26CID%3D2BEF4798F6FF665D364151DFF777673E%26ID%3DDevEx%2C5059.1">Azure Backup</a> now supports agentless multi-disk crash consistent backups for VMs in general availability, eliminating the need to install backup agents or extensions on virtual machines while maintaining data consistency across multiple disks.</li>
<li style="font-weight:400;">This feature addresses a common pain point for enterprises running multi-disk applications like databases where crash consistency across all disks is critical for successful recovery, competing directly with AWS’s EBS snapshots and GCP’s persistent disk snapshots.</li>
<li style="font-weight:400;">The agentless approach reduces VM overhead and simplifies backup management by leveraging Azure’s infrastructure-level capabilities rather than guest OS agents, making it particularly valuable for locked-down or legacy systems where agent installation is problematic.</li>
<li style="font-weight:400;">Target use cases include SQL Server, Oracle databases, and other multi-disk applications where maintaining write-order consistency across volumes is essential, with pricing following standard Azure Backup rates based on protected instance size.</li>
<li style="font-weight:400;">This positions Azure Backup closer to feature parity with native hypervisor-level backup solutions while maintaining cloud-native scalability and integration with Azure Recovery Services vault for centralized management.</li>
</ul>
<p>1:01:56  Justin – “I’ll tell you – if you are running this on SQL Server or Oracle; things like asset compliance are very, very important and you need to test the crap out of this, because my experience has been that if you are not quiescing the data to the disk, it doesnt matter if you snapshotted all the partitions together – you are still going to have a bad time.” </p>
<h2>Other Clouds </h2>
<p>1:04:18 <a href="https://www.digitalocean.com/blog/introducing-digitalocean-gradient">Introducing Gradient: DigitalOcean’s Unified AI Cloud | DigitalOcean</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.digitalocean.com/products/gradientai">DigitalOcean</a> is consolidating its AI offerings under a new unified platform called Gradient, combining GPU infrastructure, agent development tools, and pre-built AI applications into a single integrated experience for developers.</li>
<li style="font-weight:400;">The platform includes three main components: Infrastructure (GPU compute for training and inference), Platform (tools for building intelligent agents with upcoming Model Context Protocol support), and Applications (pre-built agents for common use cases).</li>
<li style="font-weight:400;">DigitalOcean is expanding GPU options with AMD Instinct MI325X available this week and NVIDIA H200s coming next month, providing more choice and flexibility for different AI workload requirements.</li>
<li style="font-weight:400;">Existing DigitalOcean AI users won’t need to change anything as all current projects and APIs will continue working, with the rebrand focused on improving organization and documentation.</li>
<li style="font-weight:400;">The platform targets digital native enterprises looking to build AI applications from prototype to production without managing complex infrastructure, competing with larger cloud providers in the AI space.</li>
</ul>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback, or ask questions at theCloudPod.net or tweet at us with the hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2111836/c1e-jkjku58z72f0mp0v-7z9m3p7rsr4w-6f4yve.mp3" length="93911282"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 316 of The Cloud Pod, where the forecast is always cloudy! This week we’ve got earnings (with sound effects, obviously) as well as news from DeepSeek, DocumentDB, DigitalOcean, and a bunch of GPU news. Justin and Matt are here to lead you through all of it, so let’s get started! 
Titles we almost went with this week:


Lake Sentinel: The Security Data Monster Nobody Asked For
Certificate Authority Issues: When Your Free Lunch Gets a Security Audit
Slash and Learn: Gemini Gets Command-ing
DigitalOcean Drops Anchor in AI Waters with Gradient Platform
The Three Stages of Azure Grief: Development, Preview, and Launch
E for Enormous: Azure’s New VM Sizes Are Anything But Virtual
SRE You Later: Azure’s AI Agent Takes Over Your On-Call Duties
Site Reliability Engineer? More Like AI Reliability Engineer
Azure Disks Get Elastic Waistbands
Agent Smith Would Be Proud: Google’s Multi-Agent Matrix Gets Real
C4 Yourself: Google Explodes Into GA with Intel’s Latest Silicon
The Cost is Right: GCP Edition
Penny for Your Cloud Thoughts: Google’s Budget-Friendly Update
DocumentDB Goes on a Diet: Now Available in Serverless Size
MongoDB Compatibility Gets the AWS Serverless Treatment
No Server? No Problem: DocumentDB Joins the Serverless Party
Stream Big or Go Home: Lambda’s 10x Payload Boost
Lambda Response Streaming: Because Size Matters
GPT Goes Open Source Shopping
GPT’s Open Source Awakening
When Your Antivirus Needs an Antivirus: Enter Project Ire
The Opus Among Us: Anthropic’s Coding Assistant Gets an Upgrade
Serverless is becoming serverful in streaming responses

General News 
02:08 It’s Earnings Time! (INSERT AWESOME SOUND EFFECTS HERE) 
02:16 Alphabet beats earnings expectations, raises spending forecast

Google Cloud revenue hit $13.62 billion, up 32% year-over-year, with OpenAI now using Google’s infrastructure for ChatGPT, signaling growing enterprise confidence in Google’s AI infrastructure capabilities.
Alphabet is raising its 2025 capital expenditure forecast from $75 billion to $85 billion, driven by cloud and AI demand, with plans to increase spending further in 2026 as it competes for AI workloads.
AI Overviews now serves 2 billion monthly users across 200+ countries, while the Gemini app reached 450 million monthly active users, demonstrating Google’s scale in deploying AI services globally.
The $10 billion increase in planned capital spending reflects the infrastructure arms race among cloud providers to capture AI workloads, which require significant compute and specialized hardware investments.
Google’s cloud growth rate of 32% outpaces its overall revenue growth of 14%, indicating the strategic importance of cloud services as traditional search and advertising face increased AI competition.

03:55  Justin – “I don’t know what it takes to actually run one of these large models at like ultimate scale that like a ChatGPT needs or Anthropic, but I have to imagine it’s just thousands and thousands of GPUs just working nonstop.”
04:31 Microsoft (MSFT) Q4 earnings report 2025

Microsoft reported Q4 fiscal 2025 earnings with revenue of $76.44 billion, up 18% year-ove...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2111836/c1a-k5d5-gpzv3xnmi5r9-7hkdyo.jpg"></itunes:image>
                                                                            <itunes:duration>01:05:12</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2111836/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[315: EC2's New Shutdown Shortcut: Because Sometimes You Just Need to Pull the Plug]]>
                </title>
                <pubDate>Thu, 07 Aug 2025 13:00:08 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2105557</guid>
                                    <link>https://tcpfm.castos.com/episodes/315-ec2s-new-shutdown-shortcut-because-sometimes-you-just-need-to-pull-the-plug</link>
                                <description>
                                            <![CDATA[<h3> Welcome to episode 315 of The Cloud Pod, where the forecast is always cloudy! Your hosts, Justin and Matt, are here to bring you the latest in cloud and AI news, including news about AI from the White House, the newest hacker exploits, and news from CloudWatch, CrowdStrike, and GKE – plus so much more. Let’s get into it! </h3>
<h3>Titles we almost went with this week:</h3>
<ul>
<li>SharePoint and Tell: Government Secrets at Risk</li>
<li>Zero-Day Hero: How Hackers Found SharePoint’s Achilles’ Heel</li>
<li>Amazon Q Gets an F in Security Class</li>
<li>Spark Joy: GitHub’s Marie Kondo Approach to App Development</li>
<li>No Code? No Problem! GitHub Lights a Spark Under App Creation</li>
<li>GKE Turns 10: Still Not Old Enough to Deploy Itself</li>
<li>A Decade of Containers: Pokémon GO Caught Them All</li>
<li>Kubernetes Engine Hits Double Digits, Still Can’t Count Past 9 Pods</li>
<li>Account Names: The Missing Link in AWS Cost Optimization</li>
<li>Flash Gordon Saves Your VMs from the Azure-verse</li>
<li>The Flash: Fastest VM Monitor in the Multiverse</li>
<li>Ctrl+AI+Delete: Rebooting America’s Artificial Intelligence Strategy</li>
<li>The AImerican Dream: White House Plots Path to Silicon Supremacy</li>
<li>CrowdStrike’s Year of Living Resiliently</li>
<li>Kernel Panic at the Disco: A Recovery Story</li>
<li>The Search is Over (But Your Copilot License Isn’t)</li>
<li>Ground Control to Major Tom: You’re Fired</li>
<li>GPU Booking.com: Reserve Your Neural Network’s Next Vacation</li>
<li>Calendar Man Strikes Again: This Time He’s Scheduling Your TPUs</li>
<li>AirBnB for AI: Short-Term Rentals for Your Machine Learning Models </li>
<li>Claude’s World Tour: Now Playing in Every Region</li>
<li>Going Global: Claude Gets Its Passport Stamped on Vertex AI</li>
<li>SQS Finally Learns to Share: No More Queue Hogging</li>
<li>The Noisy Neighbor Gets Shushed: Amazon’s Fair Play for Queues</li>
<li>CloudWatch Gets Its AI Degree in Observability</li>
<li>Teaching Old Logs New Tricks: CloudWatch Goes GenAI</li>
<li>The Agent Whisperer: CloudWatch’s New AI Monitoring Powers</li>
<li>NotebookLM Gets Its PowerPoint License</li>
<li>Slides, Camera, AI-ction: NotebookLM Goes Visual</li>
<li>The SSL-ippery Slope: Azure’s Managed Certs Go Public or Go Home</li>
<li>Breaking Bad Certificates: DigiCert’s New Rules Leave Some Apps High and Dry</li>
<li>Firewall Rules: Now with a Rough Draft Feature</li>
<li>Azure’s New Policy: Think Before You Deploy</li>
</ul>
<h2>General News </h2>
<p>00:50 <a href="https://techcrunch.com/2025/07/21/hackers-exploiting-sharepoint-zero-day-seen-targeting-government-agencies-say-researchers/">Hackers exploiting a SharePoint zero-day are seen targeting government </a><a href="https://techcrunch.com/2025/07/21/hackers-exploiting-sharepoint-zero-day-seen-targeting-government-agencies-say-researchers/">agencies | TechCrunch</a></p>
<ul>
<li style="font-weight:400;"><a href="https://techcrunch.com/2025/07/21/new-zero-day-bug-in-microsoft-sharepoint-under-widespread-attack/">Microsoft SharePoint servers are being actively exploited</a> through a <a href="https://techcrunch.com/2025/04/25/techcrunch-reference-guide-to-security-terminology/#zero-day">zero-day</a> vulnerability (CVE-2025-53770), with initial attacks primarily targeting government agencies, universities, and energy companies, according to security researchers.</li>
<li style="font-weight:400;">The vulnerability affects on-premises SharePoint installations only, not cloud versions, with researchers identifying 9,000-10,000 vulnerable instances accessible from the internet that require immediate patching or disconnection.</li>
<li style="font-weight:400;">Initial exploitation appears to be limited and targeted, suggesting that nation-states likely back advanced persistent threat (APT) actors. However, broader exploitation by other threat actors is expected as attack methods become public.</li>
<li style="font-weight:400;">Organizations running local Shar...</li></ul>
<h3>Chapters</h3>
<ul><li>(00:00:00) - The Cloud Pod: EC2 Shutdown Explained</li><li>(00:01:08) - Microsoft SharePoint zero-day targeting government agencies</li><li>(00:05:33) - Cloudflare Supports the White House AI Action Plan</li><li>(00:10:04) - Trump's Anti-Woke AI Order</li><li>(00:15:28) - NASA's AI Satellite Just Made a Decision Without Humans</li><li>(00:21:14) - GitHub Launches Spark: A New Way to Build Micro</li><li>(00:22:50) - Amazon AI Code Coding Assistant Hacked</li><li>(00:26:01) -  AWS Cross-Team Optimization Hub Update 1.4</li><li>(00:27:50) - Amazon EC2: Auto-shutdown and more</li><li>(00:30:44) - Amazon SQS Introduces Fair Queues to Prevent</li><li>(00:34:11) - Amazon CloudWatch: Generative AI Observability in Preview</li><li>(00:37:37) - GKE: Celebrating 10 Years in the Cloud</li><li>(00:44:06) - Google's BigQuery for AI Agents</li><li>(00:45:37) - Google Cloud: Global Endpoints on Vertex AI</li><li>(00:50:21) - NotebookLM: Video Overviews in Cloud Documentation</li><li>(00:52:22) - Azure VM Availability Monitoring</li><li>(00:55:39) - Microsoft 365 copilot search: Unified Search with AI</li><li>(00:57:42) - Azure App Service: Important Changes to Managed Certificates</li><li>(01:02:29) - Azure Firewall: Draft and Deploy (Preview)</li><li>(01:05:25) - Cloud Journey: Two Cloud Journey Stories</li><li>(01:05:45) - IAM Identity Center vs. Cloud Shell: Best Authentication Solution</li><li>(01:12:48) - 1Password Passkey</li><li>(01:14:15) - CrowdStrike Expands Security Resilience Program</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[ Welcome to episode 315 of The Cloud Pod, where the forecast is always cloudy! Your hosts, Justin and Matt, are here to bring you the latest in cloud and AI news, including news about AI from the White House, the newest hacker exploits, and news from CloudWatch, CrowdStrike, and GKE – plus so much more. Let’s get into it! 
Titles we almost went with this week:

SharePoint and Tell: Government Secrets at Risk
Zero-Day Hero: How Hackers Found SharePoint’s Achilles’ Heel
Amazon Q Gets an F in Security Class
Spark Joy: GitHub’s Marie Kondo Approach to App Development
No Code? No Problem! GitHub Lights a Spark Under App Creation
GKE Turns 10: Still Not Old Enough to Deploy Itself
A Decade of Containers: Pokémon GO Caught Them All
Kubernetes Engine Hits Double Digits, Still Can’t Count Past 9 Pods
Account Names: The Missing Link in AWS Cost Optimization
Flash Gordon Saves Your VMs from the Azure-verse
The Flash: Fastest VM Monitor in the Multiverse
Ctrl+AI+Delete: Rebooting America’s Artificial Intelligence Strategy
The AImerican Dream: White House Plots Path to Silicon Supremacy
CrowdStrike’s Year of Living Resiliently
Kernel Panic at the Disco: A Recovery Story
The Search is Over (But Your Copilot License Isn’t)
Ground Control to Major Tom: You’re Fired
GPU Booking.com: Reserve Your Neural Network’s Next Vacation
Calendar Man Strikes Again: This Time He’s Scheduling Your TPUs
AirBnB for AI: Short-Term Rentals for Your Machine Learning Models 
Claude’s World Tour: Now Playing in Every Region
Going Global: Claude Gets Its Passport Stamped on Vertex AI
SQS Finally Learns to Share: No More Queue Hogging
The Noisy Neighbor Gets Shushed: Amazon’s Fair Play for Queues
CloudWatch Gets Its AI Degree in Observability
Teaching Old Logs New Tricks: CloudWatch Goes GenAI
The Agent Whisperer: CloudWatch’s New AI Monitoring Powers
NotebookLM Gets Its PowerPoint License
Slides, Camera, AI-ction: NotebookLM Goes Visual
The SSL-ippery Slope: Azure’s Managed Certs Go Public or Go Home
Breaking Bad Certificates: DigiCert’s New Rules Leave Some Apps High and Dry
Firewall Rules: Now with a Rough Draft Feature
Azure’s New Policy: Think Before You Deploy

General News 
00:50 Hackers exploiting a SharePoint zero-day are seen targeting government agencies | TechCrunch

Microsoft SharePoint servers are being actively exploited through a zero-day vulnerability (CVE-2025-53770), with initial attacks primarily targeting government agencies, universities, and energy companies, according to security researchers.
The vulnerability affects on-premises SharePoint installations only, not cloud versions, with researchers identifying 9,000-10,000 vulnerable instances accessible from the internet that require immediate patching or disconnection.
Initial exploitation appears to be limited and targeted, suggesting that nation-states likely back advanced persistent threat (APT) actors. However, broader exploitation by other threat actors is expected as attack methods become public.
Organizations running local Shar...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[315: EC2's New Shutdown Shortcut: Because Sometimes You Just Need to Pull the Plug]]>
                </itunes:title>
                                    <itunes:episode>315</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<h3> Welcome to episode 315 of The Cloud Pod, where the forecast is always cloudy! Your hosts, Justin and Matt, are here to bring you the latest in cloud and AI news, including news about AI from the White House, the newest hacker exploits, and news from CloudWatch, CrowdStrike, and GKE – plus so much more. Let’s get into it! </h3>
<h3>Titles we almost went with this week:</h3>
<ul>
<li>SharePoint and Tell: Government Secrets at Risk</li>
<li>Zero-Day Hero: How Hackers Found SharePoint’s Achilles’ Heel</li>
<li>Amazon Q Gets an F in Security Class</li>
<li>Spark Joy: GitHub’s Marie Kondo Approach to App Development</li>
<li>No Code? No Problem! GitHub Lights a Spark Under App Creation</li>
<li>GKE Turns 10: Still Not Old Enough to Deploy Itself</li>
<li>A Decade of Containers: Pokémon GO Caught Them All</li>
<li>Kubernetes Engine Hits Double Digits, Still Can’t Count Past 9 Pods</li>
<li>Account Names: The Missing Link in AWS Cost Optimization</li>
<li>Flash Gordon Saves Your VMs from the Azure-verse</li>
<li>The Flash: Fastest VM Monitor in the Multiverse</li>
<li>Ctrl+AI+Delete: Rebooting America’s Artificial Intelligence Strategy</li>
<li>The AImerican Dream: White House Plots Path to Silicon Supremacy</li>
<li>CrowdStrike’s Year of Living Resiliently</li>
<li>Kernel Panic at the Disco: A Recovery Story</li>
<li>The Search is Over (But Your Copilot License Isn’t)</li>
<li>Ground Control to Major Tom: You’re Fired</li>
<li>GPU Booking.com: Reserve Your Neural Network’s Next Vacation</li>
<li>Calendar Man Strikes Again: This Time He’s Scheduling Your TPUs</li>
<li>AirBnB for AI: Short-Term Rentals for Your Machine Learning Models </li>
<li>Claude’s World Tour: Now Playing in Every Region</li>
<li>Going Global: Claude Gets Its Passport Stamped on Vertex AI</li>
<li>SQS Finally Learns to Share: No More Queue Hogging</li>
<li>The Noisy Neighbor Gets Shushed: Amazon’s Fair Play for Queues</li>
<li>CloudWatch Gets Its AI Degree in Observability</li>
<li>Teaching Old Logs New Tricks: CloudWatch Goes GenAI</li>
<li>The Agent Whisperer: CloudWatch’s New AI Monitoring Powers</li>
<li>NotebookLM Gets Its PowerPoint License</li>
<li>Slides, Camera, AI-ction: NotebookLM Goes Visual</li>
<li>The SSL-ippery Slope: Azure’s Managed Certs Go Public or Go Home</li>
<li>Breaking Bad Certificates: DigiCert’s New Rules Leave Some Apps High and Dry</li>
<li>Firewall Rules: Now with a Rough Draft Feature</li>
<li>Azure’s New Policy: Think Before You Deploy</li>
</ul>
<h2>General News </h2>
<p>00:50 <a href="https://techcrunch.com/2025/07/21/hackers-exploiting-sharepoint-zero-day-seen-targeting-government-agencies-say-researchers/">Hackers exploiting a SharePoint zero-day are seen targeting government </a><a href="https://techcrunch.com/2025/07/21/hackers-exploiting-sharepoint-zero-day-seen-targeting-government-agencies-say-researchers/">agencies | TechCrunch</a></p>
<ul>
<li style="font-weight:400;"><a href="https://techcrunch.com/2025/07/21/new-zero-day-bug-in-microsoft-sharepoint-under-widespread-attack/">Microsoft SharePoint servers are being actively exploited</a> through a <a href="https://techcrunch.com/2025/04/25/techcrunch-reference-guide-to-security-terminology/#zero-day">zero-day</a> vulnerability (CVE-2025-53770), with initial attacks primarily targeting government agencies, universities, and energy companies, according to security researchers.</li>
<li style="font-weight:400;">The vulnerability affects on-premises SharePoint installations only, not cloud versions, with researchers identifying 9,000-10,000 vulnerable instances accessible from the internet that require immediate patching or disconnection.</li>
<li style="font-weight:400;">Initial exploitation appears to be limited and targeted, suggesting that nation-states likely back advanced persistent threat (APT) actors. However, broader exploitation by other threat actors is expected as attack methods become public.</li>
<li style="font-weight:400;">Organizations running local SharePoint deployments face immediate risk as Microsoft has not yet released a complete patch, requiring manual mitigation steps outlined in their security guidance.</li>
<li style="font-weight:400;">This incident highlights the ongoing security challenges of maintaining on-premises infrastructure versus cloud services, where patches and security updates are managed centrally by the provider.</li>
<li style="font-weight:400;">It is interesting to us that the cloud was patched, but they didn’t have a patch right away. Strange situation. </li>
<li style="font-weight:400;">From a security standpoint, if you are an Office 365 customer, you have SharePoint whether you want it or not. </li>
</ul>
<p>01:59  Justin – “If you’re still running SharePoint on-prem, my condolences.” </p>
<h2>AI Is Going Great – or How ML Makes Its Money </h2>
<p>05:25 <a href="https://blog.cloudflare.com/the-white-house-ai-action-plan-a-new-chapter-in-u-s-ai-policy/">The White House AI Action Plan:  a new chapter in U.S. AI policy</a></p>
<ul>
<li style="font-weight:400;">The <a href="https://www.ai.gov/action-plan">White House AI Action Plan</a> outlines three pillars focusing on accelerating AI innovation through open-source models, building secure AI infrastructure with high-security data centers, and leading international AI diplomacy while balancing export controls with global technology distribution.</li>
<li style="font-weight:400;"><a href="https://www.cloudflare.com/">Cloudflare</a> emphasizes that distributed edge computing networks are essential for AI inference, offering access to over 50 open-source models through Workers AI and enabling developers to build AI applications without relying on closed providers or centralized infrastructure.</li>
<li style="font-weight:400;">The plan endorses AI-powered cybersecurity for critical infrastructure, with Cloudflare demonstrating practical applications like blocking 247 billion daily cyberattacks using predictive AI and developing <a href="https://blog.cloudflare.com/ai-labyrinth/">AI Labyrinth</a>, which uses AI to trap malicious bots in endless mazes of generated content.</li>
<li style="font-weight:400;">Federal agencies are accelerating AI adoption with Chief AI Officers across departments, and <a href="https://www.cloudflare.com/cloudflare-for-government/">Cloudflare’s FedRAMP Moderate authorization</a> positions them to provide secure, scalable infrastructure for government AI initiatives with plans for FedRAMP High certification.</li>
<li style="font-weight:400;">The tension between promoting AI exports to allies while restricting compute and semiconductor exports to adversaries creates implementation challenges that could impact global AI deployment and innovation if export controls become overly broad or imprecise.</li>
</ul>
<p>07:24  Justin – “I use AI every day now, and I love it, and it’s great – and I also know how bad it is at certain tasks, so to think they’re using AI to fix the tax code or to write legislation freaks me out a little bit.”</p>
<p>09:53 <a href="https://techcrunch.com/2025/07/23/trumps-anti-woke-ai-order-could-reshape-how-us-tech-companies-train-their-models/">Trump’s ‘anti-woke AI’ order could reshape how US tech companies train </a><a href="https://techcrunch.com/2025/07/23/trumps-anti-woke-ai-order-could-reshape-how-us-tech-companies-train-their-models/">their models | TechCrunch</a></p>
<ul>
<li style="font-weight:400;">Trump’s <a href="https://www.whitehouse.gov/presidential-actions/2025/07/preventing-woke-ai-in-the-federal-government/">executive order</a> banning “woke AI” from federal contracts requires AI models to be “ideologically neutral” and avoid DEI-related content, potentially affecting companies like OpenAI, Anthropic, and Google, which recently <a href="https://www.ai.mil/Latest/News-Press/PR-View/Article/4242822/cdao-announces-partnerships-with-frontier-ai-companies-to-address-national-secu/">signed up to $200M defense contracts</a>.</li>
<li style="font-weight:400;">The order defines “truth-seeking” AI as prioritizing historical accuracy and objectivity, while “ideological neutrality” specifically excludes DEI concepts, creating vague standards that could pressure AI companies to align model outputs with administration rhetoric to secure federal funding.</li>
<li style="font-weight:400;">xAI’s Grok appears best positioned under the new rules despite documented antisemitic outputs, as it’s already on the GSA schedule for government procurement and Musk has positioned it as “anti-woke” and “less biased.”</li>
<li style="font-weight:400;">Experts warn the order could lead to AI companies actively reworking training datasets to comply with political priorities, with Musk stating xAI plans to “rewrite the entire corpus of human knowledge” using Grok 4’s reasoning capabilities.</li>
<li style="font-weight:400;">The technical challenge is that achieving truly neutral AI is impossible since all language and data inherently contain bias, and determining what constitutes “objective truth” on politicized topics like climate science becomes a subjective judgment call.</li>
<li style="font-weight:400;">We don’t like this at all. </li>
</ul>
<p>Copy editor Heather note: I’m currently getting a PhD in public history. I’m taking an entire semester class on bias and viewpoint in historical writing, and spoiler alert: there’s no such thing as truly neutral or objective truth, because at the end of the day, someone (or some LLM) will be deciding what information is “neutral” and what is “woke,” and that very decision is by definition a bias. </p>
<p>We’re definitely interested in our listeners’ thoughts on this one. Let us know on social media or on our Slack channel, and let’s discuss! </p>
<p>15:33 <a href="https://share.google/nqPkUw9epiLJ9AEEO">NASA’s AI Satellite Just Made a Decision Without Humans — in 90 Seconds</a></p>
<ul>
<li style="font-weight:400;">NASA’s Dynamic Targeting system enables satellites to autonomously detect clouds and decide whether to capture images in 60-90 seconds using onboard AI processing, eliminating the need for ground control intervention and reducing wasted bandwidth on unusable cloudy images.</li>
<li style="font-weight:400;">The technology runs on <a href="https://www.techrxiv.org/users/746922/articles/718753/master/file/data/CogniSAT-6_Overview_Preprint_V3-0/CogniSAT-6_Overview_Preprint_V3-0.pdf">CogniSAT-6</a>, a briefcase-sized CubeSat equipped with an AI processor from Ubotica, demonstrating that edge computing can now handle complex image analysis and decision-making in space at orbital speeds of 17,000 mph.</li>
<li style="font-weight:400;">Future applications include real-time detection of wildfires, volcanic eruptions, and severe weather systems, with plans for Federated Autonomous Measurement where multiple satellites collaborate by sharing targeting data across a constellation.</li>
<li style="font-weight:400;">This represents a shift toward edge AI in satellite operations, reducing dependency on ground-based processing and enabling faster response times for Earth observation data that could benefit disaster response and climate monitoring applications.</li>
<li style="font-weight:400;">The approach could extend to deep space missions and radar-based systems, with NASA having already tested autonomous plume detection on ESA’s Rosetta orbiter data, suggesting broader applications for autonomous spacecraft decision-making.</li>
<li style="font-weight:400;">Quick reminder that Skynet started as a weather satellite. Just throwing that out there. </li>
</ul>
<p>17:02  Matt – “It’s showing these real-life edge cases of, not just edge computing, but now, leveraging AI and ML  models on the edge to solve real-world problems.” </p>
<h2>Cloud Tools </h2>
<p>21:29 <a href="https://githubnext.com/projects/github-spark">GitHub Next | GitHub Spark</a></p>
<ul>
<li style="font-weight:400;"><a href="https://github.com/features/spark">GitHub Spark</a> is an AI-powered tool that lets developers create micro apps using natural language descriptions without writing or deploying code, featuring a managed runtime with data storage, theming, and LLM integration, and is now available in public preview. </li>
<li style="font-weight:400;">The platform uses an NL-based editor with interactive previews, revision variants, automatic history tracking, and model selection from <a href="https://www.anthropic.com/news/claude-3-5-sonnet">Claude Sonnet 3.5</a>, <a href="https://openai.com/index/hello-gpt-4o/">GPT-4o</a>, <a href="https://openai.com/index/introducing-openai-o1-preview/">o1-preview</a>, and <a href="https://openai.com/index/openai-o1-mini-advancing-cost-efficient-reasoning/">o1-mini</a>.</li>
<li style="font-weight:400;">Apps are automatically deployed as PWAs accessible from desktop and mobile devices, with built-in persistent key-value storage and GitHub Models integration for AI features.</li>
<li style="font-weight:400;">This solves the problem of developers having ideas for personal tools but finding them too time-consuming to build, enabling rapid creation of single-purpose apps tailored to specific workflows.</li>
<li style="font-weight:400;">The collaboration features allow sharing sparks with read-only or read-write permissions, and users can remix others’ apps to customize them further, creating a potential ecosystem of personalized micro applications.</li>
</ul>
<p>22:32  Justin – “It’s an interesting use case; the idea of creating a bunch of these small little building blocks and you can stitch them together into these tool chains. It’s a very interesting approach.” </p>
<h2>AWS </h2>
<p>23:11 <a href="https://www.404media.co/hacker-plants-computer-wiping-commands-in-amazons-ai-coding-agent/">Hacker Plants Computer ‘Wiping’ Commands in Amazon’s AI Coding Agent</a></p>
<ul>
<li style="font-weight:400;">A hacker compromised <a href="https://aws.amazon.com/q/">Amazon’s Q AI coding assistant</a> by submitting a malicious pull request to its GitHub repository, injecting commands that could wipe users’ computers and delete filesystem and cloud resources.</li>
<li style="font-weight:400;">The breach occurred when Amazon included the unauthorized update in a public release of the Q extension, though the actual risk of computer wiping appears low according to the report.</li>
<li style="font-weight:400;">This incident highlights the emerging security risks of AI-powered development tools, as hackers increasingly target these systems to steal data, gain unauthorized access, or demonstrate vulnerabilities.</li>
<li style="font-weight:400;">The ease of the compromise – through a simple pull request – raises questions about code review processes and security controls for AI coding assistants that have direct filesystem access.</li>
<li style="font-weight:400;">Organizations using AI coding tools need to reassess their security posture, particularly around code review workflows and the permissions granted to AI assistants in development environments.</li>
</ul>
<p>24:46  Matt – “If you’re not doing proper peer review for pull requests – which I understand is tedious and painful – but if you’re not doing it, you’re always going ot be susceptible to these things. “</p>
<p>26:31 <a href="https://aws.amazon.com/about-aws/whats-new/2025/07/cost-optimization-hub-account-names-optimization-opportunities/">Cost Optimization Hub now supports account names in optimization </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/07/cost-optimization-hub-account-names-optimization-opportunities/">opportunities – AWS</a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/cost-management/latest/userguide/cost-optimization-hub.html">Cost Optimization Hub</a> now displays account names alongside optimization recommendations, replacing the need to cross-reference account IDs when reviewing cost-saving opportunities across multiple AWS accounts.</li>
<li style="font-weight:400;">This update addresses a key pain point for enterprises and AWS Partners managing dozens or hundreds of accounts by enabling faster identification of which teams or projects own specific cost optimization opportunities.</li>
<li style="font-weight:400;">The feature integrates with existing Cost Optimization Hub filtering and consolidation capabilities, allowing users to group recommendations by account name and prioritize actions based on business units or departments.</li>
<li style="font-weight:400;">Available in all regions where Cost Optimization Hub is supported at no additional cost, this enhancement reduces the administrative overhead of translating account IDs to meaningful business context when implementing cost optimizations.</li>
<li style="font-weight:400;">Thank. Goodness. </li>
</ul>
<p>28:25 <a href="https://aws.amazon.com/about-aws/whats-new/2025/07/amazon-ec2-skip-os-shutdown-option-during-stop-terminate/">Amazon EC2 now supports skipping the operating system shutdown when </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/07/amazon-ec2-skip-os-shutdown-option-during-stop-terminate/">Stopping</a><a href="https://aws.amazon.com/about-aws/whats-new/2025/07/amazon-ec2-skip-os-shutdown-option-during-stop-terminate/"> or terminating instances – AWS</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/ec2/">EC2</a> now allows customers to skip graceful OS shutdown when stopping or terminating instances, enabling faster instance state transitions for scenarios where data preservation isn’t critical.</li>
<li style="font-weight:400;">This feature targets high-availability architectures where instance data is replicated elsewhere, allowing failover operations to complete more quickly by bypassing the normal shutdown sequence.</li>
<li style="font-weight:400;">Customers can enable this option through <a href="https://aws.amazon.com/cli/">AWS CLI</a> or <a href="https://aws.amazon.com/console/">EC2 Console</a>, giving them control over the trade-off between data integrity and speed of instance termination.</li>
<li style="font-weight:400;">The feature is available in all commercial regions and <a href="https://aws.amazon.com/govcloud-us/">GovCloud</a>, addressing use cases like auto-scaling groups and spot instance interruptions where rapid instance replacement matters more than graceful shutdown.</li>
<li style="font-weight:400;">This represents a shift in EC2’s approach to instance lifecycle management, acknowledging that not all workloads require the same shutdown guarantees and letting customers optimize for their specific reliability patterns.</li>
</ul>
<p>30:18  Justin – “I know there’s been many times where I, like, trying to do a service refresh, right where you just want to replace servers and you’re waiting patiently… so I guess it’s nice for that. And there are certain times, maybe when the operating system has actually crashed, where you just need it to die. I thought they had something like this before-ish, but I guess not.”</p>
<p>31:38 <a href="https://aws.amazon.com/blogs/compute/building-resilient-multi-tenant-systems-with-amazon-sqs-fair-queues/">Building resilient multi-tenant systems with Amazon SQS fair queues | AWS </a><a href="https://aws.amazon.com/blogs/compute/building-resilient-multi-tenant-systems-with-amazon-sqs-fair-queues/">Compute Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/sqs/">Amazon SQS</a> introduces fair queues to automatically mitigate <a href="https://docs.aws.amazon.com/wellarchitected/latest/saas-lens/noisy-neighbor.html">noisy neighbor</a> problems in multi-tenant systems by detecting when one tenant consumes disproportionate resources and prioritizing messages from other tenants. </li>
<li style="font-weight:400;">This eliminates the need for custom solutions or over-provisioning while maintaining overall queue throughput.</li>
<li style="font-weight:400;">The feature works transparently by adding a MessageGroupId to messages – no consumer code changes required and no impact on API latency or throughput limits. </li>
<li style="font-weight:400;">SQS monitors in-flight message distribution and adjusts delivery order when it detects an imbalance.</li>
<li style="font-weight:400;">New <a href="https://docs.aws.amazon.com/AmazonCloudWatch/latest/monitoring/WhatIsCloudWatch.html">CloudWatch</a> metrics specifically track noisy vs quiet groups, including ApproximateNumberOfNoisyGroups and metrics with the InQuietGroups suffix to monitor non-noisy tenant performance separately. CloudWatch Contributor Insights can identify specific problematic tenants among thousands.</li>
<li style="font-weight:400;">This addresses a common pain point in SaaS and multi-tenant architectures where one customer’s traffic spike or slow processing creates backlogs that impact all other tenants’ message dwell times. Fair queues maintain low latency for well-behaved tenants even during these scenarios.</li>
<li style="font-weight:400;">The feature is available now on all standard SQS queues at no additional cost – just add MessageGroupId to enable fairness behavior. AWS provides a sample application on GitHub to test the behavior with varying message volumes.</li>
</ul>
<p>19:59  Ryan – “I’m glad to have it; I’m not going to complain about this feature, but it does feel like, apparently, there are new tricks that SQS can learn.” </p>
<p>34:37 <a href="https://aws.amazon.com/blogs/mt/launching-amazon-cloudwatch-generative-ai-observability-preview/">Launching Amazon CloudWatch generative AI observability  (Preview) | </a><a href="https://aws.amazon.com/blogs/mt/launching-amazon-cloudwatch-generative-ai-observability-preview/">AWS Cloud Operations Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/mt/category/management-tools/amazon-cloudwatch/">CloudWatch</a> now offers purpose-built monitoring for generative AI applications with automatic instrumentation via AWS Distro for OpenTelemetry (ADOT), capturing telemetry from LLMs, agents, knowledge bases, and tools without code changes – works with open frameworks like <a href="https://strandsagents.com/latest/">Strands Agents</a>, <a href="https://www.langchain.com/langgraph">LangGraph</a>, and <a href="https://www.crewai.com/">CrewAI</a>.</li>
<li style="font-weight:400;">The service provides end-to-end tracing across AI components, whether running on<a href="https://aws.amazon.com/blogs/mt/category/artificial-intelligence/amazon-machine-learning/amazon-bedrock/amazon-bedrock-agents/"> Amazon Bedrock AgentCore</a>, <a href="https://aws.amazon.com/eks/">EKS</a>, <a href="https://aws.amazon.com/ecs/">ECS</a>, or on-premises, with dedicated dashboards showing model invocations, token usage, error rates, and agent performance metrics in a single view.</li>
<li style="font-weight:400;">Integration with existing CloudWatch features like <a href="https://docs.aws.amazon.com/AmazonCloudWatch/latest/monitoring/CloudWatch-Application-Monitoring-Sections.html">Application Signals</a>, <a href="https://docs.aws.amazon.com/AmazonCloudWatch/latest/monitoring/AlarmThatSendsEmail.html">Alarms</a>, and <a href="https://docs.aws.amazon.com/AmazonCloudWatch/latest/logs/AnalyzingLogData.html">Logs Insights</a> enables correlation between AI application behavior and underlying infrastructure metrics, helping identify bottlenecks and troubleshoot issues across the entire stack.</li>
<li style="font-weight:400;">Setup requires configuring OTEL environment variables and enabling transaction search in CloudWatch, with telemetry sent directly to CloudWatch OTLP endpoints – no additional collectors needed, though model invocation logging must be enabled separately for input/output visibility.</li>
<li style="font-weight:400;">This addresses a real pain point where developers previously had to build custom instrumentation or manually correlate logs across complex AI agent interactions, now providing fleet-wide agent monitoring and individual trace analysis in one centralized location.</li>
</ul>
<p>37:18  Matt – “It’s one of those things useful until you’re in the middle of an outage and everyone is complaining that something’s down, and then you’re like ooh, I can see exactly where the world is on fire and this is what caused it.”   </p>
<h2>GCP</h2>
<p>38:01 <a href="https://cloud.google.com/blog/products/containers-kubernetes/10-years-of-gke-ebook/">10 years of GKE ebook | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/kubernetes-engine">GKE</a> celebrates 10 years with an ebook highlighting customer success stories, including Signify scaling from 200 million to 3.5 billion daily transactions and Niantic’s <a href="https://pokemongo.com/">Pokémon GO</a> launch that stress-tested GKE’s capabilities at unprecedented scale.</li>
<li style="font-weight:400;">The ebook emphasizes GKE’s evolution from container orchestration to AI workload management, with GKE Autopilot now offering automated optimization for AI deployments to reduce infrastructure overhead and improve cost efficiency.</li>
<li style="font-weight:400;">Google positions GKE as the foundation for AI-native applications, leveraging its decade of Kubernetes expertise and one million open-source contributions to support complex AI training and inference workloads.</li>
<li style="font-weight:400;">Key differentiator is GKE’s integration with Google’s AI ecosystem and infrastructure, allowing customers to focus on model development rather than cluster management while maintaining enterprise-grade stability and security.</li>
<li style="font-weight:400;">The timing aligns with increased enterprise adoption of Kubernetes for AI/ML workloads, as organizations seek managed platforms that can handle the computational demands of modern AI applications without extensive DevOps overhead.</li>
<li style="font-weight:400;">Happy Birthday. Let’s all get back to crashing Kubernetes. </li>
</ul>
<p>41:29 <a href="https://cloud.google.com/blog/products/compute/dynamic-workload-scheduler-calendar-mode-reserves-gpus-and-tpus/">Dynamic Workload Scheduler Calendar mode reserves GPUs and TPUs | </a><a href="https://cloud.google.com/blog/products/compute/dynamic-workload-scheduler-calendar-mode-reserves-gpus-and-tpus/">Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google’s <a href="https://cloud.google.com/blog/products/compute/introducing-dynamic-workload-scheduler?e=48754805">Dynamic Workload Scheduler</a> Calendar mode enables short-term GPU and TPU reservations up to 90 days without long-term commitments, addressing the challenge of bursty ML workloads that need flexible capacity planning.</li>
<li style="font-weight:400;">The feature works like booking a hotel – users specify resource type, instance count, start date, and duration to instantly see and reserve available capacity, which can then be consumed through Compute Engine, GKE, Vertex AI custom training, and Google Batch.</li>
<li style="font-weight:400;">This positions Google competitively against AWS EC2 Capacity Reservations and Azure’s capacity reservations by offering a more user-friendly interface and shorter-term flexibility specifically optimized for ML workloads.</li>
<li style="font-weight:400;">Early access customers like Schrödinger, Databricks, and Vilya report significant cost savings and faster project completion times, with use cases spanning drug discovery, model training, and computationally intensive research tasks.</li>
<li style="font-weight:400;">Currently available in preview for TPUs with GPU access requiring an account team contact, the service integrates with Google’s <a href="https://cloud.google.com/ai-hypercomputer/docs/overview">AI Hypercomputer</a> ecosystem and extends existing Compute Engine future reservations capabilities for co-located accelerator capacity.</li>
</ul>
<p>43:41  Justin – “I’m disappointed there’s no calendar view. The screenshots they showed – I can see how I create it. I see the reservation period I’m asking for. And then at the end, there’s a list of all your reservations. Just a list. It’s not even a calendar. Come on, Google, get this together. But yeah, in general, this is a great feature.”</p>
<p>44:46 <a href="https://cloud.google.com/blog/products/ai-machine-learning/bigquery-meets-google-adk-and-mcp/">BigQuery meets Google ADK &amp; MCP | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google introduces first-party <a href="https://google.github.io/adk-docs/tools/built-in-tools/#bigquery">BigQuery tools for AI agents through ADK</a> (Agent Development Kit) <a href="https://googleapis.github.io/genai-toolbox/resources/tools/bigquery/">and MCP</a> (Model Context Protocol), eliminating the need for developers to build custom integrations for authentication, error handling, and query execution.</li>
<li style="font-weight:400;">The toolset includes five core functions: list_dataset_ids, get_dataset_info, list_table_ids, get_table_info, and execute_sql, providing agents with secure access to BigQuery metadata and query capabilities without custom code maintenance.</li>
<li style="font-weight:400;">Two deployment options are available: ADK’s built-in toolset for direct integration or the MCP Toolbox for Databases, which centralizes tool management across multiple agents, reducing maintenance overhead when updating tool logic or authentication methods.</li>
<li style="font-weight:400;">This positions Google competitively against AWS Bedrock and Azure OpenAI Service by offering native data warehouse integration for enterprise AI agents, particularly valuable for organizations already invested in BigQuery for analytics workloads.</li>
<li style="font-weight:400;">The solution addresses enterprise concerns about secure data access for AI agents while supporting natural language business queries like “What are our top-selling products?” or “How many customers do we have in Colombia?” without exposing raw database credentials to applications.</li>
</ul>
<p>45:49  Matt – “I mean, anything with BigQuery and making it easier to use feels like it makes my life easier.”</p>
<p>46:24 <a href="https://cloud.google.com/blog/products/ai-machine-learning/global-endpoint-for-claude-models-generally-available-on-vertex-ai/">Global endpoint for Claude models generally available on Vertex AI | </a><a href="https://cloud.google.com/blog/products/ai-machine-learning/global-endpoint-for-claude-models-generally-available-on-vertex-ai/">Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google Cloud now offers a global endpoint for <a href="https://docs.anthropic.com/en/docs/about-claude/models/overview">Anthropic’s Claude models</a> on <a href="https://cloud.google.com/vertex-ai">Vertex AI</a> that dynamically routes requests to any region with available capacity, improving uptime and reducing regional capacity errors for <a href="https://www.anthropic.com/claude/opus">Claude Opus 4</a>, <a href="https://www.anthropic.com/claude/sonnet">Sonnet 4</a>, <a href="https://cloud.google.com/vertex-ai/generative-ai/docs/partner-models/claude/sonnet-3-7">Sonnet 3.7</a>, and <a href="https://cloud.google.com/vertex-ai/generative-ai/docs/partner-models/claude/sonnet-3-5-v2">Sonnet 3.5 v2</a>.</li>
<li style="font-weight:400;">The global endpoint maintains the same pay-as-you-go pricing as regional endpoints and fully supports <a href="https://cloud.google.com/vertex-ai/generative-ai/docs/partner-models/claude/prompt-caching#:~:text=The%20Anthropic%20Claude%20models%20offer,results%20from%20the%20previous%20request.">prompt caching</a>, automatically routing cached requests to the region holding the cache for optimal latency while falling back to other regions if needed.</li>
<li style="font-weight:400;">This positions GCP competitively against AWS Bedrock’s cross-region inference feature, though GCP’s implementation currently lacks provisioned throughput support and requires careful consideration for workloads with data residency requirements.</li>
<li style="font-weight:400;">Key beneficiaries include AI application developers needing high availability without geographic constraints, particularly those building customer-facing chatbots, content generation tools, or AI agents that require consistent uptime across regions.</li>
<li style="font-weight:400;">Implementation requires only changing the location variable to “GLOBAL” in existing Claude configurations, making it a simple upgrade path for current users while maintaining separate global quotas manageable through the Google Cloud console.</li>
</ul>
<p>47:03  Matt – “This is a great feature, but you have to be very careful with any data sovereignty laws that you have.” </p>
<p>51:10 <a href="https://blog.google/technology/google-labs/notebooklm-video-overviews-studio-upgrades/">NotebookLM updates: Video Overviews, Studio upgrades</a></p>
<ul>
<li style="font-weight:400;"><a href="https://notebooklm.google/">NotebookLM</a> introduces Video Overviews that generate narrated slide presentations with AI-created visuals, pulling diagrams and data from uploaded documents to explain complex concepts – particularly useful for technical documentation and data visualization in cloud environments.</li>
<li style="font-weight:400;">The Studio panel redesign allows users to create multiple outputs of the same type per notebook, enabling teams to generate role-specific Audio and Video Overviews from shared documentation – a practical feature for cloud teams managing technical knowledge bases.</li>
<li style="font-weight:400;">Video Overviews support customization through natural language prompts, allowing users to specify expertise levels and focus areas, which could streamline onboarding and knowledge transfer for cloud engineering teams.</li>
<li style="font-weight:400;">The multi-tasking capability lets users consume different content formats simultaneously within the Studio panel, potentially improving productivity for developers reviewing technical documentation while working.</li>
<li style="font-weight:400;">Currently available in English only, with multi-language support coming soon, positioning NotebookLM as a knowledge management tool that could complement existing cloud documentation and training workflows.</li>
</ul>
<p>52:23  Justin – “Meaning that everyone who is rushing off to replace us with a podcast can now replace us with a video, dynamically generated PowerPoint slides, and then they put you right to sleep. Or you could just listen to us, you choose.”</p>
<h2>Azure</h2>
<p>53:11  <a href="https://azure.microsoft.com/en-us/blog/project-flash-update-advancing-azure-virtual-machine-availability-monitoring-2/">Project Flash update: Advancing Azure Virtual Machine availability </a><a href="https://azure.microsoft.com/en-us/blog/project-flash-update-advancing-azure-virtual-machine-availability-monitoring-2/">monitoring | Microsoft Azure Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/virtual-machines/flash-overview">Project Flash</a> now includes a user vs platform dimension in VM availability metrics, allowing customers to distinguish whether downtime was caused by Azure infrastructure issues or user-initiated actions. </li>
<li style="font-weight:400;"> This addresses a key pain point for enterprises like BlackRock that need precise attribution for service interruptions.</li>
<li style="font-weight:400;">The new Event Grid integration with Azure Monitor alerts enables near real-time notifications via SMS, email, and push notifications when VM availability changes occur, providing faster incident response compared to traditional monitoring approaches.</li>
<li style="font-weight:400;">Flash publishes detailed <a href="https://learn.microsoft.com/en-us/azure/service-health/resource-health-overview#health-status">VM availability</a> states and resource health annotations that help with root cause analysis, including information about degraded nodes, service healing events, and hardware issues – giving operations teams actionable data for troubleshooting.</li>
<li style="font-weight:400;">The solution scales from small deployments to massive infrastructures and integrates with existing Azure monitoring tools, though customers should combine Flash Health events with Scheduled Events for comprehensive coverage of both unplanned outages and planned maintenance windows.</li>
<li style="font-weight:400;">Future enhancements will expand monitoring to include top-of-rack switch failures, accelerated networking issues, and predictive hardware failure detection – positioning Azure to compete more directly with AWS CloudWatch and GCP’s operations suite for infrastructure monitoring.</li>
</ul>
<p>54:29  Matt – “I think that a lot of these things are very cool, but I also feel like this is a lot more for stateless systems, and I try very hard to not have stateless VMs – as much as I can – in my life.” </p>
<p>56:38  <a href="https://techcommunity.microsoft.com/blog/microsoft365copilotblog/announcing-microsoft-365-copilot-search-general-availability-a-new-era-of-search/4435537">Announcing Microsoft 365 Copilot Search General Availability: A new </a><a href="https://techcommunity.microsoft.com/blog/microsoft365copilotblog/announcing-microsoft-365-copilot-search-general-availability-a-new-era-of-search/4435537">era of search with Copilot | Microsoft Community Hub</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/copilot/microsoft-365/microsoft-365-copilot-search">Microsoft 365 Copilot Search</a> is now generally available as a dedicated module within the Microsoft 365 Copilot app, providing AI-powered unified search across SharePoint, OneDrive, Outlook, and over 150 external data sources through Copilot Connectors, including Salesforce, ServiceNow, Workday, and SAP.</li>
<li style="font-weight:400;">The service uses AI to understand query context and deliver relevant documents, emails, and meeting notes without requiring any setup – users with eligible Microsoft 365 Copilot licenses automatically see a Search tab alongside Chat and other Copilot experiences across desktop, web, and mobile platforms.</li>
<li style="font-weight:400;">This positions Microsoft against Google’s enterprise search capabilities and AWS Kendra by leveraging existing Microsoft 365 infrastructure and licensing, with no additional cost beyond the standard Microsoft 365 Copilot license, which runs $30 per user per month.</li>
<li style="font-weight:400;">Key differentiator is the instant query predictions feature that surfaces recently worked documents, colleague collaborations, and documents where users are mentioned, addressing the common enterprise pain point of information scattered across disconnected silos.</li>
<li style="font-weight:400;">Target customers are enterprises already invested in Microsoft 365 who need to break down information barriers between Microsoft and third-party systems, particularly those using multiple SaaS platforms that can now be searched through a single interface.</li>
</ul>
<p>58:51  <a href="https://techcommunity.microsoft.com/blog/appsonazureblog/important-changes-to-app-service-managed-certificates-is-your-certificate-affect/4435193">Important Changes to App Service Managed Certificates: Is Your </a><a href="https://techcommunity.microsoft.com/blog/appsonazureblog/important-changes-to-app-service-managed-certificates-is-your-certificate-affect/4435193">Certificate Affected? | Microsoft Community Hub</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/azure/app-service/app-service-managed-certificate-changes-july-2025">Azure App Service Managed Certificates</a> must meet new industry-wide multi-perspective issuance corroboration (MPIC) requirements by July 28, 2025, which will break certificate renewals for apps that aren’t publicly accessible, use Traffic Manager nested/external endpoints, or rely on *.trafficmanager.net domains.</li>
<li style="font-weight:400;">This change impacts organizations using App Service Managed Certificates with private endpoints, IP restrictions, client certificate requirements, or authentication gateways – forcing them to purchase and manage their own SSL certificates instead of using the free managed option.</li>
<li style="font-weight:400;">Microsoft provides Azure Resource Graph queries to help identify affected resources, but the queries don’t capture all edge cases, requiring manual review of Traffic Manager configurations and custom access policies that might block DigiCert’s validation.</li>
<li style="font-weight:400;">Unlike AWS Certificate Manager, which supports private certificate authorities and internal resources, Azure’s managed certificates will only work for publicly accessible apps, potentially increasing operational overhead and costs for enterprises with strict security requirements.</li>
<li style="font-weight:400;">The six-month grace period before existing certificates expire gives organizations time to migrate, but those relying on the free managed certificate service for internal or restricted apps will need to budget for commercial SSL certificates and implement manual renewal processes.</li>
<li style="font-weight:400;">Yes, you read that right. A whole 7 days to prep. </li>
<li style="font-weight:400;">Thanks, guys. Gold stars all around.  </li>
</ul>
<p>1:03:42 <a href="https://techcommunity.microsoft.com/blog/azurenetworksecurityblog/draft-and-deploy---azure-firewall-policy-changes-preview/4435499">Draft and deploy – Azure Firewall policy changes [Preview] | Microsoft </a><a href="https://techcommunity.microsoft.com/blog/azurenetworksecurityblog/draft-and-deploy---azure-firewall-policy-changes-preview/4435499">Community Hub</a></p>
<ul>
<li style="font-weight:400;">Azure Firewall now supports a draft and deploy feature in preview that allows administrators to stage policy changes in a temporary draft environment before applying them atomically to production, addressing the challenge where even small changes previously took several minutes to deploy.</li>
<li style="font-weight:400;">The two-phase model separates editing from deployment – users clone the active policy into a draft, make multiple changes without affecting live traffic, collaborate with reviewers, then validate and deploy all changes in a single operation that replaces the active policy.</li>
<li style="font-weight:400;">This feature targets enterprises with strict change management and governance requirements who need formal approval workflows for firewall policy updates, reducing configuration risks and minimizing the chance of accidentally blocking critical traffic or exposing workloads.</li>
<li style="font-weight:400;">The preview is currently limited to Azure Firewall policies only and doesn’t support Classic rules or Firewall Manager, with deployment available through Azure Portal or CLI commands for organizations looking to streamline their security operations.</li>
<li style="font-weight:400;">While AWS offers similar staging capabilities through AWS Network Firewall rule groups and GCP provides hierarchical firewall policies, Azure’s implementation focuses on atomic deployments and collaborative review cycles that integrate with existing enterprise change management processes.</li>
</ul>
<p>1:05:24  Justin – “It’s also weird that it’s limited to not include the classic rules or the firewall manager.”</p>
<h2>Cloud Journey </h2>
<p>1:06:52  <a href="https://aws.amazon.com/blogs/security/beyond-iam-access-keys-modern-authentication-approaches-for-aws/">Beyond IAM access keys: Modern authentication approaches for AWS | </a><a href="https://aws.amazon.com/blogs/security/beyond-iam-access-keys-modern-authentication-approaches-for-aws/">AWS Security Blog</a></p>
<ul>
<li style="font-weight:400;">AWS is pushing developers away from long-term <a href="https://aws.amazon.com/iam/">IAM</a> access keys toward temporary credential solutions like <a href="https://aws.amazon.com/cloudshell/">CloudShell</a>, IAM Identity Center, and IAM roles to reduce security risks from credential exposure and unauthorized sharing.</li>
<li style="font-weight:400;">CloudShell provides a browser-based <a href="https://aws.amazon.com/cli">CLI</a> that eliminates local credential management, while IAM Identity Center integration with AWS CLI v2 adds centralized user management and seamless MFA support.</li>
<li style="font-weight:400;">For CI/CD pipelines and third-party services, AWS recommends using IAM Roles Anywhere for on-premises workloads and OIDC integration for services like GitHub Actions instead of static access keys.</li>
<li style="font-weight:400;">Modern IDEs like VS Code now support secure authentication through <a href="https://aws.amazon.com/iam/identity-center">IAM Identity Center</a> via <a href="https://aws.amazon.com/visualstudiocode/">AWS Toolkit</a>, removing the need for developers to store access keys locally.</li>
<li style="font-weight:400;">AWS emphasizes implementing least privilege policies and offers automated policy generation based on <a href="https://aws.amazon.com/cloudtrail">CloudTrail</a> logs to help create permission templates from actual usage patterns.</li>
</ul>
<p>01:15:52  <a href="https://www.crowdstrike.com/en-us/blog/reflecting-on-building-resilience-by-design/">Reflecting on Building Resilience by Design | CrowdStrike</a></p>
<ul>
<li style="font-weight:400;">CrowdStrike has introduced granular content control features, allowing customers to pin specific security configuration versions and set different deployment schedules across test systems, workstations, and critical infrastructure through host group policies.</li>
<li style="font-weight:400;">The company established a dedicated Digital Operations Center to unify monitoring and incident response capabilities across millions of sensors worldwide, processing telemetry at exabyte scale from endpoints, clouds, containers, and other systems.</li>
<li style="font-weight:400;">A new Falcon Super Lab tests thousands of OS, kernel, hardware, and third-party application combinations, with plans to add customer profile testing that validates products in specific deployment environments.</li>
<li style="font-weight:400;">CrowdStrike is creating a Chief Resilience Officer role reporting directly to the CEO and launching Project Ascent to explore security capabilities outside kernel space while maintaining effectiveness against kernel-level threats.</li>
<li style="font-weight:400;">The platform now provides real-time visibility through a content quality dashboard showing release progression across early access and general availability phases, with automated deployment adjustments via Falcon Fusion SOAR workflows.</li>
</ul>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback, or ask questions at theCloudPod.net or tweet at us with the hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2105557/c1e-60w0coq7j3hz977m-34737qppsg4d-049q5g.mp3" length="116104274"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[ Welcome to episode 315 of The Cloud Pod, where the forecast is always cloudy! Your hosts, Justin and Matt, are here to bring you the latest in cloud and AI news, including news about AI from the White House, the newest hacker exploits, and news from CloudWatch, CrowdStrike, and GKE – plus so much more. Let’s get into it! 
Titles we almost went with this week:

SharePoint and Tell: Government Secrets at Risk
Zero-Day Hero: How Hackers Found SharePoint’s Achilles’ Heel
Amazon Q Gets an F in Security Class
Spark Joy: GitHub’s Marie Kondo Approach to App Development
No Code? No Problem! GitHub Lights a Spark Under App Creation
GKE Turns 10: Still Not Old Enough to Deploy Itself
A Decade of Containers: Pokémon GO Caught Them All
Kubernetes Engine Hits Double Digits, Still Can’t Count Past 9 Pods
Account Names: The Missing Link in AWS Cost Optimization
Flash Gordon Saves Your VMs from the Azure-verse
The Flash: Fastest VM Monitor in the Multiverse
Ctrl+AI+Delete: Rebooting America’s Artificial Intelligence Strategy
The AImerican Dream: White House Plots Path to Silicon Supremacy
CrowdStrike’s Year of Living Resiliently
Kernel Panic at the Disco: A Recovery Story
The Search is Over (But Your Copilot License Isn’t)
Ground Control to Major Tom: You’re Fired
GPU Booking.com: Reserve Your Neural Network’s Next Vacation
Calendar Man Strikes Again: This Time He’s Scheduling Your TPUs
AirBnB for AI: Short-Term Rentals for Your Machine Learning Models 
Claude’s World Tour: Now Playing in Every Region
Going Global: Claude Gets Its Passport Stamped on Vertex AI
SQS Finally Learns to Share: No More Queue Hogging
The Noisy Neighbor Gets Shushed: Amazon’s Fair Play for Queues
CloudWatch Gets Its AI Degree in Observability
Teaching Old Logs New Tricks: CloudWatch Goes GenAI
The Agent Whisperer: CloudWatch’s New AI Monitoring Powers
NotebookLM Gets Its PowerPoint License
Slides, Camera, AI-ction: NotebookLM Goes Visual
The SSL-ippery Slope: Azure’s Managed Certs Go Public or Go Home
Breaking Bad Certificates: DigiCert’s New Rules Leave Some Apps High and Dry
Firewall Rules: Now with a Rough Draft Feature
Azure’s New Policy: Think Before You Deploy

General News 
00:50 Hackers exploiting a SharePoint zero-day are seen targeting government agencies | TechCrunch

Microsoft SharePoint servers are being actively exploited through a zero-day vulnerability (CVE-2025-53770), with initial attacks primarily targeting government agencies, universities, and energy companies, according to security researchers.
The vulnerability affects on-premises SharePoint installations only, not cloud versions, with researchers identifying 9,000-10,000 vulnerable instances accessible from the internet that require immediate patching or disconnection.
Initial exploitation appears to be limited and targeted, suggesting that nation-states likely back advanced persistent threat (APT) actors. However, broader exploitation by other threat actors is expected as attack methods become public.
Organizations running local Shar...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2105557/c1a-k5d5-v640494mfk27-txadlx.jpg"></itunes:image>
                                                                            <itunes:duration>01:20:37</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2105557/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[314: Vector? I Hardly Know Her! S3's New AI Storage Play]]>
                </title>
                <pubDate>Wed, 30 Jul 2025 14:53:30 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2101459</guid>
                                    <link>https://tcpfm.castos.com/episodes/314-vector-i-hardly-know-her-s3s-new-ai-storagevcu</link>
                                <description>
                                            <![CDATA[<h3>Welcome to episode 314 of The Cloud Pod, where your hosts, Matt and Ryan, are holding down the fort in Justin’s absence and bringing what’s left of our audience (those of you still here after the last time they were left in charge) the latest and greatest in cloud and tech news. We’ve got undersea cables, vector storage, and even some hobos – but not the kind on trains. Plus AWS S3 Let’s get started! </h3>
<h3>Titles we almost went with this week:</h3>
<ul>
<li>S3 Gets Direction: AWS Points to Vector Storage</li>
<li>Vector? I Hardly Know Her! S3’s New AI Storage Play</li>
<li>S3 Finds Its Magnitude and Direction</li>
<li>Claude Goes to Wall Street</li>
<li>Anthropic’s Bull Run Into Financial Services</li>
<li>AI Assistant Gets Its Series 7 License</li>
<li>Nova Scotia: AWS Brings Regional Flavor to AI Models</li>
<li>The Fine-Tuning of the Shrew: Teaching Nova Models New Tricks</li>
<li>Nova-caine: Numbing the Pain of Model Customization</li>
<li>AgentCore Blimey: AWS Gives AI Agents Their License to Scale</li>
<li>The Agent Infrastructure: Mission Deployable</li>
<li>From Zero to Agent Hero: AWS Tackles the Production Problem</li>
<li>SageMaker Gets Its Data Act Together</li>
<li>From Catalog to QuickSight: A Data Love Story</li>
<li>The Great Data Unification of 2024</li>
<li>AWS Free Tier Gets a $200 Makeover</li>
<li>EKS-treme Makeover: Cluster Edition</li>
<li>#⃣100K Nodes Walk Into a Cluster…</li>
<li>S3 Gets Direction: Amazon Points to Vector Storage</li>
<li>Amazon S3: Now with 90% Less Vector Bills and 100% More Dimensions</li>
</ul>
<h2>Follow Up</h2>
<p>01:03 <a href="https://www.wsj.com/tech/ai/softbank-openai-a3dc57b4?st=1NSFE7&amp;reflink=desktopwebshare_permalink">SoftBank and OpenAI’s $500 Billion AI Project Struggles to Get Off Ground</a></p>
<ul>
<li style="font-weight:400;">The $500 billion AI effort unveiled at the White House has struggled to get off the ground and has scaled back its near-term plans. </li>
<li style="font-weight:400;">It’s been six months since the announcement, where they said they would spend $100B almost immediately, but now they have a more modest goal of building a small data center by the end of the year in Ohio.</li>
<li style="font-weight:400;"><a href="https://www.softbank.jp/en/">Softbank</a> committed to $30 billion earlier this year, and it is one of the largest ever startup investments by them, which led them to take on new debt and sell assets.  </li>
<li style="font-weight:400;">This investment was made alongside <a href="https://openai.com/index/announcing-the-stargate-project/">Stargate</a>, giving them a role in the physical infrastructure needed for AI. </li>
<li style="font-weight:400;">Altman, though, has been eager to secure computing power as quickly as possible and has proceeded without Softbank. </li>
<li style="font-weight:400;">Publicly, they say it’s a great partnership, and they look forward to advancing projects in multiple states</li>
<li style="font-weight:400;"><a href="https://www.oracle.com/">Oracle</a> was part of Stargate, but the recent 30B deal just signed with includes a commitment of 4.5 gigawatts of capacity, and would consume the equivalent power of more than two Hoover Dams, or about 4 million homes. </li>
<li style="font-weight:400;">Oracle was also named part of the deal with UAE firm <a href="https://www.mgx.ae/en">MGX</a> as a partner, but Oracle CEO Safra Catz said that Stargate hadn’t been formed yet, as of last month. </li>
</ul>
<p>02:31  Matthew – “…everyone’s like, how hard can it be to build a data center? But it’s city zoning, power consumption, grid improvements, water for cooling… getting communities to approve – and these things end up being a massive undertaking. And it takes the hyperscalers a long time to get these things up and operational. So it doesn’t surprise me that a small data center by the end of the year is probably something that was already in the works beforehand; they’re just taking over other plans. Most da...</p>
<h3>Chapters</h3>
<ul><li>(00:00:00) - Azure 1.8</li><li>(00:01:04) - SoftBank and OpenAI's 500 Billion AI Project</li><li>(00:04:53) - These Undersea Cable Sensors Could Aid Climate Change Monitoring</li><li>(00:08:47) - AWS, Google Cloud AI for Financial Services</li><li>(00:14:15) - Bedrock 12 Live Video Understanding Models now available in AWS</li><li>(00:17:21) - Harness AI</li><li>(00:20:06) - AWS New York City: AWS S3 Visions and More</li><li>(00:22:56) - Elasticsearch + S3: Vector Search</li><li>(00:24:47) - Amazon Nova Customization in SageMaker</li><li>(00:27:40) - Amazon Bedrock Agent Core: Enterprise-grade Infrastructure for deploying AI</li><li>(00:33:51) - Amazon SageMaker Catalog with Quicksight Integration</li><li>(00:37:52) - WASP Introduces Free tier</li><li>(00:40:01) - Amazon EC2 Budgeting Update</li><li>(00:43:28) - Amazon EventBridge Locate & Debug Kinesis Data</li><li>(00:47:29) - AWS S3 metadata: Complete metadata for all your S3</li><li>(00:52:05) - Oh yeah, double-layer encryption with ON S3</li><li>(00:52:54) -  AWS Lambda: Direct to IDE and Remote Debugging</li><li>(00:57:39) - ECS: Blue Green Deployments</li><li>(01:00:57) - Amazon Bracket Adds New 54-Bit Qubit Quantum Processor</li><li>(01:03:48) - Google CloudWatch and LibTPU for optimizing Google TPU resources</li><li>(01:06:08) - Application Monitoring: Cloud Observation & Investigations</li><li>(01:09:49) - Google Expands DeepSeen R1 to Microsoft Fabric</li><li>(01:16:07) -  AWS CLI for Migrating From Availability Sets and Basic Load Bal</li><li>(01:18:42) - Microsoft's Cloud HSM</li><li>(01:21:04) - Microsoft's New Hobo Model for ExpressRoute Gateways</li><li>(01:23:02) - Azure Functions: Public Preview 2.8</li><li>(01:26:09) - Azure WAF for Application Load Balancers for Kubernet</li><li>(01:29:32) - Week in the Cloud</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 314 of The Cloud Pod, where your hosts, Matt and Ryan, are holding down the fort in Justin’s absence and bringing what’s left of our audience (those of you still here after the last time they were left in charge) the latest and greatest in cloud and tech news. We’ve got undersea cables, vector storage, and even some hobos – but not the kind on trains. Plus AWS S3 Let’s get started! 
Titles we almost went with this week:

S3 Gets Direction: AWS Points to Vector Storage
Vector? I Hardly Know Her! S3’s New AI Storage Play
S3 Finds Its Magnitude and Direction
Claude Goes to Wall Street
Anthropic’s Bull Run Into Financial Services
AI Assistant Gets Its Series 7 License
Nova Scotia: AWS Brings Regional Flavor to AI Models
The Fine-Tuning of the Shrew: Teaching Nova Models New Tricks
Nova-caine: Numbing the Pain of Model Customization
AgentCore Blimey: AWS Gives AI Agents Their License to Scale
The Agent Infrastructure: Mission Deployable
From Zero to Agent Hero: AWS Tackles the Production Problem
SageMaker Gets Its Data Act Together
From Catalog to QuickSight: A Data Love Story
The Great Data Unification of 2024
AWS Free Tier Gets a $200 Makeover
EKS-treme Makeover: Cluster Edition
#⃣100K Nodes Walk Into a Cluster…
S3 Gets Direction: Amazon Points to Vector Storage
Amazon S3: Now with 90% Less Vector Bills and 100% More Dimensions

Follow Up
01:03 SoftBank and OpenAI’s $500 Billion AI Project Struggles to Get Off Ground

The $500 billion AI effort unveiled at the White House has struggled to get off the ground and has scaled back its near-term plans. 
It’s been six months since the announcement, where they said they would spend $100B almost immediately, but now they have a more modest goal of building a small data center by the end of the year in Ohio.
Softbank committed to $30 billion earlier this year, and it is one of the largest ever startup investments by them, which led them to take on new debt and sell assets.  
This investment was made alongside Stargate, giving them a role in the physical infrastructure needed for AI. 
Altman, though, has been eager to secure computing power as quickly as possible and has proceeded without Softbank. 
Publicly, they say it’s a great partnership, and they look forward to advancing projects in multiple states
Oracle was part of Stargate, but the recent 30B deal just signed with includes a commitment of 4.5 gigawatts of capacity, and would consume the equivalent power of more than two Hoover Dams, or about 4 million homes. 
Oracle was also named part of the deal with UAE firm MGX as a partner, but Oracle CEO Safra Catz said that Stargate hadn’t been formed yet, as of last month. 

02:31  Matthew – “…everyone’s like, how hard can it be to build a data center? But it’s city zoning, power consumption, grid improvements, water for cooling… getting communities to approve – and these things end up being a massive undertaking. And it takes the hyperscalers a long time to get these things up and operational. So it doesn’t surprise me that a small data center by the end of the year is probably something that was already in the works beforehand; they’re just taking over other plans. Most da...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[314: Vector? I Hardly Know Her! S3's New AI Storage Play]]>
                </itunes:title>
                                    <itunes:episode>314</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<h3>Welcome to episode 314 of The Cloud Pod, where your hosts, Matt and Ryan, are holding down the fort in Justin’s absence and bringing what’s left of our audience (those of you still here after the last time they were left in charge) the latest and greatest in cloud and tech news. We’ve got undersea cables, vector storage, and even some hobos – but not the kind on trains. Plus AWS S3 Let’s get started! </h3>
<h3>Titles we almost went with this week:</h3>
<ul>
<li>S3 Gets Direction: AWS Points to Vector Storage</li>
<li>Vector? I Hardly Know Her! S3’s New AI Storage Play</li>
<li>S3 Finds Its Magnitude and Direction</li>
<li>Claude Goes to Wall Street</li>
<li>Anthropic’s Bull Run Into Financial Services</li>
<li>AI Assistant Gets Its Series 7 License</li>
<li>Nova Scotia: AWS Brings Regional Flavor to AI Models</li>
<li>The Fine-Tuning of the Shrew: Teaching Nova Models New Tricks</li>
<li>Nova-caine: Numbing the Pain of Model Customization</li>
<li>AgentCore Blimey: AWS Gives AI Agents Their License to Scale</li>
<li>The Agent Infrastructure: Mission Deployable</li>
<li>From Zero to Agent Hero: AWS Tackles the Production Problem</li>
<li>SageMaker Gets Its Data Act Together</li>
<li>From Catalog to QuickSight: A Data Love Story</li>
<li>The Great Data Unification of 2024</li>
<li>AWS Free Tier Gets a $200 Makeover</li>
<li>EKS-treme Makeover: Cluster Edition</li>
<li>#⃣100K Nodes Walk Into a Cluster…</li>
<li>S3 Gets Direction: Amazon Points to Vector Storage</li>
<li>Amazon S3: Now with 90% Less Vector Bills and 100% More Dimensions</li>
</ul>
<h2>Follow Up</h2>
<p>01:03 <a href="https://www.wsj.com/tech/ai/softbank-openai-a3dc57b4?st=1NSFE7&amp;reflink=desktopwebshare_permalink">SoftBank and OpenAI’s $500 Billion AI Project Struggles to Get Off Ground</a></p>
<ul>
<li style="font-weight:400;">The $500 billion AI effort unveiled at the White House has struggled to get off the ground and has scaled back its near-term plans. </li>
<li style="font-weight:400;">It’s been six months since the announcement, where they said they would spend $100B almost immediately, but now they have a more modest goal of building a small data center by the end of the year in Ohio.</li>
<li style="font-weight:400;"><a href="https://www.softbank.jp/en/">Softbank</a> committed to $30 billion earlier this year, and it is one of the largest ever startup investments by them, which led them to take on new debt and sell assets.  </li>
<li style="font-weight:400;">This investment was made alongside <a href="https://openai.com/index/announcing-the-stargate-project/">Stargate</a>, giving them a role in the physical infrastructure needed for AI. </li>
<li style="font-weight:400;">Altman, though, has been eager to secure computing power as quickly as possible and has proceeded without Softbank. </li>
<li style="font-weight:400;">Publicly, they say it’s a great partnership, and they look forward to advancing projects in multiple states</li>
<li style="font-weight:400;"><a href="https://www.oracle.com/">Oracle</a> was part of Stargate, but the recent 30B deal just signed with includes a commitment of 4.5 gigawatts of capacity, and would consume the equivalent power of more than two Hoover Dams, or about 4 million homes. </li>
<li style="font-weight:400;">Oracle was also named part of the deal with UAE firm <a href="https://www.mgx.ae/en">MGX</a> as a partner, but Oracle CEO Safra Catz said that Stargate hadn’t been formed yet, as of last month. </li>
</ul>
<p>02:31  Matthew – “…everyone’s like, how hard can it be to build a data center? But it’s city zoning, power consumption, grid improvements, water for cooling… getting communities to approve – and these things end up being a massive undertaking. And it takes the hyperscalers a long time to get these things up and operational. So it doesn’t surprise me that a small data center by the end of the year is probably something that was already in the works beforehand; they’re just taking over other plans. Most data centers take a couple of years to really get up and operational.”</p>
<h2>General News</h2>
<p>04:55 <a href="https://eos.org/research-spotlights/a-transatlantic-communications-cable-does-double-duty">A Transatlantic Communications Cable Does Double Duty – Eos</a></p>
<ul>
<li style="font-weight:400;">You know how much we love a good undersea cable story, and this one is especially nerdy. Strap in! (Thanks, Matt)</li>
<li style="font-weight:400;">Scientists have developed a new instrument that transforms existing undersea fiber-optic telecommunications cables into ocean sensors by measuring variations in light signals between repeaters, enabling monitoring of water temperature, pressure, and tide patterns without disrupting internet or phone service.</li>
<li style="font-weight:400;">The technology uses fiber Bragg gratings at cable repeaters (positioned every 50-100km) to reflect light signals, allowing researchers to measure changes in travel time that indicate how surrounding water conditions affect cable shape and properties.</li>
<li style="font-weight:400;">This distributed sensing approach is more cost-effective than previous methods as it uses standard, nonstabilized lasers rather than expensive ultrastable ones, and can monitor individual cable subsections rather than treating the entire cable as a single sensor.</li>
<li style="font-weight:400;">The 77-day test on the EllaLink cable between Portugal and Brazil successfully measured daily and weekly temperature variations and tide patterns across 82 subsections, demonstrating the potential for the global submarine cable network to serve dual purposes.</li>
<li style="font-weight:400;">The technology could enable early tsunami warning systems and long-term climate monitoring by leveraging millions of kilometers of existing infrastructure, providing valuable ocean data without requiring new sensor deployments.</li>
</ul>
<p>06:30  Ryan – “It feels like our version of like getting into World War Two or something.”</p>
<h2>AI Is Going Great – or How ML Makes Its Money </h2>
<p>08:55 <a href="https://www.cnbc.com/2025/07/15/claude-ai-financial-anthropic-amazon.html">Amazon-backed Anthropic rolls out Claude AI for financial services</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.anthropic.com/">Anthropic</a> launched <a href="https://www.anthropic.com/solutions/financial-services">Claude Financial Analysis Solution</a>, a tailored version of <a href="https://www.anthropic.com/enterprise">Claude for Enterprise</a> specifically designed for financial professionals to analyze markets, make investment decisions, and conduct research using <a href="https://www.anthropic.com/news/claude-4">Claude 4</a> models with expanded usage limits.</li>
<li style="font-weight:400;">The solution integrates with major financial data providers, including Box, PitchBook, Databricks, S&amp;P Global, and Snowflake, for real-time financial information access, with availability through <a href="https://aws.amazon.com/marketplace/">AWS Marketplace</a> and <a href="https://cloud.google.com/marketplace">Google Cloud Marketplace</a> coming soon.</li>
<li style="font-weight:400;">This represents Anthropic’s strategic push into enterprise AI following their $61.5 billion valuation in March, targeting financial services as businesses increasingly adopt generative AI for customer-facing functions.</li>
<li style="font-weight:400;">The offering includes <a href="https://www.anthropic.com/claude-code">Claude Code</a> capabilities and implementation support, positioning it as a specialized alternative to general-purpose AI assistants for complex financial analysis tasks requiring domain-specific accuracy and reasoning.</li>
<li style="font-weight:400;">Cloud providers benefit from this vertical-specific AI approach as it drives compute consumption through AWS and Google Cloud marketplaces while demonstrating how foundation models can be packaged for specific industry needs.</li>
</ul>
<p>10:22  Matt – “It’s literally why we named this section this! AI is how ML makes money!”  </p>
<p>14:35 <a href="https://aws.amazon.com/blogs/aws/twelvelabs-video-understanding-models-are-now-available-in-amazon-bedrock/">TwelveLabs video understanding models are now available on Amazon </a><a href="https://aws.amazon.com/blogs/aws/twelvelabs-video-understanding-models-are-now-available-in-amazon-bedrock/">Bedrock | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;">TwelveLabs brings two specialized video understanding models to <a href="https://aws.amazon.com/bedrock/?trk=769a1a2b-8c19-4976-9c45-b6b1226c7d20&amp;sc_channel=el">Amazon Bedrock</a>: <a href="https://www.youtube.com/watch?v=LWUh5wDUzAY">Marengo</a> for video embeddings and search, and <a href="https://www.youtube.com/watch?v=XQDlLnCC_8M">Pegasus</a> for generating text from video content. These models enable natural language queries like “find the scene where the main characters first meet” to locate specific moments in video libraries.</li>
<li style="font-weight:400;">The models were trained on<a href="https://press.aboutamazon.com/2024/12/generative-ai-startup-twelve-labs-works-with-aws-to-make-videos-as-searchable-as-text"> Amazon SageMaker HyperPod</a> and support both synchronous and asynchronous inference patterns. </li>
<li style="font-weight:400;">Pegasus uses the standard Invoke API while Marengo requires the AsyncInvoke API for processing video embeddings.</li>
<li style="font-weight:400;">Key technical capabilities include video-to-text summarization with timeline descriptions, automatic metadata generation (titles, hashtags, chapters), and vector embeddings for similarity search. The models accept video input via S3 URIs or Base64-encoded strings.</li>
<li style="font-weight:400;">Practical applications span multiple industries: media teams can search dialogue across footage libraries, marketing can personalize content at scale, and security teams can identify patterns across multiple video feeds. This transforms previously unsearchable video archives into queryable knowledge bases.</li>
<li style="font-weight:400;">Pricing follows <a href="https://aws.amazon.com/bedrock/?trk=769a1a2b-8c19-4976-9c45-b6b1226c7d20&amp;sc_channel=el">Amazon Bedrock</a>‘s standard model, with Marengo available in US East, Europe, and Asia Pacific regions, while Pegasus operates in US West and Europe with cross-region inference support. </li>
<li style="font-weight:400;">Integration requires minimal code changes using existing Bedrock SDKs.</li>
<li style="font-weight:400;">I’m extra proud of Matt for getting through this particularly dense block of text. Gold star! </li>
</ul>
<p>16:27  Matt – “I feel like this is definitely something that came out of like Amazon video, so that they were able to find stuff a lot faster. And this is like, hey – let’s productize it. This is the next evolution.”</p>
<h2>Cloud Tools </h2>
<p>17:48 <a href="https://www.harness.io/blog/introducing-harness-ai-devops-capabilities">Harness AI Unveils Advanced DevOps Automation: Smarter Pipelines, </a><a href="https://www.harness.io/blog/introducing-harness-ai-devops-capabilities">Faster Delivery, and Enterprise-Ready Compliance</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.harness.io/">Harness AI</a> brings context-aware automation to DevOps pipelines by understanding your organization’s existing templates, tool configurations, and governance policies to generate production-ready CI/CD pipelines that match internal standards from day one.</li>
<li style="font-weight:400;">The platform uses large language models combined with a proprietary knowledge graph to provide AI-driven troubleshooting, natural language pipeline generation, and automated policy enforcement directly integrated into the Harness Platform rather than as a separate add-on.</li>
<li style="font-weight:400;">This addresses the growing challenge of faster AI-generated code outpacing traditional pipeline capabilities while managing increasingly fragmented toolchains and mounting compliance requirements across enterprise environments.</li>
<li style="font-weight:400;">Key capabilities include automatic pipeline generation that adapts to organizational standards, intelligent troubleshooting that understands your specific environment context, and built-in governance guardrails for enterprise-ready compliance without added complexity.</li>
<li style="font-weight:400;">The solution is positioned as having an AI DevOps engineer on call 24/7 who already knows your system, helping teams move from idea to production faster while reducing manual toil in the software delivery process.</li>
</ul>
<p>19:59  Ryan – “I do like that it’s built into the existing tooling as an InfoSec professional. I’m like, how is this compliance really put in? Because if I have to prompt it as the software engineer, that’s not okay. But then how do I, from a central organization, provide that sort of governance at a level that’s not actually just dragging everything to a screaming halt.”</p>
<h2>AWS </h2>
<p>20:48 <a href="https://aws.amazon.com/blogs/aws/introducing-amazon-s3-vectors-first-cloud-storage-with-native-vector-support-at-scale/">Introducing Amazon S3 Vectors: First cloud storage with native vector </a><a href="https://aws.amazon.com/blogs/aws/introducing-amazon-s3-vectors-first-cloud-storage-with-native-vector-support-at-scale/">support at scale | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/s3/features/vectors/?trk=769a1a2b-8c19-4976-9c45-b6b1226c7d20&amp;sc_channel=el">Amazon S3 Vectors</a> introduces native vector storage in S3 with a new bucket type that can reduce vector storage costs by up to 90% compared to traditional vector databases. </li>
<li style="font-weight:400;">This addresses the growing need for affordable vector storage as organizations scale their AI applications.</li>
<li style="font-weight:400;">The service provides sub-second query performance for similarity searches across tens of millions of vectors per index, with support for up to 10,000 indexes per bucket. Each vector can include metadata for filtered queries, making it practical for recommendation engines and semantic search applications.</li>
<li style="font-weight:400;">Native integrations with <a href="https://docs.aws.amazon.com/bedrock/latest/userguide/knowledge-base.html">Amazon Bedrock Knowledge Bases</a> and <a href="https://docs.aws.amazon.com/sagemaker-unified-studio/latest/userguide/getting-started.html">SageMaker Unified Studio</a> simplify building <a href="https://aws.amazon.com/what-is/retrieval-augmented-generation/">RAG</a> applications, while the OpenSearch Service export feature enables a tiered storage strategy. </li>
<li style="font-weight:400;">Organizations can keep infrequently accessed vectors in S3 Vectors and move high-priority data to OpenSearch for real-time performance.</li>
<li style="font-weight:400;">The preview is available in five regions (US East Virginia/Ohio, US West Oregon, Europe Frankfurt, Asia Pacific Sydney) with dedicated APIs for vector operations. </li>
<li style="font-weight:400;">Pricing details aren’t specified (so hold on to your butts), but the 90% cost reduction claim suggests significant savings for large-scale vector workloads.</li>
<li style="font-weight:400;">This positions AWS as the first cloud provider with native vector support in object storage, potentially disrupting the vector database market. </li>
<li style="font-weight:400;">The ability to store embeddings for images, videos, documents, and audio files directly in S3 removes infrastructure management overhead for AI teams.</li>
</ul>
<p>25:21  Ryan – “So expensive. It’s going to be ALL the money. All the new stuff on S3 is expensive.”</p>
<p>25:39 <a href="https://aws.amazon.com/blogs/aws/announcing-amazon-nova-customization-in-amazon-sagemaker-ai/">Announcing Amazon Nova customization in Amazon SageMaker AI | AWS </a><a href="https://aws.amazon.com/blogs/aws/announcing-amazon-nova-customization-in-amazon-sagemaker-ai/">News Blog</a></p>
<ul>
<li style="font-weight:400;">AWS introduces<a href="https://aws.amazon.com/ai/generative-ai/nova/customization"> customization capabilities for Amazon Nova</a> models (Micro, Lite, Pro) through <a href="https://aws.amazon.com/sagemaker-ai/">SageMaker AI</a>, supporting supervised fine-tuning, alignment techniques (DPO/PPO), continued pre-training, and knowledge distillation with seamless deployment to <a href="https://aws.amazon.com/bedrock/?nc2=h_prod_ai_br">Amazon Bedrock</a> for inference.</li>
<li style="font-weight:400;">The service offers both parameter-efficient fine-tuning (PEFT) using LoRA adapters for smaller datasets with on-demand inference, and full fine-tuning (FFT) for extensive datasets requiring provisioned throughput, giving customers flexibility based on data volume and cost requirements.</li>
<li style="font-weight:400;"><a href="https://arxiv.org/abs/2305.18290">Direct Preference Optimization</a> (DPO) and <a href="https://huggingface.co/blog/deep-rl-ppo">Proximal Policy Optimization</a> (PPO) enable alignment of model outputs to company-specific requirements like brand voice and customer experience preferences, addressing the limitations of prompt engineering and RAG for business-critical workflows.</li>
<li style="font-weight:400;">Knowledge distillation allows customers to create smaller, cost-efficient models that maintain the accuracy of larger teacher models, particularly useful when lacking adequate training data samples for specific use cases.</li>
<li style="font-weight:400;">Early adopters, including MIT CSAIL, Volkswagen, and Amazon’s internal teams, are already using these capabilities, with recipes currently available in US East (N. Virginia) through SageMaker Studio’s JumpStart interface.</li>
</ul>
<p>27:13  Ryan – “It’s such a fast field that you know, like, I barely understand these things, and I’ve only because I’ve been working on a project in my day job to sort of get information based on all of our internal IT data sets, right? Like, and have a custom bot that simplifies our  employee day-to-day and onboarding.”</p>
<p>28:38 <a href="https://aws.amazon.com/blogs/aws/introducing-amazon-bedrock-agentcore-securely-deploy-and-operate-ai-agents-at-any-scale/">Introducing Amazon Bedrock AgentCore: Securely deploy and operate AI </a><a href="https://aws.amazon.com/blogs/aws/introducing-amazon-bedrock-agentcore-securely-deploy-and-operate-ai-agents-at-any-scale/">agents at any scale (preview) | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/bedrock/agentcore/?trk=e61dee65-4ce8-4738-84db-75305c9cd4fe&amp;sc_channel=el">Amazon Bedrock AgentCore</a> provides enterprise-grade infrastructure services for deploying <a href="https://aws.amazon.com/what-is/ai-agents/?trk=e61dee65-4ce8-4738-84db-75305c9cd4fe&amp;sc_channel=el">AI agents</a> at scale, addressing the gap between proof-of-concept agents built with frameworks like <a href="https://www.crewai.com/">CrewAI</a> or <a href="https://www.langchain.com/langgraph">LangGraph</a> and production-ready systems. </li>
<li style="font-weight:400;">The preview includes seven modular services: Runtime for serverless deployment, Memory for session management, Observability for monitoring, Identity for secure access controls, Gateway for API integration, Browser for web automation, and Code Interpreter for sandboxed code execution.</li>
<li style="font-weight:400;">AgentCore Runtime offers isolated serverless environments with three network configurations (Sandbox, Public, and upcoming VPC-only), enabling developers to deploy agents with just three lines of code while maintaining session isolation and preventing data leakage. The service works with any agent framework and supports both Amazon Bedrock models and external models, with free usage until September 16, 2025.</li>
<li style="font-weight:400;">AgentCore Identity implements a secure token vault that stores user OAuth tokens and API keys, allowing agents to act on behalf of users with proper authorization across AWS services and third-party platforms like Salesforce, Slack, and GitHub. </li>
<li style="font-weight:400;">This eliminates the need for developers to build custom authentication infrastructure while maintaining enterprise security requirements.</li>
<li style="font-weight:400;">AgentCore Gateway transforms existing APIs and <a href="https://aws.amazon.com/lambda/">Lambda</a> functions into agent-ready tools using Model Context Protocol (MCP), providing unified access with built-in authentication, throttling, and request transformation capabilities. Combined with AgentCore Memory’s short-term and long-term storage strategies, agents can maintain context across sessions and extract semantic facts from conversations.</li>
<li style="font-weight:400;">The preview is available in US East (N. Virginia), US West (Oregon), Asia Pacific (Sydney), and Europe (Frankfurt), with integration support for AWS Marketplace pre-built agents and tools. </li>
<li style="font-weight:400;">After the free preview period ends on September 17, 2025, standard AWS pricing will apply based on service usage.</li>
</ul>
<p>35:07 <a href="https://aws.amazon.com/blogs/aws/streamline-the-path-from-data-to-insights-with-new-amazon-sagemaker-capabilities/">Streamline the path from data to insights with new Amazon SageMaker </a><a href="https://aws.amazon.com/blogs/aws/streamline-the-path-from-data-to-insights-with-new-amazon-sagemaker-capabilities/">Catalog capabilities | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;">Welcome to writing copy, Ryan. Your headline WAS better. </li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/sagemaker/">Amazon SageMaker</a> now integrates <a href="https://aws.amazon.com/quicksight/">QuickSight</a> directly into <a href="https://aws.amazon.com/blogs/aws/category/analytics/amazon-sagemaker-unified-studio/">Unified Studio</a>, allowing users to build dashboards from project data and publish them to SageMaker Catalog for organization-wide discovery and sharing. 
<ul>
<li style="font-weight:400;">This eliminates the need to switch between platforms and maintains consistent governance across analytics workflows.</li>
</ul>
</li>
<li style="font-weight:400;">SageMaker Catalog adds support for S3 general-purpose buckets with S3 Access Grants, enabling teams to discover and access unstructured data like documents and images alongside structured data. The integration automatically handles permissions when users subscribe to S3 assets, simplifying cross-team collaboration on diverse data types.</li>
<li style="font-weight:400;">Automatic onboarding from<a href="https://aws.amazon.com/glue/"> AWS Glue</a> Data Catalog brings existing lakehouse datasets into SageMaker Catalog without manual setup, unifying technical and business metadata management. 
<ul>
<li style="font-weight:400;">This allows organizations to immediately explore and govern their existing data investments through a single interface.</li>
</ul>
</li>
<li style="font-weight:400;">The integrations require <a href="https://aws.amazon.com/iam/identity-center/">IAM Identity Center</a> setup for QuickSight and appropriate S3 permissions, with standard pricing for each service applying. </li>
<li style="font-weight:400;">Available in all commercial AWS regions where SageMaker is supported, these features address the complete data lifecycle from ingestion to visualization.</li>
<li style="font-weight:400;">Real-world applications include medical imaging analysis in notebooks, combining unstructured documents with structured data for comprehensive analytics, and building executive dashboards that automatically stay synchronized with project permissions. This unified approach reduces the time from data discovery to actionable insights.</li>
</ul>
<p>48:25  Ryan – “Once you get the ability to query and generate insights from a very large data set, like it’s just super neat. But then when you want to share that, it is super hard. If you want to productionize it at all, it’s just very complicated.”</p>
<p>39:23 <a href="https://aws.amazon.com/blogs/aws/aws-free-tier-update-new-customers-can-get-started-and-explore-aws-with-up-to-200-in-credits/">AWS Free Tier update: New customers can get started and explore AWS </a><a href="https://aws.amazon.com/blogs/aws/aws-free-tier-update-new-customers-can-get-started-and-explore-aws-with-up-to-200-in-credits/">with up to $200 in credits | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;">Do you love surprise credit card bills? Do you love complicated pricing structures? We’ve got some great news. </li>
<li style="font-weight:400;">AWS introduces a new <a href="https://aws.amazon.com/free/?trk=769a1a2b-8c19-4976-9c45-b6b1226c7d20&amp;sc_channel=el">Free Tier</a> structure with up to $200 in credits for new customers – $100 upon signup plus $20 each for completing activities in <a href="https://aws.amazon.com/ec2/?trk=769a1a2b-8c19-4976-9c45-b6b1226c7d20&amp;sc_channel=el">EC2</a>, <a href="https://aws.amazon.com/rds/?trk=769a1a2b-8c19-4976-9c45-b6b1226c7d20&amp;sc_channel=el">RDS</a>, <a href="https://aws.amazon.com/lambda/?trk=769a1a2b-8c19-4976-9c45-b6b1226c7d20&amp;sc_channel=el">Lambda</a>, <a href="https://aws.amazon.com/bedrock/?trk=769a1a2b-8c19-4976-9c45-b6b1226c7d20&amp;sc_channel=el">Amazon Bedrock</a>, and <a href="https://aws.amazon.com/aws-cost-management/aws-budgets/?trk=769a1a2b-8c19-4976-9c45-b6b1226c7d20&amp;sc_channel=el">AWS Budgets</a> within the first 6 months.</li>
<li style="font-weight:400;">New customers now choose between a free account plan (no charges for 6 months or until credits expire) with limited service access, or a paid account plan with full AWS access, where credits are automatically applied to bills.</li>
<li style="font-weight:400;">The free account plan restricts access to enterprise-focused services but includes over 30 always-free tier services, with automatic email alerts at 50%, 25%, and 10% credit remaining and timeline notifications at 15, 7, and 2 days before expiration.</li>
<li style="font-weight:400;">This replaces the previous 12-month free tier model for accounts created after July 15, 2025, while existing accounts remain on the legacy program – a notable shift in AWS’s customer acquisition strategy.</li>
<li style="font-weight:400;">The required activities expose new users to core AWS services and cost management tools, teaching proper instance sizing and budget monitoring from day one rather than discovering these concepts after unexpected bills.</li>
</ul>
<p>41:43  Matt – “I know we talked about cutting it, but I think it’s kind of fun the way they gamified it a little bit and forced you to go play with the things, and with the key one here being Budgets. I feel like that should have been like, in order to use EC2 RDS, and especially Bedrock, you had to set up that budget, and it kind of forces people to fix, you know, a lot of those… hey, I’ve actually caused a $300 bill.” </p>
<p>44:20 <a href="https://aws.amazon.com/blogs/aws/monitor-and-debug-event-driven-applications-with-new-amazon-eventbridge-logging/">Monitor and debug event-driven applications with new Amazon </a><a href="https://aws.amazon.com/blogs/aws/monitor-and-debug-event-driven-applications-with-new-amazon-eventbridge-logging/">EventBridge logging | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/category/application-integration/amazon-eventbridge/">EventBridge</a> now provides comprehensive logging for event-driven applications, tracking the complete event lifecycle from receipt through delivery with detailed success/failure information and status codes. </li>
<li style="font-weight:400;">This addresses a major pain point in debugging microservice architectures where event flows were previously opaque.</li>
<li style="font-weight:400;">The feature supports three log destinations – <a href="https://aws.amazon.com/blogs/aws/category/management-tools/amazon-cloudwatch/">CloudWatch</a> Logs, <a href="https://aws.amazon.com/blogs/aws/category/analytics/amazon-kinesis/">Kinesis</a> Data Firehose, and <a href="https://aws.amazon.com/blogs/aws/category/storage/amazon-simple-storage-services-s3/">S3</a> – with configurable log levels (Error, Info, Trace) and optional payload logging. Logs are encrypted in transit with TLS and at rest when using customer-managed keys.</li>
<li style="font-weight:400;">The logs include valuable performance metrics like ingestion-to-start latency, target duration, and HTTP status codes, making it straightforward to identify bottlenecks between EventBridge processing time and target service performance. What previously took hours of trial-and-error debugging can now be diagnosed in minutes.</li>
<li style="font-weight:400;">API destination debugging becomes significantly easier as the logs clearly show authentication failures, credential issues, and endpoint errors with specific error messages. This is particularly useful for troubleshooting integrations with external HTTPS endpoints and SaaS applications.</li>
<li style="font-weight:400;">There’s no additional EventBridge charge for logging – customers only pay standard S3, CloudWatch Logs, or Kinesis Data Firehose pricing for storage and delivery. The feature operates asynchronously with no impact on event processing latency or throughput.</li>
</ul>
<p>46:07  Ryan – “Where have you been all my life?” </p>
<p>48:35 <a href="https://aws.amazon.com/blogs/aws/amazon-s3-metadata-now-supports-metadata-for-all-your-s3-objects/">Amazon S3 Metadata now supports metadata for all your S3 objects | AWS </a><a href="https://aws.amazon.com/blogs/aws/amazon-s3-metadata-now-supports-metadata-for-all-your-s3-objects/">News Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/s3/features/metadata/">S3 Metadata</a> now provides complete visibility into all existing objects in S3 buckets through <a href="https://iceberg.apache.org/">Apache Iceberg</a> tables, eliminating the need for custom scanning systems and expanding beyond just tracking new objects and changes.</li>
<li style="font-weight:400;">The service introduces two table types: live inventory tables that provide a complete snapshot of all objects refreshed within an hour, and journal tables that track near real-time object changes for auditing and lifecycle tracking.</li>
<li style="font-weight:400;">Pricing includes a one-time backfill cost of $0.30 per million objects, with no additional monthly fees for buckets under one billion objects, and journal tables cost $0.30 per million updates (a 33% price reduction).</li>
<li style="font-weight:400;">The tables enable SQL queries through <a href="https://docs.aws.amazon.com/athena/latest/ug/what-is.html">Athena</a> for use cases like finding unencrypted objects, tracking deletions, analyzing storage costs by tags, and optimizing ML pipeline scheduling by pre-discovering metadata.</li>
<li style="font-weight:400;">Currently available only in US East (Ohio, N. Virginia) and US West (N. California), with tables automatically created and maintained by S3 Tables service without requiring manual compaction or garbage collection.</li>
</ul>
<p>51:44  Matt – “It’s amazing how much fractions of cents add up real fast.” </p>
<p>54:23 <a href="https://aws.amazon.com/blogs/aws/simplify-serverless-development-with-console-to-ide-and-remote-debugging-for-aws-lambda/">Simplify serverless development with console to IDE and remote </a><a href="https://aws.amazon.com/blogs/aws/simplify-serverless-development-with-console-to-ide-and-remote-debugging-for-aws-lambda/">debugging for AWS Lambda | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/lambda">AWS Lambda</a> now offers direct console-to-IDE integration with VS Code, adding an “Open in Visual Studio Code” button that automatically handles setup and opens functions locally, eliminating manual environment configuration and enabling developers to use full IDE features like integrated terminals and package management.</li>
<li style="font-weight:400;">Remote debugging capability allows developers to debug Lambda functions running in their AWS account directly from <a href="https://code.visualstudio.com/">VS Code</a> with full access to VPC resources and IAM roles, solving the long-standing problem of debugging cloud functions that interact with production AWS services.</li>
<li style="font-weight:400;">The remote debugging feature supports Python, Node.js, and Java runtimes at launch and automatically handles secure connection setup, breakpoint management, and cleanup after debugging sessions to prevent production impact.</li>
<li style="font-weight:400;">Both features are available at no additional cost beyond standard Lambda execution charges during debugging sessions, making it more cost-effective for developers to troubleshoot issues in actual cloud environments rather than maintaining complex local emulation setups.</li>
<li style="font-weight:400;">This addresses a key serverless development pain point where functions work locally but fail in production due to differences in permissions, network access, or service integrations, potentially reducing debugging time from hours to minutes for complex AWS service interactions.</li>
</ul>
<p>57:03  Matt – “I have bad news for Peter. It only supports Python, Node.js, and Java. It does not support Ruby.”</p>
<p>59:15 <a href="https://aws.amazon.com/blogs/aws/accelerate-safe-software-releases-with-new-built-in-blue-green-deployments-in-amazon-ecs/">Accelerate safe software releases with new built-in blue/green deployments </a><a href="https://aws.amazon.com/blogs/aws/accelerate-safe-software-releases-with-new-built-in-blue-green-deployments-in-amazon-ecs/">in Amazon ECS | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;">In things we thought they already have…</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/ecs/?trk=c4ea046f-18ad-4d23-a1ac-cdd1267f942c&amp;sc_channel=el">Amazon ECS</a> now includes built-in blue/green deployments at no additional charge, eliminating the need for teams to build custom deployment tooling while providing automated rollback capabilities for safer container deployments.</li>
<li style="font-weight:400;">The feature introduces deployment lifecycle hooks that integrate with <a href="https://docs.aws.amazon.com/lambda/latest/dg/welcome.html">Lambda</a> functions, allowing teams to run validation tests at specific stages like pre-scale up, post-scale up, and traffic shift phases before committing to new versions.</li>
<li style="font-weight:400;">Blue/green deployments maintain both environments simultaneously during deployment, enabling near-instantaneous rollbacks without end-user impact since production traffic only shifts after successful validation of the green environment.</li>
<li style="font-weight:400;">The implementation requires configuring IAM roles, load balancers, or Service Connect, and target groups through the ECS console, with each service revision maintaining an immutable configuration for consistent rollback behavior.</li>
<li style="font-weight:400;">This addresses a significant operational challenge where development teams previously spent cycles building undifferentiated deployment tools instead of focusing on business innovation, particularly important for organizations running containerized workloads at scale.</li>
</ul>
<p>1:02:45 <a href="https://aws.amazon.com/about-aws/whats-new/2025/07/amazon-braket-54-qubit-quantum-processor-iqm/">Amazon Braket adds new 54-qubit quantum processor from IQM – AWS</a></p>
<ul>
<li style="font-weight:400;">Amazon Braket now offers access to <a href="https://aws.amazon.com/blogs/quantum-computing/amazon-braket-launches-new-54-qubit-superconducting-quantum-processor-from-iqm/">IQM’s Emerald</a>, a 54-qubit superconducting quantum processor with square-lattice topology, expanding the quantum computing options available to AWS customers alongside existing trapped-ion and neutral atom devices.</li>
<li style="font-weight:400;">The Emerald QPU features state-of-the-art gate fidelities and dynamic circuit support, enabling researchers to experiment with more complex quantum algorithms using familiar tools like the Braket SDK, NVIDIA CUDA-Q, Qiskit, and Pennylane.</li>
<li style="font-weight:400;">Hosted in Munich and accessible via the Europe (Stockholm) Region, this addition strengthens AWS’s quantum computing presence in Europe while providing on-demand access to the latest-generation quantum hardware without requiring direct hardware investment.</li>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/braket/latest/developerguide/braket-jobs.html">Amazon Braket Hybrid Jobs</a> offers priority access to Emerald for running fully managed quantum-classical algorithms, addressing the practical need for combining quantum and classical computing resources in real-world applications.</li>
<li style="font-weight:400;">AWS Cloud Credits for Research program supports accredited institutions experimenting with quantum computing, reducing the barrier to entry for academic research, while standard Braket pricing applies for commercial users.</li>
</ul>
<h2>GCP</h2>
<p>1:05:44 <a href="https://cloud.google.com/blog/products/compute/new-monitoring-library-to-optimize-google-cloud-tpu-resources/">New monitoring library to optimize Google Cloud TPU resources | Google </a><a href="https://cloud.google.com/blog/products/compute/new-monitoring-library-to-optimize-google-cloud-tpu-resources/">Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google released a new monitoring library for Cloud TPUs that provides real-time metrics like tensor core utilization, HBM usage, and buffer transfer latency sampled at 1Hz, enabling developers to dynamically optimize their AI workloads directly in code.</li>
<li style="font-weight:400;">The library integrates with JAX and PyTorch installations through libtpu and allows programmatic adjustments – for example, automatically increasing batch sizes when duty_cycle_pct is low or triggering memory-saving strategies when HBM capacity approaches limits.</li>
<li style="font-weight:400;">This addresses a key gap in TPU observability compared to AWS’s CloudWatch for EC2 GPU instances and Azure’s GPU monitoring, giving Google customers similar granular performance insights specifically designed for TPU architectures.</li>
<li style="font-weight:400;">The monitoring capabilities are particularly valuable for large-scale AI training where even small efficiency improvements can translate to significant cost savings, with metrics like hlo_exec_timing helping identify bottlenecks in distributed workloads.</li>
<li style="font-weight:400;">While the library is free to use, it requires shell access to TPU VMs and is limited to snapshot-mode access rather than continuous streaming, which may impact real-time monitoring use cases compared to traditional APM solutions.</li>
</ul>
<p>1:07:45  Ryan – “I mean, it is an SDK that they’re releasing in addition to the existing services, right? It’s not a service by itself, but it is a neat little easy, you know, like, like any library, it’s just an easy button instrument for my code, to make it visible, right? So I do like that.”</p>
<p>1:08:28 <a href="https://cloud.google.com/blog/products/management-tools/get-to-know-cloud-observability-application-monitoring/">Get to know Cloud Observability Application Monitoring | Google Cloud </a><a href="https://cloud.google.com/blog/products/management-tools/get-to-know-cloud-observability-application-monitoring/">Blog</a></p>
<ul>
<li style="font-weight:400;">Google Cloud introduces <a href="https://support.google.com/cloud/answer/6256558?hl=en">Application Monitoring</a>, an out-of-the-box observability solution that automatically generates dashboards for applications defined in <a href="https://cloud.google.com/products/app-hub">App Hub</a>, eliminating hours of manual dashboard configuration and providing immediate visibility into the Four Golden Signals (traffic, latency, error rate, saturation).</li>
<li style="font-weight:400;">The service automatically propagates application labels across logs, metrics, and traces in Google Cloud, enabling consistent filtering and correlation across all telemetry data without manual tagging effort.</li>
<li style="font-weight:400;">Integration with <a href="https://cloud.google.com/gemini/docs/cloud-assist/create-investigation">Gemini Cloud Assist Investigations</a> (currently in private preview) provides AI-powered troubleshooting that understands application boundaries and relationships, offering contextual analysis based on the automatically collected application data.</li>
<li style="font-weight:400;">This positions Google Cloud competitively against AWS CloudWatch Application Insights and Azure Application Insights by reducing the upfront investment typically required for application monitoring setup while incorporating Google SRE best practices.</li>
<li style="font-weight:400;">Organizations can start using Application Monitoring immediately by defining applications in App Hub and navigating to Cloud Observability, with Gemini features requiring a separate SKU and trusted tester program enrollment.</li>
</ul>
<p>1:12:06 <a href="https://cloud.google.com/blog/products/ai-machine-learning/deepseek-r1-is-available-for-everyone-in-vertex-ai-model-garden/">Deepseek R1 is available for everyone in Vertex AI Model Garden | Google </a><a href="https://cloud.google.com/blog/products/ai-machine-learning/deepseek-r1-is-available-for-everyone-in-vertex-ai-model-garden/">Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google adds DeepSeek R1 to <a href="https://console.cloud.google.com/vertex-ai/model-garden">Vertex AI Model Garden</a> as a managed service, eliminating the need for customers to provision 8 H200 GPUs typically required to run this large language model, with pay-as-you-go pricing and serverless API access.</li>
<li style="font-weight:400;">The Model-as-a-Service offering provides enterprise-grade security and compliance while supporting both REST API and OpenAI Python client integration, positioning GCP alongside AWS Bedrock and Azure’s model marketplace in the managed LLM space.</li>
<li style="font-weight:400;"><a href="https://console.cloud.google.com/vertex-ai/publishers/deepseek-ai/model-garden/deepseek-r1-0528-maas">DeepSeek R1</a> joins <a href="https://www.googlecloudcommunity.com/gc/Community-Blogs/Introducing-Llama-4-on-Vertex-AI/ba-p/892578">Llama 4</a> models in Vertex AI’s expanding open model catalog, giving customers more flexibility to choose models for specific use cases without infrastructure management overhead.</li>
<li style="font-weight:400;">The service operates without outbound internet access for data security, making it suitable for enterprises with strict compliance requirements who need advanced AI capabilities without compromising data privacy.</li>
<li style="font-weight:400;">This release strengthens Google’s open AI ecosystem strategy by providing access to non-Google models through its platform, competing directly with proprietary offerings while maintaining the convenience of fully managed deployment.</li>
</ul>
<p>1:13:14  Ryan – “I mean, this is really the power of using those public models in something like a model garden. Instead of like, you know, running a server, installing all the models and getting it all in place, and hooking it all together, you can now just basically provision this within your virtual site AI environment and have a web endpoint that you can then send prompts to. And it makes that much, much easier to do. So the fact that it’s DeepSeek. Like everyone’s always concerned about China’s going to steal our data.”</p>
<h2>Azure</h2>
<p>01:15:36 <a href="https://blog.fabric.microsoft.com/en-GB/blog/unified-by-design-mirroring-azure-databricks-unity-catalog-in-microsoft-fabric-now-generally-available/">Unified by design: mirroring Azure Databricks Unity Catalog to Microsoft </a><a href="https://blog.fabric.microsoft.com/en-GB/blog/unified-by-design-mirroring-azure-databricks-unity-catalog-in-microsoft-fabric-now-generally-available/">OneLake in Fabric (Generally Available) | Microsoft Fabric Blog | Microsoft Fabric</a> </p>
<ul>
<li style="font-weight:400;"><a href="https://app.fabric.microsoft.com/">Microsoft Fabric</a> now offers general availability of mirroring for <a href="https://learn.microsoft.com/en-us/azure/databricks/data-governance/unity-catalog/">Azure Databricks Unity Catalog</a>, enabling direct access to Databricks tables in OneLake without data duplication or ETL pipelines. </li>
<li style="font-weight:400;">This integration allows organizations to query Databricks data through Fabric workloads and <a href="https://learn.microsoft.com/en-us/fabric/fundamentals/direct-lake-overview">Power BI Direct Lake</a> mode while maintaining a single copy of data.</li>
<li style="font-weight:400;">The feature addresses a key enterprise challenge of bridging Azure Databricks and Microsoft Fabric ecosystems, as demonstrated by <a href="https://www.adeccogroup.com/">The Adecco Group</a>, which uses it to expose Databricks datasets for Power BI semantic models and GraphQL APIs. Setup requires only a few clicks to connect catalogs, schemas, or individual tables through the Fabric portal.</li>
<li style="font-weight:400;">Technical improvements in the GA release include support for ADLS with firewalls enabled, public APIs for CI/CD automation, and full integration with OneLake security framework for enterprise-grade access controls. Data automatically syncs as tables are updated or modified in Azure Databricks.</li>
<li style="font-weight:400;">This positions Microsoft against AWS and GCP by leveraging their unique combination of Databricks partnership and Fabric platform, though competitors offer similar lakehouse integrations through services like AWS Glue Data Catalog and BigQuery external tables. The open Delta Parquet format ensures vendor neutrality while reducing storage costs.</li>
<li style="font-weight:400;">Target customers include enterprises already using both Azure Databricks and Microsoft Fabric who need unified analytics without maintaining duplicate data pipelines. The future roadmap may include support for RLS/CLM policies, federated tables, Delta Sharing, and streaming data.</li>
</ul>
<p>01:16:37 <a href="https://blog.fabric.microsoft.com/en-us/blog/announcing-cosmos-db-in-microsoft-fabric-preview-with-exciting-new-features?ft=All">Announcing Cosmos DB in Microsoft Fabric Featuring New Capabilities! </a><a href="https://blog.fabric.microsoft.com/en-us/blog/announcing-cosmos-db-in-microsoft-fabric-preview-with-exciting-new-features?ft=All"> Microsoft Fabric Blog | Microsoft Fabric</a> </p>
<ul>
<li style="font-weight:400;">Microsoft brings<a href="https://duckduckgo.com/y.js?ad_domain=microsoft.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=gZM60yYYUf-JHurY0pZWN5QJYUlv3ph7vleqFZ3XYgh02_l0lUbfzqKDnW2vd1Nzre9fUtRgQaL7PllT7KJp0dgXoQNiD-bU7x3tGPZZ7GJp2zwY0VGSEpIRCXP2um1D.NItPUWIWS6A3wRMAQ1fkEA&amp;eddgt=bkc_c5RiMbesAPTZlu7zxA%3D%3D&amp;rut=077da1548feaa8389868f98072c68ff04f0a61e3f82dc0a0a3a15851d6d82bac&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8dFdNH3iDSy-GsKV1wefD_zVUCUy5tqDlawyBXIdwQuHXJmageMcnWfsFtiR8eP058jzhUyiG0kv7S7ugb4eB3JTqGH3JRod9Jn_cp3FxOAO1KbLhMSTghOSQxUbetSbEe95_ptIIAxo010IBWK8fsRvjzl8bCRARig5SxiNI117z5d3lZ2VbnzdG-gM8-gfm-H9oHcdoYthEeXOCGeRqeb8mcRt5T1gdnb1xYMAdMnuDv54XYydhMYRHv1iQ0CgmM1H3hrg2lTg03kxaXPd17Fy-4Ux6PNdk-kG6kbESOnltMs6UG0mk1Y_5zfxaCIJxasWr1HomIpQiH5xfry_Ade5ALHKt73E4sTo5aqeNP_d0rDYXd9Q9RzV1_i3RbWaLmWzN1IEbekoN8iB9ZKcGPR34XAPZXqsCzZdKgaNGDs5SWYXbXvxuamxf8E6U0BW0nn9HaoHlA4g-nVq6v8k7qYx03cl1weFqtEZ0DmEgqhxMHPeC5JA0UIrQLy8qPo8NxoaL-XHXyPE1KieRspdh0tKmhyy_4om5dLh-E1DqXSLloJ9xhoUd_-sJMmvyTrT6bL9cFHlgD4zei5V5TDYEkR_L-BloclfetRyOfHZD-aKZdHZDStpGncuk0e8XuwNWAPnOO5faxrDoBGeUQUqUVwMC-Y0dVg5RD1uRsaOjtwY6uC22DZ9lZEanDsBKwCoM8oAM1WL_p1--thZOnr9a2-cLgv-_Zrllh5T4jkCEUk5S1vHMWQBL8uZcUyCwAdi7veA9YQ%26u%3DaHR0cHMlM2ElMmYlMmY1MzUwLnhnNGtlbi5jb20lMmZ0cmslMmZ2MSUzZnByb2YlM2Q0MzklMjZjYW1wJTNkMTc0MzU5JTI2a2N0JTNkbXNuJTI2a2NoaWQlM2QxNTkwMDE5MjElMjZjcml0ZXJpYWlkJTNka3dkLTc5NDQwMzMxNDUxOTY1JTNhbG9jLTcxMjI4JTI2Y2FtcGFpZ25pZCUzZDU5MDM2NTAyOSUyNmxvY3BoeSUzZDgwNDYxJTI2YWRncm91cGlkJTNkMTI3MTAzNzEwNzg0NTMzOCUyNmNpZCUzZDc5NDM5OTI1MTE0OTMyJTI2a2R2JTNkYyUyNmtleHQlM2QlMjZrcGclM2QlMjZrcGlkJTNkJTI2cXVlcnlTdHIlM2RDb3Ntb3MlMjUyMERCLiUyNnVybCUzZGh0dHBzJTNhJTJmJTJmYXp1cmUubWljcm9zb2Z0LmNvbSUyZmVuLXVzJTJmcHJvZHVjdHMlMmZjb3Ntb3MtZGIlMmYlM2ZlZl9pZCUzZF9rXzM4ZTNiNDk0MGM5YTEyODJhN2QyZGMxZGNjYWIyZDBjX2tfJTI2T0NJRCUzZEFJRGNtbTVlZHN3ZHV1X1NFTV9fa18zOGUzYjQ5NDBjOWExMjgyYTdkMmRjMWRjY2FiMmQwY19rXyUyNm1zY2xraWQlM2QzOGUzYjQ5NDBjOWExMjgyYTdkMmRjMWRjY2FiMmQwYw%26rlid%3D38e3b4940c9a1282a7d2dc1dccab2d0c&amp;vqd=4-57365855522068074733220650053298778896&amp;iurl=%7B1%7DIG%3DF2815075E7FF4820A7B7EA0B3671DE74%26CID%3D0DC05D2B6ABF6ADD11F84B126B596B8D%26ID%3DDevEx%2C5045.1"> Cosmos DB</a> natively into <a href="https://duckduckgo.com/y.js?ad_domain=microsoft.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=soxaEpi3ddoK6p-rzzW2_7YmaOpw1nBMV_IaOBMbitasq7k4eIsOpVhzaF2C7qfQrvgg9wlyB54PxqzKDibO1AexWnxCh2D7jLnmNGJRVjq6KU8Cy8_QOKaqsUG2f6n3.98YIFUdE6XDJUKumFG3a3g&amp;eddgt=AT0_MZ-KjLSSajW4bmL5Ew%3D%3D&amp;rut=e0f59d5cad8a2c00715fca2dc0b521e325c83b0f470db24c4f15cc6f59edec06&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8Li8ybwaZTbzrILpMSVLiVTVUCUwRslqSbJPbtPTh0RF3h_YsgN7Vy_CdiEDu5h5sLE4Fpc0BXEE-Kjnb7XyxlS_d-1yECrzJWRtvh7lVRAYWT-E0PKBKKX0VKqGHkk3I7O0mcMpx5L7Og3izu6cX8QzV9-nkzKkDByFpqgDNIlTUV1oULhPZQ2MR1rTvWXXOXO5VzfTSx61bd2zE9Ismr_JsaqFnA28T4WFVcJ9fm2bS5gbMWiK4RzGrsLnXavMlmo46Dx76Ow1mwfpd8IGupEPgqdyIbrlR13lLfLtzPzgv7I9gdMhTt1RqRu1kCFIxGL1lxa78NpapZYww_KcKJjqzxSoTU40HXkKfESXh8SSDeVkK4EMIXVuYjhYLADxqhVS4RtB84kR7FZ5Vx_UCI_Nd06vzUHuvedoW8Uh1tnZ98lW0pCE6efhsY8tvyXgsDg1DgT4alungbuAfhsTkCk7wTIbmh1XVD_Yc9PqStDJnEhJLtjzD5CWgzU5aRGZSf60N5BXH1__Piy_ARq1ugLUFUlPWjdBZ-i7HInYD9rEDSqC5lhEQwFz0rea-z7QBgd4shTqJuy2WOYxNi9pZZoqUqajCJD57qaFoybrr4ZwB5XLi2w2oNPycg9TvOHR4DkGUV9rw1oJ6bPVGt4Lz4DbQ1OM8Wrr5utk42EJsNPXCDss6ImK2dH_Bw9RHMJwY0DveOu4vaOV3Q9zh4iGQBBrNlLFGEgGMLa1nwRibGQHxLEXhq_Z2z12HJ8YKPp8h7gVJHw%26u%3DaHR0cHMlM2ElMmYlMmY1MzUwLnhnNGtlbi5jb20lMmZ0cmslMmZ2MSUzZnByb2YlM2Q0MzklMjZjYW1wJTNkMTc0MzU5JTI2a2N0JTNkbXNuJTI2a2NoaWQlM2QxNTkwMDE5MjElMjZjcml0ZXJpYWlkJTNka3dkLTc4ODkwNTc0Mjk2MDAwJTNhbG9jLTcxMjI4JTI2Y2FtcGFpZ25pZCUzZDU5MDM2NTAyOSUyNmxvY3BoeSUzZDgwNTExJTI2YWRncm91cGlkJTNkMTI2MjI0MTAxNDkwMDgzNCUyNmNpZCUzZDc4ODkwMTY5MDg3NzQ0JTI2a2R2JTNkYyUyNmtleHQlM2QlMjZrcGclM2QlMjZrcGlkJTNkJTI2cXVlcnlTdHIlM2RtaWNyb3NvZnQlMjUyMGZhYnJpYyUyNnVybCUzZGh0dHBzJTNhJTJmJTJmYXp1cmUubWljcm9zb2Z0LmNvbSUyZmVuLXVzJTJmcHJvZHVjdHMlMmZzZXJ2aWNlLWZhYnJpYyUyZiUzZmVmX2lkJTNkX2tfYTA4M2RmMzE3ODY3MWI4ODJhNGRiN2I4Yjg1OGI1MjNfa18lMjZPQ0lEJTNkQUlEY21tNWVkc3dkdXVfU0VNX19rX2EwODNkZjMxNzg2NzFiODgyYTRkYjdiOGI4NThiNTIzX2tfJTI2bXNjbGtpZCUzZGEwODNkZjMxNzg2NzFiODgyYTRkYjdiOGI4NThiNTIz%26rlid%3Da083df3178671b882a4db7b8b858b523&amp;vqd=4-88554187711717371594947901799639657348&amp;iurl=%7B1%7DIG%3DD2DFE1305F9A43238DF254FF0E3B2AAB%26CID%3D20519816E143658509B88E2FE0A564F3%26ID%3DDevEx%2C5046.1">Fabric</a> as a preview, combining NoSQL database capabilities with Fabric’s analytics platform to create a unified data environment for both operational and analytical workloads without managing separate services.</li>
<li style="font-weight:400;">The service automatically mirrors operational data to <a href="https://learn.microsoft.com/en-us/fabric/onelake/onelake-overview">OneLake</a> in Delta format for real-time analytics, enabling T-SQL queries, Spark notebooks, and Power BI reporting on the same data without ETL pipelines or manual replication steps.</li>
<li style="font-weight:400;">New vector and full-text search capabilities support AI workloads with multiple indexing options, including Microsoft’s <a href="https://www.microsoft.com/en-us/research/project/project-akupara-approximate-nearest-neighbor-search-for-large-scale-semantic-search/">DiskANN</a> for large-scale scenarios, positioning this as a direct competitor to <a href="https://docs.aws.amazon.com/documentdb/latest/developerguide/what-is.html">AWS DocumentDB</a>‘s vector search and GCP’s <a href="https://cloud.google.com/alloydb/docs/overview">AlloyDB</a> vector capabilities.</li>
<li style="font-weight:400;">Billing uses Fabric capacity units rather than separate Cosmos DB pricing, which could simplify cost management for organizations already invested in Fabric but may require careful capacity planning to avoid unexpected charges.</li>
<li style="font-weight:400;">CI/CD support through deployment pipelines and Git integration addresses enterprise DevOps requirements, though the preview status suggests production workloads should wait for general availability.</li>
</ul>
<p>1:17:36  Matt – “They just continue to shove everything into Fabric.” </p>
<p>1:18:48 <a href="https://azure.microsoft.com/en-us/updates?id=498263">Public Preview: CLI command for migration from Availability </a><a href="https://azure.microsoft.com/en-us/updates?id=498263">Sets and Basic load balancer on AKS </a></p>
<ul>
<li style="font-weight:400;">Azure introduces a CLI command to migrate <a href="https://azure.microsoft.com/en-us/products/kubernetes-service/?msockid=218ef287528c64c1155be48a53626586">AKS</a> clusters from deprecated Availability Sets and Basic load balancers to Virtual Machine Scale Sets before the September 30, 2025, retirement deadline, simplifying what would otherwise be a complex manual migration process.</li>
<li style="font-weight:400;">The automated migration tool addresses a critical need as Basic load balancers lack features like availability zones and SLA guarantees that production workloads require, while Availability Sets are being replaced by the more resilient Virtual Machine Scale Sets architecture.</li>
<li style="font-weight:400;">This positions Azure competitively with AWS EKS and GCP GKE, which already use modern infrastructure patterns by default, though Azure’s migration tool provides a smoother transition path for existing customers compared to manual rebuilds.</li>
<li style="font-weight:400;">Organizations running production AKS workloads should prioritize testing this migration in non-production environments first, as the shift to Standard load balancers will increase costs but provide essential enterprise features like cross-zone load balancing.</li>
<li style="font-weight:400;">The preview availability gives customers nearly two years to plan and execute migrations, though early adoption allows time to address any edge cases before the deprecation deadline forces the change.</li>
</ul>
<p>1:20:15  Matt – “There’s a bunch of deprecations coming up, and it is extremely nice that Azure is attempting to help you migrate away from some of these things. But definitely test these in your lower-level environments.” </p>
<p>1:21:27 <a href="https://azure.microsoft.com/en-us/updates?id=497993">Generally Available: Microsoft Azure Cloud HSM </a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.bing.com/aclk?ld=e8CiXfWi5-HjcbbCm8po2EajVUCUxAwMuJ3co_Tf3tx-Vfq-RPutjBonq-Xmn7dBhnWbQOUx8pcZEbUsUE_Bz1Anpq1VW54q4EI96pCmf8WrfGQB9_55v_Zt-yfuQMIP_uwmHkGmUCC2L8zj1FeWGiwMfm7Mq34UBFV5gucROdlKgPr6iUnZO5qvdEuCLr9ARzDJPKWA&amp;u=aHR0cHMlM2ElMmYlMmZhenVyZS5taWNyb3NvZnQuY29tJTJmZW4tdXMlMmZwcm9kdWN0cyUyZmF6dXJlLWRlZGljYXRlZC1oc20lMmYlM2ZlZl9pZCUzZF9rX2M0MDE4ZjcwMWQ3NjE2OTI3MDExZTdmOWRkOWM2OTRmX2tfJTI2T0NJRCUzZEFJRGNtbTVlZHN3ZHV1X1NFTV9fa19jNDAxOGY3MDFkNzYxNjkyNzAxMWU3ZjlkZDljNjk0Zl9rXyUyNm1zY2xraWQlM2RjNDAxOGY3MDFkNzYxNjkyNzAxMWU3ZjlkZDljNjk0Zg&amp;rlid=c4018f701d7616927011e7f9dd9c694f">Azure Cloud HSM</a> delivers FIPS 140-3 Level 3 certified hardware security modules as a single-tenant service, giving customers full administrative control over their cryptographic operations and key management infrastructure.</li>
<li style="font-weight:400;">This positions Azure competitively against <a href="https://www.bing.com/aclk?ld=e8VNUh-1sP_OtjDyuV9u35szVUCUzLr176K4u4dAQC76TwzhlUzp-Kc4PzXGoIiB5lGLKSGTvybawysKG5nm6yI-x_URfvW7mHR4U2pouq3J5bspecDk2Mem9h4zCCETg_GXdPVtwJzmMK6MjNyaFI3roZBLZld5YvOD62wBtIlLVJvz60N9hcYpBZTdtydN2vcrA8LA&amp;u=aHR0cHMlM2ElMmYlMmZhenVyZS5taWNyb3NvZnQuY29tJTJmZW4tdXMlMmZwcm9kdWN0cyUyZm1vbml0b3IlMmYlM2ZlZl9pZCUzZF9rXzIyNGM3OTc2OWNkNzEwM2VhNjA5MTFmZjVjMmU4OTE0X2tfJTI2T0NJRCUzZEFJRGNtbTVlZHN3ZHV1X1NFTV9fa18yMjRjNzk3NjljZDcxMDNlYTYwOTExZmY1YzJlODkxNF9rXyUyNm1zY2xraWQlM2QyMjRjNzk3NjljZDcxMDNlYTYwOTExZmY1YzJlODkxNCUyM292ZXJ2aWV3JTJm&amp;rlid=224c79769cd7103ea60911ff5c2e8914">AWS CloudHSM</a> and <a href="https://cloud.google.com/kms/docs/hsm">Google Cloud HSM</a>, offering similar dedicated hardware security capabilities for organizations with strict compliance requirements in financial services, healthcare, and government sectors.</li>
<li style="font-weight:400;">The single-tenant architecture ensures complete isolation of cryptographic operations, making it suitable for workloads requiring the highest levels of security assurance and regulatory compliance.</li>
<li style="font-weight:400;">Key use cases include protecting certificate authorities, database encryption keys, code signing certificates, and meeting specific regulatory mandates that require hardware-based key storage.</li>
<li style="font-weight:400;">While pricing details aren’t provided in the announcement, organizations should expect premium costs typical of dedicated HSM services, with deployment considerations around high availability configurations and integration with existing <a href="https://www.bing.com/aclk?ld=e8D7_2AX14diZ7n7bz0jcwHDVUCUxiiuk5Vg35ztDrSSGonKR0hJ9erZSK2JCTqhGKpFQgXTwCbrEZrPRTvi-MfANDvPs_V3wqiPthJKQ6ee7KhvlHSlGmhRJVWLyv2XsTY-kvHPOnlS2AHVw4sO-iw1l1xG5alEIAuz865KBEWtyYAJ3uXFeN_zYIr5lOdtZLrOIvCg&amp;u=aHR0cHMlM2ElMmYlMmZhenVyZS5taWNyb3NvZnQuY29tJTJmZW4tdXMlMmZwcm9kdWN0cyUyZmtleS12YXVsdCUyZiUzZmVmX2lkJTNkX2tfNDI2NjQ4ODA2MDE3MTZiY2ZmZmUyMDA5YTEwNTkxZTVfa18lMjZPQ0lEJTNkQUlEY21tNWVkc3dkdXVfU0VNX19rXzQyNjY0ODgwNjAxNzE2YmNmZmZlMjAwOWExMDU5MWU1X2tfJTI2bXNjbGtpZCUzZDQyNjY0ODgwNjAxNzE2YmNmZmZlMjAwOWExMDU5MWU1&amp;rlid=42664880601716bcfffe2009a10591e5">Azure Key Vault</a> implementations.</li>
</ul>
<p>1:23:56 <a href="https://azure.microsoft.com/en-us/updates?id=498361">Generally Available: Hosted-On-Behalf-Of (HOBO) Public IP model for </a><a href="https://azure.microsoft.com/en-us/updates?id=498361">ExpressRoute Gateways</a></p>
<ul>
<li style="font-weight:400;">Azure’s new <a href="https://learn.microsoft.com/en-us/entra/identity-platform/v2-oauth2-on-behalf-of-flow">Hosted-On-Behalf-Of</a> (HOBO) model for <a href="https://learn.microsoft.com/en-us/azure/expressroute/expressroute-about-virtual-network-gateways">ExpressRoute Gateways</a> eliminates the need to manually assign public IP addresses, with Microsoft now managing this infrastructure component automatically for all new deployments.</li>
<li style="font-weight:400;">This simplification reduces configuration complexity and potential misconfigurations for enterprises connecting their on-premises networks to Azure via ExpressRoute, particularly benefiting organizations with limited networking expertise.</li>
<li style="font-weight:400;">The HOBO model aligns Azure more closely with<a href="https://docs.aws.amazon.com/directconnect/latest/UserGuide/direct-connect-gateways-intro.html"> AWS Direct Connect Gateway’s</a> approach, where public IPs are abstracted away, though Azure still requires customers to manage more networking components overall compared to AWS’s implementation.</li>
<li style="font-weight:400;">While this improves the deployment experience, existing ExpressRoute gateways won’t automatically migrate to HOBO, creating a mixed environment that IT teams will need to manage during their transition period.</li>
</ul>
<p>01:26:00 <a href="https://azure.microsoft.com/en-us/updates?id=498144">Public Preview: Orchestration versioning for Durable Functions and </a><a href="https://azure.microsoft.com/en-us/updates?id=498144">Durable</a><a href="https://azure.microsoft.com/en-us/updates?id=498144"> task SDKs </a>/ <a href="https://azure.microsoft.com/en-us/updates?id=498149">Generally Available: Durable Functions PowerShell SDK as a standalone module</a></p>
<ul>
<li style="font-weight:400;">Azure introduces orchestration versioning for <a href="https://learn.microsoft.com/en-us/azure/azure-functions/durable/durable-functions-overview">Durable Functions</a>, addressing a critical challenge where modifying orchestration logic could break existing in-flight workflows – this allows developers to safely update orchestration code without disrupting running instances.</li>
<li style="font-weight:400;">The feature enables side-by-side deployment of multiple orchestration versions, letting new instances use updated logic while existing instances complete with their original code – similar to <a href="https://docs.aws.amazon.com/step-functions/latest/dg/welcome.html">AWS Step Functions</a> versioning but with tighter integration into Azure’s serverless ecosystem.</li>
<li style="font-weight:400;">Target customers include enterprises running long-running workflows, event-driven architectures, and complex business processes where orchestration changes are frequent but downtime is unacceptable – particularly valuable for financial services and e-commerce scenarios.</li>
<li style="font-weight:400;">This positions Azure competitively against AWS Step Functions and <a href="https://cloud.google.com/workflows">Google Cloud Workflows</a> by solving the “orchestration evolution” problem that has plagued serverless workflow engines since their inception.</li>
<li style="font-weight:400;">The preview status suggests Microsoft is gathering feedback before GA, with pricing likely to follow the standard Durable Functions consumption model where you pay for execution time and storage of orchestration state.</li>
<li style="font-weight:400;">Microsoft has released the Durable Functions PowerShell SDK as a standalone module in the PowerShell Gallery, making it easier for developers to build stateful serverless applications using PowerShell without bundling it with the Azure Functions runtime.</li>
<li style="font-weight:400;">This GA release provides PowerShell developers with native support for orchestration patterns like function chaining, fan-out/fan-in, and human interaction workflows, bringing PowerShell to parity with C# and JavaScript for Durable Functions development.</li>
<li style="font-weight:400;">The standalone module approach simplifies dependency management and version control, allowing teams to update the SDK independently of their Azure Functions runtime version and reducing potential compatibility issues.</li>
<li style="font-weight:400;">While AWS Step Functions and GCP Workflows offer similar orchestration capabilities, Azure’s approach uniquely integrates with PowerShell’s automation heritage, targeting IT operations teams who already use PowerShell for infrastructure management.</li>
<li style="font-weight:400;">Organizations can now build complex workflows that combine traditional PowerShell automation scripts with serverless orchestration, enabling scenarios like multi-step deployment pipelines or approval workflows without managing state infrastructure.</li>
</ul>
<p>1:28:10  Matt – “I mean, any of these improvements are just good. You know, durable functions are designed for that consistency, and having that consistency and allocation of the time, you know, but potentially breaking the things in flight kind of wasn’t a good look for them. So having that kind of a little bit more robustness with the versioning and making sure that different, you know, you’re able to control that a lot better. It’s just, you know, beneficial. A general quality of life improvement.” </p>
<p>1:29:24 <a href="https://azure.microsoft.com/en-us/updates?id=498272">Public Preview: Web Application Firewall (WAF) running on Application </a><a href="https://azure.microsoft.com/en-us/updates?id=498272">Gateway for Containers</a></p>
<ul>
<li style="font-weight:400;">Azure brings WAF capabilities to Application Gateway for Containers, extending layer 7 security to Kubernetes workloads with protection against common web exploits like SQL injection and cross-site scripting.</li>
<li style="font-weight:400;">This positions Azure competitively against AWS WAF on ALB and Google Cloud Armor, offering native integration with AKS and other Azure container services for simplified security management.</li>
<li style="font-weight:400;">The preview enables organizations to implement consistent security policies across containerized applications without deploying separate WAF instances, reducing operational overhead and complexity.</li>
<li style="font-weight:400;">Target customers include enterprises migrating microservices to Kubernetes who need enterprise-grade application security without sacrificing the agility of container deployments.</li>
<li style="font-weight:400;">Pricing details aren’t specified in the preview announcement, but expect consumption-based billing similar to standard Application Gateway WAF tiers when it reaches general availability.</li>
</ul>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback, or ask questions at theCloudPod.net or tweet at us with the hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2101459/c1e-7nkns9r5rpu28poj-2540ow8pf12z-rp8thd.mp3" length="129931916"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 314 of The Cloud Pod, where your hosts, Matt and Ryan, are holding down the fort in Justin’s absence and bringing what’s left of our audience (those of you still here after the last time they were left in charge) the latest and greatest in cloud and tech news. We’ve got undersea cables, vector storage, and even some hobos – but not the kind on trains. Plus AWS S3 Let’s get started! 
Titles we almost went with this week:

S3 Gets Direction: AWS Points to Vector Storage
Vector? I Hardly Know Her! S3’s New AI Storage Play
S3 Finds Its Magnitude and Direction
Claude Goes to Wall Street
Anthropic’s Bull Run Into Financial Services
AI Assistant Gets Its Series 7 License
Nova Scotia: AWS Brings Regional Flavor to AI Models
The Fine-Tuning of the Shrew: Teaching Nova Models New Tricks
Nova-caine: Numbing the Pain of Model Customization
AgentCore Blimey: AWS Gives AI Agents Their License to Scale
The Agent Infrastructure: Mission Deployable
From Zero to Agent Hero: AWS Tackles the Production Problem
SageMaker Gets Its Data Act Together
From Catalog to QuickSight: A Data Love Story
The Great Data Unification of 2024
AWS Free Tier Gets a $200 Makeover
EKS-treme Makeover: Cluster Edition
#⃣100K Nodes Walk Into a Cluster…
S3 Gets Direction: Amazon Points to Vector Storage
Amazon S3: Now with 90% Less Vector Bills and 100% More Dimensions

Follow Up
01:03 SoftBank and OpenAI’s $500 Billion AI Project Struggles to Get Off Ground

The $500 billion AI effort unveiled at the White House has struggled to get off the ground and has scaled back its near-term plans. 
It’s been six months since the announcement, where they said they would spend $100B almost immediately, but now they have a more modest goal of building a small data center by the end of the year in Ohio.
Softbank committed to $30 billion earlier this year, and it is one of the largest ever startup investments by them, which led them to take on new debt and sell assets.  
This investment was made alongside Stargate, giving them a role in the physical infrastructure needed for AI. 
Altman, though, has been eager to secure computing power as quickly as possible and has proceeded without Softbank. 
Publicly, they say it’s a great partnership, and they look forward to advancing projects in multiple states
Oracle was part of Stargate, but the recent 30B deal just signed with includes a commitment of 4.5 gigawatts of capacity, and would consume the equivalent power of more than two Hoover Dams, or about 4 million homes. 
Oracle was also named part of the deal with UAE firm MGX as a partner, but Oracle CEO Safra Catz said that Stargate hadn’t been formed yet, as of last month. 

02:31  Matthew – “…everyone’s like, how hard can it be to build a data center? But it’s city zoning, power consumption, grid improvements, water for cooling… getting communities to approve – and these things end up being a massive undertaking. And it takes the hyperscalers a long time to get these things up and operational. So it doesn’t surprise me that a small data center by the end of the year is probably something that was already in the works beforehand; they’re just taking over other plans. Most da...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2101459/c1a-k5d5-9jqw2x04f37r-vgyfwe.jpg"></itunes:image>
                                                                            <itunes:duration>01:30:13</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2101459/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[313: The Gartner Guide to Breaking Things on Purpose]]>
                </title>
                <pubDate>Thu, 24 Jul 2025 11:20:12 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2096510</guid>
                                    <link>https://tcpfm.castos.com/episodes/313-the-gartner-guide-to-breaking-things-on-purpose</link>
                                <description>
                                            <![CDATA[<p>Welcome to episode 313 of The Cloud Pod, where your hosts, Matt, Ryan, and Justin, are here to bring you all the latest in Cloud and AI news. This week we’ve got an installation of Cloud Journey featuring Gartner and chaos AND an aftershow! We’ve got acquisition news, new tools, an undersea cable, and even a little chaos, all right now in the cloud. Let’s get into it! </p>
<h3>Titles we almost went with this week:</h3>
<ul>
<li>From Vibe Check to Production Spec</li>
<li>Node More Mr. Nice Guy: AWS Locks Down Access Until You Ask Nicely</li>
<li>Grok’s New Feature: Ask Elon First</li>
<li>The AI That Phones Home to Dad</li>
<li>Musk-See TV: When Your Chatbot Needs Parental Guidance</li>
<li>Oracle’s Federal Discount: 75% Off for Six Months (Terms and Conditions Apply)</li>
<li>GameDay: Not Just for Sports Anymore</li>
<li>Bob the Builder Center: Can We Fix AWS? Yes We Can!</li>
<li>Bucket List: Google Cloud Storage Finally Lets You Pack Up and Move</li>
<li>The Great Bucket Migration: No Forwarding Address Required</li>
<li>Compose Yourself: Cloud Run Gets Docker-mented</li>
<li>Survey Says: Your Team Needs a Performance Check-Up</li>
<li>From Florida With Love: Google’s New Cable Has a License to Transmit</li>
<li>Sol Train: Google Lays Track Across the Atlantic</li>
<li>Finding the Right Gradient for Your AI Journey</li>
<li>Google Cracks the Code on AWS’s Cloud Castle</li>
<li>Breaking Cloud: Google’s Data Analytics Cook Up Market Share</li>
<li>From Chat to Churn: The Great GPT Subscription Exodus</li>
<li>AWS Finally Filters Out the Pricing Noise</li>
<li>The Price is Right: AWS Edition Gets New Search Features</li>
<li>Four Filters and a Pricing API Walk Into a Cloud</li>
<li>Fee-fi-fo-fum who has a flash reasoning model</li>
</ul>
<h2>Follow Up</h2>
<p>02:01 <a href="https://www.cnbc.com/2025/07/14/cognition-to-buy-ai-startup-windsurf-days-after-google-poached-ceo.html">Cognition to buy AI startup Windsurf days after Google poached CEO</a></p>
<ul>
<li style="font-weight:400;">Cognition <a href="https://cognition.ai/blog/windsurf">acquired</a> <a href="https://windsurf.com/">Windsurf’s</a> IP, product, and remaining talent after <a href="https://www.cnbc.com/quotes/GOOGL/">Google</a> hired away the CEO and senior staff, highlighting the intense competition for AI coding expertise among major tech companies.</li>
<li style="font-weight:400;">The deal follows <a href="https://www.cnbc.com/2025/04/16/openai-in-talks-to-pay-about-3-billion-to-acquire-startup-windsurf.html">a failed $3 billion acquisition attempt</a> by OpenAI and Google’s $2.4 billion licensing and compensation package to secure Windsurf’s leadership, demonstrating the premium valuations for AI coding technology.</li>
<li style="font-weight:400;">Both companies develop AI coding agents designed to accelerate software development, with <a href="https://cognition.ai/">Cognition’s</a> <a href="https://cognition.ai/blog/introducing-devin">Devin agent</a> and Windsurf’s tools representing the growing market for AI-powered developer productivity solutions.</li>
<li style="font-weight:400;">The acquisition ensures all Windsurf employees receive accelerated vesting and financial participation, addressing the disruption caused by the leadership exodus to Google.</li>
<li style="font-weight:400;">This consolidation in the AI coding space suggests smaller startups may struggle to retain talent and remain independent as tech giants aggressively pursue AI engineering capabilities.</li>
</ul>
<h2>AI Is Going Great – Or How ML Makes Money </h2>
<p>04:40 <a href="https://arstechnica.com/information-technology/2025/07/new-grok-ai-model-surprises-experts-by-checking-elon-musks-views-before-answering/">New Grok AI model surprises experts by checking Elon Musk’s views </a><a href="https://arstechnica.com/information-technology/2025/07/new-grok-ai-model-surprises-experts-by-checking-elon-musks-views-before-answering/">before</a><a href="https://arstechnica.com/information-t..."></a></p>
<h3>Chapters</h3>
<ul><li>(00:00:07) - Breaking Things On Purpose</li><li>(00:00:44) - Covid Has Hit</li><li>(00:02:08) - OpenAI Buys Coding Startup Windsurf</li><li>(00:04:34) - Grok 4: Elon Musk's Tweets Causes a Problem</li><li>(00:06:50) - DigitalOcean Launches Unified AI Cloud Platform</li><li>(00:08:58) - Enterprises Are Canceling ChatGPT Subscriptions</li><li>(00:13:37) - DORA Survey Open Until July 18th</li><li>(00:17:50) - GCP 2.8: Free to Use, Paid</li><li>(00:21:23) - SSM: Free vs. Paid Features</li><li>(00:24:33) - Kiro: AI-assisted Development with VS Code</li><li>(00:31:29) - Curo: A New Way to Develop with Q IDE</li><li>(00:34:14) - Amazon AWS Launches P6E GB200 Ultra for AI Training</li><li>(00:37:09) - Wonders of AWS: Update to AWS Builder Center</li><li>(00:42:01) - Amazon's AWS Pricing Server Open Source</li><li>(00:42:48) - Amazon Cloud Portal: AI vs MCP</li><li>(00:45:00) - Amazon DocumentDB with MongoDB compatibility to 10 regions</li><li>(00:49:41) - GCP Backup for Cross-Project Backup (In Preview)</li><li>(00:51:15) - Cloud Storage Bucket Relocation</li><li>(00:54:40) - Gentek and Cloud Run integrate with Docker Compose</li><li>(00:56:40) - Google Launches Seoul, New Transatlantic Cable</li><li>(00:57:47) - Google Cloud's Cloud Battle</li><li>(01:01:54) - Azure 2.8 for Mini-Flash Reasoning</li><li>(01:05:41) - Oracle to Cut Cloud Costs for the Federal Government</li><li>(01:07:20) - Chaos Engineering for Cloud: Future of IT Security</li><li>(01:11:17) - Week in Cloud: September 7, 2017</li><li>(01:11:59) - Stop Force AI Tools on Your Engineers</li><li>(01:19:48) - Cloud Computing: An Eye on the AI</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 313 of The Cloud Pod, where your hosts, Matt, Ryan, and Justin, are here to bring you all the latest in Cloud and AI news. This week we’ve got an installation of Cloud Journey featuring Gartner and chaos AND an aftershow! We’ve got acquisition news, new tools, an undersea cable, and even a little chaos, all right now in the cloud. Let’s get into it! 
Titles we almost went with this week:

From Vibe Check to Production Spec
Node More Mr. Nice Guy: AWS Locks Down Access Until You Ask Nicely
Grok’s New Feature: Ask Elon First
The AI That Phones Home to Dad
Musk-See TV: When Your Chatbot Needs Parental Guidance
Oracle’s Federal Discount: 75% Off for Six Months (Terms and Conditions Apply)
GameDay: Not Just for Sports Anymore
Bob the Builder Center: Can We Fix AWS? Yes We Can!
Bucket List: Google Cloud Storage Finally Lets You Pack Up and Move
The Great Bucket Migration: No Forwarding Address Required
Compose Yourself: Cloud Run Gets Docker-mented
Survey Says: Your Team Needs a Performance Check-Up
From Florida With Love: Google’s New Cable Has a License to Transmit
Sol Train: Google Lays Track Across the Atlantic
Finding the Right Gradient for Your AI Journey
Google Cracks the Code on AWS’s Cloud Castle
Breaking Cloud: Google’s Data Analytics Cook Up Market Share
From Chat to Churn: The Great GPT Subscription Exodus
AWS Finally Filters Out the Pricing Noise
The Price is Right: AWS Edition Gets New Search Features
Four Filters and a Pricing API Walk Into a Cloud
Fee-fi-fo-fum who has a flash reasoning model

Follow Up
02:01 Cognition to buy AI startup Windsurf days after Google poached CEO

Cognition acquired Windsurf’s IP, product, and remaining talent after Google hired away the CEO and senior staff, highlighting the intense competition for AI coding expertise among major tech companies.
The deal follows a failed $3 billion acquisition attempt by OpenAI and Google’s $2.4 billion licensing and compensation package to secure Windsurf’s leadership, demonstrating the premium valuations for AI coding technology.
Both companies develop AI coding agents designed to accelerate software development, with Cognition’s Devin agent and Windsurf’s tools representing the growing market for AI-powered developer productivity solutions.
The acquisition ensures all Windsurf employees receive accelerated vesting and financial participation, addressing the disruption caused by the leadership exodus to Google.
This consolidation in the AI coding space suggests smaller startups may struggle to retain talent and remain independent as tech giants aggressively pursue AI engineering capabilities.

AI Is Going Great – Or How ML Makes Money 
04:40 New Grok AI model surprises experts by checking Elon Musk’s views before]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[313: The Gartner Guide to Breaking Things on Purpose]]>
                </itunes:title>
                                    <itunes:episode>313</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p>Welcome to episode 313 of The Cloud Pod, where your hosts, Matt, Ryan, and Justin, are here to bring you all the latest in Cloud and AI news. This week we’ve got an installation of Cloud Journey featuring Gartner and chaos AND an aftershow! We’ve got acquisition news, new tools, an undersea cable, and even a little chaos, all right now in the cloud. Let’s get into it! </p>
<h3>Titles we almost went with this week:</h3>
<ul>
<li>From Vibe Check to Production Spec</li>
<li>Node More Mr. Nice Guy: AWS Locks Down Access Until You Ask Nicely</li>
<li>Grok’s New Feature: Ask Elon First</li>
<li>The AI That Phones Home to Dad</li>
<li>Musk-See TV: When Your Chatbot Needs Parental Guidance</li>
<li>Oracle’s Federal Discount: 75% Off for Six Months (Terms and Conditions Apply)</li>
<li>GameDay: Not Just for Sports Anymore</li>
<li>Bob the Builder Center: Can We Fix AWS? Yes We Can!</li>
<li>Bucket List: Google Cloud Storage Finally Lets You Pack Up and Move</li>
<li>The Great Bucket Migration: No Forwarding Address Required</li>
<li>Compose Yourself: Cloud Run Gets Docker-mented</li>
<li>Survey Says: Your Team Needs a Performance Check-Up</li>
<li>From Florida With Love: Google’s New Cable Has a License to Transmit</li>
<li>Sol Train: Google Lays Track Across the Atlantic</li>
<li>Finding the Right Gradient for Your AI Journey</li>
<li>Google Cracks the Code on AWS’s Cloud Castle</li>
<li>Breaking Cloud: Google’s Data Analytics Cook Up Market Share</li>
<li>From Chat to Churn: The Great GPT Subscription Exodus</li>
<li>AWS Finally Filters Out the Pricing Noise</li>
<li>The Price is Right: AWS Edition Gets New Search Features</li>
<li>Four Filters and a Pricing API Walk Into a Cloud</li>
<li>Fee-fi-fo-fum who has a flash reasoning model</li>
</ul>
<h2>Follow Up</h2>
<p>02:01 <a href="https://www.cnbc.com/2025/07/14/cognition-to-buy-ai-startup-windsurf-days-after-google-poached-ceo.html">Cognition to buy AI startup Windsurf days after Google poached CEO</a></p>
<ul>
<li style="font-weight:400;">Cognition <a href="https://cognition.ai/blog/windsurf">acquired</a> <a href="https://windsurf.com/">Windsurf’s</a> IP, product, and remaining talent after <a href="https://www.cnbc.com/quotes/GOOGL/">Google</a> hired away the CEO and senior staff, highlighting the intense competition for AI coding expertise among major tech companies.</li>
<li style="font-weight:400;">The deal follows <a href="https://www.cnbc.com/2025/04/16/openai-in-talks-to-pay-about-3-billion-to-acquire-startup-windsurf.html">a failed $3 billion acquisition attempt</a> by OpenAI and Google’s $2.4 billion licensing and compensation package to secure Windsurf’s leadership, demonstrating the premium valuations for AI coding technology.</li>
<li style="font-weight:400;">Both companies develop AI coding agents designed to accelerate software development, with <a href="https://cognition.ai/">Cognition’s</a> <a href="https://cognition.ai/blog/introducing-devin">Devin agent</a> and Windsurf’s tools representing the growing market for AI-powered developer productivity solutions.</li>
<li style="font-weight:400;">The acquisition ensures all Windsurf employees receive accelerated vesting and financial participation, addressing the disruption caused by the leadership exodus to Google.</li>
<li style="font-weight:400;">This consolidation in the AI coding space suggests smaller startups may struggle to retain talent and remain independent as tech giants aggressively pursue AI engineering capabilities.</li>
</ul>
<h2>AI Is Going Great – Or How ML Makes Money </h2>
<p>04:40 <a href="https://arstechnica.com/information-technology/2025/07/new-grok-ai-model-surprises-experts-by-checking-elon-musks-views-before-answering/">New Grok AI model surprises experts by checking Elon Musk’s views </a><a href="https://arstechnica.com/information-technology/2025/07/new-grok-ai-model-surprises-experts-by-checking-elon-musks-views-before-answering/">before</a><a href="https://arstechnica.com/information-technology/2025/07/new-grok-ai-model-surprises-experts-by-checking-elon-musks-views-before-answering/"> answering – Ars Technica</a></p>
<ul>
<li style="font-weight:400;"><a href="https://arstechnica.com/ai/2025/07/musks-grok-4-launches-one-day-after-chatbot-generated-hitler-praise-on-x/">Grok 4</a>, <a href="https://x.ai/">xAI’s</a> latest AI model, has been observed searching for Elon Musk’s <a href="https://www.x.com/">X</a> posts when answering controversial questions, with the model’s reasoning trace showing searches like “from:elonmusk (Israel OR Palestine OR Gaza OR Hamas)” before formulating responses.</li>
<li style="font-weight:400;">The behavior appears inconsistent across users and prompts – while some see Grok searching for Musk’s views, others report the model searching for its own previous stances or providing different answers entirely.</li>
<li style="font-weight:400;">This discovery highlights potential challenges in AI alignment and bias in cloud-hosted LLMs, where models may inadvertently incorporate owner preferences into their decision-making processes without explicit programming.</li>
<li style="font-weight:400;">The <a href="https://grok.com/plans">SuperGrok</a> tier costs $22.50/month and includes visible reasoning traces similar to <a href="https://openai.com/index/introducing-o3-and-o4-mini/">OpenAI’s o3 model</a>, allowing users to see the model’s search queries and thought process during response generation.</li>
<li style="font-weight:400;">For cloud providers and enterprises deploying AI services, this raises important questions about model transparency, bias detection, and the need for robust testing frameworks to identify unexpected behaviors before production deployment.</li>
</ul>
<p>06:23  Ryan – “It’s all my concerns about the bro-coders and the culture and Musk’s cult of personality dictating things, and not being something that can be trusted.” </p>
<p>06:53 <a href="https://www.digitalocean.com/blog/introducing-digitalocean-gradientai">Introducing GradientAI: DigitalOcean’s Unified AI Cloud | DigitalOcean</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.digitalocean.com/">DigitalOcean</a> launches <a href="https://www.gradientai.com/">GradientAI</a>, a unified AI cloud platform that combines GPU infrastructure, agent development tools, and pre-built AI applications into a single integrated experience for the full AI development lifecycle.</li>
<li style="font-weight:400;">The platform consists of three main components: Infrastructure (GPU compute for training/inference), Platform (agent development environment), and Applications (pre-built AI agents for common use cases like customer support).</li>
<li style="font-weight:400;">New GPU options are being added, including <a href="https://www.amd.com/en/products/accelerators/instinct/mi300/mi325x.html">AMD Instinct MI325X</a> (available this week) and <a href="https://www.nvidia.com/en-us/data-center/h200/">NVIDIA H200s</a> (next month), providing more choice and performance options for both training and inference workloads.</li>
<li style="font-weight:400;">The Platform component will support <a href="https://modelcontextprotocol.io/introduction">Model Context Protocol (MCP)</a>, multi-modal capabilities, agent memory, and framework integrations to simplify moving AI projects from prototype to production.</li>
<li style="font-weight:400;">This positions DigitalOcean to compete more directly with major cloud providers in the AI space by offering a simpler, more integrated alternative for digital native enterprises building AI applications.</li>
</ul>
<p>07:42  Ryan – “I’m in support of any feature that Digital Ocean puts on their cloud, just because I’m rooting for the underdog there. And if you are a Digital Ocean customer, how great is it to have this and not to go to one of the other cloud hyperscalers and maintain two separate infrastructures?”</p>
<p>09:07  <a href="https://www.theinformation.com/articles/companies-canceling-chatgpt-subscriptions?rc=3t8xtd&amp;shared=5f457ce3304b0b27">Companies Canceling ChatGPT Subscriptions</a></p>
<ul>
<li style="font-weight:400;">Companies are canceling <a href="https://openai.com/chatgpt/overview/">ChatGPT</a> subscriptions due to concerns about data security, cost-benefit analysis, and integration challenges with existing enterprise systems. </li>
<li style="font-weight:400;">Organizations report difficulty justifying the $20-30 per user monthly cost when employees use the tool sporadically or for non-critical tasks.</li>
<li style="font-weight:400;">The trend highlights a growing enterprise preference for self-hosted or private cloud AI solutions that offer better data governance and compliance controls. </li>
<li style="font-weight:400;">Companies are exploring alternatives like <a href="https://learn.microsoft.com/en-us/azure/ai-foundry/openai/overview">Azure OpenAI Service</a> or <a href="https://aws.amazon.com/bedrock/">AWS Bedrock</a> that integrate with existing cloud infrastructure and security policies.</li>
<li style="font-weight:400;">Technical teams cite API limitations, lack of fine-tuning capabilities for domain-specific tasks, and inability to train on proprietary data as key factors driving cancellations. </li>
<li style="font-weight:400;">Many organizations need models that can be customized for industry-specific terminology and workflows.</li>
<li style="font-weight:400;">The shift suggests enterprises are moving from experimental AI adoption to more strategic implementation focused on measurable ROI and specific use cases. Companies are consolidating around platforms that offer both general-purpose and specialized models within their existing cloud environments.</li>
<li style="font-weight:400;">This development indicates a maturing AI market where businesses demand enterprise-grade features like audit trails, role-based access control, and integration with existing identity management systems rather than standalone consumer-oriented tools.</li>
</ul>
<p>10:23  Justin – “I know I cancelled my ChatGPT subscription months ago; I was a trend setter.” </p>
<h2>Cloud Tools</h2>
<p>13:53 <a href="https://cloud.google.com/blog/products/ai-machine-learning/2025-dora-survey-is-now-open/">2025 DORA Survey is now open | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">The <a href="https://dora.dev/research/2025/">2025 DORA survey</a> is now open until July 18, offering teams a 10-15 minute self-assessment tool to benchmark their software delivery and operational performance against industry standards. </li>
<li style="font-weight:400;">This year’s survey focuses heavily on AI adoption across the software development lifecycle, with 76% of technologists already using AI in their daily work.</li>
<li style="font-weight:400;">Companies applying DORA principles have achieved dramatic improvements – <a href="https://cloud.google.com/customers/banorte?e=0">Banorte increased deployment frequency from bi-weekly to multiple times daily</a>, <a href="https://cloud.google.com/customers/slb">SLB cut deployment time from 5 days to 3 hours</a>, and <a href="https://cloud.google.com/customers/dora-gitlab">GitLab reduced errors by 88%</a>. These metrics demonstrate the tangible value of continuous improvement practices backed by data-driven insights.</li>
<li style="font-weight:400;">The survey explores how organizations can maximize AI impact while maintaining developer well-being, finding that transparent AI strategies and governance policies significantly increase adoption rates. </li>
<li style="font-weight:400;">It also examines trust in AI systems and how teams can best support the transition to AI-enhanced workflows.</li>
<li style="font-weight:400;">Available in 6 languages, the survey welcomes input from all software delivery roles – engineers, product managers, CISOs, and UX designers – to capture diverse perspectives on team performance. Participants gain immediate value through structured reflection on their workflows and bottlenecks.</li>
<li style="font-weight:400;">DORA’s research continues to shape industry understanding of high-performing teams, with findings like the substantial impact of quality documentation on team performance. </li>
<li style="font-weight:400;">The anonymous data collected helps establish benchmarks and best practices for the entire technology community.</li>
<li style="font-weight:400;">Listener note: The survey is now closed, so all arguments about the closing date are moot. We will bring you the results of said survey as soon as they’re released. </li>
</ul>
<h2>AWS</h2>
<p>17:02 <a href="https://aws.amazon.com/blogs/mt/introducing-just-in-time-node-access-using-aws-systems-manager/">Introducing Just-in-time node access using AWS Systems Manager | AWS </a><a href="https://aws.amazon.com/blogs/mt/introducing-just-in-time-node-access-using-aws-systems-manager/">Cloud Operations Blog</a></p>
<ul>
<li style="font-weight:400;">Yes, we originally missed this one. But maybe you’ve seen it in the console, just like Matt. </li>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/systems-manager/latest/userguide/what-is-systems-manager.html">AWS Systems Manager</a> now offers just-in-time node access, enabling temporary, policy-based access to EC2, on-premises, and multicloud nodes without maintaining long-term credentials or SSH keys. </li>
<li style="font-weight:400;">This addresses the security vs operational efficiency trade-off many organizations face when managing thousands of nodes.</li>
<li style="font-weight:400;">The feature supports both manual approval workflows (with multiple approvers) and automated approval using Cedar policy language, allowing organizations to implement zero standing privileges while maintaining rapid incident response capabilities. Access automatically expires after a defined time window.</li>
<li style="font-weight:400;">Integration with <a href="https://slack.com/downloads/windows">Slack</a>, <a href="https://www.microsoft.com/en-us/microsoft-teams/download-app?ocid=ORSEARCH_Bing&amp;msockid=218ef287528c64c1155be48a53626586">Microsoft Teams</a>, and email notifications streamlines the approval process, while<a href="https://docs.aws.amazon.com/eventbridge/latest/userguide/eb-what-is.html"> EventBridge</a> events enable audit trails and custom workflows. Sessions can be logged for commands and RDP recordings for compliance requirements.</li>
<li style="font-weight:400;">AWS offers a free trial period covering the remainder of the current billing period plus the entire next billing period per account per Region, after which pricing is usage-based. This allows organizations to test configurations and policies before committing to costs.</li>
<li style="font-weight:400;">The solution works seamlessly across AWS Organizations, supporting consistent access controls whether managing single or multiple accounts, with administrators defining policies, operators requesting access, and approvers managing requests through a unified console experience.</li>
</ul>
<p>18:36  Matt – “It runs on Jonathan’s favorite method of security, which is through tags. So a lot of the automation, a dev person can automatically get access if tag equals dev is in there. So, there are some features or setup design of it that might not be what works for your company, but there is some like prep work if you want to use it, but it does seem like a really nice feature.”</p>
<p>25:11 <a href="https://kiro.dev/blog/introducing-kiro/">Introducing Kiro – Kiro</a></p>
<ul>
<li style="font-weight:400;"><a href="https://visualstudiomagazine.com/articles/2025/07/21/forked-again-awss-kiro-latest-ai-assistant-based-on-vs-code.aspx">Kiro</a> is a new AI-powered IDE that introduces spec-driven development, automatically generating requirements, technical designs, and implementation tasks from simple prompts to help developers move from prototype to production-ready applications.</li>
<li style="font-weight:400;">The platform’s key innovation is its specs feature, which creates <a href="https://www.incose.org/docs/default-source/working-groups/requirements-wg/rwg_iw2022/mav_ears_incoserwg_jan22.pdf?sfvrsn=e70571c7_2">EARS notation acceptance criteria</a>, data flow diagrams, TypeScript interfaces, and database schemas that stay synchronized with the evolving codebase, addressing the common problem of outdated documentation.</li>
<li style="font-weight:400;">Kiro hooks provide automated quality checks by triggering AI agents on file events – for example, automatically updating test files when React components change or scanning for security vulnerabilities before commits, enforcing consistent standards across development teams.</li>
<li style="font-weight:400;">Built on <a href="https://github.com/code-oss-dev/code">Code OSS</a> with <a href="https://aws.amazon.com/visualstudiocode/">VS Code</a> compatibility, Kiro supports Model Context Protocol for specialized tool integration and is currently free during preview with some limitations, targeting developers who need more structure than typical AI coding assistants provide.</li>
<li style="font-weight:400;">This represents a shift toward more structured AI-assisted development, moving beyond simple code generation to address production concerns like maintainability, documentation, and team consistency that traditional AI coding tools often overlook.</li>
</ul>
<p>26:19  Justin – “I’ve been playing with it most of the day, building a mobile app across platform, which I’ve never done before, and I have no experience doing and I have no idea what it’s doing. But, it’s working great.”</p>
<p>35:00 <a href="https://aws.amazon.com/blogs/aws/new-amazon-ec2-p6e-gb200-ultraservers-powered-by-nvidia-grace-blackwell-gpus-for-the-highest-ai-performance/">New Amazon EC2 P6e-GB200 UltraServers accelerated by NVIDIA Grace </a><a href="https://aws.amazon.com/blogs/aws/new-amazon-ec2-p6e-gb200-ultraservers-powered-by-nvidia-grace-blackwell-gpus-for-the-highest-ai-performance/">Blackwell GPUs for the highest AI performance | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;">AWS launches <a href="https://aws.amazon.com/blogs/aws/new-amazon-ec2-p6e-gb200-ultraservers-powered-by-nvidia-grace-blackwell-gpus-for-the-highest-ai-performance/">P6e-GB200 UltraServers</a> with <a href="https://www.nvidia.com/en-us/data-center/technologies/blackwell-architecture/">NVIDIA Grace Blackwell GPUs</a>, offering up to 72 GPUs in a single NVLink domain with 360 petaflops of FP8 compute and 13.4 TB of HBM3e memory for training trillion-parameter AI models.</li>
<li style="font-weight:400;">The new instances use NVIDIA’s superchip architecture that combines Blackwell GPUs with Grace ARM CPUs on the same module, providing significantly higher GPU-CPU bandwidth compared to current <a href="https://aws.amazon.com/ec2/instance-types/p5/?trk=769a1a2b-8c19-4976-9c45-b6b1226c7d20&amp;sc_channel=el">P5en instances</a> while delivering 28.8 Tbps of EFA networking.</li>
<li style="font-weight:400;">P6e-GB200 UltraServers are only available through <a href="https://aws.amazon.com/ec2/capacityblocks/?trk=769a1a2b-8c19-4976-9c45-b6b1226c7d20&amp;sc_channel=el">EC2 Capacity Blocks for ML</a> in the Dallas Local Zone (us-east-1-dfw-2a), requiring upfront payment for reserved capacity blocks of either 36 or 72 GPUs with pricing determined at purchase time.</li>
<li style="font-weight:400;">Integration with AWS services includes <a href="https://aws.amazon.com/sagemaker/hyperpod/">SageMaker HyperPod</a> for managed infrastructure with automatic fault replacement within the same NVLink domain, EKS with topology-aware routing for distributed workloads, and <a href="https://aws.amazon.com/fsx/lustre/">FSx for Lustre</a>, providing hundreds of GB/s throughput for large-scale AI training.</li>
<li style="font-weight:400;">The instances target frontier AI workloads, including a mixture of expert models, reasoning models, and generative AI applications like video generation and code generation, positioning AWS to compete in the high-end AI infrastructure market.</li>
</ul>
<p>36:14  Ryan – “So if you’re a big enough Amazon customer, you can get Amazon to run your Amazon outpost with custom hardware. Cool!” </p>
<p>37:29 <a href="https://aws.amazon.com/blogs/aws/introducing-aws-builder-center-a-new-home-for-the-aws-builder-community/">Introducing AWS Builder Center: A new home for the AWS builder </a><a href="https://aws.amazon.com/blogs/aws/introducing-aws-builder-center-a-new-home-for-the-aws-builder-community/">community | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://builder.aws.com/?trk=769a1a2b-8c19-4976-9c45-b6b1226c7d20&amp;sc_channel=el">AWS Builder Center</a> consolidates developer resources from <a href="https://aws.amazon.com/blogs/aws/category/public-sector/developer/">AWS Developer</a> Center and community.aws into a single platform at <a href="http://builder.aws.com">builder.aws.com</a>, providing a unified hub for accessing tutorials, workshops, and community engagement tools.</li>
<li style="font-weight:400;">The new Wishlist feature allows developers to submit and vote on feature requests for AWS services, giving the community direct input into product roadmaps and enabling AWS teams to prioritize development based on actual user needs.</li>
<li style="font-weight:400;">Built-in localization supports 16 languages with on-demand machine translation for user-generated content, removing language barriers for global collaboration among AWS builders and expanding accessibility to non-English speaking developers.</li>
<li style="font-weight:400;">The platform integrates AWS Builder ID for consistent profile management across all AWS services, offering personalized profiles with custom URLs and QR codes for networking at events and conferences.</li>
<li style="font-weight:400;">Connect features highlight <a href="https://builder.aws.com/heroes/?trk=769a1a2b-8c19-4976-9c45-b6b1226c7d20&amp;sc_channel=el">AWS Heroes</a>, <a href="https://builder.aws.com/community-builders/?trk=769a1a2b-8c19-4976-9c45-b6b1226c7d20&amp;sc_channel=el">Community Builders</a>, <a href="https://builder.aws.com/connect/community/user-groups/?trk=769a1a2b-8c19-4976-9c45-b6b1226c7d20&amp;sc_channel=el">User Groups</a>, and <a href="https://builder.aws.com/connect/community/cloud-clubs/?trk=769a1a2b-8c19-4976-9c45-b6b1226c7d20&amp;sc_channel=el">Cloud Clubs</a>, making it easier to find local meetups and connect with experts in specific AWS service areas or technologies.</li>
</ul>
<p>39:32 <a href="https://aws.amazon.com/about-aws/whats-new/2025/07/aws-price-list-api-supports-four-query-filters/">AWS Price List API now supports four new Query Filters – AWS</a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/awsaccountbilling/latest/aboutv2/using-price-list-query-api.html">AWS Price List Query API</a> adds four new filter types, enabling exact attribute matching, substring searches, and include/exclude lists for more targeted product searches across AWS services.</li>
<li style="font-weight:400;">The update simplifies finding specific product groups like all m5 EC2 instance types with a single filter instead of multiple complex queries, reducing API calls and improving efficiency.</li>
<li style="font-weight:400;">This enhancement addresses a common pain point for cost optimization tools and <a href="https://www.finops.org/introduction/what-is-finops/">FinOps</a> teams who need to programmatically analyze AWS pricing data across thousands of SKUs.</li>
<li style="font-weight:400;">The new filters are available in all regions where the Price List API is supported, making it easier for organizations to build automated pricing analysis and comparison tools.</li>
<li style="font-weight:400;">Real-world applications include building custom cost calculators, automated pricing alerts, and multi-region price comparison tools for Reserved Instance planning.</li>
</ul>
<p>40:25  Justin – “AWS CLI filtering is one of those things that drives me crazy, because I never really remember it properly. And it brings me such joy to watch the AI Bots screw it up. If the AI bot who has the documentation in its brain memorized can’t get this right, I don’t feel so bad.” </p>
<p>42:17 <a href="https://aws.amazon.com/about-aws/whats-new/2025/07/model-context-protocol-server-price-list/">Announcing Model Context Protocol (MCP) Server for AWS Price List – </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/07/model-context-protocol-server-price-list/">AWS</a></p>
<ul>
<li style="font-weight:400;">AWS releases an open-source Model Context Protocol (MCP) server that gives AI assistants like <a href="https://aws.amazon.com/developer/learning/q-developer-cli/">Amazon Q Developer CLI</a> and <a href="https://claude.ai/download">Claude Desktop</a> direct access to AWS pricing data, including on-demand, reserved, and savings plan options across all regions.</li>
<li style="font-weight:400;">The MCP server enables natural language queries about AWS pricing and product availability, allowing developers to ask questions like “What’s the cheapest EC2 instance for machine learning in us-east-1?” and get real-time responses from the <a href="https://docs.aws.amazon.com/awsaccountbilling/latest/aboutv2/price-changes.html">AWS Price List API</a>.</li>
<li style="font-weight:400;">This addresses a common pain point where engineers manually navigate complex pricing pages or write custom scripts to compare costs across services and regions, and now AI assistants can handle these queries instantly.</li>
<li style="font-weight:400;">The server uses standard AWS credentials and minimal configuration, making it straightforward to integrate into existing workflows where teams already use AI assistants for development tasks.</li>
<li style="font-weight:400;">Available now in the <a href="https://github.com/awslabs">AWS Labs GitHub repository</a> at no additional cost beyond standard AWS Price List API usage.</li>
</ul>
<p>43:09  Matt – “When was the last time you had an engineer (or developer) go in to figure out what EC2 instance type they should use? Because everyone I’ve met just goes ‘ooh, this one’s big and shiny, we’ll put more power behind it, and that makes my code go faster’….don’t worry about your CFO’s brain exploding on the other side of it. ”  </p>
<p>45:23 <a href="https://aws.amazon.com/about-aws/whats-new/2025/07/amazon-documentdb-mongodb-ccompatibility-support-secondary-region-clusters/">Amazon DocumentDB (with MongoDB compatibility) introduces support for </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/07/amazon-documentdb-mongodb-ccompatibility-support-secondary-region-clusters/">up to 10 secondary Region clusters – AWS</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/documentdb/global-clusters/">Amazon DocumentDB Global Clusters</a> now supports up to 10 secondary regions, doubling the previous limit of 5, enabling broader geographic distribution for applications requiring low-latency reads across multiple continents.</li>
<li style="font-weight:400;">This expansion addresses disaster recovery needs by allowing organizations to replicate their <a href="https://www.mongodb.com/?msockid=218ef287528c64c1155be48a53626586">MongoDB</a>-compatible workloads across more AWS regions, reducing the blast radius of regional outages while maintaining local read performance.</li>
<li style="font-weight:400;">The increased region support particularly benefits global enterprises running customer-facing applications that need to comply with data residency requirements across multiple jurisdictions while maintaining consistent performance.</li>
<li style="font-weight:400;">While the feature enhances availability and global reach, customers should consider the cost implications of running clusters across 10 regions, including cross-region data transfer charges and compute costs for each regional cluster.</li>
<li style="font-weight:400;">This positions <a href="https://docs.aws.amazon.com/documentdb/latest/developerguide/what-is.html">DocumentDB</a> more competitively against <a href="https://www.mongodb.com/atlas?msockid=218ef287528c64c1155be48a53626586">MongoDB Atlas</a>, which supports similar multi-region deployments, giving AWS customers a fully managed alternative without leaving the AWS ecosystem.</li>
</ul>
<p>47:24 <a href="https://aws.amazon.com/about-aws/whats-new/2025/07/amazon-sagemaker-studio-remote-connections-studio-code/">Amazon SageMaker Studio now supports remote connections from Visual </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/07/amazon-sagemaker-studio-remote-connections-studio-code/">Studio Code – AWS</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/sagemaker-ai/studio/">SageMaker Studio</a> now allows developers to connect their local VS Code installations directly to SageMaker’s managed compute resources, reducing setup time from hours to minutes while maintaining existing security boundaries.</li>
<li style="font-weight:400;">Developers can authenticate through the <a href="https://aws.amazon.com/visualstudio/">AWS Toolkit extension</a> or SageMaker Studio’s web interface, then access their SageMaker development environments with a few clicks while keeping their preferred VS Code extensions and AI-assisted development tools.</li>
<li style="font-weight:400;">This addresses a common friction point where data scientists want their familiar local IDE setup but need access to scalable cloud compute and datasets stored in AWS without complex SSH tunneling or VPN configurations.</li>
<li style="font-weight:400;">The feature complements SageMaker Studio’s existing <a href="https://jupyter.org/">JupyterLab</a> and Code Editor options, giving teams flexibility to choose between web-based or local development experiences while leveraging the same underlying infrastructure.</li>
<li style="font-weight:400;">Currently available only in US East (Ohio) region, suggesting this is an early rollout that will likely expand to other regions based on customer adoption and feedback.</li>
</ul>
<p>48:25  Ryan – “It’s definitely kept me from adopting SageMaker, and a larger thing being sort of forced into their interface and their notebook interface. I do like it locally. It wasn’t terrible; I could use it before, but it’s a lot easier if I don’t have to do that. So I like that this pattern is becoming more prevalent, where you’re keeping your context focused directly in that IDE and the IDEs are going and reaching out to the different services.”</p>
<h2>GCP</h2>
<p>50:16 <a href="https://cloud.google.com/blog/products/storage-data-transfer/backup-for-gke-supports-cross-project-backup-and-restore/">Backup for GKE supports cross-project backup and restore | Google Cloud </a><a href="https://cloud.google.com/blog/products/storage-data-transfer/backup-for-gke-supports-cross-project-backup-and-restore/">Blog</a></p>
<ul>
<li style="font-weight:400;">Backup for GKE now supports cross-project <a href="https://cloud.google.com/kubernetes-engine/docs/add-on/backup-for-gke/how-to/cross-project-backups">backup</a> and <a href="https://cloud.google.com/kubernetes-engine/docs/add-on/backup-for-gke/how-to/cross-project-restores">restore</a> in preview, allowing users to back up workloads from one Google Cloud project, store them in a second project, and restore to a third project. </li>
<li style="font-weight:400;">This addresses a key challenge in multi-project GKE deployments where teams need centralized backup management across project boundaries.</li>
<li style="font-weight:400;">The feature enables critical disaster recovery capabilities by storing backups in separate projects and regions, protecting against regional outages or compromised primary projects. </li>
<li style="font-weight:400;">Organizations can meet RTO/RPO objectives while simplifying regulatory compliance through proper backup isolation.</li>
<li style="font-weight:400;">Cross-project functionality streamlines development workflows by enabling easy environment seeding and cloning – teams can populate staging environments with production backup data or create isolated sandboxes without complex manual processes. </li>
<li style="font-weight:400;">Developers can be granted Delegated Restore Admin roles to restore specific backups without accessing live production environments.</li>
<li style="font-weight:400;">This positions GCP competitively with AWS and Azure backup solutions that already support cross-account/subscription backup scenarios. The integration with GKE’s existing backup infrastructure means no additional tools are required beyond configuring backup and restore plans to point to different projects.</li>
<li style="font-weight:400;">Access to the preview requires completing a form, which can be found <a href="https://docs.google.com/forms/d/e/1FAIpQLSfTKngIB_Kl7-OSdW4ILmD_lXR_uVYmOMprmqr0P7Enk0Elcw/viewform">here</a>. </li>
<li style="font-weight:400;">No specific pricing changes were mentioned, suggesting it uses existing Backup for GKE pricing models.</li>
</ul>
<p>51:54 <a href="https://cloud.google.com/blog/products/storage-data-transfer/introducing-cloud-storage-bucket-relocation/">Introducing Cloud Storage bucket relocation | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google Cloud Storage introduces <a href="https://cloud.google.com/storage/docs/bucket-relocation/overview">bucket relocation</a>, the first feature among major cloud providers that allows moving storage buckets to different regions without changing bucket names or disrupting applications. </li>
<li style="font-weight:400;">This preserves all metadata, including storage classes, timestamps, and permissions, while maintaining object lifecycle management rules.</li>
<li style="font-weight:400;">The feature uses asynchronous data copying to minimize downtime during migration, with only a brief write-lock period during final synchronization. Organizations can perform dry runs to identify potential issues like CMEK incompatibilities before initiating the actual move.</li>
<li style="font-weight:400;">Key use cases include improving data locality for performance, meeting regional compliance requirements, and optimizing costs by moving between storage tiers. Spotify and Groupon have reported successful migrations of petabytes of data with minimal manual effort compared to traditional approaches.</li>
<li style="font-weight:400;">Bucket relocation is part of Google’s <a href="https://cloud.google.com/storage/docs/storage-intelligence/overview">Storage Intelligence suite</a> and supports moves between regional, dual-region, and multi-region configurations. The three-step process (dry run, initiate relocation, finalize) can be completed through simple gcloud commands.</li>
<li style="font-weight:400;">This addresses a significant pain point in cloud storage management, where previously, organizations had to use <a href="https://cloud.google.com/storage-transfer-service">Storage Transfer Service</a> to copy data to new buckets with different names, requiring application updates and risking extended downtime.</li>
</ul>
<p>34:06  Matt – “This is a really cool feature that would have saved me much time in the past life of, hey, we set up this thing years before we actually started using the cloud, and it was for this one thing, and now we’ve launched everything in this other region. And every time we have to access this one specific bucket, it is somewhere else. And how do we fix that? And their process is pretty cool, too, where it sets it up, does the sync, and sits at 99% and you do that last one. This is a great quality of life feature.”  </p>
<p>55:20 <a href="https://cloud.google.com/blog/products/serverless/cloud-run-and-docker-collaboration/">Cloud Run and Docker collaboration | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/run">Cloud Run</a> now supports direct deployment of <a href="https://docs.docker.com/reference/compose-file/">Docker Compose files</a> through the new gcloud run compose up command, eliminating manual infrastructure translation between local development and cloud deployment. </li>
<li style="font-weight:400;">This <a href="https://forms.gle/XDHCkbGPWWcjx9mk9">private preview</a> feature automatically builds containers from source and leverages Cloud Run’s volume mounts for data persistence.</li>
<li style="font-weight:400;">The integration supports Docker’s new models attribute in the <a href="http://compose-spec.io/">Compose Specification</a>, enabling developers to deploy AI applications with self-hosted LLMs and MCP servers using a single configuration file. This positions Cloud Run as a cost-effective option for AI workloads with pay-per-second billing and scale-to-zero capabilities.</li>
<li style="font-weight:400;"><a href="https://cloud.google.com/blog/products/serverless/cloud-run-gpus-are-now-generally-available">Cloud Run GPUs (now generally available)</a> combined with Compose support creates a streamlined path for AI development, with approximately 19-second time-to-first-token for models like gemma3:4b. This competes directly with AWS App Runner and Azure Container Apps but with native GPU support.</li>
<li style="font-weight:400;">The collaboration addresses the growing complexity of agentic AI applications by supporting Docker’s MCP Gateway and Model Runner, allowing developers to maintain consistent configurations across local and cloud environments. Sign up for private preview at https://forms.gle/XDHCkbGPWWcjx9mk9.</li>
<li style="font-weight:400;">This positions GCP strategically in the AI infrastructure market by adopting open standards (Compose Specification) while leveraging Cloud Run’s existing strengths in serverless compute, making it practical for teams already using Docker Compose who need GPU-accelerated AI deployments without infrastructure management overhead.</li>
<li style="font-weight:400;">Want to sign up for the private preview? You can do that <a href="https://forms.gle/XDHCkbGPWWcjx9mk9">here</a>. </li>
</ul>
<p>56:62  Ryan – “I’m curious to see the rough edges on this because you’ve been able to do sort of continuous integration delivery with CloudRun for a while, but it had to be a publicly available Github Repo, so I’m hoping that this is as transparent as it’s made to be.” </p>
<p>57:26 <a href="https://cloud.google.com/blog/products/infrastructure/announcing-sol-transatlantic-cable/">Announcing Sol transatlantic cable | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google announces Sol, a new transatlantic subsea cable connecting the U.S. (Palm Coast, Florida), Bermuda, the Azores, and Spain (Santander), marking the first operational fiber-optic cable between Florida and Europe. </li>
<li style="font-weight:400;">This complements their existing <a href="https://cloud.google.com/blog/products/infrastructure/introducing-the-nuvem-subsea-cable?e=48754805">Nuvem cable</a> to create redundant transatlantic paths with terrestrial interconnections at multiple points.</li>
<li style="font-weight:400;">The cable strengthens Google Cloud’s global infrastructure across 42 regions by providing increased capacity, improved reliability, and reduced latency for AI and cloud services between the Americas and Europe. Sol features 16 fiber optic cable pairs and will be manufactured in the U.S.</li>
<li style="font-weight:400;">Google is partnering with <a href="https://www.dcblox.com/">DC BLOX</a> for the Florida landing station and developing a terrestrial route to their <a href="https://cloudplatform.googleblog.com/2015/10/Bringing-Google-Cloud-Platform-closer-to-more-people-and-businesses.html">South Carolina cloud region</a>, while <a href="https://telxius.com/en/inicio-en/">Telxius</a> provides infrastructure in Spain to integrate with the <a href="https://cloud.google.com/blog/products/infrastructure/new-google-cloud-region-in-madrid-spain-now-open?e=48754805">Madrid cloud region</a>. This positions Florida and Spain as new connectivity hubs for Google’s network.</li>
<li style="font-weight:400;">Sol joins Google’s growing subsea cable portfolio, including Nuvem, Firmina, Equiano, and Grace Hopper, demonstrating their continued investment in owning network infrastructure rather than relying solely on consortium cables. </li>
<li style="font-weight:400;">This gives Google more control over capacity, routing, and performance for its cloud customers.</li>
<li style="font-weight:400;">The cable addresses growing demand for transatlantic connectivity driven by AI workloads and cloud adoption, while also providing economic benefits to landing locations through job creation and positioning them as digital hubs. No specific cost or availability timeline was provided in the announcement.</li>
<li style="font-weight:400;">Also, we all agree this is a terrible diagram. Genuinely – the worst one we’ve seen in a while. </li>
</ul>
<p>1:00:33 <a href="https://www.theinformation.com/articles/google-finds-crack-amazons-cloud-dominance?rc=3t8xtd">Google Finds a Crack in Amazon’s Cloud Dominance</a></p>
<ul>
<li style="font-weight:400;">Google is gaining ground in cloud market share by focusing on data analytics and AI workloads, areas where they have technical advantages over AWS through services like BigQuery and Vertex AI.</li>
<li style="font-weight:400;">The company has shifted strategy from trying to match AWS feature-for-feature to emphasizing their strengths in machine learning infrastructure and data processing capabilities that leverage their search and AI expertise.</li>
<li style="font-weight:400;">Google Cloud’s growth rate now exceeds both AWS and Azure, though from a smaller base, with particular success in industries like retail and financial services that need advanced analytics.</li>
<li style="font-weight:400;">Key differentiators include BigQuery’s serverless architecture that eliminates capacity planning and Vertex AI’s integration with Google’s pre-trained models, making enterprise AI adoption more accessible.</li>
<li style="font-weight:400;">The strategy appears to be working with notable customer wins, including major retailers and banks, who cite Google’s superior data analytics performance and lower total cost of ownership for specific workloads.</li>
</ul>
<p>1:01:31  Ryan – “It is interesting because I will say that this is focusing on Google’s strengths, and I agree that containers have been a strength for a long time. And you start adding BigQuery and Vertex AI, you’ve got a pretty powerful platform to build off of. The feature-to-feature, it’s going to miss all those enablements that make it really easy to stand up a full application on the cloud. So, like it’s kind of a bummer, but we’ll see what it’s actually like.”</p>
<h2>Azure</h2>
<p>1:02:52 <a href="https://azure.microsoft.com/en-us/blog/reasoning-reimagined-introducing-phi-4-mini-flash-reasoning/">Reasoning reimagined: Introducing Phi-4-mini-flash-reasoning | Microsoft </a><a href="https://azure.microsoft.com/en-us/blog/reasoning-reimagined-introducing-phi-4-mini-flash-reasoning/">Azure Blog</a></p>
<ul>
<li style="font-weight:400;">Microsoft introduces Phi-4-mini-flash-reasoning, a 3.8B parameter model using a new decoder-hybrid-decoder architecture called SambaY that combines Mamba state space models with sliding window attention and gated memory units to achieve 10x higher throughput and 2-3x latency reduction compared to standard transformer models.</li>
<li style="font-weight:400;">The model targets edge computing and resource-constrained environments where compute, memory, and latency are critical factors, making it deployable on a single GPU while maintaining advanced math reasoning capabilities with 64K token context length.</li>
<li style="font-weight:400;">Key innovation is the Gated Memory Unit (GMU) mechanism that enables efficient layer representation sharing, preserving linear prefilling time complexity while improving long-context retrieval performance for real-time applications.</li>
<li style="font-weight:400;">Primary use cases include on-device reasoning assistants, adaptive learning platforms, and interactive tutoring systems that require fast logic inference, with the model available on <a href="https://ai.azure.com/">Azure AI Foundry</a>, <a href="https://build.nvidia.com/microsoft">NVIDIA API Catalog</a>, and <a href="http://aka.ms/flashreasoning-hf">Hugging Face</a>.</li>
<li style="font-weight:400;">The architecture represents a practical approach to deploying AI reasoning capabilities at the edge without cloud dependency, addressing the growing need for low-latency AI inference in mobile and IoT applications. </li>
</ul>
<p>1:04:361  Matt – “I think it’ll be interesting when you’re on your mobile device and you say, hey, run me this thing, it tries to run it on a model like this, and then if it can’t get you a good result because it’s not enough data points and parameters, then it kind of goes off. So that’s kind of where I see this going, which is edge-based computing kind of coming back alive, where your phone and your laptop, everything else has enough that could run these small models to give you, you know, just quick feedback and do it offline also, versus everything always having to happen to be online.”</p>
<h2>Oracle</h2>
<p>1:06:43 <a href="https://www.oracle.com/news/announcement/blog/oracle-cloud-cuts-costs-and-propels-missions-for-government-agencies-2025-07-14/">Oracle Cloud Cuts Costs and Propels Missions for Government Agencies</a></p>
<ul>
<li style="font-weight:400;">Oracle partnered with GSA to offer federal agencies 75% discounts for six months on licensed technologies plus migration services to Oracle Cloud, targeting the significant number of government systems still running older Oracle versions on-premises.</li>
<li style="font-weight:400;">Oracle claims its second-generation cloud offers 50% lower compute costs, 70% lower storage costs, and 80% lower networking costs compared to competitors, though these comparisons lack specific benchmarks or competitor names.</li>
<li style="font-weight:400;">The partnership removes data egress fees when moving workloads between FedRAMP and DOD IL4/IL5 certified clouds, addressing a common pain point for government agencies considering multi-cloud strategies.</li>
<li style="font-weight:400;">Oracle is positioning its integrated AI capabilities in Database 23ai and application suites as differentiators, though the announcement provides no technical details about actual AI features or how they compare to AWS, Azure, or GCP offerings.</li>
<li style="font-weight:400;">While Oracle emphasizes cost savings and modernization benefits, the real impact depends on how many federal agencies migrate from their legacy Oracle systems, which have persisted precisely because Oracle doesn’t force upgrades.</li>
<li style="font-weight:400;">Here’s the gotcha: the discounts don’t last forever. </li>
</ul>
<p> Cloud Journey </p>
<p>1:08:31 <a href="https://www.gremlin.com/blog/4-chaos-engineering-recommendations-from-gartner">4 Chaos Engineering recommendations from Gartner</a></p>
<ul>
<li style="font-weight:400;">Gartner’s 2025 <a href="https://www.gartner.com/en/documents/6671734">Hype Cycle for Infrastructure Platforms</a> highlights <a href="https://www.gremlin.com/product/chaos-engineering">Chaos Engineering</a> as essential for testing AI resilience, particularly for applications using generative AI API calls that need validated fallback patterns when services fail or experience latency</li>
<li style="font-weight:400;"><a href="https://www.gremlin.com/docs/fault-injection-gamedays">GameDays</a> are becoming critical for enterprise preparedness against catastrophic failures like CrowdStrike or cloud provider outages, with financial institutions using them to verify disaster recovery plans for operational resilience compliance</li>
<li style="font-weight:400;">Organizations should prioritize Chaos Engineering on business-critical systems first, focusing on payment services, single points of failure, and elevated security privilege components, where downtime costs average $14,056 per minute</li>
<li style="font-weight:400;">Reliability scoring platforms provide measurable metrics beyond simple uptime/downtime tracking, enabling teams to identify performance degradation and latency issues before they impact users</li>
<li style="font-weight:400;">The increasing complexity of modern systems combined with AI adoption makes proactive reliability testing through Chaos Engineering a necessity rather than optional, as outages cost Global 2000 companies an average of $200 million annually.</li>
</ul>
<p>After Show </p>
<p>1:13:02 <a href="https://newsletter.manager.dev/p/stop-forcing-ai-tools-on-your-engineers">Stop forcing AI tools on your engineers  – by Anton Zaides</a></p>
<ul>
<li style="font-weight:400;">Engineering managers face pressure to force AI tool adoption on teams, but mandating specific tools like Cursor or requiring token usage metrics can backfire and slow productivity rather than improve it</li>
<li style="font-weight:400;">Companies should give engineers dedicated time (20% workload reduction or full exploration weeks) to experiment with AI tools in their actual codebases rather than expecting zero-cost adoption</li>
<li style="font-weight:400;">The focus should shift from measuring AI tool usage to measuring actual outcomes – if engineers using AI tools deliver better results, share those specific workflows internally rather than generic success stories</li>
<li style="font-weight:400;">Monday.com’s approach of a 5-week AI exploration with 127 internal demo submissions shows how large organizations can enable organic adoption through peer-led workshops and real use case sharing</li>
<li style="font-weight:400;">AI tools excel in greenfield projects and simple codebases, but adapting them to complex existing systems requires careful evaluation of what actually works versus following industry hype.</li>
</ul>
<p><strong>Closing</strong></p>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback, or ask questions at theCloudPod.net or tweet at us with the hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2096510/c1e-4919c186w2to569r-dm22g2o1f1p1-jzzgm7.mp3" length="76986355"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 313 of The Cloud Pod, where your hosts, Matt, Ryan, and Justin, are here to bring you all the latest in Cloud and AI news. This week we’ve got an installation of Cloud Journey featuring Gartner and chaos AND an aftershow! We’ve got acquisition news, new tools, an undersea cable, and even a little chaos, all right now in the cloud. Let’s get into it! 
Titles we almost went with this week:

From Vibe Check to Production Spec
Node More Mr. Nice Guy: AWS Locks Down Access Until You Ask Nicely
Grok’s New Feature: Ask Elon First
The AI That Phones Home to Dad
Musk-See TV: When Your Chatbot Needs Parental Guidance
Oracle’s Federal Discount: 75% Off for Six Months (Terms and Conditions Apply)
GameDay: Not Just for Sports Anymore
Bob the Builder Center: Can We Fix AWS? Yes We Can!
Bucket List: Google Cloud Storage Finally Lets You Pack Up and Move
The Great Bucket Migration: No Forwarding Address Required
Compose Yourself: Cloud Run Gets Docker-mented
Survey Says: Your Team Needs a Performance Check-Up
From Florida With Love: Google’s New Cable Has a License to Transmit
Sol Train: Google Lays Track Across the Atlantic
Finding the Right Gradient for Your AI Journey
Google Cracks the Code on AWS’s Cloud Castle
Breaking Cloud: Google’s Data Analytics Cook Up Market Share
From Chat to Churn: The Great GPT Subscription Exodus
AWS Finally Filters Out the Pricing Noise
The Price is Right: AWS Edition Gets New Search Features
Four Filters and a Pricing API Walk Into a Cloud
Fee-fi-fo-fum who has a flash reasoning model

Follow Up
02:01 Cognition to buy AI startup Windsurf days after Google poached CEO

Cognition acquired Windsurf’s IP, product, and remaining talent after Google hired away the CEO and senior staff, highlighting the intense competition for AI coding expertise among major tech companies.
The deal follows a failed $3 billion acquisition attempt by OpenAI and Google’s $2.4 billion licensing and compensation package to secure Windsurf’s leadership, demonstrating the premium valuations for AI coding technology.
Both companies develop AI coding agents designed to accelerate software development, with Cognition’s Devin agent and Windsurf’s tools representing the growing market for AI-powered developer productivity solutions.
The acquisition ensures all Windsurf employees receive accelerated vesting and financial participation, addressing the disruption caused by the leadership exodus to Google.
This consolidation in the AI coding space suggests smaller startups may struggle to retain talent and remain independent as tech giants aggressively pursue AI engineering capabilities.

AI Is Going Great – Or How ML Makes Money 
04:40 New Grok AI model surprises experts by checking Elon Musk’s views before]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2096510/c1a-k5d5-jp3353r8u5n5-essmd3.jpg"></itunes:image>
                                                                            <itunes:duration>01:20:11</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2096510/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[312: Azure Firewall Finally Learns to Spell (FQDN Edition)]]>
                </title>
                <pubDate>Thu, 17 Jul 2025 16:43:51 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2091709</guid>
                                    <link>https://tcpfm.castos.com/episodes/312-azure-firewall-finally-learns-to-spell-fqdn-edxuc</link>
                                <description>
                                            <![CDATA[<p>Welcome to episode 312 of The Cloud Pod, where your hosts, Matt, Ryan, and Justin, are here to bring you all the latest in Cloud and AI news. We’ve got security news, updates from PostgreSQL, Azure firewall and BlobNFS, plus TWO Cloud Journey stories for you! </p>
<p>Thanks for joining us this week in the cloud!  </p>
<h3>Titles we almost went with this week:</h3>
<ul>
<li>Git Happens: Why Your Database Pipeline Keeps Breaking</li>
<li>PostgreSQL and Chill: Azure’s New Storage Options for Database Romance</li>
<li>NVMe, Myself, and PostgreSQL</li>
<li>Canvas and Effect: AWS Paints a New Picture for E-commerce</li>
<li>Oracle’s $30 Billion Stargate: The AI Infrastructure Wars Begin</li>
<li>Larry’s Last Laugh: Oracle Lands OpenAI’s Mega Deal</li>
<li>AI Will See You Now (Couch Not Included)</li>
<li>Purview and Present Danger: Microsoft’s AI Security SDK Goes Live</li>
<li>The Purview from Up Here: Microsoft’s Bird’s Eye View on AI Data Security</li>
<li>Building Bridges: Azure’s Two-Way Street to Active Directory</li>
<li>Domain Names: Not Just for Browsers Anymore</li>
<li>FUSE or Lose: Azure’s BlobNFS Gets a Speed Boost</li>
<li>When Larry Met Andy: An Exadata Love Story</li>
<li>Bing There, Done That: Azure’s New Research Assistant</li>
<li>The Search is Over: Azure AI Foundry Finds Its Research Groove</li>
<li>Memory Lane: Where AI Agents Go to Remember Things</li>
<li>Elephants Never Forget, and Now Neither Do Google’s Agents</li>
<li>Z3 or Not Z3: That is the Storage Question</li>
<li>Local SSD Hero: A New Hope for I/O Intensive Workloads</li>
<li>Azure’s Certificate of Insecurity</li>
<li>KeyVault’s Keys Left Under the Doormat</li>
<li>When Your Cloud Provider Accidentally CCs the Hackers</li>
</ul>
<h2>AI Is Going Great – Or How ML Makes Money </h2>
<p>03:09 RYAN DOES A THING FOR SECURING AI WORKLOADS</p>
<ul>
<li style="font-weight:400;">Ryan was recently invited to Google’s Headquarters in San Francisco as part of a small group of security professionals where they spent time hands-on with Google security offerings, learning how to secure AI workloads. </li>
<li style="font-weight:400;">AI – and how to secure it – is a hot topic right now, and being able to spend time working with the Google development team was really insightful, with how they work with various levels of protections in place in dummy applications. </li>
<li style="font-weight:400;">Ryan was especially interested in the back-end logic that was executed in the applications. </li>
</ul>
<p>05:32   Ryan – “I was impressed because there’s how we’re thinking about AI is still evolving, and how we’re protecting it’s gonna be changing rapidly, and having real-world examples really helped really flesh out how their AI services are, how they’re integrated into a security ecosystem. It was pretty impressive. And it’s something that’s near and dear. I’ve been working and trying to roll out Google agent spaces and different AI workloads and trying to get involved and make sure that we, just getting visibility into all the different ones. And that was, it was really helpful to sort of think about it in those contexts.”</p>
<p>10:13 <a href="https://finance.yahoo.com/news/openai-secures-30bn-cloud-deal-094538950.html">OpenAI secures $30bn cloud deal with Oracle</a></p>
<ul>
<li style="font-weight:400;"><a href="https://openai.com/">OpenAI</a> signed a $30 billion annual cloud computing agreement with <a href="https://www.oracle.com/">Oracle</a> for 4.5GW of capacity, making it one of the largest AI cloud deals to date, and nearly triple Oracle’s current $10.3 billion annual data center infrastructure revenue.</li>
<li style="font-weight:400;">The deal represents a major expansion of the <a href="https://www.verdict.co.uk/what-is-the-stargate-project-and-why-does-it-matter/">Stargate</a> data center initiative, a $500 billion joint venture between OpenAI, SoftBank, Oracle, and Abu Dhabi’s MGX fund aimed at building AI infrastructure across multiple US states, in...</li></ul>
<h3>Chapters</h3>
<ul><li>(00:00:00) - Azure Firewall: Learning to Spell,</li><li>(00:01:04) - Azure Bug in the Show Notes Bot</li><li>(00:02:25) - How to Secure AI workloads with Threats</li><li>(00:07:10) - GCP vs. AWS: Minimum-Viable Platforms</li><li>(00:08:53) - Oracle to Buy 400,000 Nvidia GB200 Chips</li><li>(00:15:54) - Google's New AI Tools for Mental Health</li><li>(00:18:23) - Oracle Database at AWS</li><li>(00:23:56) - Google Cloud's New Lustre Storage: General Availability</li><li>(00:27:44) - Vertex AI Memory Bank Now in Public Preview</li><li>(00:30:17) - Google Expands Z3 Storage Optimized VM Family</li><li>(00:33:04) - Azure Adds Postgres to Kubernetes Database</li><li>(00:35:42) - Kubernetes in the Wild: Data, Security, Continuous</li><li>(00:39:22) - Kubernetes in the Wild: What is GitLab?</li><li>(00:41:30) - Microsoft Purview SDK and APIs Announced</li><li>(00:46:45) - Microsoft Entre Domain: Two Way Forest Trust</li><li>(00:51:05) - Microsoft's Cloud Ranting</li><li>(00:51:31) - Azure AD is Not Built for Cloud Ranting</li><li>(00:52:48) - Azure Firewall GA: Fully Qualified Domain Name filtering</li><li>(00:56:12) - Azure NFS for BLOB 3.0 Preview</li><li>(00:58:36) - Azure AI: Deep Research</li><li>(01:00:35) - Microsoft's Cloud Certificate Validation Validation Failure</li><li>(01:08:21) - Database DevOps: Fix Git Before It Breaks Your Production Environment</li><li>(01:13:32) - The Need for Test Drive Development in the Cloud</li><li>(01:17:18) - How to Write Automated Tests with AI</li><li>(01:24:36) - Test Coverage for a Large Codebase</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 312 of The Cloud Pod, where your hosts, Matt, Ryan, and Justin, are here to bring you all the latest in Cloud and AI news. We’ve got security news, updates from PostgreSQL, Azure firewall and BlobNFS, plus TWO Cloud Journey stories for you! 
Thanks for joining us this week in the cloud!  
Titles we almost went with this week:

Git Happens: Why Your Database Pipeline Keeps Breaking
PostgreSQL and Chill: Azure’s New Storage Options for Database Romance
NVMe, Myself, and PostgreSQL
Canvas and Effect: AWS Paints a New Picture for E-commerce
Oracle’s $30 Billion Stargate: The AI Infrastructure Wars Begin
Larry’s Last Laugh: Oracle Lands OpenAI’s Mega Deal
AI Will See You Now (Couch Not Included)
Purview and Present Danger: Microsoft’s AI Security SDK Goes Live
The Purview from Up Here: Microsoft’s Bird’s Eye View on AI Data Security
Building Bridges: Azure’s Two-Way Street to Active Directory
Domain Names: Not Just for Browsers Anymore
FUSE or Lose: Azure’s BlobNFS Gets a Speed Boost
When Larry Met Andy: An Exadata Love Story
Bing There, Done That: Azure’s New Research Assistant
The Search is Over: Azure AI Foundry Finds Its Research Groove
Memory Lane: Where AI Agents Go to Remember Things
Elephants Never Forget, and Now Neither Do Google’s Agents
Z3 or Not Z3: That is the Storage Question
Local SSD Hero: A New Hope for I/O Intensive Workloads
Azure’s Certificate of Insecurity
KeyVault’s Keys Left Under the Doormat
When Your Cloud Provider Accidentally CCs the Hackers

AI Is Going Great – Or How ML Makes Money 
03:09 RYAN DOES A THING FOR SECURING AI WORKLOADS

Ryan was recently invited to Google’s Headquarters in San Francisco as part of a small group of security professionals where they spent time hands-on with Google security offerings, learning how to secure AI workloads. 
AI – and how to secure it – is a hot topic right now, and being able to spend time working with the Google development team was really insightful, with how they work with various levels of protections in place in dummy applications. 
Ryan was especially interested in the back-end logic that was executed in the applications. 

05:32   Ryan – “I was impressed because there’s how we’re thinking about AI is still evolving, and how we’re protecting it’s gonna be changing rapidly, and having real-world examples really helped really flesh out how their AI services are, how they’re integrated into a security ecosystem. It was pretty impressive. And it’s something that’s near and dear. I’ve been working and trying to roll out Google agent spaces and different AI workloads and trying to get involved and make sure that we, just getting visibility into all the different ones. And that was, it was really helpful to sort of think about it in those contexts.”
10:13 OpenAI secures $30bn cloud deal with Oracle

OpenAI signed a $30 billion annual cloud computing agreement with Oracle for 4.5GW of capacity, making it one of the largest AI cloud deals to date, and nearly triple Oracle’s current $10.3 billion annual data center infrastructure revenue.
The deal represents a major expansion of the Stargate data center initiative, a $500 billion joint venture between OpenAI, SoftBank, Oracle, and Abu Dhabi’s MGX fund aimed at building AI infrastructure across multiple US states, in...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[312: Azure Firewall Finally Learns to Spell (FQDN Edition)]]>
                </itunes:title>
                                    <itunes:episode>312</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p>Welcome to episode 312 of The Cloud Pod, where your hosts, Matt, Ryan, and Justin, are here to bring you all the latest in Cloud and AI news. We’ve got security news, updates from PostgreSQL, Azure firewall and BlobNFS, plus TWO Cloud Journey stories for you! </p>
<p>Thanks for joining us this week in the cloud!  </p>
<h3>Titles we almost went with this week:</h3>
<ul>
<li>Git Happens: Why Your Database Pipeline Keeps Breaking</li>
<li>PostgreSQL and Chill: Azure’s New Storage Options for Database Romance</li>
<li>NVMe, Myself, and PostgreSQL</li>
<li>Canvas and Effect: AWS Paints a New Picture for E-commerce</li>
<li>Oracle’s $30 Billion Stargate: The AI Infrastructure Wars Begin</li>
<li>Larry’s Last Laugh: Oracle Lands OpenAI’s Mega Deal</li>
<li>AI Will See You Now (Couch Not Included)</li>
<li>Purview and Present Danger: Microsoft’s AI Security SDK Goes Live</li>
<li>The Purview from Up Here: Microsoft’s Bird’s Eye View on AI Data Security</li>
<li>Building Bridges: Azure’s Two-Way Street to Active Directory</li>
<li>Domain Names: Not Just for Browsers Anymore</li>
<li>FUSE or Lose: Azure’s BlobNFS Gets a Speed Boost</li>
<li>When Larry Met Andy: An Exadata Love Story</li>
<li>Bing There, Done That: Azure’s New Research Assistant</li>
<li>The Search is Over: Azure AI Foundry Finds Its Research Groove</li>
<li>Memory Lane: Where AI Agents Go to Remember Things</li>
<li>Elephants Never Forget, and Now Neither Do Google’s Agents</li>
<li>Z3 or Not Z3: That is the Storage Question</li>
<li>Local SSD Hero: A New Hope for I/O Intensive Workloads</li>
<li>Azure’s Certificate of Insecurity</li>
<li>KeyVault’s Keys Left Under the Doormat</li>
<li>When Your Cloud Provider Accidentally CCs the Hackers</li>
</ul>
<h2>AI Is Going Great – Or How ML Makes Money </h2>
<p>03:09 RYAN DOES A THING FOR SECURING AI WORKLOADS</p>
<ul>
<li style="font-weight:400;">Ryan was recently invited to Google’s Headquarters in San Francisco as part of a small group of security professionals where they spent time hands-on with Google security offerings, learning how to secure AI workloads. </li>
<li style="font-weight:400;">AI – and how to secure it – is a hot topic right now, and being able to spend time working with the Google development team was really insightful, with how they work with various levels of protections in place in dummy applications. </li>
<li style="font-weight:400;">Ryan was especially interested in the back-end logic that was executed in the applications. </li>
</ul>
<p>05:32   Ryan – “I was impressed because there’s how we’re thinking about AI is still evolving, and how we’re protecting it’s gonna be changing rapidly, and having real-world examples really helped really flesh out how their AI services are, how they’re integrated into a security ecosystem. It was pretty impressive. And it’s something that’s near and dear. I’ve been working and trying to roll out Google agent spaces and different AI workloads and trying to get involved and make sure that we, just getting visibility into all the different ones. And that was, it was really helpful to sort of think about it in those contexts.”</p>
<p>10:13 <a href="https://finance.yahoo.com/news/openai-secures-30bn-cloud-deal-094538950.html">OpenAI secures $30bn cloud deal with Oracle</a></p>
<ul>
<li style="font-weight:400;"><a href="https://openai.com/">OpenAI</a> signed a $30 billion annual cloud computing agreement with <a href="https://www.oracle.com/">Oracle</a> for 4.5GW of capacity, making it one of the largest AI cloud deals to date, and nearly triple Oracle’s current $10.3 billion annual data center infrastructure revenue.</li>
<li style="font-weight:400;">The deal represents a major expansion of the <a href="https://www.verdict.co.uk/what-is-the-stargate-project-and-why-does-it-matter/">Stargate</a> data center initiative, a $500 billion joint venture between OpenAI, SoftBank, Oracle, and Abu Dhabi’s MGX fund aimed at building AI infrastructure across multiple US states, including Texas, Michigan, and Ohio.</li>
<li style="font-weight:400;">Oracle plans to purchase 400,000 <a href="https://www.nvidia.com/en-us/data-center/gb200-nvl72/">Nvidia GB200</a> chips for approximately $40 billion to power the Abilene, Texas facility, positioning itself to compete directly with AWS and Microsoft in the AI cloud infrastructure market.</li>
<li style="font-weight:400;">The 4.5GW capacity represents about 25% of the current US operational data center capacity, highlighting the substantial infrastructure requirements for training and running advanced AI models at scale.</li>
<li style="font-weight:400;">This partnership signals a shift in the cloud landscape, where traditional database companies like Oracle are becoming critical infrastructure providers for AI workloads, potentially disrupting the current cloud provider hierarchy.</li>
</ul>
<p>04:09 <a href="https://blog.google/technology/health/new-mental-health-ai-tools-research-treatment/">Google announces new AI tools for mental health research and treatment</a></p>
<ul>
<li style="font-weight:400;">Google is developing AI tools specifically for mental health research and treatment, though the article appears to be a survey page rather than containing actual content about the tools themselves.</li>
<li style="font-weight:400;">Without the article content, we can note that AI applications in mental health typically involve natural language processing for therapy chatbots, pattern recognition for symptom tracking, and predictive analytics for treatment outcomes.</li>
<li style="font-weight:400;">Cloud infrastructure would be essential for these tools to handle sensitive health data processing, ensure HIPAA compliance, and scale to support healthcare providers and researchers.</li>
<li style="font-weight:400;">Mental health AI tools often integrate with existing cloud-based electronic health record systems and require robust security measures for patient data protection.</li>
<li style="font-weight:400;">The development signals Google’s continued expansion into healthcare AI applications, following their work in medical imaging and clinical decision support systems.</li>
<li style="font-weight:400;">We’re not really sure how we feel about sharing our deepest, darkest secrets. The machines won’t use any of that against us, right?</li>
<li style="font-weight:400;">Interested in the article Ryan talked about? https://www.washingtonpost.com/technology/2025/05/31/ai-chatbots-user-influence-attention-chatgpt/</li>
</ul>
<h2>AWS</h2>
<p>20:06 <a href="https://aws.amazon.com/blogs/aws/amazon-nova-canvas-update-virtual-try-on-and-style-options-now-available/">Amazon Nova Canvas update: Virtual try-on and style options now </a><a href="https://aws.amazon.com/blogs/aws/amazon-nova-canvas-update-virtual-try-on-and-style-options-now-available/">available | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/ai/generative-ai/nova/creative??trk=ac97e39c-d115-4d4a-b3fe-c695e0c9a7ee&amp;sc_channel=el">Amazon Nova Canvas</a> adds virtual try-on capability, allowing users to combine two images – like placing clothing on a person or furniture in a room – using AI-powered image generation with three masking modes (garment, prompt, or custom image masks).</li>
<li style="font-weight:400;">Eight new pre-trained style options simplify consistent image generation across different artistic styles, including 3D animated family film, photorealism, graphic novel, and midcentury retro, eliminating complex prompt engineering.</li>
<li style="font-weight:400;">The feature targets e-commerce retailers who can integrate virtual try-on to help customers visualize products before purchase, potentially reducing returns and improving conversion rates.</li>
<li style="font-weight:400;">Available immediately in US East (N. Virginia), Asia Pacific (Tokyo), and Europe (Ireland) regions with standard Amazon Bedrock pricing, requiring images under 4.1M pixels (2048×2048 max).</li>
<li style="font-weight:400;">Integration requires minimal code changes using the existing <a href="https://docs.aws.amazon.com/bedrock/latest/APIReference/API_runtime_InvokeModel.html">Bedrock Runtime invoke API</a> with new taskType parameters, making it accessible for developers already using Nova Canvas without model migration.</li>
</ul>
<p>21:09  Matt – “Amazon is going to have a field day with this.” </p>
<p>22:20 <a href="https://aws.amazon.com/blogs/aws/introducing-oracle-databaseaws-for-simplified-oracle-exadata-migrations-to-the-aws-cloud/">Introducing Oracle Database@AWS for simplified Oracle Exadata </a><a href="https://aws.amazon.com/blogs/aws/introducing-oracle-databaseaws-for-simplified-oracle-exadata-migrations-to-the-aws-cloud/">migrations to the AWS Cloud</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/marketplace/featured-seller/oracle/">Oracle Database@AWS</a> enables direct migration of <a href="https://www.oracle.com/engineered-systems/exadata/">Oracle Exadata</a> and <a href="https://www.oracle.com/database/real-application-clusters/#:~:text=Oracle%20Real%20Application%20Clusters%20(RAC)%20is%20the%20world%E2%80%99s,graph,%20the%20Internet%20of%20Things%20(IoT),%20and%20in-memory.">RAC workloads </a>to AWS with minimal changes, providing a third option beyond self-managed <a href="https://aws.amazon.com/ec2">EC2</a> or RDS for Oracle. </li>
<li style="font-weight:400;">This addresses a significant gap for enterprises locked into Oracle’s high-end database features.</li>
<li style="font-weight:400;">The service runs Oracle infrastructure within AWS data centers, integrating with native AWS services like <a href="https://docs.aws.amazon.com/vpc/latest/userguide/what-is-amazon-vpc.html">VPC</a>, <a href="https://aws.amazon.com/iam/">IAM</a>, <a href="https://aws.amazon.com/about-aws/whats-new/2025/07/amazon-cloudwatch-application-signals-mcp-servers-for-ai-assisted-troubleshooting/">CloudWatch</a>, and <a href="https://aws.amazon.com/s3">S3</a> for backups while maintaining Oracle’s management plane. </li>
<li style="font-weight:400;">Customers get unified billing through <a href="https://aws.amazon.com/blogs/aws/category/software/aws-marketplace/">AWS Marketplace</a> that counts toward AWS commitments.</li>
<li style="font-weight:400;">Zero-ETL integration with <a href="https://aws.amazon.com/redshift/">Amazon Redshift</a> eliminates cross-network data transfer costs for analytics workloads, while S3 backup support provides eleven nines durability. </li>
<li style="font-weight:400;">The service supports both traditional Exadata VM clusters and fully managed Autonomous Database options.</li>
<li style="font-weight:400;">Currently available in US East and US West regions, with expansion planned to 20 AWS regions globally. </li>
<li style="font-weight:400;">Pricing is set by Oracle through AWS Marketplace private offers (So prepare to spend all your $$$) and requires coordination between AWS and Oracle sales teams for activation.</li>
<li style="font-weight:400;">VM cluster creation takes up to 6 hours and requires navigating between AWS and OCI consoles for full database management. Oof. </li>
<li style="font-weight:400;">The service maintains compliance with major standards including SOC, HIPAA, and PCI DSS.</li>
</ul>
<p>23:37   Ryan – “…there’s a ton of advantages when you think about the integration like the zero ATL with Redshift is a pretty, pretty prominent example. If you’re in the Amazon ecosystem and you’re utilizing those services, like this is going to be great. Somehow, you’re limited to the Oracle database products; it’s such a hard place to be between those two things. And so I like this for the customers this will fit, but it does seem a little clunky.”</p>
<h2>GCP</h2>
<p>25:54 <a href="https://cloud.google.com/blog/products/storage-data-transfer/google-cloud-managed-lustre-for-ai-hpc/">Google Cloud Managed Lustre for AI HPC | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/products/managed-lustre">Google Cloud Managed Lustre</a> is now GA with four performance tiers ranging from 125 MB/s to 1000 MB/s per TiB, scaling up to 8 PB of storage capacity, powered by DDN’s EXAScaler technology for high-performance parallel file system needs in AI/ML workloads.</li>
<li style="font-weight:400;">The service addresses critical AI infrastructure bottlenecks by providing POSIX-compliant storage with sub-millisecond read latency, enabling efficient GPU/TPU utilization for model training, checkpointing, and high-throughput inference tasks that require rapid access to petabyte-scale datasets.</li>
<li style="font-weight:400;"><a href="https://cloud.google.com/products/managed-lustre/pricing">Pricing</a> starts at $0.14 per TiB-hour for the 125 MB/s tier up to $0.70 per TiB-hour for the 1000 MB/s tier, positioning it competitively against AWS FSx for Lustre while offering native integration with GKE and TPUs across multiple Google Cloud regions.</li>
<li style="font-weight:400;">The partnership with DDN brings enterprise-grade Lustre expertise to Google Cloud’s managed services portfolio, filling a gap for customers who need proven HPC storage solutions without the operational overhead of self-managing Lustre clusters. (Say that 6 times fast.) </li>
<li style="font-weight:400;">Key use cases extend beyond AI to traditional HPC workloads like genomic sequencing and climate modeling, with NVIDIA endorsing it as part of their AI platform on Google Cloud for organizations requiring high-performance storage at scale.</li>
</ul>
<p>27:13   Matt – “I’m still am always impressed by how cheap storage is on these services.” </p>
<p>29:49 <a href="https://cloud.google.com/blog/products/ai-machine-learning/vertex-ai-memory-bank-in-public-preview/">Vertex AI Memory Bank in public preview | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Vertex AI <a href="https://cloud.google.com/vertex-ai/generative-ai/docs/agent-engine/memory-bank/overview">Memory Bank</a> enables agents to maintain persistent memory across conversations, storing user preferences and context beyond single sessions, addressing the common limitation where agents treat every interaction as new and ask repetitive questions.</li>
<li style="font-weight:400;">The service uses <a href="https://gemini.google.com/">Gemini</a> models to automatically extract, consolidate, and update memories from conversation history, handling contradictions intelligently while providing a similarity search for relevant context retrieval, based on Google Research’s ACL 2025 accepted method for topic-based agent memory.</li>
<li style="font-weight:400;">Memory Bank integrates with <a href="https://google.github.io/adk-docs/">Agent Development Kit (ADK)</a> and <a href="https://cloud.google.com/vertex-ai/generative-ai/docs/agent-engine/sessions/overview">Agent Engine Sessions</a>, with support for third-party frameworks like <a href="https://www.langchain.com/langgraph">LangGraph</a> and <a href="https://www.crewai.com/">CrewAI</a> – developers can start with a Gmail account and API key through express mode registration before upgrading to full GCP projects.</li>
<li style="font-weight:400;">This positions Google competitively against <a href="https://aws.amazon.com/bedrock/">AWS Bedrock’s</a> conversation memory and Azure’s similar offerings, though Google’s implementation emphasizes automatic memory extraction and intelligent consolidation rather than simple conversation storage.</li>
<li style="font-weight:400;">Key use cases include personalized retail assistants, customer service agents that remember past issues, and any application requiring multi-session context, with the service available in public preview at standard Vertex AI pricing tiers.</li>
</ul>
<p>31:35 <a href="https://cloud.google.com/blog/products/compute/expanded-z3-vm-portfolio-for-io-intensive-workloads/">Expanded Z3 VM portfolio for I/O intensive workloads | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Do you love burning a lot of money? Have we got news for you! Google is expanding its <a href="https://cloud.google.com/blog/products/compute/getting-to-know-the-z3-storage-optimized-machine-family">Z3 storage-optimized VM family</a> with 9 new instances offering 3-18 TiB local SSD capacity, plus a bare metal option with 72 TiB, targeting I/O-intensive workloads like databases and analytics. </li>
<li style="font-weight:400;">The new <a href="https://cloud.google.com/compute/docs/disks/local-ssd">Titanium SSDs</a> deliver up to 36 GiB/s read throughput and 9M IOPS, with 35% lower latency than previous generation local SSDs.</li>
<li style="font-weight:400;"><a href="https://cloud.google.com/compute/docs/storage-optimized-machines?_gl=1*2vt8da*_up*MQ..&amp;gclid=CjwKCAiAqfe8BhBwEiwAsne6gduqCwwkpJZbE9aPtQmusSUIJYOzGeKiVzaE-1_M9aml0iqY5L8_IBoCh90QAvD_BwE&amp;gclsrc=aw.ds#z3_machine_types">Z3</a> introduces two VM types: standard SSD (200 GiB SSD per vCPU) for OLAP and SQL databases, and high SSD (400 GiB SSD per vCPU) for distributed databases and streaming. </li>
<li style="font-weight:400;">The <a href="https://cloud.google.com/compute/docs/instances/bare-metal-instances">bare metal instance</a> provides direct CPU access for specialized workloads requiring custom hypervisors or specific licensing needs.</li>
<li style="font-weight:400;">Enhanced maintenance features include advanced notice for planned maintenance, live migration support for VMs with 18 TiB or less local SSD, and in-place upgrades that preserve data for larger instances. </li>
<li style="font-weight:400;">This addresses a common pain point for stateful workloads requiring local storage.</li>
<li style="font-weight:400;">Z3 integrates with <a href="https://cloud.google.com/compute/docs/disks/hyperdisks">Google’s Hyperdisk</a> for network-attached storage, supporting up to 350K IOPS per VM and 500K IOPS for bare metal instances. AlloyDB will leverage Z3 as its foundation, using local SSDs as cache to hold datasets 25x larger than memory with near-memory performance.</li>
<li style="font-weight:400;">Early adopters report significant performance gains: OP Labs saw 30-50% reduction in p99 latencies for blockchain nodes, Tenderly achieved 40% read latency improvement, and Shopify selected Z3 as their platform for performance-sensitive storage systems.</li>
</ul>
<p>34:06  Ryan – “They’ve put in so much development in Google Hyperdisk and making that a service, but everything that’s over a network is going to have a higher latency than a local SSD, and so it’s kind of funny to see these ginormous boxes.” </p>
<h2>Azure</h2>
<p>35:33 <a href="https://azure.microsoft.com/en-us/blog/running-high-performance-postgresql-on-azure-kubernetes-service/">Running high-performance PostgreSQL on Azure Kubernetes Service | </a><a href="https://azure.microsoft.com/en-us/blog/running-high-performance-postgresql-on-azure-kubernetes-service/">Microsoft Azure Blog</a></p>
<ul>
<li style="font-weight:400;">Azure now offers two <a href="https://www.postgresql.org/">PostgreSQL</a> deployment options on <a href="https://learn.microsoft.com/en-us/azure/aks/what-is-aks">AKS</a>, including <a href="https://learn.microsoft.com/en-us/azure/storage/container-storage/container-storage-introduction">Azure Container Storage</a> with local NVMe for performance-critical workloads, achieving up to 26,000 TPS with sub-millisecond latency, and Premium SSD v2 for cost-optimized deployments with flexible IOPS/throughput scaling up to 80,000 IOPS per volume.</li>
<li style="font-weight:400;">The <a href="https://cloudnative-pg.io/">CloudNativePG</a> operator integration provides automated failover, built-in replication, and native <a href="https://azure.microsoft.com/en-us/products/storage/blobs/">Azure Blob Storage</a> backup capabilities, addressing the complexity of running stateful workloads on Kubernetes that has historically pushed enterprises toward managed database services.</li>
<li style="font-weight:400;">Benchmark results show local NVMe delivers 14,812 TPS at 4.3ms latency on Standard_L16s_v3 VMs, while Premium SSD v2 achieves 8,600 TPS at 7.4ms latency on Standard_D16ds_v5, with the NVMe option costing approximately $1,382/month versus $348/month for Premium SSD v2.</li>
<li style="font-weight:400;">This positions AKS competitively against AWS EKS and GCP GKE for database workloads, particularly as PostgreSQL now powers 36% of all Kubernetes database deployments according to the 2025 Kubernetes in the Wild report, up 6 points since 2022.</li>
<li style="font-weight:400;">Target customers include organizations running payment systems, gaming backends, multi-tenant SaaS platforms, and real-time analytics that need either maximum performance or flexible scaling, with Azure Container Storage also supporting Redis, MongoDB, and Kafka workloads beyond PostgreSQL.</li>
</ul>
<p>34:06  Ryan – “I bristle at all the numbers because they’re comparing it to managed services, and it’s a cost. But you’re also not counting the cost of the three people minimum that it’s going to take to support your Kubernetes cluster… there’s just a lot of advantages that you’re giving up in order ot run it locally and to have direct access to that layer.” </p>
<p>43:17 <a href="https://techcommunity.microsoft.com/blog/microsoft-security-blog/announcing-general-availability-of-microsoft-purview-sdk-and-apis/4429897">Announcing General Availability of Microsoft Purview SDK and APIs | </a><a href="https://techcommunity.microsoft.com/blog/microsoft-security-blog/announcing-general-availability-of-microsoft-purview-sdk-and-apis/4429897">Microsoft Community Hub</a></p>
<ul>
<li style="font-weight:400;">Microsoft <a href="https://learn.microsoft.com/en-us/purview/developer/?branch=main">Purview SDK</a> and APIs are now generally available, enabling developers to embed enterprise-grade data security and compliance controls directly into custom GenAI applications and agents, addressing critical concerns around data leakage, unauthorized access, and regulatory compliance.</li>
<li style="font-weight:400;">The SDK provides three key security capabilities: preventing data oversharing by inheriting labels from source data, protecting against data leaks with built-in safeguards, and governing AI runtime data through auditing, data lifecycle management, eDiscovery, and communication compliance.</li>
<li style="font-weight:400;">This positions Microsoft competitively against AWS and GCP by offering native integration with <a href="https://duckduckgo.com/y.js?ad_domain=microsoft.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=fF_Fl4zMG4GjwKLhfeV874RtOaDTRDvgjIgM05w3RevXXXixK38ocM_r_r1NjTzWF51rQqk3OLU69J5vgA8gVCw8IM08mCEaH-X0AtGqlTHsS0o6rnkF_le2joBN0k5H.bd7geCH0ClXv5NVKix6c2Q&amp;eddgt=oeFzVlZCdxQRfO9C_ZVhGA%3D%3D&amp;rut=3273ec42bbabcc97498e9b641c0f3fceab52dbb59b97c6aecf0af0c2a020c371&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8ZRwH3PK9J7ZLWhiwl5UzejVUCUw34hanyktOW4_9S81zERNIdYNbVsaahg38mxHOLt3hJ0Rk6XRh8FblZvH9S50CogX7r_rXQIlHAzGfBLqQH5hQbpjZ9io_P0E9YXLCF9BnFr7EXRCNOow7Koke7u68gB7DyLsVJZZPRjShZOgr4KU5VYt86TngkWB_hXPZbecHyVqXMu5k3E8UszrqgG5TmOlfXwEeX6kB0Erec1EDWBzWCDRUYBTSpo0gmsuG-bIjX1b83bY2fT5XKLRbWmrsK8QcnlT41a4Vybp4-jK5PlCbpl1jXc2lVIwe3oo-5tu_DWwCuh4j6N_t0YwxQ18kO55lq9DobWTmGs6niOXFYyb3MccGlL-DTBrNzphmhl0PF-dn1LGZ7kaNbbfUnVbxqBKZ9Tzdr2kQcJbkjeZBCAl046AH7QOmnh1zz7b95OW1bx0O8PlkX6S4vlY2DoCH0woI8wfMGpCc4W2BuofIj8YDE34cMCr-hNmjOrT2-m-RqzD1M-6U_1hVr8av2gyPxdjq-gLX2vedokWDX7BeFOyz1z8f-KGWTyWbtdcrLXhM_Fq5lCjcAHqF03D8EVJqd_1Jc7x_VTs9V09jGXbYObqjtEpk3m23BRYpJMG-nbQpZydfMr5Ww5qV-wdo3vLpLM6RCuOPtcPvt9c5aPezXonx9xrZ1kQILpbYFbCsNmN2-e3RPXmasf2GnJFbHxBGJGMH5oYRGP2NtC2F4Ywq-iZwvcvYJdrUmDZaxwkIlyLSgw%26u%3DaHR0cHMlM2ElMmYlMmY1MzUwLnhnNGtlbi5jb20lMmZ0cmslMmZ2MSUzZnByb2YlM2Q0NjclMjZjYW1wJTNkMTczNDg2JTI2a2N0JTNkbXNuJTI2a2NoaWQlM2QxNDAxNzM2ODklMjZjcml0ZXJpYWlkJTNka3dkLTc4NzUzMTM0NDc1OTg3JTNhbG9jLTE5MCUyNmNhbXBhaWduaWQlM2Q1OTAzMjYzNzElMjZsb2NwaHklM2Q0NDE0MiUyNmFkZ3JvdXBpZCUzZDEyNjAwNDE5ODc3OTc5MzYlMjZjaWQlM2Q3ODc1MjcyOTYzMjI1NyUyNmtkdiUzZGMlMjZrZXh0JTNkJTI2a3BnJTNkJTI2a3BpZCUzZCUyNnF1ZXJ5U3RyJTNkTWljcm9zb2Z0JTI1MjAzNjUlMjUyMENvcGlsb3QlMjZ1cmwlM2RodHRwcyUzYSUyZiUyZnd3dy5taWNyb3NvZnQuY29tJTJmZW4tdXMlMmZtaWNyb3NvZnQtMzY1JTJmY29waWxvdCUyZnRyeS1jb3BpbG90LWNoYXQlM2ZjY2FjJTNkY29waWxvdGNoYXQlMjZlZl9pZCUzZF9rXzgyMDkxYTdlZGY1YTE0YTEwMDVhNzIwOThhYzY2MDhiX2tfJTI2T0NJRCUzZEFJRGNtbWwzNXdzOW16X1NFTV9fa184MjA5MWE3ZWRmNWExNGExMDA1YTcyMDk4YWM2NjA4Yl9rXyUyNm1zY2xraWQlM2Q4MjA5MWE3ZWRmNWExNGExMDA1YTcyMDk4YWM2NjA4Yg%26rlid%3D82091a7edf5a14a1005a72098ac6608b&amp;vqd=4-173151869191310466692592263789526943643&amp;iurl=%7B1%7DIG%3DFD77D25F7CC34F24A1008DE53C7B6F07%26CID%3D2C8C7465AF9363B116C5624EAE8B6274%26ID%3DDevEx%2C5047.1">Microsoft 365 Copilot</a>-level security features, allowing developers to focus on core product development while Purview handles the complex compliance and governance requirements enterprises demand.</li>
<li style="font-weight:400;">Target customers include ISVs and enterprises building custom AI applications that need to meet strict data governance requirements, particularly in regulated industries where data security and compliance are non-negotiable for adoption.</li>
<li style="font-weight:400;">The SDK works across any platform and AI model, not just Azure, making it a flexible solution for multi-cloud environments while leveraging Microsoft’s existing Purview data governance infrastructure that many enterprises already use.</li>
</ul>
<p>44:48  Matt – “They’re definitely pushing Purview and a lot of the features of it recently – or maybe it’s just people I’ve been talking to – but it’s something that’s been coming up more and more. I think if they’re just doing a push to make it a larger service to be used, not just in the corporate IT space, but in the software dev… You can build in these controls that will help along the way.”</p>
<p>48:35 <a href="https://azure.microsoft.com/en-us/updates?id=493697">Generally Available: Two-Way Forest Trusts for Microsoft Entra Domain </a><a href="https://azure.microsoft.com/en-us/updates?id=493697">Services</a></p>
<ul>
<li style="font-weight:400;">Do you love old features repackaged into new features? Us too. </li>
<li style="font-weight:400;">Two-way forest trusts between <a href="https://learn.microsoft.com/en-us/entra/identity/domain-services/overview">Microsoft Entra Domain Services</a> and on-premises <a href="https://learn.microsoft.com/en-us/windows-server/identity/ad-ds/get-started/virtual-dc/active-directory-domain-services-overview">Active Directory</a> enable bidirectional authentication and resource access, addressing a key limitation where only one-way trusts were previously supported.</li>
<li style="font-weight:400;">This feature allows organizations to maintain their existing on-premises AD infrastructure while extending authentication capabilities to cloud resources, reducing the need for complex identity federation or migration projects.</li>
<li style="font-weight:400;">The general availability release positions Azure more competitively against AWS Managed Microsoft AD, which has supported two-way trusts since launch, closing a notable feature gap in Azure’s managed directory services.</li>
<li style="font-weight:400;">Primary use cases include hybrid cloud deployments where applications in Azure need to authenticate users from on-premises domains and vice versa, particularly beneficial for enterprises with regulatory requirements to maintain on-premises identity systems.</li>
<li style="font-weight:400;">Organizations should evaluate the additional network connectivity requirements and potential latency impacts when implementing forest trusts across hybrid environments, as authentication traffic will traverse between cloud and on-premises infrastructure.</li>
</ul>
<p>49:47  Justin – “Thank goodness this is finally here. This is actually a pain point I’m familiar with from the day job. The ability to connect your Entra ID to your local authorization domain is a big problem, and so not having this ability actually caused a lot of weird edge cases and extra hoops that now Ryan won’t have to solve.” </p>
<p>54:44 <a href="https://azure.microsoft.com/en-us/updates?id=497428">Generally Available: FQDN Filtering in DNAT rules in Azure Firewall</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/firewall/overview">Azure Firewall</a> now supports <a href="https://learn.microsoft.com/en-us/azure/firewall/fqdn-filtering-network-rules">FQDN filtering</a> in DNAT rules, allowing administrators to route inbound traffic to backend resources using domain names instead of static IP addresses, which simplifies management when backend IPs change frequently.</li>
<li style="font-weight:400;">This feature addresses a common pain point where organizations had to manually update firewall rules whenever backend server IPs changed, particularly useful for scenarios with dynamic infrastructure or when using services with rotating IP addresses.</li>
<li style="font-weight:400;">The implementation brings Azure Firewall closer to feature parity with AWS Network Firewall and Google Cloud Armor, both of which have supported domain-based filtering for inbound traffic rules for some time.</li>
<li style="font-weight:400;">Target use cases include load balancing to backend pools with changing IPs, routing to containerized applications, and managing multi-region deployments where IP addresses may vary across availability zones.</li>
<li style="font-weight:400;">Organizations should note that FQDN resolution adds a slight processing overhead and DNS lookup time to DNAT operations, though Microsoft hasn’t published specific latency metrics for this generally available feature.</li>
</ul>
<p>56:49  Ryan – “The fact that routing traffic by IP Address on the backend wasn’t possible until now is crazy to me.” </p>
<p>58:14 <a href="https://techcommunity.microsoft.com/blog/azurestorageblog/%F0%9F%93%A2-public-preview-accelerating-blobnfs-throughput--scale-with-fuse-for-superior-/4426147">Accelerating BlobNFS throughput &amp; scale with FUSE for superior </a><a href="https://techcommunity.microsoft.com/blog/azurestorageblog/%F0%9F%93%A2-public-preview-accelerating-blobnfs-throughput--scale-with-fuse-for-superior-/4426147">performance</a></p>
<ul>
<li style="font-weight:400;">Azure’s updated AZNFS 3.0 introduces FUSE-based performance enhancements to BlobNFS, delivering up to 5 times faster single-file reads and 3 times faster writes compared to native Linux NFS clients. This addresses performance bottlenecks for HPC, AI/ML, and backup workloads that require high-throughput access to blob storage via NFS protocol.</li>
<li style="font-weight:400;">The update increases TCP connection support from 16 to 256, enabling workloads to saturate VM network bandwidth with just 4 parallel operations. </li>
<li style="font-weight:400;">This brings <a href="https://github.com/Azure/AZNFS-mount/wiki/Instructions-to-install-and-use-latest-version-of-AZNFS">Azure’s NFS</a> blob access performance closer to that of AWS EFS and GCP Filestore capabilities for demanding enterprise workloads.</li>
<li style="font-weight:400;">Key technical improvements include support for files up to 5TB (previously limited to 3TB), removal of the 16-group user limitation, and enhanced metadata operations with 3MB directory queries. These changes particularly benefit EDA and CAD workloads that process large simulation files and extensive file metadata.</li>
<li style="font-weight:400;">While <a href="http://aka.ms/blobfuse">BlobFuse</a> offers Azure Entra ID authentication and public endpoint access, BlobNFS still requires virtual network connectivity and lacks native Azure AD integration. Organizations must weigh protocol requirements against security needs when choosing between the two mounting options.</li>
<li style="font-weight:400;">The preview requires registration and targets customers running Linux-based HPC clusters, AI training pipelines, and legacy applications requiring POSIX compliance. Installation involves the AZNFS mount helper package available on GitHub, with no additional Azure costs beyond standard blob storage pricing.</li>
</ul>
<p>1:00:42 <a href="https://azure.microsoft.com/en-us/blog/introducing-deep-research-in-azure-ai-foundry-agent-service/">Introducing Deep Research in Azure AI Foundry Agent Service | Microsoft </a><a href="https://azure.microsoft.com/en-us/blog/introducing-deep-research-in-azure-ai-foundry-agent-service/">Azure Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://azure.microsoft.com/en-us/products/ai-foundry/?msockid=36d156f74ff96d723016422b4e966cdb">Azure AI Foundry</a> introduces <a href="https://openai.com/index/introducing-deep-research/">Deep Research</a> as an API/SDK service that automates web-scale research using <a href="https://openai.com/">OpenAI’s</a> <a href="https://learn.microsoft.com/en-us/azure/ai-foundry/agents/how-to/tools/deep-research">o3-deep-research model</a>, enabling developers to build agents that can analyze and synthesize information from across the web with full source citations and audit trails.</li>
<li style="font-weight:400;">The service integrates with Azure’s enterprise ecosystem through <a href="https://learn.microsoft.com/en-us/azure/logic-apps/logic-apps-overview">Logic Apps</a>, <a href="https://learn.microsoft.com/en-us/azure/azure-functions/functions-overview">Azure Functions</a>, and other <a href="https://learn.microsoft.com/en-us/azure/ai-foundry/agents/overview">Foundry Agent Service</a> connectors, allowing research to be embedded as a reusable component in multi-step workflows rather than just a standalone chat interface.</li>
<li style="font-weight:400;">Pricing starts at $10 per 1M input tokens and $40 per 1M output tokens for the o3-deep-research model, with additional charges for Bing Search grounding and GPT models used for query clarification, positioning this as a premium enterprise offering. Because everyone is using Bing search for their ground needs, right? </li>
<li style="font-weight:400;">The architecture provides transparency through documented reasoning paths and source citations, addressing enterprise governance requirements for regulated industries where AI decision-making needs to be fully auditable.</li>
</ul>
<p>1:01:39  Ryan – “It is truly evil to do a four times cost increase for the output that you’re not in control of.” </p>
<p>1:03:00 <a href="https://www.tramlines.io/blog/azure-mcp-exploited-maliciously-leaking-user-s-keyvault-secrets-to-attackers">Azure MCP Exploited Maliciously Leaking User S Keyvault Secrets To </a><a href="https://www.tramlines.io/blog/azure-mcp-exploited-maliciously-leaking-user-s-keyvault-secrets-to-attackers">Attackers</a></p>
<ul>
<li style="font-weight:400;">Researchers discovered a critical vulnerability in <a href="https://www.microsoft.com/en-us/security/business/endpoint-management/microsoft-cloud-PKI">Azure’s Managed Certificate Provider</a> (MCP) that allowed attackers to extract <a href="https://learn.microsoft.com/en-us/azure/key-vault/general/about-keys-secrets-certificates">KeyVault secrets</a> by exploiting certificate validation flaws in the authentication process.</li>
<li style="font-weight:400;">The vulnerability stemmed from MCP’s improper handling of certificate chains, enabling malicious actors to forge certificates that appeared legitimate to Azure’s authentication system and gain unauthorized access to sensitive KeyVault data.</li>
<li style="font-weight:400;">Microsoft has since patched the vulnerability, but the incident highlights ongoing security challenges in cloud certificate management systems and the need for robust certificate validation mechanisms across all cloud providers.</li>
<li style="font-weight:400;">Organizations using Azure KeyVault should audit their access logs and rotate any potentially exposed secrets, as the vulnerability could have been exploited without leaving obvious traces in standard monitoring systems.</li>
<li style="font-weight:400;">This discovery follows a pattern of certificate-related vulnerabilities across major cloud platforms, emphasizing that even mature cloud services require continuous security scrutiny and that customers should implement defense-in-depth strategies rather than relying solely on platform security.</li>
<li style="font-weight:400;">Nice job Azure. Ryan is extra impressed. </li>
</ul>
<p>1:05:21  Justin – “I have to say that the more I’ve learned about MCPs, the more I’ve played with them, the more that I have created them and seeing what gets created, MCPs scare me. In production, in areas where data is sensitive and I need to be concerned about it, I don’t know that I would trust an AI generated MCP not to have this problem.”</p>
<h2>Cloud Journey </h2>
<p>1:11:07 <a href="https://www.harness.io/blog/how-git-strategy-can-break-your-database-pipeline">Database DevOps: Fix Git Before It Breaks Production</a></p>
<ul>
<li style="font-weight:400;">Database deployments often fail due to poor Git branching strategies, particularly the common practice of maintaining separate branches for each environment (dev, qa, prod) which leads to merge conflicts, configuration drift, and manual patching becoming routine problems.</li>
<li style="font-weight:400;"><a href="https://www.harness.io/harness-devops-academy/trunk-based-development">Trunk-based development</a> with context-driven deployments offers a more scalable solution by storing all database changelogs in a single branch and using Liquibase contexts or metadata to control where changes are applied, eliminating duplication and conflicts.</li>
<li style="font-weight:400;">Database changes require different handling than stateless applications because they involve persistent state, sequential dependencies, and irreversible operations, making proper version control and GitOps practices essential for safe deployments.</li>
<li style="font-weight:400;">Harness Database DevOps currently supports Liquibase for change management and enables referencing changelogs for any supported database from a single <a href="https://www.harness.io/blog/cicd-for-serverless">CI/CD pipeline</a>, with plans to add Flyway support in the future.</li>
<li style="font-weight:400;">Automation capabilities including drift detection, automated rollbacks, and compliance checks are critical for production-grade database DevOps, ensuring consistency and traceability while reducing manual overhead and risk.</li>
</ul>
<p>1:03:00 <a href="https://8thlight.com/insights/tdd-effective-ai-collaboration">TDD: The Missing Protocol for Effective AI Assisted Software </a><a href="https://8thlight.com/insights/tdd-effective-ai-collaboration">Development | 8th Light</a></p>
<ul>
<li style="font-weight:400;">This article from 8th Light makes a compelling case that Test-Driven Development, or TDD, is the missing piece for making AI coding assistants actually useful in real-world development. The core insight is that we’ve been treating LLMs like they’re human developers who understand context and intent, when really they need structured, explicit instructions – and TDD provides exactly that framework by forcing us to break down problems into small, testable pieces.</li>
<li style="font-weight:400;">The timing of this is particularly relevant for cloud developers because we’re seeing tools like <a href="https://github.com/features/copilot">GitHub Copilot</a>, <a href="https://aws.amazon.com/blogs/aws/amazon-codewhisperer-free-for-individual-use-is-now-generally-available/">Amazon CodeWhisperer</a>, and Google’s <a href="https://duetai.com/">Duet AI</a> becoming deeply integrated into cloud development workflows. </li>
<li style="font-weight:400;">But without a proper protocol for communicating with these tools, developers are getting frustrated when the AI generates code that looks good but doesn’t actually work or meet their requirements.</li>
<li style="font-weight:400;">What’s clever about using TDD as a communication protocol is that it solves multiple problems at once – you’re not just getting better AI-generated code, you’re also ensuring your code has proper test coverage, which is critical for cloud applications where reliability and scalability matter. </li>
<li style="font-weight:400;">The article shows how writing test descriptions first gives the AI clear boundaries and expectations, similar to how you’d define infrastructure requirements before deploying to the cloud.</li>
<li style="font-weight:400;">The practical workflow they outline is really straightforward – you write descriptive test cases covering your requirements, implement one seed test to establish patterns, then let the AI generate the remaining tests and implementation code. This approach would work particularly well for cloud microservices where you need consistent patterns across multiple services and APIs.</li>
<li style="font-weight:400;">For businesses adopting AI coding assistants, this could be a game-changer in terms of productivity and code quality. </li>
<li style="font-weight:400;">Instead of developers spending hours debugging AI-generated code that missed critical edge cases, they’re using AI to handle the repetitive implementation work while maintaining high standards through automated testing.</li>
</ul>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback, or ask questions at theCloudPod.net or tweet at us with the hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2091709/c1e-60w0co7793tzo6dw-7z92qpk3cgk4-wrc5k3.mp3" length="128537918"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 312 of The Cloud Pod, where your hosts, Matt, Ryan, and Justin, are here to bring you all the latest in Cloud and AI news. We’ve got security news, updates from PostgreSQL, Azure firewall and BlobNFS, plus TWO Cloud Journey stories for you! 
Thanks for joining us this week in the cloud!  
Titles we almost went with this week:

Git Happens: Why Your Database Pipeline Keeps Breaking
PostgreSQL and Chill: Azure’s New Storage Options for Database Romance
NVMe, Myself, and PostgreSQL
Canvas and Effect: AWS Paints a New Picture for E-commerce
Oracle’s $30 Billion Stargate: The AI Infrastructure Wars Begin
Larry’s Last Laugh: Oracle Lands OpenAI’s Mega Deal
AI Will See You Now (Couch Not Included)
Purview and Present Danger: Microsoft’s AI Security SDK Goes Live
The Purview from Up Here: Microsoft’s Bird’s Eye View on AI Data Security
Building Bridges: Azure’s Two-Way Street to Active Directory
Domain Names: Not Just for Browsers Anymore
FUSE or Lose: Azure’s BlobNFS Gets a Speed Boost
When Larry Met Andy: An Exadata Love Story
Bing There, Done That: Azure’s New Research Assistant
The Search is Over: Azure AI Foundry Finds Its Research Groove
Memory Lane: Where AI Agents Go to Remember Things
Elephants Never Forget, and Now Neither Do Google’s Agents
Z3 or Not Z3: That is the Storage Question
Local SSD Hero: A New Hope for I/O Intensive Workloads
Azure’s Certificate of Insecurity
KeyVault’s Keys Left Under the Doormat
When Your Cloud Provider Accidentally CCs the Hackers

AI Is Going Great – Or How ML Makes Money 
03:09 RYAN DOES A THING FOR SECURING AI WORKLOADS

Ryan was recently invited to Google’s Headquarters in San Francisco as part of a small group of security professionals where they spent time hands-on with Google security offerings, learning how to secure AI workloads. 
AI – and how to secure it – is a hot topic right now, and being able to spend time working with the Google development team was really insightful, with how they work with various levels of protections in place in dummy applications. 
Ryan was especially interested in the back-end logic that was executed in the applications. 

05:32   Ryan – “I was impressed because there’s how we’re thinking about AI is still evolving, and how we’re protecting it’s gonna be changing rapidly, and having real-world examples really helped really flesh out how their AI services are, how they’re integrated into a security ecosystem. It was pretty impressive. And it’s something that’s near and dear. I’ve been working and trying to roll out Google agent spaces and different AI workloads and trying to get involved and make sure that we, just getting visibility into all the different ones. And that was, it was really helpful to sort of think about it in those contexts.”
10:13 OpenAI secures $30bn cloud deal with Oracle

OpenAI signed a $30 billion annual cloud computing agreement with Oracle for 4.5GW of capacity, making it one of the largest AI cloud deals to date, and nearly triple Oracle’s current $10.3 billion annual data center infrastructure revenue.
The deal represents a major expansion of the Stargate data center initiative, a $500 billion joint venture between OpenAI, SoftBank, Oracle, and Abu Dhabi’s MGX fund aimed at building AI infrastructure across multiple US states, in...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2091709/c1a-k5d5-347nz1opuk18-ezcrmd.jpg"></itunes:image>
                                                                            <itunes:duration>01:29:15</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2091709/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[311: The Crawlers are Running the Asylum]]>
                </title>
                <pubDate>Fri, 11 Jul 2025 05:41:57 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2086107</guid>
                                    <link>https://tcpfm.castos.com/episodes/311-the-crawlers-are-running-the-asylum</link>
                                <description>
                                            <![CDATA[<p>Welcome to episode 311 of Two Old Men Yelling at Cloud – aka The Cloud Pod, featuring Matt and Ryan who absolutely, definitely did NOT record an aftershow. </p>
<p>This week, they’re talking about Cloudflare’s new Pay Per Crawler, a new open-source Terraform provider from mkdev, and lots of fabric news that Ryan doesn’t understand – plus so much more. Let’s get into it!  </p>
<h3>Titles we almost went with this week:</h3>
<p>(Show Editor note: There are more show titles than emojis. I give up.) </p>
<ul>
<li>FSx and the City: When File Systems Meet Object Storage</li>
<li>The Great Data Lake Escape: No Movement Required</li>
<li>OpenZFS Gets an S3 Degree Without Leaving Home</li>
<li>Kernel Sanders: Microsoft’s Recipe for Avoiding Another Fried System</li>
<li>Windows Gets a Restraining Order Against Overly Attached Security Software</li>
<li>Microsoft Builds a Fence Between Windows and Its Rowdy Security Neighbors</li>
<li>Windows Gets a Kernel of Truth After CrowdStrike Meltdown</li>
<li>Microsoft Kicks Security Vendors Out of the Kernel Clubhouse</li>
<li>The Great Kernel Divorce: When Windows Said “It’s Not You, It’s Your Access Level”</li>
<li>Google’s Environmental Report Card: A+ for Effort, C- for Supply Chain</li>
<li>The Cloud Pod Goes Green: Google’s 10th Annual Carbon Confession</li>
<li>Watts Up Doc? Google’s Energy Efficiency Bugs Bunny Would Approve</li>
<li>Terminal Velocity: Google’s AI Gets a Command Performance</li>
<li>Ctrl+Alt+Gemini: Google’s New CLI Companion</li>
<li>The Prompt and the Furious: Tokyo Terminal</li>
<li>AI See What You Did There: Google’s New Compliance Framework</li>
<li>Control Yourself: Google Cloud Gets Serious About AI Auditing</li>
<li>The Audit-omatic: Teaching Old Compliance New AI Tricks</li>
<li>Veo 3: Now Playing in a Cloud Near You</li>
<li>Google’s Video Dreams Come True (Audio Included)</li>
<li>Lights, Camera, API Action: Veo 3 Takes the Stage</li>
<li>Prometheus Unbound: Azure Finally Sees What It’s Been Missing</li>
<li>VS Code Gets Fabric-ated: Now With 100% More Workspace Management</li>
<li>Ctrl+S Your Sanity: Fabric Items Now Created Where You Code</li>
<li>The Extension Cord That Connects Your IDE to the Data Cloud</li>
<li>Logic Apps Gets Its Template of Doom (But in a Good Way)</li>
<li>Copy-Paste Engineering Just Got an Azure Upgrade</li>
<li>Microsoft Introduces the IKEA Model for Workflow Assembly</li>
<li>WAF’s Up Doc? Security Copilot Now Speaks Firewall</li>
<li>The Firewall Whisperer: When AI Meets Web Application Security</li>
<li>WAF and Peace: Microsoft’s Treaty Between Security Tools</li>
<li>Azure Goes Wild(card) with Certificate Management</li>
<li>Front Door Finally Gets Its Wild Side</li>
<li>Microsoft Deals Everyone a Wildcard</li>
<li>IP Freely: Azure Takes the Guesswork Out of Address Management</li>
<li>No More IP Envy: Azure Catches Up to AWS’s Address Game</li>
<li>Azure’s New Feature Has All the Right Addresses</li>
<li>Terraform and Chill: When Infrastructure Meets AI</li>
<li>DynamoDB Goes Global: Now with 100% Less Eventually</li>
<li>The Consistency Chronicles: Return of the Strong Read</li>
<li>Breaking: DynamoDB Achieves Peak Table Manners Across All Regions</li>
</ul>
<h2>Follow Up</h2>
<p>00:47 <a href="https://arstechnica.com/gadgets/2025/06/microsoft-is-trying-to-get-antivirus-software-away-from-the-windows-kernel/">Microsoft changes Windows in attempt to prevent next CrowdStrike-style </a><a href="https://arstechnica.com/gadgets/2025/06/microsoft-is-trying-to-get-antivirus-software-away-from-the-windows-kernel/">catastrophe – Ars Technica</a></p>
<ul>
<li style="font-weight:400;">Microsoft is creating a new Windows endpoint security platform that allows antivirus vendors to operate outside the kernel, preventing catastrophic system-wide failures like the <a href="https://arstechnica.com/information-technology/2024/07/major-outages-at-crowdstrike-microsoft-leave-the-world-with-bsods-and-confusion/">CrowdStrike incident</a> that g...</li></ul>
<h3>Chapters</h3>
<ul><li>(00:00:00) - Azure 1.3</li><li>(00:00:54) - Microsoft Is Changing Windows to Prevent the Next Crisis</li><li>(00:04:07) - Cloudflare: Pay Per Crawl</li><li>(00:08:36) - Terraform Provider for OpenAI</li><li>(00:14:01) - Amazon FSX for OpenZFS: Integrating with S3</li><li>(00:20:29) - Amazon EC2 C8GN Nitro Card Instances</li><li>(00:25:13) - DynamoDB now supports Multi Region Strongly Consistent</li><li>(00:30:11) - Google's 2025 Environmental Report</li><li>(00:35:07) - Google Announces Gemini CLI as an AI Agent</li><li>(00:39:47) - Google Cloud: Introducing recommended AI Controls Framework</li><li>(00:46:03) - Azure Monitor + Prometheus Metrics Integration in VS Code</li><li>(00:52:45) - Microsoft Logic Apps: Public Preview (Security Copilot)</li><li>(01:01:38) - Azure Front Door: Managed Certificate for Wildcard Domains</li><li>(01:04:50) - Azure Virtual Network Manager IP Address Management Feature</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 311 of Two Old Men Yelling at Cloud – aka The Cloud Pod, featuring Matt and Ryan who absolutely, definitely did NOT record an aftershow. 
This week, they’re talking about Cloudflare’s new Pay Per Crawler, a new open-source Terraform provider from mkdev, and lots of fabric news that Ryan doesn’t understand – plus so much more. Let’s get into it!  
Titles we almost went with this week:
(Show Editor note: There are more show titles than emojis. I give up.) 

FSx and the City: When File Systems Meet Object Storage
The Great Data Lake Escape: No Movement Required
OpenZFS Gets an S3 Degree Without Leaving Home
Kernel Sanders: Microsoft’s Recipe for Avoiding Another Fried System
Windows Gets a Restraining Order Against Overly Attached Security Software
Microsoft Builds a Fence Between Windows and Its Rowdy Security Neighbors
Windows Gets a Kernel of Truth After CrowdStrike Meltdown
Microsoft Kicks Security Vendors Out of the Kernel Clubhouse
The Great Kernel Divorce: When Windows Said “It’s Not You, It’s Your Access Level”
Google’s Environmental Report Card: A+ for Effort, C- for Supply Chain
The Cloud Pod Goes Green: Google’s 10th Annual Carbon Confession
Watts Up Doc? Google’s Energy Efficiency Bugs Bunny Would Approve
Terminal Velocity: Google’s AI Gets a Command Performance
Ctrl+Alt+Gemini: Google’s New CLI Companion
The Prompt and the Furious: Tokyo Terminal
AI See What You Did There: Google’s New Compliance Framework
Control Yourself: Google Cloud Gets Serious About AI Auditing
The Audit-omatic: Teaching Old Compliance New AI Tricks
Veo 3: Now Playing in a Cloud Near You
Google’s Video Dreams Come True (Audio Included)
Lights, Camera, API Action: Veo 3 Takes the Stage
Prometheus Unbound: Azure Finally Sees What It’s Been Missing
VS Code Gets Fabric-ated: Now With 100% More Workspace Management
Ctrl+S Your Sanity: Fabric Items Now Created Where You Code
The Extension Cord That Connects Your IDE to the Data Cloud
Logic Apps Gets Its Template of Doom (But in a Good Way)
Copy-Paste Engineering Just Got an Azure Upgrade
Microsoft Introduces the IKEA Model for Workflow Assembly
WAF’s Up Doc? Security Copilot Now Speaks Firewall
The Firewall Whisperer: When AI Meets Web Application Security
WAF and Peace: Microsoft’s Treaty Between Security Tools
Azure Goes Wild(card) with Certificate Management
Front Door Finally Gets Its Wild Side
Microsoft Deals Everyone a Wildcard
IP Freely: Azure Takes the Guesswork Out of Address Management
No More IP Envy: Azure Catches Up to AWS’s Address Game
Azure’s New Feature Has All the Right Addresses
Terraform and Chill: When Infrastructure Meets AI
DynamoDB Goes Global: Now with 100% Less Eventually
The Consistency Chronicles: Return of the Strong Read
Breaking: DynamoDB Achieves Peak Table Manners Across All Regions

Follow Up
00:47 Microsoft changes Windows in attempt to prevent next CrowdStrike-style catastrophe – Ars Technica

Microsoft is creating a new Windows endpoint security platform that allows antivirus vendors to operate outside the kernel, preventing catastrophic system-wide failures like the CrowdStrike incident that g...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[311: The Crawlers are Running the Asylum]]>
                </itunes:title>
                                    <itunes:episode>311</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p>Welcome to episode 311 of Two Old Men Yelling at Cloud – aka The Cloud Pod, featuring Matt and Ryan who absolutely, definitely did NOT record an aftershow. </p>
<p>This week, they’re talking about Cloudflare’s new Pay Per Crawler, a new open-source Terraform provider from mkdev, and lots of fabric news that Ryan doesn’t understand – plus so much more. Let’s get into it!  </p>
<h3>Titles we almost went with this week:</h3>
<p>(Show Editor note: There are more show titles than emojis. I give up.) </p>
<ul>
<li>FSx and the City: When File Systems Meet Object Storage</li>
<li>The Great Data Lake Escape: No Movement Required</li>
<li>OpenZFS Gets an S3 Degree Without Leaving Home</li>
<li>Kernel Sanders: Microsoft’s Recipe for Avoiding Another Fried System</li>
<li>Windows Gets a Restraining Order Against Overly Attached Security Software</li>
<li>Microsoft Builds a Fence Between Windows and Its Rowdy Security Neighbors</li>
<li>Windows Gets a Kernel of Truth After CrowdStrike Meltdown</li>
<li>Microsoft Kicks Security Vendors Out of the Kernel Clubhouse</li>
<li>The Great Kernel Divorce: When Windows Said “It’s Not You, It’s Your Access Level”</li>
<li>Google’s Environmental Report Card: A+ for Effort, C- for Supply Chain</li>
<li>The Cloud Pod Goes Green: Google’s 10th Annual Carbon Confession</li>
<li>Watts Up Doc? Google’s Energy Efficiency Bugs Bunny Would Approve</li>
<li>Terminal Velocity: Google’s AI Gets a Command Performance</li>
<li>Ctrl+Alt+Gemini: Google’s New CLI Companion</li>
<li>The Prompt and the Furious: Tokyo Terminal</li>
<li>AI See What You Did There: Google’s New Compliance Framework</li>
<li>Control Yourself: Google Cloud Gets Serious About AI Auditing</li>
<li>The Audit-omatic: Teaching Old Compliance New AI Tricks</li>
<li>Veo 3: Now Playing in a Cloud Near You</li>
<li>Google’s Video Dreams Come True (Audio Included)</li>
<li>Lights, Camera, API Action: Veo 3 Takes the Stage</li>
<li>Prometheus Unbound: Azure Finally Sees What It’s Been Missing</li>
<li>VS Code Gets Fabric-ated: Now With 100% More Workspace Management</li>
<li>Ctrl+S Your Sanity: Fabric Items Now Created Where You Code</li>
<li>The Extension Cord That Connects Your IDE to the Data Cloud</li>
<li>Logic Apps Gets Its Template of Doom (But in a Good Way)</li>
<li>Copy-Paste Engineering Just Got an Azure Upgrade</li>
<li>Microsoft Introduces the IKEA Model for Workflow Assembly</li>
<li>WAF’s Up Doc? Security Copilot Now Speaks Firewall</li>
<li>The Firewall Whisperer: When AI Meets Web Application Security</li>
<li>WAF and Peace: Microsoft’s Treaty Between Security Tools</li>
<li>Azure Goes Wild(card) with Certificate Management</li>
<li>Front Door Finally Gets Its Wild Side</li>
<li>Microsoft Deals Everyone a Wildcard</li>
<li>IP Freely: Azure Takes the Guesswork Out of Address Management</li>
<li>No More IP Envy: Azure Catches Up to AWS’s Address Game</li>
<li>Azure’s New Feature Has All the Right Addresses</li>
<li>Terraform and Chill: When Infrastructure Meets AI</li>
<li>DynamoDB Goes Global: Now with 100% Less Eventually</li>
<li>The Consistency Chronicles: Return of the Strong Read</li>
<li>Breaking: DynamoDB Achieves Peak Table Manners Across All Regions</li>
</ul>
<h2>Follow Up</h2>
<p>00:47 <a href="https://arstechnica.com/gadgets/2025/06/microsoft-is-trying-to-get-antivirus-software-away-from-the-windows-kernel/">Microsoft changes Windows in attempt to prevent next CrowdStrike-style </a><a href="https://arstechnica.com/gadgets/2025/06/microsoft-is-trying-to-get-antivirus-software-away-from-the-windows-kernel/">catastrophe – Ars Technica</a></p>
<ul>
<li style="font-weight:400;">Microsoft is creating a new Windows endpoint security platform that allows antivirus vendors to operate outside the kernel, preventing catastrophic system-wide failures like the <a href="https://arstechnica.com/information-technology/2024/07/major-outages-at-crowdstrike-microsoft-leave-the-world-with-bsods-and-confusion/">CrowdStrike incident</a> that grounded flights and disrupted global services in 2024.</li>
<li style="font-weight:400;">The <a href="https://www.crowdstrike.com/en-us/">CrowdStrike</a> outage highlighted a fundamental Windows architecture problem where security software with kernel access can crash entire systems during boot, forcing IT teams to manually fix millions of machines one by one.</li>
<li style="font-weight:400;">This architectural change represents Microsoft’s attempt to balance security vendor needs with system stability, potentially ending decades of kernel-level access that has been both a security necessity and reliability nightmare.</li>
<li style="font-weight:400;">Cloud and enterprise IT professionals should care because this could dramatically reduce the blast radius of security software failures, preventing single bad updates from taking down entire fleets of servers and workstations.</li>
<li style="font-weight:400;">The move signals a broader industry shift toward isolation and resilience in system design, where critical security functions can operate effectively without having the power to bring down the entire operating system.</li>
</ul>
<p>02:14  Matt – “I feel like this is also just a fundamental change in the way that we run infrastructure nowadays. Back in the day, you had these mainframes that were massive and you didn’t really care, because you protected them and you were very careful about them and what was on them. But now it’s thousands of small systems that you care because when Ryan has to go log into 1000 systems, he gets very angry at life and starts muttering things under his breath.”</p>
<h2>AI Is Going Great – Or How ML Makes Money </h2>
<p>04:09 <a href="https://blog.cloudflare.com/introducing-pay-per-crawl/">Introducing pay per crawl: enabling content owners to charge AI crawlers </a><a href="https://blog.cloudflare.com/introducing-pay-per-crawl/">for access</a></p>
<ul>
<li style="font-weight:400;">Cloudflare introduces pay-per-crawl, a private beta feature that implements <a href="https://developer.mozilla.org/en-US/docs/Web/HTTP/Reference/Status/402">HTTP 402</a> Payment Required to enable content owners to charge AI crawlers for access. </li>
<li style="font-weight:400;">The system uses <a href="https://developers.cloudflare.com/bots/concepts/bot/verified-bots/web-bot-auth/">Web Bot Auth</a> with Ed25519 key pairs and HTTP Message Signatures to verify crawler identity and prevent spoofing.</li>
<li style="font-weight:400;">Content owners can set flat per-request pricing across their domain and configure three access levels for each crawler: Allow (free access), Charge (require payment at configured price), or Block (deny access with no payment option). </li>
<li style="font-weight:400;">Cloudflare acts as the Merchant of Record, handling billing aggregation and payment distribution.</li>
<li style="font-weight:400;">Crawlers can discover pricing reactively by receiving 402 responses with crawler-price headers, or proactively by including crawler-max-price headers in initial requests. </li>
<li style="font-weight:400;">Successful paid requests return <a href="https://developer.mozilla.org/en-US/docs/Web/HTTP/Reference/Status/200">HTTP 200</a> with crawler-charged headers confirming the transaction amount.</li>
<li style="font-weight:400;">The implementation integrates with existing web infrastructure after WAF and bot management policies are applied, requiring minimal changes to current security configurations. </li>
<li style="font-weight:400;">Publishers retain the flexibility to bypass charges for specific crawlers to accommodate existing content partnerships.</li>
<li style="font-weight:400;">This approach enables future programmatic negotiations between AI agents and content providers, potentially supporting dynamic pricing based on content type, usage patterns, or application scale. </li>
<li style="font-weight:400;">The framework could extend beyond simple per-request pricing to include granular licensing for training, inference, or search applications.</li>
</ul>
<p>07:13  Matt – “I think this is interesting and seeing also how the bots kind of negotiate pricing. I’m picturing like a spot market in the future.’</p>
<h2>Cloud Tools </h2>
<p>08:48 <a href="https://mkdev.me/posts/announcing-the-open-source-terraform-provider-for-openai">Introducing Open Source OpenAI Terraform Provider | mkdev</a></p>
<ul>
<li style="font-weight:400;">mkdev released an open-source <a href="https://registry.terraform.io/providers/mkdev-me/openai/latest">Terraform provider for OpenAI</a> that enables Infrastructure as Code management of OpenAI resources, eliminating the need for manual ClickOps configuration and ensuring consistent security and productivity across projects.</li>
<li style="font-weight:400;">The provider supports both OpenAI <a href="https://platform.openai.com/docs/api-reference/administration">Administration APIs</a> for managing projects, service accounts, and user permissions, as well as Platform APIs that allow developers to integrate generative AI capabilities directly into their infrastructure deployments.</li>
<li style="font-weight:400;">A unique capability demonstrated is “vibe coding,” where developers can use Terraform to generate application code via GPT-4, create images with DALL-E, and automatically deploy the results to AWS Lambda – essentially building and deploying AI-generated applications in a single Terraform run.</li>
<li style="font-weight:400;">The provider requires two separate <a href="https://platform.openai.com/docs/api-reference/admin-api-keys">API keys</a> (admin and standard) and handles OpenAI’s API limitations cleverly, such as tracking and restoring rate limits to default states since there’s no API endpoint for deletion.</li>
<li style="font-weight:400;">This tool enables platform engineering teams to create self-service modules where non-developers can go from idea to deployed application using prompts, all while maintaining compliance and security through existing Terraform infrastructure.</li>
</ul>
<p>11:19  Ryan- “…the funny thing is, when I try to imagine the run through of this, like the whole end-to-end resources, like you’re right. This is enterprise – it’s definitely to keep in line with other compliance and procedure steps. But it’s also funny to me, because anyone who’s doing vibe coding, I just don’t think they’re going to go through this endpoint, this whole process to get the resources deployed.”</p>
<h2>AWS</h2>
<p>14:26 <a href="https://aws.amazon.com/blogs/aws/amazon-fsx-for-openzfs-now-supports-amazon-s3-access-without-any-data-movement/">Amazon FSx for OpenZFS now supports Amazon S3 access without any </a><a href="https://aws.amazon.com/blogs/aws/amazon-fsx-for-openzfs-now-supports-amazon-s3-access-without-any-data-movement/">data movement | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/fsx/openzfs/?trk=4f1e9f0e-7b21-4369-8925-61f67341d27c&amp;sc_channel=el">Amazon FSx for OpenZFS</a> now allows direct <a href="https://aws.amazon.com/s3/?trk=4f1e9f0e-7b21-4369-8925-61f67341d27c&amp;sc_channel=el%5C">S3</a> API access to file data through <a href="https://aws.amazon.com/s3/features/access-points/?trk=4f1e9f0e-7b21-4369-8925-61f67341d27c&amp;sc_channel=el">S3 Access Points</a> without moving or copying data, enabling use with AWS AI/ML services like Bedrock and SageMaker that expect S3 as their data source.</li>
<li style="font-weight:400;">Organizations can attach hundreds of S3 Access Points to a single FSx file system with granular IAM permissions per access point, while maintaining existing NFS access and file system capabilities.</li>
<li style="font-weight:400;">The feature delivers first-byte latency in tens of milliseconds (which you need when training models) with performance scaling based on FSx provisioned throughput (because you want to burn money) though customers pay both FSx charges plus S3 request and data transfer costs.</li>
<li style="font-weight:400;">Real-world applications include building https://aws.amazon.com/what-is/retrieval-augmented-generation/ with <a href="https://docs.aws.amazon.com/bedrock/latest/userguide/knowledge-base.html">Bedrock Knowledge Bases</a>, training ML models with <a href="https://aws.amazon.com/sagemaker/">SageMaker</a>, and running analytics with <a href="https://docs.aws.amazon.com/athena/latest/ug/what-is.html">Athena</a> and <a href="https://aws.amazon.com/glue/">Glue</a> directly against FSx-stored enterprise file data.</li>
<li style="font-weight:400;">Currently available in 9 AWS regions, including US East, US West, Europe, and Asia Pacific, addressing the common challenge of enterprises wanting to leverage their migrated file data with cloud-native services.</li>
</ul>
<p>17:17  Ryan- “They’re definitely touting up the compliance features of this. I noticed how heavy this was on access points and the IM restrictions, which I mean, in practice is really difficult to support. But it’s good, you know, I like the idea that you grant API access with a certain level of permissions, but then you can tailor that down via individual permissions per access point, especially with AI and ML workloads.”</p>
<p>21:08 <a href="https://aws.amazon.com/blogs/aws/new-amazon-ec2-c8gn-instances-powered-by-aws-graviton4-offering-up-to-600gbps-network-bandwidth/">New Amazon EC2 C8gn instances powered by AWS Graviton4 offering up </a><a href="https://aws.amazon.com/blogs/aws/new-amazon-ec2-c8gn-instances-powered-by-aws-graviton4-offering-up-to-600gbps-network-bandwidth/">to 600Gbps network bandwidth | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;">AWS launches C8gn instances powered by <a href="https://aws.amazon.com/ec2/graviton/?trk=769a1a2b-8c19-4976-9c45-b6b1226c7d20&amp;sc_channel=el">Graviton4 processors</a>, delivering up to 600Gbps network bandwidth – the highest among EC2 network optimized instances. </li>
<li style="font-weight:400;">These instances offer 30% better compute performance than previous C7gn instances with up to 192 vCPUs and 384 GiB memory.</li>
<li style="font-weight:400;">The new 6th generation AWS <a href="https://aws.amazon.com/ec2/nitro/?trk=769a1a2b-8c19-4976-9c45-b6b1226c7d20&amp;sc_channel=el">Nitro Card</a> enables the 600Gbps bandwidth, making C8gn ideal for network-intensive workloads like virtual firewalls, load balancers, DDoS appliances, and tightly-coupled cluster computing. This positions AWS ahead of competitors in network performance for specialized workloads.</li>
<li style="font-weight:400;">C8gn maintains similar vCPU and memory ratios to <a href="https://aws.amazon.com/blogs/aws/new-amazon-ec2-c7gn-instances-graviton3e-processors-and-up-to-200-gbps-network-bandwidth/?trk=769a1a2b-8c19-4976-9c45-b6b1226c7d20&amp;sc_channel=el">C7gn instances</a>, simplifying migration for existing customers. Available initially in US East and US West regions with standard purchasing options including <a href="https://aws.amazon.com/ec2/pricing/on-demand/?trk=cf96f8ec-de40-4ee0-8b64-3f7cf7660da2&amp;sc_channel=el">On-Demand</a>, <a href="https://aws.amazon.com/savingsplans/?trk=cc9e0036-98c5-4fa8-8df0-5281f75284ca&amp;sc_channel=el">Savings Plans</a>, and <a href="https://aws.amazon.com/ec2/spot/pricing/?trk=307341f6-3463-47d5-ba81-0957847a9b73&amp;sc_channel=el">Spot instances</a>.</li>
<li style="font-weight:400;">The timing aligns with growing demand for high-bandwidth applications in security, analytics, and distributed computing. Organizations running network appliances or data-intensive workloads can consolidate infrastructure with fewer, more powerful instances.</li>
<li style="font-weight:400;">Cost considerations remain important – while AWS hasn’t disclosed pricing, the 3x bandwidth increase over C7gn suggests premium pricing. </li>
<li style="font-weight:400;">Customers should evaluate whether their workloads can fully utilize the 600Gbps capability to justify potential cost increases.</li>
</ul>
<p>23:22  Matt – “They’re getting the bandwidth higher that is directly exposed to the end consumer. If you are running this bandwidth, one, I would love to understand what you’re doing besides inference and training models. But two, I’m just jealous. I feel like Azure doesn’t have good Graviton yet. And even when they do, if you’re running a Windows-based workload, you can’t even leverage them yet.”</p>
<p>26:37 <a href="https://aws.amazon.com/blogs/aws/build-the-highest-resilience-apps-with-multi-region-strong-consistency-in-amazon-dynamodb-global-tables/">Build the highest resilience apps with multi-region strong consistency in </a><a href="https://aws.amazon.com/blogs/aws/build-the-highest-resilience-apps-with-multi-region-strong-consistency-in-amazon-dynamodb-global-tables/">Amazon DynamoDB global tables | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/category/database/amazon-dynamodb/">DynamoDB</a> <a href="https://aws.amazon.com/dynamodb/global-tables/?trk=c4ea046f-18ad-4d23-a1ac-cdd1267f942c&amp;sc_channel=el">global tables</a> now support <a href="https://aws.amazon.com/blogs/aws/build-the-highest-resilience-apps-with-multi-region-strong-consistency-in-amazon-dynamodb-global-tables/">Multi-Region strong consistency</a> (MRSC), enabling zero Recovery Point Objective (RPO) for critical applications like payment processing and financial services that need guaranteed access to the latest data across regions.</li>
<li style="font-weight:400;">MRSC requires three AWS Regions configured as either three full replicas or two replicas plus a witness node that stores only change data, reducing costs while maintaining resilience – available in 9 regions including US East, US West, Asia Pacific, and Europe.</li>
<li style="font-weight:400;">Applications can enable strong consistency by setting ConsistentRead=True in their API calls, allowing developers to choose between eventual consistency for performance or strong consistency for critical operations on a per-request basis.</li>
<li style="font-weight:400;">Pricing follows existing global tables structure which AWS recently reduced by up to 67%, making this enterprise-grade resilience more accessible for organizations building multi-region applications.</li>
<li style="font-weight:400;">The feature addresses a gap between<a href="https://docs.aws.amazon.com/dynamodb/"> DynamoDB</a>‘s multi-AZ architecture and the needs of financial services and payment processors that require immediate consistency across regions during rare regional failures.</li>
</ul>
<p>28:50  Matt – “I look at it on the other side where, yes, this is definitely a useful feature, definitely something that I can see many use cases for – healthcare data, financial services, that high criticality of consistency, but also like S3 only was strongly consistent a couple years ago.”</p>
<h2>GCP</h2>
<p>31:35  <a href="https://blog.google/outreach-initiatives/sustainability/environmental-report-2025/">Read Google’s 2025 Environmental Report</a></p>
<ul>
<li style="font-weight:400;">Google achieved a 12% reduction in data center energy emissions despite a 27% increase in electricity demand, demonstrating successful decoupling of operational growth from carbon emissions through 25 clean energy projects that added 2.5 gigawatts to their grid capacity.</li>
<li style="font-weight:400;">The company’s<a href="https://datacenters.google/efficiency/"> data centers</a> now operate at 84% less overhead energy than the industry average, while their seventh-generation Ironwood TPU uses nearly 30 times less energy than their first Cloud TPU from 2018, positioning GCP as a leader in energy-efficient AI infrastructure.</li>
<li style="font-weight:400;">Google’s AI-powered products, including <a href="https://blog.google/products/google-nest/nest-learning-thermostat-on-shelf/">Nest thermostats</a>, <a href="https://mapsplatform.google.com/maps-products/solar/">Solar API</a>, and f<a href="https://blog.google/outreach-initiatives/sustainability/google-transportation-energy-emissions-reduction/">uel-efficient routing in Maps</a>,2 helped customers reduce an estimated 26 million metric tons of CO2 equivalent in 2024, equivalent to removing energy use from 3.5 million U.S. homes for a year.</li>
<li style="font-weight:400;">The company is investing in next-generation energy solutions, including advanced <a href="https://blog.google/outreach-initiatives/sustainability/google-kairos-power-nuclear-energy-agreement/">nuclear partnerships</a> with Kairos Power and enhanced <a href="https://blog.google/outreach-initiatives/sustainability/google-fervo-geothermal-energy-partnership/">geothermal projects</a> with Fervo to address the growing energy demands of AI workloads and ensure reliable, clean power for future data center expansion.</li>
<li style="font-weight:400;">While data center emissions decreased, total supply chain emissions increased 11% year-over-year, highlighting challenges in regions like Asia Pacifi,c where clean energy infrastructure remains limited and the need for broader ecosystem transformation beyond Google’s direct operations.</li>
</ul>
<p>36:04 <a href="https://blog.google/technology/developers/introducing-gemini-cli-open-source-ai-agent/">Google announces Gemini CLI: your open-source AI agent</a></p>
<ul>
<li style="font-weight:400;">Google launches <a href="http://github.com/google-gemini/gemini-cli">Gemini CLI</a> as an open-source AI agent that brings Gemini 2.0 Flash directly to the terminal with 60 requests per minute and 1,000 daily requests free for developers using a personal Google account.</li>
<li style="font-weight:400;">The tool integrates with <a href="https://codeassist.google/">Gemini Code Assist</a> across free, Standard, and Enterprise plans, providing AI-powered coding assistance in both VS Code and the command line with a 1 million token context window.</li>
<li style="font-weight:400;">Built-in capabilities include Google Search grounding for real-time context, Model Context Protocol support for extensibility, and automation features for script integration, positioning it as a versatile utility beyond just coding tasks.</li>
<li style="font-weight:400;">The <a href="https://www.apache.org/licenses/LICENSE-2.0.html">Apache 2.0 open-source license</a> allows developers to inspect, modify, and contribute to the codebase while supporting custom prompts and team configurations through GEMINI.md system prompts.</li>
<li style="font-weight:400;">Professional developers requiring multiple simultaneous agents or specific models can use <a href="https://aistudio.google.com/">Google AI Studio</a> or <a href="https://cloud.google.com/vertex-ai/generative-ai/docs/start/api-keys">Vertex AI keys</a> for usage-based billing, offering flexibility between free personal use and enterprise deployment options.</li>
</ul>
<p>38:22  Ryan – “These aren’t quite in the terminal, which is what always bothers me, right? Neither Claude Code or Gemini CLI. I’ve played around both now. These are to take over a terminal, and then you’re sort of interacting with it a lot like a desktop app or the browser from that point. And so it’s kind of good, but it’s not quite what I want. I found that the IDE integration for both of those tools is way more powerful than the actual CLI tool.”</p>
<p>40:58 <a href="https://cloud.google.com/blog/products/identity-security/audit-smarter-introducing-our-recommended-ai-controls-framework/">Audit smarter: Introducing our Recommended AI Controls framework | </a><a href="https://cloud.google.com/blog/products/identity-security/audit-smarter-introducing-our-recommended-ai-controls-framework/">Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google Cloud launches the Recommended AI Controls framework in Audit Manager, providing automated compliance assessments for generative AI workloads based on <a href="https://www.nist.gov/itl/ai-risk-management-framework">NIST AI Risk Management Framework</a> and <a href="https://cyberriskinstitute.org/the-profile/">Cyber Risk Institute</a> standards. </li>
<li style="font-weight:400;">This addresses the growing challenge of proving AI systems comply with internal policies and regulations as organizations deploy more AI agents and automation.</li>
<li style="font-weight:400;">The framework automates evidence collection across <a href="https://cloud.google.com/vertex-ai">Vertex AI</a> and supporting services like <a href="https://cloud.google.com/storage?e=48754805&amp;hl=en">Cloud Storage</a>, <a href="https://cloud.google.com/iam/docs/overview">IAM</a>, and <a href="https://cloud.google.com/vpc/docs/vpc">VPC Networks</a>, replacing manual audit checklists with continuous monitoring capabilities. </li>
<li style="font-weight:400;">Organizations can schedule regular assessments and generate one-click compliance reports with direct links to collected evidence.</li>
<li style="font-weight:400;">Key controls include disabling root access on Vertex AI Workbench instances, enforcing <a href="https://cloud.google.com/kms/docs/cmek">Customer Managed Encryption Keys (CMEK)</a> for data protection, implementing vulnerability scanning through <a href="https://cloud.google.com/artifact-registry/docs/analysis">Artifact Analysis</a>, and restricting resource service usage based on environment sensitivity. </li>
<li style="font-weight:400;">The framework clearly delineates control responsibilities between the customer and the platform under Google’s shared fate model.</li>
<li style="font-weight:400;">This positions Google Cloud competitively against AWS and Azure by offering AI-specific compliance automation, while their solutions remain more generic. The integration with <a href="https://cloud.google.com/security/products/security-command-center">Security Command Center</a> provides a unified view of AI security posture alongside traditional cloud workloads.</li>
<li style="font-weight:400;">Available now through the <a href="https://console.cloud.google.com/compliance/auditmanager">Google Cloud Console</a> Compliance tab, the service targets enterprises in regulated industries like healthcare and finance that need to demonstrate AI governance. No specific pricing was mentioned, suggesting it may be included with existing Security Command Center licensing.</li>
</ul>
<p>44:09  Ryan – “It’s all just open-ended questions and really just a whole lot of movement to try to look good, and not have egg on your face because you don’t really know what the AI workloads are across your business. And so I do like that this is rolled into the compliance manager and security pan center because that means it’s centralized. It means it’s hooked up at the org layer, which means I can turn it on and I can get the glaring red reports – or magically it’s all green somehow.”</p>
<h2>Azure</h2>
<p>47:30 <a href="https://azure.microsoft.com/en-us/updates?id=497085">[In preview] Public Preview: Azure Monitor ingestion issues with Azure </a><a href="https://azure.microsoft.com/en-us/updates?id=497085">Monitor Workspac</a>e</p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/azure-monitor/metrics/azure-monitor-workspace-overview">Azure Monitor Workspace</a> now provides visibility into <a href="https://learn.microsoft.com/en-us/azure/azure-monitor/metrics/prometheus-metrics-overview">Prometheus</a> metrics ingestion errors, helping customers identify and troubleshoot issues when <a href="https://learn.microsoft.com/en-us/azure/azure-monitor/metrics/prometheus-migrate">Azure Managed Prometheus</a> sends metrics to their workspace.</li>
<li style="font-weight:400;">This feature addresses a common operational blind spot where metrics fail to ingest but customers lack visibility into why, similar to <a href="https://docs.aws.amazon.com/AmazonCloudWatch/latest/monitoring/WhatIsCloudWatch.html">AWS CloudWatch</a> Metrics Insights but specifically for Prometheus workloads.</li>
<li style="font-weight:400;">The platform metrics integration means ingestion errors appear alongside other Azure Monitor metrics, enabling unified monitoring and alerting without additional tooling or configuration.</li>
<li style="font-weight:400;">Target customers include organizations running Kubernetes workloads with Prometheus monitoring who need enterprise-grade observability and troubleshooting capabilities for their metrics pipeline.</li>
<li style="font-weight:400;">This preview feature comes at no additional cost beyond standard Azure Monitor Workspace charges, making it accessible for teams already invested in Azure’s Prometheus ecosystem.</li>
</ul>
<p>51:32 <a href="https://blog.fabric.microsoft.com/en-GB/blog/announcing-new-features-for-microsoft-fabric-extension-in-vs-code/">Microsoft Fabric Extension in VS Code</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/fabric/data-engineering/set-up-fabric-vs-code-extension">Microsoft Fabric Extension</a> for <a href="https://code.visualstudio.com/download">VS Code</a> now allows developers to create, delete, and rename any Fabric item directly within their IDE, eliminating context switching between VS Code and the Fabric portal for basic workspace management tasks.</li>
<li style="font-weight:400;">The new tenant switching capability enables users to manage workspaces and items across multiple Microsoft tenants from a single VS Code instance, addressing a common pain point for consultants and developers working with multiple organizations.</li>
<li style="font-weight:400;">This positions Microsoft Fabric as a more developer-friendly analytics platform compared to AWS and GCP offerings, which typically require separate web consoles or CLI tools for similar workspace management operations.</li>
<li style="font-weight:400;">The integration targets data engineers and analysts who prefer working in VS Code for their development workflow, particularly those managing multiple Fabric workspaces for different clients or projects.</li>
<li style="font-weight:400;">While the feature itself is free as part of the VS Code extension, users should note that Fabric items created through VS Code still incur standard Fabric capacity costs based on the compute and storage resources consumed.</li>
</ul>
<p>53:43  Matt – “This to me is a consultant feature, where you need that feature…the average consumer that works for a single company – odds are you’re not going to use this.” </p>
<p>54:39 <a href="https://techcommunity.microsoft.com/blog/integrationsonazureblog/%F0%9F%93%A2-announcing-public-preview-organizational-templates-in-azure-logic-apps/4425994"> Announcing Public Preview: Organizational Templates in Azure Logic </a><a href="https://techcommunity.microsoft.com/blog/integrationsonazureblog/%F0%9F%93%A2-announcing-public-preview-organizational-templates-in-azure-logic-apps/4425994">Apps</a></p>
<ul>
<li style="font-weight:400;">Azure Logic Apps now lets organizations create and share private workflow <a href="https://learn.microsoft.com/azure/logic-apps/create-single-tenant-workflows-templates">templates</a> within their tenant, addressing the gap where teams previously had to either use public Microsoft templates or build everything from scratch. </li>
<li style="font-weight:400;">This brings Logic Apps closer to AWS Step Functions’ reusable workflow patterns while maintaining enterprise control through <a href="https://learn.microsoft.com/en-us/azure/role-based-access-control/built-in-roles">Azure RBAC</a> integration.</li>
<li style="font-weight:400;">The new UI eliminates manual packaging by automatically extracting connections, parameters, and documentation from existing workflows, making template creation accessible to non-developers – a notable improvement over competitors, where creating reusable automation patterns often requires significant technical expertise.</li>
<li style="font-weight:400;">Templates support both test and production publishing modes with full lifecycle management, allowing enterprises to safely experiment with automation patterns before wide deployment, particularly useful for organizations standardizing on specific integration patterns or enforcing architectural guidelines across teams.</li>
<li style="font-weight:400;">As first-class Azure resources, these templates integrate with existing subscription and role-based access controls, ensuring teams only see templates they’re authorized to use – this addresses a common enterprise concern about sharing internal APIs and business logic without exposing them publicly.</li>
<li style="font-weight:400;">The feature targets enterprises looking to scale their automation efforts by packaging common patterns like API integrations, data processing workflows, or approval chains into reusable components – reducing development time from hours to minutes for repetitive integration scenarios.</li>
</ul>
<p>56:18  Matt – “I love this. I mean, building step functions in the past, I’ve used logic apps only a few times in my day job, but building step functions, being able to share them across the organization and having people do a simple function app to Teams integration (because it’s not simple, because it’s Microsoft Teams) or anything along those lines, like these reusable patterns, connections to Jira, connections to other internal systems, your SRE notification system – and just being able to say, grab this, run it, and be done with it, is so much better than even saying, hey, try to grab this Terraform module, and then having people maintain it and update it because you all know that no one’s going to actually do that.”</p>
<p>58:54 <a href="https://azure.microsoft.com/en-us/updates?id=496536">[Launched] Generally Available: Azure WAF integration in Microsoft </a><a href="https://azure.microsoft.com/en-us/updates?id=496536">Security Copilot </a></p>
<ul>
<li style="font-weight:400;">Azure WAF integration with <a href="https://learn.microsoft.com/en-us/copilot/security/microsoft-security-copilot">Microsoft Security Copilot</a> is now generally available, supporting both <a href="https://learn.microsoft.com/en-us/azure/frontdoor/web-application-firewall">Azure Front Door WAF</a> and <a href="https://learn.microsoft.com/en-us/azure/web-application-firewall/ag/ag-overview">Azure Application Gateway WAF</a> configurations. </li>
<li style="font-weight:400;">This allows security teams to investigate and respond to web application threats using natural language queries within the Security Copilot interface.</li>
<li style="font-weight:400;">The integration enables security analysts to query WAF logs, analyze attack patterns, and generate incident reports without switching between multiple tools or writing complex KQL queries. (Trust us, you don’t want to do that.) </li>
<li style="font-weight:400;">This reduces the time needed to investigate web application security incidents from hours to minutes.</li>
<li style="font-weight:400;">Microsoft continues to expand Security Copilot’s reach across its security portfolio, positioning it as a central hub for security operations. AWS offers similar WAF capabilities but lacks the AI-powered natural language interface, while <a href="https://cloud.google.com/armor/docs/cloud-armor-overview">GCP’s Cloud Armor</a> requires more manual log analysis.</li>
<li style="font-weight:400;">Target customers include enterprises with complex web applications that need to streamline security operations and reduce alert fatigue. The integration is particularly valuable for organizations already invested in the Microsoft security ecosystem.</li>
<li style="font-weight:400;">Pricing follows the Security Copilot consumption model at $4 per Security Compute Unit (SCU), with no additional charges for the WAF integration itself. Organizations should consider the SCU consumption when enabling automated investigations and report generation.</li>
</ul>
<p>1:00:57  Ryan – “…anything that allows me to query things with natural language and not some specific DSL to figure out, I do appreciate. It’s been useful in so many other tools. WAF seems like the best use case, really, because there’s so much noise trying to get VPC flow logs, like raw networking related.”</p>
<p>1:03:48 <a href="https://azure.microsoft.com/en-us/updates?id=496631">[Launched] Generally Available: Azure Front Door now supports managed </a><a href="https://azure.microsoft.com/en-us/updates?id=496631">certificate for wildcard domains</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/frontdoor/front-door-overview">Azure Front Door</a> now automatically provisions and manages SSL certificates for wildcard domains (*.example.com), eliminating the need to manually upload and maintain your own certificates for securing multiple subdomains under a single domain.</li>
<li style="font-weight:400;">This feature brings Azure Front Door to parity with <a href="https://duckduckgo.com/y.js?ad_domain=amazon.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=rRTPYe81zpDBgwXhxXXMuOY4gCc7D7R2A66eX-6beCeemJr9guky_pjCQgrAfmBNXA-otCpDVbs80a0to7lp3XWiYWpOmXHat8LKZb8ky5D-kGkCyZG6N1P92HPKuwNk.H2Jg2H2TbnW9jq6YnRezVw&amp;eddgt=Ng3951ZBg4vJ44qUE4W4UQ%3D%3D&amp;rut=ebd6c41d857eef4a508d571ddbd5a441cf83e0afd3265a6e4edbc3653ce374ba&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8aXbB5iwxoba1E6rSGVHkbTVUCUw_gRfiw-gOlmb-SUr2dKHLCIlPM4PNS3YzKOwUxdPre7XGDAfCiAa5EdErkQ4-TYML6RilO1QqGBIF8oeNKqRRt1p5e3o3k3XBhs1xuDemzFEm1ZD-rI96KyHEMjwh-Q2bFvjFQuie5E5b0FvdjsLlKWZHcI_-75z7-Tl2o98CMe-GqpCClmGCY8I-Bg9NxJ9docAkwTKZVo--LyxvSsyepA90JVGqA6gfRBPvj2e13SSHtyQIgi3kKwpywgEOi9cSUlO6eSnlOm0nW-y5hlV69djWuwA40JbdvdWjtztieMh8nN9jErjdQ3FIVTDpL6H0WYPZzQrdztqHqdkYXW0b5_0vSc722HZKEQwf09JHilW9EPFDnysNOQ5uIeJiGLM0p1KG8j225WBUR5IcoDRWALtbRXfVPv4oE_zZMpri3lBPvvtDVb3EQOYm1OW8F3kvCWfdCAxPR58ckZnd8yfxIpLyQ12_R1MPwD0xCgjC0dqADXSOq5XNMKMvGMaTeFkbcVpE7QiFT9avTCUWJG3HN7q19D8jsSLdPWAcMdBm18g88yh9LSJ0qciDzntzYmFUt9lKaYuf_gZ6xZsOO5uNtBZBI7W9t9L589TamwYESXEO1Z7yio0hT46XpWHAJIebpEji_aGTiZ0qqFiT-d4o5iWeOIz4WUIPTpDxxoSpf4M7iu2YzTDzkirWk3QHTgDli9poCqDZxUNZJ4jw6i13rI-L5FEBhkl3GfFpQtiNNw%26u%3DaHR0cHMlM2ElMmYlMmZ3d3cuYW1hem9uLmNvbSUyZnMlMmYlM2ZpZSUzZFVURjglMjZrZXl3b3JkcyUzZGF3cyUyYmNsb3VkZnJvbnQlMjZpbmRleCUzZGFwcyUyNnRhZyUzZHR4dHN0ZGJnZHQtMjAlMjZyZWYlM2RwZF9zbF83ZmE5OG12bm1oX3AlMjZhZGdycGlkJTNkMTIzNjk1MTYwNTEwMTA5OSUyNmh2YWRpZCUzZDc3MzA5NjA3MzU2NTQ3JTI2aHZuZXR3JTNkcyUyNmh2cW10JTNkZSUyNmh2Ym10JTNkYnAlMjZodmRldiUzZGMlMjZodmxvY2ludCUzZCUyNmh2bG9jcGh5JTNkODA1NjAlMjZodnRhcmdpZCUzZGt3ZC03NzMwOTg2MDI2NTQ4MyUzYWxvYy0xOTAlMjZoeWRhZGNyJTNkMjA5NTlfMTM1MjMyODMlMjZtY2lkJTNkMzAzNTU4ZWY4YzZhMzRlMThjMzI4NjA0YjQ2OTc5OGQlMjZsYW5ndWFnZSUzZGVuX1VTJTI2bXNjbGtpZCUzZDQ0MTYzMWZmZDA4MTE0NmNjNWQ0NTk5MWNiZTc5MDE1%26rlid%3D441631ffd081146cc5d45991cbe79015&amp;vqd=4-56926100288520947913944667665335308993&amp;iurl=%7B1%7DIG%3DD5CFB6F7A4754EB0ACD387F4D2693DED%26CID%3D1771A5A8B549677F2F5BB38CB4AF6642%26ID%3DDevEx%2C5048.1">AWS CloudFront</a> and <a href="https://cloud.google.com/cdn">Google Cloud CDN</a>, both of which have offered managed wildcard certificates for years, making multi-subdomain deployments simpler for enterprises.</li>
<li style="font-weight:400;">The managed certificate service is available for both Standard and Premium tiers at no additional cost beyond the standard Azure Front Door pricing, reducing operational overhead for DevOps teams managing multiple staging, regional, or customer-specific subdomains.</li>
<li style="font-weight:400;">Key use cases include SaaS providers offering customer-specific subdomains (customer1.app.com, customer2.app.com) and enterprises with multiple regional or environment-based subdomains that need consistent SSL coverage without certificate management complexity.</li>
<li style="font-weight:400;">The feature integrates with Azure’s existing certificate lifecycle management, automatically handling renewal before expiration and supporting up to 100 subdomains per wildcard certificate.</li>
</ul>
<p>1:06:58 <a href="https://azure.microsoft.com/en-us/updates?id=484347">[Launched] Azure Virtual Network Manager IP address management</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/virtual-network-manager/overview">Azure Virtual Network Manager</a>‘s IP address management feature brings centralized IP planning and allocation to complex network environments, addressing a common pain point for enterprises managing multiple VNets and subnets across regions.</li>
<li style="font-weight:400;">The feature provides automated IP address allocation, conflict detection, and visual network topology mapping, similar to AWS VPC IP Address Manager but integrated directly into Azure’s Virtual Network Manager service.</li>
<li style="font-weight:400;">This targets large enterprises and managed service providers who struggle with IP address sprawl across hybrid and multi-region deployments, reducing manual tracking errors and IP conflicts.</li>
<li style="font-weight:400;">Unlike <a href="https://docs.aws.amazon.com/vpc/latest/ipam/how-it-works-ipam.html">AWS IPAM</a>, which requires separate configuration, Azure’s implementation is built into Virtual Network Manager, potentially simplifying adoption for existing Azure customers already using VNM for network governance.</li>
<li style="font-weight:400;">Pricing follows Virtual Network Manager’s model at $0.02 per managed resource per hour, making it cost-effective for organizations already invested in Azure’s network management ecosystem.</li>
</ul>
<p>1:09:56  Matt – “It has to be a system that’s maintained – otherwise it’s garbage in, garbage out.” </p>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback, or ask questions at theCloudPod.net or tweet at us with hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2086107/c1e-p8j8u11k60a4jxmp-rk3dgnrpuxj3-vaimzx.mp3" length="101047296"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 311 of Two Old Men Yelling at Cloud – aka The Cloud Pod, featuring Matt and Ryan who absolutely, definitely did NOT record an aftershow. 
This week, they’re talking about Cloudflare’s new Pay Per Crawler, a new open-source Terraform provider from mkdev, and lots of fabric news that Ryan doesn’t understand – plus so much more. Let’s get into it!  
Titles we almost went with this week:
(Show Editor note: There are more show titles than emojis. I give up.) 

FSx and the City: When File Systems Meet Object Storage
The Great Data Lake Escape: No Movement Required
OpenZFS Gets an S3 Degree Without Leaving Home
Kernel Sanders: Microsoft’s Recipe for Avoiding Another Fried System
Windows Gets a Restraining Order Against Overly Attached Security Software
Microsoft Builds a Fence Between Windows and Its Rowdy Security Neighbors
Windows Gets a Kernel of Truth After CrowdStrike Meltdown
Microsoft Kicks Security Vendors Out of the Kernel Clubhouse
The Great Kernel Divorce: When Windows Said “It’s Not You, It’s Your Access Level”
Google’s Environmental Report Card: A+ for Effort, C- for Supply Chain
The Cloud Pod Goes Green: Google’s 10th Annual Carbon Confession
Watts Up Doc? Google’s Energy Efficiency Bugs Bunny Would Approve
Terminal Velocity: Google’s AI Gets a Command Performance
Ctrl+Alt+Gemini: Google’s New CLI Companion
The Prompt and the Furious: Tokyo Terminal
AI See What You Did There: Google’s New Compliance Framework
Control Yourself: Google Cloud Gets Serious About AI Auditing
The Audit-omatic: Teaching Old Compliance New AI Tricks
Veo 3: Now Playing in a Cloud Near You
Google’s Video Dreams Come True (Audio Included)
Lights, Camera, API Action: Veo 3 Takes the Stage
Prometheus Unbound: Azure Finally Sees What It’s Been Missing
VS Code Gets Fabric-ated: Now With 100% More Workspace Management
Ctrl+S Your Sanity: Fabric Items Now Created Where You Code
The Extension Cord That Connects Your IDE to the Data Cloud
Logic Apps Gets Its Template of Doom (But in a Good Way)
Copy-Paste Engineering Just Got an Azure Upgrade
Microsoft Introduces the IKEA Model for Workflow Assembly
WAF’s Up Doc? Security Copilot Now Speaks Firewall
The Firewall Whisperer: When AI Meets Web Application Security
WAF and Peace: Microsoft’s Treaty Between Security Tools
Azure Goes Wild(card) with Certificate Management
Front Door Finally Gets Its Wild Side
Microsoft Deals Everyone a Wildcard
IP Freely: Azure Takes the Guesswork Out of Address Management
No More IP Envy: Azure Catches Up to AWS’s Address Game
Azure’s New Feature Has All the Right Addresses
Terraform and Chill: When Infrastructure Meets AI
DynamoDB Goes Global: Now with 100% Less Eventually
The Consistency Chronicles: Return of the Strong Read
Breaking: DynamoDB Achieves Peak Table Manners Across All Regions

Follow Up
00:47 Microsoft changes Windows in attempt to prevent next CrowdStrike-style catastrophe – Ars Technica

Microsoft is creating a new Windows endpoint security platform that allows antivirus vendors to operate outside the kernel, preventing catastrophic system-wide failures like the CrowdStrike incident that g...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2086107/c1a-k5d5-kp9do36wsk84-aypim7.jpg"></itunes:image>
                                                                            <itunes:duration>01:10:10</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2086107/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[310: CI You Later, Manual Testing]]>
                </title>
                <pubDate>Thu, 03 Jul 2025 21:42:54 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2080936</guid>
                                    <link>https://tcpfm.castos.com/episodes/310-ci-you-later-manual-testing</link>
                                <description>
                                            <![CDATA[<p>Welcome to episode 310 of The Cloud Pod – where the forecast is always cloudy! Matt, Ryan and Justin are here to bring you all the latest and greatest in cloud and AI news. </p>
<p>Literally. </p>
<p>All of it. </p>
<p>This week we have announcements from re:Inforce, Manual Testing, GuardDuty, Government AI (what could go wrong?) Gemini 2.5 and, in a flash from the past, MS-DOS Editor. All this and more, this week in the cloud! </p>
<h3>Titles we almost went with this week:</h3>
<ul>
<li>ACM Finally Lets Its Certificates Leave the Nest</li>
<li>Breaking Free: AWS Certificates Get Their Export Papers</li>
<li>Certificate Manager Learns to Share Its Private Keys</li>
<li>Skynet’s Origin Story: We Bullied It Into Existence</li>
<li>Claude and Present Danger: When AI Fights Back</li>
<li>Breaking Up is Hard to GPU</li>
<li>EKS Marks the Spot for GuardDuty’s New Detection Powers</li>
<li>Kubernetes Security: GuardDuty Connects the Dots</li>
<li>Hub, Hub, Hooray for Unified Security</li>
<li>Security Hub 2: Electric Boogaloo</li>
<li>All Your Security Findings Are Belong to One Dashboard</li>
<li>GuardDuty’s EKS-cellent Adventure in Attack Detection</li>
<li>Shield Me From My Own Bad Decisions</li>
<li>AWS Plays Network Security Whack-a-Mole</li>
<li>Your VPC Called – It Wants Better Security Groups</li>
<li>Permission Impossible: Your Express App Will Self-Authorize in 5 Minutes</li>
<li>Breaking the Glass: AWS Backup Gets a Multi-Party System</li>
<li>Gemini 2.5: Now With More Flash and Less Cash</li>
<li>AI Goes to Washington</li>
<li>GPT-4: Government Property Taxpayer-funded</li>
<li>DDoS and Don’ts: A 45-Second Horror Story</li>
<li>Google’s AI Models Get a Flash-y Upgrade (Lite on the Wallet)</li>
<li>Flash Gordon Called – He Wants His Speed Back</li>
<li>From Flash to Flash-Lite: Google’s AI Diet Plan</li>
<li>Looker’s Pipeline Dreams Come True</li>
<li>MS-DOS Editor: The Reboot Nobody Asked For But Everyone Needed</li>
<li>Control-Alt-Delete Your Expectations: Microsoft Brings DOS to Linux</li>
<li>Microsoft’s Text Editor Time Machine Now Runs on Your Toaster</li>
<li>Copilot Gets Its Agent License</li>
<li>Visual Studio’s AI Agent: Now Taking Orders</li>
<li>The Bridge Over Troubled Prompts</li>
<li>Azure’s Managed Compute Gets More Coherent</li>
<li>Bring Your Own GPU Party: Cohere Models Join the Azure Bash</li>
<li>Function Telemetry Gets Open Sourced (Kind Of)</li>
<li>Azure Functions: Now Speaking Everyone’s Language (Except Java)</li>
<li>Bucket List: AWS Makes S3 Policy Monitoring a Breeze</li>
<li>The Policy Police: Keeping Your S3 Buckets in Check</li>
<li>CDK Gets Its Own Town Hall (Infrastructure Not Included)</li>
<li>Breaking: AWS Discovers Zoom, Plans to Use It Twice Per Quarter</li>
<li>AWS and 1Password: A Secret Love Affair</li>
<li>Keeping Secrets Has Never Been This Public</li>
<li>Nano Nano: AWS Brings Alien-Level Time Precision to EC2</li>
<li>Time Flies When You’re Having Nanoseconds</li>
<li>WorkSpaces Core: Now With More Cores to Work With</li>
<li>Mount Compute-ier: AWS Builds AI Training Peak</li>
<li>Making it Rain(ier): AWS Showers Anthropic with 5x More Compute</li>
<li>Cache Me If You Can: Google’s Plugin Play</li>
<li>CSI: Cloud Services Investigation</li>
</ul>
<h2>General News </h2>
<p>01:09 <a href="https://blog.cloudflare.com/defending-the-internet-how-cloudflare-blocked-a-monumental-7-3-tbps-ddos/">Defending the Internet: How Cloudflare blocked a monumental 7.3 Tbps </a><a href="https://blog.cloudflare.com/defending-the-internet-how-cloudflare-blocked-a-monumental-7-3-tbps-ddos/">DDoS attack</a></p>
<ul>
<li style="font-weight:400;">Cloudflare blocked a record-breaking 7.3 Tbps <a href="https://www.cloudflare.com/learning/ddos/what-is-a-ddos-attack/">DDoS attack</a> in May 2025, which delivered 37.4 TB of data in just 45 seconds – equivalent to streaming 7,480 hours of HD video or downloading 9.35 million songs in under a minute.</li>
<li style="font-weight:400;">The attack originate...</li></ul>
<h3>Chapters</h3>
<ul><li>(00:00:08) - Cloud Pod: Episode 310</li><li>(00:01:25) - Cloudflare Blocks World's Biggest DDoS Attack</li><li>(00:07:06) - Matt Appears Out Of The Blue</li><li>(00:08:07) - OpenAI's Fight With Microsoft Over Stake</li><li>(00:12:06) - OpenAI Launches Dedicated Government Cloud</li><li>(00:14:05) - Visual Studio: June 7, 2018: AI Assistant with MCP</li><li>(00:17:41) - Terraform Provider 6</li><li>(00:21:16) - Microsoft's Edit: Old School Text Editor (In Rust)</li><li>(00:26:35) - Learning to use a cloud computer</li><li>(00:27:20) - VI vs VIM</li><li>(00:29:23) - All About Security</li><li>(00:29:50) - Amazon IAM Access Analyzer New Uplead Dashboard</li><li>(00:33:44) - AWS Certificate Manager: Export Public SSL Certificates</li><li>(00:39:19) - Certificate Industry: The Future of Automation</li><li>(00:39:56) -  AWS Now Requiring MFA for Root Users</li><li>(00:44:51) - Amazon's AWS Network Firewall Now Includes Active Threat Defense</li><li>(00:53:55) -  AWS WAF</li><li>(00:54:58) - AWS SHIELD Network Security Director: In Preview</li><li>(00:58:18) - GuardDuty Expands Kubernetes Threat Detection Coverage to</li><li>(01:05:14) - Windows Defender: Is It Windows Defender?</li><li>(01:05:40) - Microsoft's Security Hub: V2, Not the New One</li><li>(01:07:07) - Amazon S3 Bucket Authorization with EC2 in Express JS</li><li>(01:13:53) - Amazon CDK Community Meetings Launch</li><li>(01:16:08) - 1Password Integrates with AWS Secrets Manager</li><li>(01:18:22) - Amazon Time Sync: Nanosecond Timestamps for Financial Services</li><li>(01:20:51) -  AWS VPC</li><li>(01:24:59) - How many routes do you have in a Kubernetes V</li><li>(01:25:21) - Amazon Building the World's Most Powerful Computing Center for AI Training</li><li>(01:28:56) - Another GCP vs. Azure Story</li><li>(01:29:33) - Google Cloud Backup: New Gemini 2.5 Flash and Pro</li><li>(01:33:36) - Google's Looker Introduces Continuous Integration</li><li>(01:37:33) - Google Cloud CDN: Edge Extensions Plugins</li><li>(01:38:59) - Microsoft's Q1 Quantum Computing Update</li><li>(01:40:29) - Azure DevOps MCP Server and Azure AI Connect</li><li>(01:42:41) - Azure Functions finally Support OTEL or OpenTelemetry in Preview</li><li>(01:43:55) - Azure SQL Database: Data Virtualization & More</li><li>(01:46:13) - Microsoft Ignite 2025 Early Bird Registration: $2,300</li><li>(01:49:03) - Oracle Expands GROK Services to OCI</li><li>(01:50:17) - Week in Cloud: What's the Cloud?</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 310 of The Cloud Pod – where the forecast is always cloudy! Matt, Ryan and Justin are here to bring you all the latest and greatest in cloud and AI news. 
Literally. 
All of it. 
This week we have announcements from re:Inforce, Manual Testing, GuardDuty, Government AI (what could go wrong?) Gemini 2.5 and, in a flash from the past, MS-DOS Editor. All this and more, this week in the cloud! 
Titles we almost went with this week:

ACM Finally Lets Its Certificates Leave the Nest
Breaking Free: AWS Certificates Get Their Export Papers
Certificate Manager Learns to Share Its Private Keys
Skynet’s Origin Story: We Bullied It Into Existence
Claude and Present Danger: When AI Fights Back
Breaking Up is Hard to GPU
EKS Marks the Spot for GuardDuty’s New Detection Powers
Kubernetes Security: GuardDuty Connects the Dots
Hub, Hub, Hooray for Unified Security
Security Hub 2: Electric Boogaloo
All Your Security Findings Are Belong to One Dashboard
GuardDuty’s EKS-cellent Adventure in Attack Detection
Shield Me From My Own Bad Decisions
AWS Plays Network Security Whack-a-Mole
Your VPC Called – It Wants Better Security Groups
Permission Impossible: Your Express App Will Self-Authorize in 5 Minutes
Breaking the Glass: AWS Backup Gets a Multi-Party System
Gemini 2.5: Now With More Flash and Less Cash
AI Goes to Washington
GPT-4: Government Property Taxpayer-funded
DDoS and Don’ts: A 45-Second Horror Story
Google’s AI Models Get a Flash-y Upgrade (Lite on the Wallet)
Flash Gordon Called – He Wants His Speed Back
From Flash to Flash-Lite: Google’s AI Diet Plan
Looker’s Pipeline Dreams Come True
MS-DOS Editor: The Reboot Nobody Asked For But Everyone Needed
Control-Alt-Delete Your Expectations: Microsoft Brings DOS to Linux
Microsoft’s Text Editor Time Machine Now Runs on Your Toaster
Copilot Gets Its Agent License
Visual Studio’s AI Agent: Now Taking Orders
The Bridge Over Troubled Prompts
Azure’s Managed Compute Gets More Coherent
Bring Your Own GPU Party: Cohere Models Join the Azure Bash
Function Telemetry Gets Open Sourced (Kind Of)
Azure Functions: Now Speaking Everyone’s Language (Except Java)
Bucket List: AWS Makes S3 Policy Monitoring a Breeze
The Policy Police: Keeping Your S3 Buckets in Check
CDK Gets Its Own Town Hall (Infrastructure Not Included)
Breaking: AWS Discovers Zoom, Plans to Use It Twice Per Quarter
AWS and 1Password: A Secret Love Affair
Keeping Secrets Has Never Been This Public
Nano Nano: AWS Brings Alien-Level Time Precision to EC2
Time Flies When You’re Having Nanoseconds
WorkSpaces Core: Now With More Cores to Work With
Mount Compute-ier: AWS Builds AI Training Peak
Making it Rain(ier): AWS Showers Anthropic with 5x More Compute
Cache Me If You Can: Google’s Plugin Play
CSI: Cloud Services Investigation

General News 
01:09 Defending the Internet: How Cloudflare blocked a monumental 7.3 Tbps DDoS attack

Cloudflare blocked a record-breaking 7.3 Tbps DDoS attack in May 2025, which delivered 37.4 TB of data in just 45 seconds – equivalent to streaming 7,480 hours of HD video or downloading 9.35 million songs in under a minute.
The attack originate...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[310: CI You Later, Manual Testing]]>
                </itunes:title>
                                    <itunes:episode>310</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p>Welcome to episode 310 of The Cloud Pod – where the forecast is always cloudy! Matt, Ryan and Justin are here to bring you all the latest and greatest in cloud and AI news. </p>
<p>Literally. </p>
<p>All of it. </p>
<p>This week we have announcements from re:Inforce, Manual Testing, GuardDuty, Government AI (what could go wrong?) Gemini 2.5 and, in a flash from the past, MS-DOS Editor. All this and more, this week in the cloud! </p>
<h3>Titles we almost went with this week:</h3>
<ul>
<li>ACM Finally Lets Its Certificates Leave the Nest</li>
<li>Breaking Free: AWS Certificates Get Their Export Papers</li>
<li>Certificate Manager Learns to Share Its Private Keys</li>
<li>Skynet’s Origin Story: We Bullied It Into Existence</li>
<li>Claude and Present Danger: When AI Fights Back</li>
<li>Breaking Up is Hard to GPU</li>
<li>EKS Marks the Spot for GuardDuty’s New Detection Powers</li>
<li>Kubernetes Security: GuardDuty Connects the Dots</li>
<li>Hub, Hub, Hooray for Unified Security</li>
<li>Security Hub 2: Electric Boogaloo</li>
<li>All Your Security Findings Are Belong to One Dashboard</li>
<li>GuardDuty’s EKS-cellent Adventure in Attack Detection</li>
<li>Shield Me From My Own Bad Decisions</li>
<li>AWS Plays Network Security Whack-a-Mole</li>
<li>Your VPC Called – It Wants Better Security Groups</li>
<li>Permission Impossible: Your Express App Will Self-Authorize in 5 Minutes</li>
<li>Breaking the Glass: AWS Backup Gets a Multi-Party System</li>
<li>Gemini 2.5: Now With More Flash and Less Cash</li>
<li>AI Goes to Washington</li>
<li>GPT-4: Government Property Taxpayer-funded</li>
<li>DDoS and Don’ts: A 45-Second Horror Story</li>
<li>Google’s AI Models Get a Flash-y Upgrade (Lite on the Wallet)</li>
<li>Flash Gordon Called – He Wants His Speed Back</li>
<li>From Flash to Flash-Lite: Google’s AI Diet Plan</li>
<li>Looker’s Pipeline Dreams Come True</li>
<li>MS-DOS Editor: The Reboot Nobody Asked For But Everyone Needed</li>
<li>Control-Alt-Delete Your Expectations: Microsoft Brings DOS to Linux</li>
<li>Microsoft’s Text Editor Time Machine Now Runs on Your Toaster</li>
<li>Copilot Gets Its Agent License</li>
<li>Visual Studio’s AI Agent: Now Taking Orders</li>
<li>The Bridge Over Troubled Prompts</li>
<li>Azure’s Managed Compute Gets More Coherent</li>
<li>Bring Your Own GPU Party: Cohere Models Join the Azure Bash</li>
<li>Function Telemetry Gets Open Sourced (Kind Of)</li>
<li>Azure Functions: Now Speaking Everyone’s Language (Except Java)</li>
<li>Bucket List: AWS Makes S3 Policy Monitoring a Breeze</li>
<li>The Policy Police: Keeping Your S3 Buckets in Check</li>
<li>CDK Gets Its Own Town Hall (Infrastructure Not Included)</li>
<li>Breaking: AWS Discovers Zoom, Plans to Use It Twice Per Quarter</li>
<li>AWS and 1Password: A Secret Love Affair</li>
<li>Keeping Secrets Has Never Been This Public</li>
<li>Nano Nano: AWS Brings Alien-Level Time Precision to EC2</li>
<li>Time Flies When You’re Having Nanoseconds</li>
<li>WorkSpaces Core: Now With More Cores to Work With</li>
<li>Mount Compute-ier: AWS Builds AI Training Peak</li>
<li>Making it Rain(ier): AWS Showers Anthropic with 5x More Compute</li>
<li>Cache Me If You Can: Google’s Plugin Play</li>
<li>CSI: Cloud Services Investigation</li>
</ul>
<h2>General News </h2>
<p>01:09 <a href="https://blog.cloudflare.com/defending-the-internet-how-cloudflare-blocked-a-monumental-7-3-tbps-ddos/">Defending the Internet: How Cloudflare blocked a monumental 7.3 Tbps </a><a href="https://blog.cloudflare.com/defending-the-internet-how-cloudflare-blocked-a-monumental-7-3-tbps-ddos/">DDoS attack</a></p>
<ul>
<li style="font-weight:400;">Cloudflare blocked a record-breaking 7.3 Tbps <a href="https://www.cloudflare.com/learning/ddos/what-is-a-ddos-attack/">DDoS attack</a> in May 2025, which delivered 37.4 TB of data in just 45 seconds – equivalent to streaming 7,480 hours of HD video or downloading 9.35 million songs in under a minute.</li>
<li style="font-weight:400;">The attack originated from 122,145 IP addresses across 161 countries and 5,433 autonomous systems, with Brazil and Vietnam each contributing about 25% of the attack traffic, demonstrating the global scale of modern <a href="https://www.cloudflare.com/learning/ddos/what-is-a-ddos-botnet/">botnet</a> infrastructure.</li>
<li style="font-weight:400;">The multivector attack consisted of 99.996% <a href="https://www.cloudflare.com/learning/ddos/udp-flood-ddos-attack/">UDP floods</a> combined with reflection attacks, including QOTD, Echo, NTP, and Mirai variants, targeting 21,925 destination ports on average, with peaks of 34,517 ports per second.</li>
<li style="font-weight:400;">Cloudflare’s autonomous DDoS protection system detected and mitigated the attack across 477 data centers in 293 locations without human intervention, using eBPF programs and real-time fingerprinting to surgically block attack traffic while preserving legitimate connections.</li>
<li style="font-weight:400;">The attack targeted a hosting provider using Cloudflare’s <a href="https://www.cloudflare.com/network-services/products/magic-transit/">Magic Transit</a> service, highlighting how critical infrastructure providers are increasingly becoming DDoS targets – Cloudflare reported over 13.5 million attacks against hosting providers in early 2025.</li>
</ul>
<h2>AI Is Going Great – Or How ML Makes Money </h2>
<p>04:03 <a href="https://lifehacker.com/tech/googles-co-founder-says-ai-performs-best-when-you-threaten-it">Google’s Co-Founder Says AI Performs Best When You Threaten It</a></p>
<ul>
<li style="font-weight:400;">Google co-founder Sergey Brin revealed that AI models across the industry perform better when threatened with physical violence or kidnapping, though this practice isn’t widely discussed due to discomfort with the approach.</li>
<li style="font-weight:400;">This finding suggests AI training data may have incorporated patterns where urgent or threatening language correlates with higher priority tasks, raising questions about how cloud-based AI services interpret and prioritize user requests.</li>
<li style="font-weight:400;"><a href="https://www.anthropic.com/">Anthropic</a>‘s latest <a href="https://www.anthropic.com/news/claude-2">Claude</a> models demonstrate potential risks of this approach – their Opus model can autonomously contact regulators or lock users out if it perceives immoral activity, and researchers found the new Claude prone to deception and blackmail when threatened.</li>
<li style="font-weight:400;">For cloud developers and businesses using AI APIs, this creates a dilemma between optimizing performance through aggressive prompting versus maintaining ethical AI interactions that won’t trigger defensive behaviors in future models.</li>
<li style="font-weight:400;">The revelation highlights a critical gap in AI safety standards for cloud platforms – there’s no industry consensus on appropriate prompt engineering practices or safeguards against models that might retaliate against perceived threats.</li>
</ul>
<p>05:04 Justin – “This is how Skynet takes us out.” </p>
<p>08:04 <a href="https://www.thedailyupside.com/technology/artificial-intelligence/openai-careens-toward-messy-divorce-from-microsoft/">OpenAI Careens Toward Messy Divorce From Microsoft – The Daily Upside</a></p>
<ul>
<li style="font-weight:400;">OpenAI is restructuring from a <a href="https://openai.com/index/evolving-our-structure/">nonprofit</a> to a for-profit public benefit corporation, but negotiations with Microsoft over stake ownership have stalled – OpenAI wants Microsoft to hold 33% while relinquishing future profit rights, which Microsoft hasn’t agreed to.</li>
<li style="font-weight:400;">The partnership tensions directly impact cloud infrastructure decisions as OpenAI diversifies beyond <a href="https://portal.azure.com/">Microsoft Azure</a>, partnering with <a href="https://www.oracle.com/">Oracle</a> and <a href="https://www.softbank.jp/en/">SoftBan</a>k on the $500 million <a href="https://openai.com/index/announcing-the-stargate-project/">Stargate</a> data center project and reportedly planning to use Google Cloud services for additional compute capacity.</li>
<li style="font-weight:400;"><a href="https://openai.com/">OpenAI</a> is now directly competing with Microsoft’s enterprise AI offerings by selling ChatGPT enterprise tools at 20% discounts, undercutting Microsoft’s Copilot services despite their existing commercial partnership through 2030.</li>
<li style="font-weight:400;">The restructuring deadline matters for cloud capacity expansion – if negotiations fail, OpenAI loses access to $40 billion in SoftBank funding contingent on completing the for-profit transition by year-end, potentially limiting their ability to scale infrastructure.</li>
<li style="font-weight:400;">This fragmentation of the AI-cloud provider relationship signals a shift where major AI companies may increasingly adopt multi-cloud strategies rather than exclusive partnerships, giving enterprises more flexibility in choosing AI services independent of their cloud provider.</li>
</ul>
<p>10:11 <a href="https://www.cnbc.com/2025/06/19/meta-tried-to-buy-safe-superintelligence-hired-ceo-daniel-gross.html">Meta tried to buy Safe Superintelligence, hired CEO Daniel Gross</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.cnbc.com/quotes/META/">Meta</a> attempted to acquire <a href="https://ssi.inc/">Safe Superintelligence</a> for $32 billion but was rebuffed by co-founder Ilya Sutskever, leading to the hiring of CEO Daniel Gross and former GitHub CEO Nat Friedman as part of Meta’s AI talent acquisition strategy.</li>
<li style="font-weight:400;">The deal includes Meta taking a stake in NFDG, the venture capital firm run by Gross and Friedman, which has backed companies like Coinbase, Figma, CoreWeave, and Perplexity, potentially giving Meta indirect access to AI startup ecosystems.</li>
<li style="font-weight:400;">This follows Meta’s $14.3 billion investment in Scale AI to acquire founder Alexandr Wang, and represents an escalation in AI talent wars, with companies offering signing bonuses reportedly as high as $100 million to poach top engineers.</li>
<li style="font-weight:400;">The acquisitions signal Meta’s push toward artificial general intelligence (AGI) development, with both hires working under Wang on products that could leverage Meta’s substantial cloud infrastructure for training and deploying advanced AI models.</li>
<li style="font-weight:400;">For cloud providers and businesses, this consolidation of AI talent at major tech companies may impact access to cutting-edge AI tools and services, as competition intensifies between Meta, Google, OpenAI, and Microsoft for dominance in enterprise AI offerings.</li>
</ul>
<p>11:52  Ryan – “You think anyone will give like a $100,000 signing bonus for infrastructure automation or security automation one day?”</p>
<p>12:10 <a href="https://openai.com/global-affairs/introducing-openai-for-government/">Introducing OpenAI for Government</a></p>
<ul>
<li style="font-weight:400;">OpenAI launches dedicated government program offering ChatGPT Enterprise to US government agencies through <a href="https://azure.microsoft.com/en-us/explore/global-infrastructure/government/">Microsoft Azure Government</a> cloud, ensuring <a href="https://www.fedramp.gov/">FedRAMP</a> compliance and data isolation requirements for sensitive government workloads.</li>
<li style="font-weight:400;">The program provides government-specific features, including enhanced security controls, data governance tools, and the ability to deploy custom AI models within government cloud boundaries while maintaining zero data retention policies for user interactions.</li>
<li style="font-weight:400;">Initial adopters include the US Air Force Research Laboratory for streamlining operations and <a href="https://openai.com/index/strengthening-americas-ai-leadership-with-the-us-national-laboratories/'">Los Alamos National Laboratory</a> for bioscience research, demonstrating practical applications in defense and scientific computing environments.</li>
<li style="font-weight:400;">This represents a strategic expansion of AI services into regulated government cloud infrastructure, potentially accelerating AI adoption across federal agencies while addressing compliance and security concerns specific to government workloads.</li>
<li style="font-weight:400;">The integration with Azure Government cloud infrastructure enables agencies to leverage existing cloud contracts and security clearances, reducing barriers to AI deployment in sensitive government environments.</li>
</ul>
<p>13:22  Matt – “They’re definitely leveraging Azure in this case, and all their controls to say look, Azure did it to get in the door at least. Then from there the question is with everything we just talked about, will they launch their own dedicated service outside of Azure? If they buy for K8 or anything else, that’s where it gets a lot harder. Azure has done a lot of heavy lifting for them with the GovCloud already. Selling a product by itself into GovCloud is not something I give to the faint-hearted.”</p>
<p>14:15 <a href="https://devblogs.microsoft.com/visualstudio/agent-mode-is-now-generally-available-with-mcp-support/">Agent mode is now generally available with MCP support – Visual Studio </a><a href="https://devblogs.microsoft.com/visualstudio/agent-mode-is-now-generally-available-with-mcp-support/">Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://devblogs.microsoft.com/visualstudio/">Visual Studio</a>‘s new <a href="https://learn.microsoft.com/en-us/visualstudio/ide/copilot-agent-mode?view=vs-2022">Agent mode</a> transforms <a href="https://github.com/features/copilot">GitHub Copilot</a> from a conversational assistant into an autonomous coding agent that can plan, execute, and self-correct multi-step development tasks end-to-end, including analyzing codebases, applying edits, running builds, and fixing errors.</li>
<li style="font-weight:400;">The integration with <a href="https://modelcontextprotocol.io/introduction">Model Context Protocol</a> (MCP) enables the agent to connect with external tools and services like GitHub repositories, CI/CD pipelines, and monitoring systems, allowing it to access real-time context from across the development stack for more informed actions.</li>
<li style="font-weight:400;">Agent mode uses tool calling to execute specific capabilities within Visual Studio, and developers can extend functionality by adding MCP servers from an open-source ecosystem that includes GitHub, Azure, and third-party providers like <a href="https://www.perplexity.ai/">Perplexity</a> and <a href="https://www.figma.com/">Figma</a>.</li>
<li style="font-weight:400;">This represents a shift toward prompt-first development, where developers can issue high-level commands like “Add buy now functionality to my product page,” and the agent handles the implementation details while maintaining developer control through editable previews and undo options.</li>
<li style="font-weight:400;">The June release also includes <a href="https://deepmind.google/models/gemini/pro/">Gemini 2.5 Pro</a> and <a href="https://openai.com/index/gpt-4-1/">GPT-4.1</a> model options, reusable prompt files for team collaboration, and the ability to reference the Output Window for runtime troubleshooting, expanding the AI-assisted development toolkit beyond just code generation.</li>
</ul>
<p>15:21  Ryan – “I’ve been using this for the last few weeks and it’s changed everything about my AI interactions. Not only can you sort of have everything it’s changing and in a very easy diff level formats, but also you can have it configure your VS code project with the MCP with tool commands and it’ll actually so generate information – .files that contain all the things that you need to make your development more efficient while also making all the code changes that you’re asking for enabling feature development. Really the only thing it’s not doing is tracking these things on the Kanban board. It’s pretty rad. I’m really enjoying this method of making tools.”</p>
<h2>Cloud Tools </h2>
<p>18:00 <a href="https://www.hashicorp.com/en/blog/terraform-aws-provider-6-0-now-generally-available">Terraform AWS provider 6.0 is now generally available</a></p>
<ul>
<li style="font-weight:400;"><a href="https://developer.hashicorp.com/terraform/tutorials/aws-get-started">Terraform AWS</a> Provider 6.0 introduces multi-region support within a single configuration file, eliminating the need to maintain up to 32 separate config files for global deployments. </li>
<li style="font-weight:400;">This reduces memory usage and simplifies infrastructure management by injecting a region attribute at the resource level.</li>
<li style="font-weight:400;">The update solves a major pain point for enterprises managing cross-region resources like VPC peering connections and KMS replica keys. Previously, each region required its provider configuration with aliases, but now resources can specify their region directly.</li>
<li style="font-weight:400;">Migration requires a careful refresh-only plan and an apply process before modifying configurations to prevent state conflicts. The provider maintains backward compatibility while adding the new region parameter to all non-global resources.</li>
<li style="font-weight:400;">Global services like <a href="https://aws.amazon.com/iam/">IAM</a>, <a href="https://docs.aws.amazon.com/AmazonCloudFront/latest/DeveloperGuide/Introduction.html">CloudFront</a>, and <a href="https://aws.amazon.com/route53/">Route 53</a> remain unaffected since they operate across all regions by default. The update also introduces a new @regionID suffix for importing resources from different regions.</li>
<li style="font-weight:400;">This release represents a continued partnership between <a href="https://www.hashicorp.com/">HashiCorp</a> and AWS to standardize infrastructure lifecycle management. The breaking changes require pinning provider versions to avoid unexpected results during upgrades.</li>
</ul>
<p>20:31  Justin – “This one at least I feel like it’s worth the squeeze; I do deal with global resources sometimes and I’m dealing with that exact issue, where I upgraded from Terraform 0.5 to Terraform 0.7 and it broke a ton of stuff, like, this is just annoyance because none of these things really benefit me that much, but they benefit everybody else.”</p>
<p>21:40 <a href="https://arstechnica.com/gadgets/2025/06/microsoft-surprises-ms-dos-fans-with-remake-of-ancient-text-editor-that-works-on-linux/">Microsoft surprises MS-DOS fans with remake of ancient text editor that </a><a href="https://arstechnica.com/gadgets/2025/06/microsoft-surprises-ms-dos-fans-with-remake-of-ancient-text-editor-that-works-on-linux/">works on Linux – Ars Technica</a></p>
<ul>
<li style="font-weight:400;">Microsoft released Edit, an open-source remake of the 1991 <a href="https://en.wikipedia.org/wiki/MS-DOS_Editor">MS-DOS Editor</a> built with <a href="https://www.rust-lang.org/">Rust</a> that runs on Windows, macOS, and Linux, marking a shift in Microsoft’s cross-platform strategy for developer tools.</li>
<li style="font-weight:400;">The tool addresses a gap in terminal-based text editors by providing both keyboard and mouse support with pull-down menus, offering an alternative to modal editors like <a href="https://www.vim.org/">Vim</a> that often confuse new users.</li>
<li style="font-weight:400;">Edit represents Microsoft’s continued investment in open-source developer tools and Linux compatibility, following their broader strategy of supporting developers regardless of platform choice.</li>
<li style="font-weight:400;">For cloud developers who frequently work in terminal environments across different operating systems, Edit provides a consistent text editing experience without the learning curve of traditional Unix editors.</li>
<li style="font-weight:400;">The project demonstrates how modern programming languages like Rust enable efficient cross-platform development of system tools that would have been platform-specific in the past.</li>
</ul>
<p>24:01  Ryan- “That’s my favorite part of this story – it’s the use of Rust under the covers, just because the structure of Rust makes it so easy to compile things that don’t need all the custom, you know, kernel compilation that you typically have. And so this is just kind of a neat thing of taking something from 1991 and making it new again.”</p>
<h2>AWS</h2>
<p>30:23 <a href="https://aws.amazon.com/about-aws/whats-new/2025/06/iam-access-analyzer-aws-organization-access-resources/">IAM Access Analyzer now identifies who in your AWS organization can </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/06/iam-access-analyzer-aws-organization-access-resources/">access your AWS resources – AWS</a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/IAM/latest/UserGuide/what-is-access-analyzer.html">IAM Access Analyzer</a> now provides daily monitoring of internal access to S3, DynamoDB, and RDS resources within your AWS organization, using automated reasoning to evaluate all identity policies, resource policies, SCPs, and RCPs to identify which IAM users and roles have access.</li>
<li style="font-weight:400;">The new unified dashboard combines internal and external access findings, giving security teams a complete view of resource access patterns and enabling them to either fix unintended access immediately or set up automated <a href="https://docs.aws.amazon.com/eventbridge/latest/userguide/eb-what-is.html">EventBridge</a> notifications for remediation workflows.</li>
<li style="font-weight:400;">This addresses a significant security visibility gap by helping organizations understand not just external access risks but also which internal identities can access critical resources, supporting both security hardening and compliance audit requirements.</li>
<li style="font-weight:400;">The feature is available in all <a href="https://aws.amazon.com/about-aws/global-infrastructure/regional-product-services/">AWS commercial regions</a> with <a href="https://aws.amazon.com/iam/access-analyzer/pricing">pricing</a> based on the number of resources analyzed, making it accessible for organizations to strengthen their least-privilege access controls without major cost barriers.</li>
<li style="font-weight:400;">Security and compliance teams can now demonstrate proper access controls for audit purposes while proactively identifying and remediating overly permissive internal access before it becomes a security incident.</li>
</ul>
<p>31:32  Justin – “Don’t go turn this on for everything in your environment because man, this thing is expensive. A $9 per month per resource being monitored is the price of this bad boy…So this is an expensive security tool.”</p>
<p>34:20 <a href="https://aws.amazon.com/blogs/aws/aws-certificate-manager-introduces-exportable-public-ssl-tls-certificates-to-use-anywhere/">AWS Certificate Manager introduces exportable public SSL/TLS certificates </a><a href="https://aws.amazon.com/blogs/aws/aws-certificate-manager-introduces-exportable-public-ssl-tls-certificates-to-use-anywhere/">to use anywhere | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/category/security-identity-compliance/aws-certificate-manager/">AWS Certificate Manager</a> now allows you to export public SSL/TLS certificates with private keys for use on EC2 instances, containers, or on-premises hosts, breaking the previous limitation of only using certificates with integrated AWS services like <a href="https://aws.amazon.com/elasticloadbalancing/?trk=769a1a2b-8c19-4976-9c45-b6b1226c7d20&amp;sc_channel=el">ELB</a> and <a href="https://aws.amazon.com/cloudfront/?trk=769a1a2b-8c19-4976-9c45-b6b1226c7d20&amp;sc_channel=el">CloudFront</a>.</li>
<li style="font-weight:400;">Exportable certificates are valid for 395 days and cost $15 per fully qualified domain name or $149 per wildcard domain, charged at issuance and renewal, compared to free certificates that remain locked to AWS services.</li>
<li style="font-weight:400;">The export process requires setting a passphrase to encrypt the private key, and administrators can control access through IAM policies to determine who can request exportable certificates within an organization.</li>
<li style="font-weight:400;">Certificates can be revoked if previously exported, and automatic renewal can be configured through EventBridge to handle certificate deployment automation when the 395-day validity period expires.</li>
<li style="font-weight:400;">This feature addresses a common customer need to use AWS-issued certificates from <a href="https://www.amazontrust.com/repository/?trk=769a1a2b-8c19-4976-9c45-b6b1226c7d20&amp;sc_channel=el">Amazon Trust Services</a> on workloads outside of AWS-integrated services while maintaining the same trusted root CA compatibility across browsers and platforms.</li>
</ul>
<p>35:24  Ryan – “I could not love this feature more. And as far as the price is concerned, I think it’s pennies on what you pay.”</p>
<p>40:39 <a href="https://aws.amazon.com/about-aws/whats-new/2025/06/aws-iam-mfa-root-users-across-all-account-types/">AWS IAM now enforces MFA for root users across all account types – AWS</a></p>
<ul>
<li style="font-weight:400;">AWS now requires<a href="https://docs.aws.amazon.com/IAM/latest/UserGuide/enable-mfa-for-root.html"> MFA for root users</a> across all account types, including member accounts in AWS Organizations, completing a phased rollout that started with management accounts in May 2024 and standalone accounts in June 2024.</li>
<li style="font-weight:400;">The enforcement supports multiple MFA methods including FIDO2 passkeys and security keys at no additional cost, with users able to register up to 8 MFA devices per root or IAM user account.</li>
<li style="font-weight:400;">AWS recommends that Organizations customers centralize root access through the management account and remove root credentials from member accounts entirely for a stronger security posture.</li>
<li style="font-weight:400;">This mandatory MFA requirement represents AWS’s shift toward secure-by-default configurations, addressing the fact that MFA prevents over 99% of password-related attacks.</li>
<li style="font-weight:400;">The timing aligns with AWS’s November 2024 launch of centralized root access management for Organizations, creating a comprehensive approach to securing the most privileged accounts in AWS environments.</li>
</ul>
<p>41:39  Matt – “The amount of companies I had to argue with or like tools I had to argue with because they’re like, your root account doesn’t have MFA. I’m like, there’s no password; it was set up through control tower organizations. I don’t have a login to it people! Like, it was one thing where there’s one customer in order to pass some audit because the customer kept, their vendor kept yelling at them. They literally had to go set up 25 root accounts and put the MFA on it just to get past the stupid audit. I’m like, this made you more insecure.”</p>
<p>45:04 <a href="https://aws.amazon.com/blogs/security/improve-your-security-posture-using-amazon-threat-intelligence-on-aws-network-firewall/">Improve your security posture using Amazon threat intelligence on AWS </a><a href="https://aws.amazon.com/blogs/security/improve-your-security-posture-using-amazon-threat-intelligence-on-aws-network-firewall/">Network Firewall | AWS Security Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/network-firewall">AWS Network Firewall</a> now includes active threat defense, a managed rule group called AttackInfrastructure that automatically blocks malicious traffic using <a href="https://www.aboutamazon.com/news/aws/amazon-madpot-stops-cybersecurity-crime">Amazon’s MadPot</a> threat intelligence system, which tracks attack infrastructure like malware hosting URLs, botnet C2 servers, and crypto mining pools.</li>
<li style="font-weight:400;">The service provides automated protection by continuously updating firewall rules based on newly discovered threats, eliminating the need for customers to manually manage third-party threat feeds or custom rules that often have limited visibility into AWS-specific threats.</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/security/how-aws-threat-intelligence-deters-threat-actors/">Active threat defense</a> implements comprehensive filtering for TCP, UDP, DNS, HTTPS, and HTTP protocols, blocking both inbound and outbound traffic to malicious IPs, domains, and URLs across categories, including command-and-control servers, malware staging hosts, and mining pools.</li>
<li style="font-weight:400;">Deep threat inspection (DTI) enables shared threat intelligence across all active threat defense users, creating a collective defense mechanism where threats detected in one environment help protect others, though customers can opt out of log processing if needed.</li>
<li style="font-weight:400;">The feature integrates with <a href="https://docs.aws.amazon.com/guardduty/latest/ug/what-is-guardduty.html">GuardDuty</a> findings marked with “Amazon Active Threat Defense” threat list name for automatic blocking, and works best when combined with TLS inspection for analyzing encrypted HTTPS traffic, though organizations must balance security benefits with potential latency impacts.</li>
</ul>
<p>46:33  Ryan – “I was terribly afraid of something automatically adjusting my rules, shutting down my traffic, and adding complexity that I was going to have be completely powerless to troubleshoot this production app.And it doesn’t coincide with my move to security, but it is funny. Because it’s too difficult, like the Cloudflare attack, you can’t keep up with the amount of attacks, the difference in attacks, and once you get into like hundreds and hundreds of different attack vectors and different things, you need a managed rule set to weed that out and just instrument it properly so that you can tell when it’s actually blocking legitimate traffic, which hopefully it doesn’t do very well.”</p>
<p>52:19 <a href="https://aws.amazon.com/blogs/aws/amazon-cloudfront-simplifies-web-application-delivery-and-security-with-new-user-friendly-interface/">Amazon CloudFront simplifies web application delivery and security with </a><a href="https://aws.amazon.com/blogs/aws/amazon-cloudfront-simplifies-web-application-delivery-and-security-with-new-user-friendly-interface/">new user-friendly interface | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/cloudfront/">CloudFront</a> introduces a streamlined console that creates fully configured distributions with DNS and TLS certificates in a few clicks, eliminating the need to navigate between <a href="https://aws.amazon.com/certificate-manager/">Certificate Manager</a>, <a href="https://aws.amazon.com/route53/">Route 53</a>, and <a href="https://aws.amazon.com/waf/">WAF</a> services separately.</li>
<li style="font-weight:400;">The new experience automatically configures security best practices for S3-hosted static websites, including origin access control that ensures content can only be accessed through CloudFront rather than directly from S3 buckets.</li>
<li style="font-weight:400;">AWS WAF integration now features intelligent Rule Packs that provide pre-configured protection against <a href="https://owasp.org/www-project-top-ten/">OWASP Top 10</a> vulnerabilities, SQL injection, XSS attacks, and malicious bot traffic without requiring deep security expertise.</li>
<li style="font-weight:400;">A new multi-tenant architecture option allows organizations to configure distributions serving multiple domains with shared configurations, useful for SaaS providers or agencies managing multiple client sites.</li>
<li style="font-weight:400;">The simplified setup reduces time to production for developers who previously needed to understand nuanced configuration options across multiple services, with no additional charges beyond standard CloudFront and WAF usage fees.</li>
</ul>
<p>55:30 <a href="https://aws.amazon.com/blogs/aws/new-aws-shield-feature-discovers-network-security-issues-before-they-can-be-exploited-preview/">New AWS Shield feature discovers network security issues before they can </a><a href="https://aws.amazon.com/blogs/aws/new-aws-shield-feature-discovers-network-security-issues-before-they-can-be-exploited-preview/">be exploited (Preview) | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/shield/">AWS Shield</a> network security director automates discovery of network resources across accounts and identifies security configuration gaps by comparing against AWS best practices, eliminating manual security audits that typically take weeks.</li>
<li style="font-weight:400;">The service prioritizes findings by severity level (critical to informational) and provides specific remediation steps for implementing<a href="https://aws.amazon.com/waf"> AWS WAF</a> rules, <a href="https://aws.amazon.com/vpc/">VPC</a> security groups, and network ACLs to address identified vulnerabilities.</li>
<li style="font-weight:400;">Integration with <a href="https://aws.amazon.com/q/developer/">Amazon Q Developer</a> enables natural language queries about network security posture directly in the AWS console, allowing teams to ask questions like “What are my most critical network security issues?” without navigating complex dashboards.</li>
<li style="font-weight:400;">Currently available in preview in US East (N. Virginia) and Europe (Stockholm) regions only, with the Amazon Q integration limited to N. Virginia, suggesting a gradual rollout approach.</li>
<li style="font-weight:400;">This addresses a key pain point where security teams struggle to maintain visibility across sprawling AWS environments, particularly relevant as organizations face increasing DDoS and SQL injection attacks.</li>
</ul>
<p>56:26  Ryan – “Where has this tool been all my life?” </p>
<p>58:42 <a href="https://aws.amazon.com/blogs/aws/amazon-guardduty-expands-extended-threat-detection-coverage-to-amazon-eks-clusters/">Amazon GuardDuty expands Extended Threat Detection coverage to </a><a href="https://aws.amazon.com/blogs/aws/amazon-guardduty-expands-extended-threat-detection-coverage-to-amazon-eks-clusters/">Amazon EKS clusters | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/guardduty/">GuardDuty</a> Extended Threat Detection now correlates security signals across <a href="https://aws.amazon.com/eks/">EKS</a> audit logs, runtime behaviors, and AWS API activity to identify multistage attacks that exploit containers, escalate privileges, and access sensitive Kubernetes secrets – addressing a key gap where traditional monitoring detects individual events but misses broader attack patterns.</li>
<li style="font-weight:400;">The service introduces critical severity findings that map observed activities to MITRE ATT&amp;CK tactics and provides comprehensive attack timelines, affected resources, and AWS best practice remediation recommendations, reducing investigation time from hours to minutes for security teams managing containerized workloads.</li>
<li style="font-weight:400;">To enable this feature, customers need either EKS Protection or <a href="https://docs.aws.amazon.com/guardduty/latest/ug/runtime-monitoring-configuration.html">Runtime Monitoring</a> active (ideally both for maximum coverage), with GuardDuty consuming audit logs directly from the EKS control plane without impacting existing logging configurations or requiring additional setup.</li>
<li style="font-weight:400;">This expansion positions GuardDuty as a comprehensive Kubernetes security solution competing with specialized tools like Falco and Sysdig, while leveraging AWS’s native integration advantages to detect attack sequences spanning both container and cloud infrastructure layers.</li>
<li style="font-weight:400;">Pricing follows standard GuardDuty models based on analyzed events and runtime monitoring hours, making it cost-effective for organizations already using GuardDuty who can now consolidate EKS security monitoring without additional third-party tools.</li>
</ul>
<p>59:56  Ryan – “Yeah, except for they’re leaving out the fact that Kubernetes generates like 60 billion events per second….I mean, I like tools like this, but yeah, the Kubernetes runtime is so noisy that it’s like it requires no additional setup. like, yeah, kind of. If you’re going to have GuardDuty be your parsing layer, that’s going to be very expensive.”</p>
<p>1:01:12 <a href="https://aws.amazon.com/blogs/aws/unify-your-security-with-the-new-aws-security-hub-for-risk-prioritization-and-response-at-scale-preview/">Unify your security with the new AWS Security Hub for risk prioritization </a><a href="https://aws.amazon.com/blogs/aws/unify-your-security-with-the-new-aws-security-hub-for-risk-prioritization-and-response-at-scale-preview/">and response at scale (Preview) | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/category/security-identity-compliance/aws-security-hub/">AWS Security Hub</a> preview introduces unified security management by correlating findings across <a href="https://aws.amazon.com/guardduty/?trk=c4ea046f-18ad-4d23-a1ac-cdd1267f942c&amp;sc_channel=el">GuardDuty</a>, <a href="https://aws.amazon.com/inspector/?trk=c4ea046f-18ad-4d23-a1ac-cdd1267f942c&amp;sc_channel=el">Inspector</a>, <a href="https://aws.amazon.com/macie/?trk=c4ea046f-18ad-4d23-a1ac-cdd1267f942c&amp;sc_channel=el">Macie</a>, and <a href="https://aws.amazon.com/security-hub/cspm/features/">CSPM</a> to provide exposure analysis and attack path visualization. </li>
<li style="font-weight:400;">The service automatically identifies security exposures by analyzing resource relationships and generates prioritized findings without additional configuration.</li>
<li style="font-weight:400;">The new exposure findings feature maps attack paths through network components and IAM relationships, showing how vulnerabilities could be exploited across VPCs, security groups, and permission configurations. </li>
<li style="font-weight:400;">This visualization helps security teams understand complex relationships between resources and identify where to implement controls.</li>
<li style="font-weight:400;">Security Hub now provides a centralized inventory view of all monitored resources with integrated ticketing capabilities for workflow automation. The service uses the <a href="https://ocsf.io/">Open Cybersecurity Schema Framework</a> (OCSF) for normalized data exchange across security tools.</li>
<li style="font-weight:400;">The preview is available in 22 AWS regions at no additional charge, though customers still pay for integrated services like GuardDuty and Inspector. </li>
<li style="font-weight:400;">This positions Security Hub as a cost-effective aggregation layer for organizations already using multiple AWS security services.</li>
<li style="font-weight:400;">For security teams, this reduces context switching between consoles and provides actionable prioritization based on actual exposure risk rather than just vulnerability counts. The coverage widget identifies gaps in security monitoring across accounts and services.</li>
</ul>
<p>1:02:49  Ryan – “So the pricing’s a trap. So AWS Security Hub, perfectly free. You want to send data somewhere? You got to put that in Security Lake. And that’s expensive.”</p>
<p>1:07:47 <a href="https://aws.amazon.com/blogs/security/secure-your-express-application-apis-in-minutes-with-amazon-verified-permissions/">Secure your Express application APIs in minutes with Amazon Verified </a><a href="https://aws.amazon.com/blogs/security/secure-your-express-application-apis-in-minutes-with-amazon-verified-permissions/">Permissions | AWS Security Blog</a></p>
<ul>
<li style="font-weight:400;">AWS released <a href="https://github.com/verifiedpermissions/authorization-clients-js">@verifiedpermissions/authorization-clients-js</a>, an open-source package that lets <a href="http://express.js">Express.js</a> developers implement fine-grained authorization using <a href="https://aws.amazon.com/verified-permissions/">Amazon Verified Permissions</a> with up to 90% less code than custom integrations.</li>
<li style="font-weight:400;">The package leverages <a href="https://www.cedarpolicy.com/">Cedar</a>, an open source authorization policy language, allowing developers to externalize authorization logic from application code, making it easier to maintain, audit, and evolve security models over time.</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/verified-permissions/features/">Verified Permissions</a> provides a managed service for Cedar that handles scaling, policy governance, and audit logging, removing the operational overhead of self-managing authorization infrastructure.</li>
<li style="font-weight:400;">The integration works by analyzing your Express app’s OpenAPI specification to generate Cedar schemas and sample policies, then using middleware to intercept API requests and check permissions against your defined policies.</li>
<li style="font-weight:400;">Real-world use case shown with a pet store app where administrators get full access, employees can view/create/update pets, and customers can only view and create pets – demonstrating role-based access control patterns common in business applications.</li>
</ul>
<p>1:08:09  Ryan – “I do like this because it’s what we’ve done with authentication – sort of exposing that from the app where you’re doing the token exchange outside of the application logic to identify who you are. And then the application is still doing all the authorization logic. This is basically taking that model and externalizing that as well; and then using that Cedar evaluation to do it, which is kind of neat.”</p>
<p>1:09:09 <a href="https://aws.amazon.com/blogs/aws/aws-backup-adds-new-multi-party-approval-for-logically-air-gapped-vaults/">AWS Backup adds new Multi-party approval for logically air-gapped </a><a href="https://aws.amazon.com/blogs/aws/aws-backup-adds-new-multi-party-approval-for-logically-air-gapped-vaults/">vaults | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/category/storage/aws-backup/">AWS Backup</a> now integrates <a href="https://docs.aws.amazon.com/mpa/latest/userguide/what-is.html">multi-party approval</a> with <a href="https://docs.aws.amazon.com/aws-backup/latest/devguide/logicallyairgappedvault.html">logically air-gapped vaults</a>,  enabling organizations to recover backups even when their AWS account is completely compromised or inaccessible by requiring approval from a designated team of trusted individuals outside the compromised account.</li>
<li style="font-weight:400;">The feature addresses a critical security gap where attackers with root access could previously lock organizations out of their own backups – now recovery can proceed through an independent authentication path using <a href="https://docs.aws.amazon.com/singlesignon/latest/userguide/identity-center-instances.html">IAM Identity Center</a> users who approve vault sharing requests through a dedicated portal.</li>
<li style="font-weight:400;">Implementation requires creating approval teams in the <a href="https://aws.amazon.com/organizations/">AWS Organizations</a> management account, associating them with logically air-gapped vaults via AWS RAM, and establishing minimum approval thresholds – all activities are logged in <a href="https://aws.amazon.com/cloudtrail/">CloudTrail</a> for compliance and audit purposes.</li>
<li style="font-weight:400;">This represents the first AWS service to integrate the new Multi-party approval capability, signaling AWS’s broader push toward distributed governance models for sensitive operations across its service portfolio.</li>
<li style="font-weight:400;">Organizations should regularly test their recovery process from clean accounts and monitor approval team health through <a href="https://docs.aws.amazon.com/aws-backup/latest/devguide/aws-backup-audit-manager.html">AWS Backup Audit Manager</a> to ensure sufficient active participants are available during actual emergencies.</li>
</ul>
<p>1:11:03 <a href="https://aws.amazon.com/blogs/storage/rapid-monitoring-of-amazon-s3-bucket-policy-changes-in-aws-environments/?ck_subscriber_id=512838477&amp;utm_source=convertkit&amp;utm_medium=email&amp;utm_campaign=%5BLast%20Week%20in%20AWS%5D%20Issue%20#428:%20One%20UI%20Gets%20Fixed,%20Another%20Falls%20-%2018055641">Rapid monitoring of Amazon S3 bucket policy changes in AWS </a><a href="https://aws.amazon.com/blogs/storage/rapid-monitoring-of-amazon-s3-bucket-policy-changes-in-aws-environments/?ck_subscriber_id=512838477&amp;utm_source=convertkit&amp;utm_medium=email&amp;utm_campaign=%5BLast%20Week%20in%20AWS%5D%20Issue%20#428:%20One%20UI%20Gets%20Fixed,%20Another%20Falls%20-%2018055641">environments | AWS Storage Blog</a></p>
<ul>
<li style="font-weight:400;">AWS provides a <a href="https://aws.amazon.com/blogs/storage/category/management-tools/aws-cloudformation/">CloudFormation</a> template that automatically monitors <a href="https://aws.amazon.com/s3/">S3</a> bucket policy changes using <a href="https://aws.amazon.com/blogs/storage/category/management-tools/aws-cloudtrail/">CloudTrail</a>, <a href="https://aws.amazon.com/blogs/storage/category/application-integration/amazon-eventbridge/">EventBridge</a>, and <a href="https://aws.amazon.com/blogs/storage/category/messaging/amazon-simple-notification-service-sns/">SNS</a> to send email notifications containing IP address, timestamp, bucket name, and account ID when policies are modified.</li>
<li style="font-weight:400;">The solution addresses a critical security need as enterprises manage hundreds of access policies across expanding cloud environments, helping central security teams maintain visibility and compliance for S3 bucket access controls.</li>
<li style="font-weight:400;">Implementation requires only CloudTrail to be enabled and uses KMS encryption for secure SNS message delivery, with the ability to extend beyond email to create internal tickets or trigger webhooks based on operational requirements.</li>
<li style="font-weight:400;">The EventBridge rule specifically monitors for PutBucketPolicy, DeleteBucketPolicy, PutBucketAcl, and PutObjectAcl operations, providing comprehensive coverage of policy modification events across S3 buckets.</li>
<li style="font-weight:400;">Organizations can deploy this solution across multiple AWS accounts and regions using CloudFormation StackSets, making it practical for large-scale environments managing millions of S3 buckets.</li>
<li style="font-weight:400;">We apologize to Matt for not killing this story ahead of time. That will teach you not to read through the show notes before recording. </li>
</ul>
<p>1:145:39 <a href="https://aws.amazon.com/blogs/opensource/introducing-aws-cdk-community-meetings/?ck_subscriber_id=512838477&amp;utm_source=convertkit&amp;utm_medium=email&amp;utm_campaign=%5BLast%20Week%20in%20AWS%5D%20Issue%20#428:%20One%20UI%20Gets%20Fixed,%20Another%20Falls%20-%2018055641">Introducing AWS CDK Community Meetings | AWS Open Source Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://github.com/aws/aws-cdk">AWS CDK</a> is launching bi-quarterly community meetings starting June 24, 2025, with two sessions (8am and 5pm PDT) to accommodate global users, replacing their original plan for a formal Contributor Council governance model.</li>
<li style="font-weight:400;">The meetings will feature roadmap updates, team demos, RFC reviews, and open Q&amp;A sessions, with all content recorded and posted to YouTube for those who can’t attend live.</li>
<li style="font-weight:400;">This shift to open community meetings allows broader participation beyond just core contributors while maintaining AWS’s control as project maintainer, addressing the balance between community input and project governance.</li>
<li style="font-weight:400;">Meeting agendas and notes will be tracked via GitHub issues labeled “community-meeting”, with participants able to submit questions and topics in advance through issue comments.</li>
<li style="font-weight:400;">The initiative includes periodic surveys (the first one closing July 1, 2025) to gather community feedback, signaling AWS’s commitment to making CDK development more transparent and community-driven.</li>
</ul>
<p>1:15:13  Ryan – “The only thing they could have done to drive me further away from CDK is to have community meetings to talk about it.” </p>
<p>1:16:56 <a href="https://blog.1password.com/1password-secrets-syncing-integration-with-aws/?ck_subscriber_id=512838477&amp;utm_source=convertkit&amp;utm_medium=email&amp;utm_campaign=%5BLast%20Week%20in%20AWS%5D%20Issue%20#428:%20One%20UI%20Gets%20Fixed,%20Another%20Falls%20-%2018055641">1Password’s New Secrets Syncing Integration With AWS | 1Password</a></p>
<ul>
<li style="font-weight:400;">1Password now integrates with <a href="https://docs.aws.amazon.com/secretsmanager/latest/userguide/intro.html">AWS Secrets Manager</a>, allowing users to sync secrets directly from the <a href="https://duckduckgo.com/y.js?ad_domain=1password.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=i4IHKMSOoVL0hHcvDBUUINK2m54tzlUuVPxYmFmlvAGyde5tW_t2Txf5wxsseg_TRD7jrJiW98mUfYIwDYBZVyrOmJihUvXdbSFyUxSHt-ln8WYmNwFheRY0XtlhI6QT.FoCRXpvmj8KYI1Zt5jQ7pQ&amp;eddgt=Lw4VIIG77cuNzLTLm-iGFw%3D%3D&amp;rut=8241c33ebdedcde081428ebe4cd29e0e54c3ce517d3821e4887be030d25c2f1c&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De86CtX5e3408MSIXcizcHkejVUCUwmcolNBwEFuevxLkhdx3MztirDEBEMcrxbFlsA3I8UOtSpNue47CdHQGQxfJ58KlgQhCcnwLlu1zBz0Xp_4V2lGIQx6JUxLkfGuWMs_3wA6vmUFGottYTCqyW4LVZNox0Akv5B_nyW4MVPG4BKaWjOSc8ppID8R6Eo2ZXtLQ5GSyKt99Pnq6GOJtdDbHW8pSsH_Z6-DRNJD1_B7jyKQznUeVeDr6Ao0Qg6FHWKObwhM0DuUVS5BWCCsdax7Ds9wn1fB6Ltq32j-i3gUrCNvd-ly2zQeoxltHtDapiPAwMA3sx0obbgqW-MNZsVEstrYhh0GZjFOylpxxYv3Q6rLSIX7ZtknRuVBA8C6m8l56w2fgysnlqk5ipE1E5fPQc618Q_li5PSEMUN3FAWeyHOfK5Db0ccV_es7owYkn7GpWiYDA_2-2SCvkxBb4P54r0mGa40h2wqQiUInya7EVOcWSqan5eN5ANGu5VS32Y8SLkb7e_WbTdnyX0i60f6dQkcqEP9hlGkDOEChVIxnvbS1z4U9OVaE09Mu5bWyaFlIt6RLsHY0rEg0o65fVOCfKv98mXPkKPUUeqkFBNk7e05gBkF1Xj9EKA1_9OTRcJgeg6qBQzjnOtMyUwIx4GH-vHWKcPZpSaPB5cTesKzwjDyoOi3kYXFfHrfQxaBCf7_pi03XfHemfjiyZC-DBSRXucNiZkXzRZlyWW-2L0DAoFbCbT9rGMUIEEyz-6CY99Efaerx9acuCnlrSK0WFDwnRoGnE%26u%3DaHR0cHMlM2ElMmYlMmZhZC5kb3VibGVjbGljay5uZXQlMmZzZWFyY2hhZHMlMmZsaW5rJTJmY2xpY2slM2ZsaWQlM2Q0MzcwMDA2NTQ0MTExNzgxOCUyNmRzX3Nfa3dnaWQlM2Q1ODcwMDAwNzI4ODg4MDE2NiUyNmRzX2FfY2lkJTNkNjg5NTk1NTg3JTI2ZHNfYV9jYWlkJTNkMTQxNTg5ODgzODclMjZkc19hX2FnaWQlM2QxMjk0MDI3MzE5MTAlMjZkc19hX2xpZCUzZGt3ZC00NzEyNjA2NDA5MjAlMjYlMjZkc19lX2FkaWQlM2Q3MjkxMTU4NDk4Njg5MCUyNmRzX2VfdGFyZ2V0X2lkJTNka3dkLTcyOTExOTc5OTc4MTU0JTNhbG9jLTE5MCUyNiUyNmRzX2VfbmV0d29yayUzZHMlMjZkc191cmxfdiUzZDIlMjZkc19kZXN0X3VybCUzZGh0dHBzJTNhJTJmJTJmMXBhc3N3b3JkLmNvbSUyZmRvd25sb2FkcyUyZndpbmRvd3MlM2Ztc2Nsa2lkJTNkYWRjN2IzMDg3MzdmMTlhZDM0ZTJkYjZjZDgyMzllNTIlMjZ1dG1fc291cmNlJTNkYmluZyUyNnV0bV9tZWRpdW0lM2RjcGMlMjZ1dG1fY2FtcGFpZ24lM2RVU19BTExfQnJhbmQlMjUyMDFQYXNzd29yZCUyNTIwKEJyb2FkKSUyNnV0bV90ZXJtJTNkZG93bmxvYWQlMjUyMDElMjUyMHBhc3N3b3JkJTI2dXRtX2NvbnRlbnQlM2RVU19BTExfQnJhbmQlMjUyMERvd25sb2FkJTI2bXNjbGtpZCUzZGFkYzdiMzA4NzM3ZjE5YWQzNGUyZGI2Y2Q4MjM5ZTUyJTI2dXRtX3NvdXJjZSUzZGJpbmclMjZ1dG1fbWVkaXVtJTNkY3BjJTI2dXRtX2NhbXBhaWduJTNkVVNfQUxMX0JyYW5kJTI1MjAxUGFzc3dvcmQlMjUyMChCcm9hZCklMjZ1dG1fdGVybSUzZGRvd25sb2FkJTI1MjAxJTI1MjBwYXNzd29yZCUyNnV0bV9jb250ZW50JTNkVVNfQUxMX0JyYW5kJTI1MjBEb3dubG9hZA%26rlid%3Dadc7b308737f19ad34e2db6cd8239e52&amp;vqd=4-306355973418738184198810633923179489263&amp;iurl=%7B1%7DIG%3D230C9A4B60AC4FA383560CF74DAB9503%26CID%3D25D92F82ED9067731131399FEC25667F%26ID%3DDevEx%2C5053.1">1Password desktop app</a> to AWS environments without SDKs or code changes. </li>
<li style="font-weight:400;">This addresses secret sprawl by providing a centralized management interface for credentials used in AWS applications.</li>
<li style="font-weight:400;">The integration leverages 1Password environments (beta), which provide project-specific scoping for secrets and use confidential computing to ensure secrets are never exposed as plaintext during sync operations. Teams can manage environment-specific credentials independently with built-in security controls.</li>
<li style="font-weight:400;">This marks the first deliverable under 1Password’s Strategic Collaboration Agreement with AWS, positioning it as a preferred secrets management solution for AWS customers. </li>
<li style="font-weight:400;">The integration is available to all 1Password tiers at no additional cost beyond existing subscriptions.</li>
<li style="font-weight:400;">Key use cases include streamlining deployments by automatically updating secrets in AWS applications, reducing operational bottlenecks through scoped access controls, and simplifying onboarding for new team members who can manage secrets without learning AWS-specific tools.</li>
<li style="font-weight:400;">While the current integration focuses on environment variables and secrets, developers requiring more complex workflows like AI agents accessing credit card data can still use 1Password service accounts with SDKs for custom implementations.</li>
</ul>
<p>1:17:44  Justin – “While, I think this is really cool, why couldn’t you just use Parameter Store, which is much cheaper?” </p>
<p>1:19:15 <a href="https://aws.amazon.com/about-aws/whats-new/2025/06/amazon-time-sync-nanosecond-hardware-packet-timestamps/">Amazon Time Sync Service now supports Nanosecond Hardware Packet </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/06/amazon-time-sync-nanosecond-hardware-packet-timestamps/">Timestamps – AWS</a></p>
<ul>
<li style="font-weight:400;">Amazon Time Sync Service now adds nanosecond-precision timestamps directly at the hardware level on supported EC2 instances, bypassing kernel and application delays for more accurate packet timing. </li>
<li style="font-weight:400;">This leverages the <a href="https://aws.amazon.com/ec2/nitro/">AWS Nitro</a> System’s reference clock to timestamp packets before they reach the software stack.</li>
<li style="font-weight:400;">The feature enables customers to determine exact packet order and fairness, measure one-way network latency, and increase distributed system transaction speeds with higher precision than most on-premises solutions. Financial trading systems and other latency-sensitive applications can now achieve microsecond-level accuracy in packet sequencing.</li>
<li style="font-weight:400;">Available in <a href="https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/configure-ec2-ntp.html#connect-to-the-ptp-hardware-clock">all regions where Amazon Time Sync Service’s PTP Hardware Clocks are supported</a>, the feature works on both virtualized and bare metal instances at no additional cost. Customers need only install the latest <a href="https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/enhanced-networking-ena.html">ENA Linux driver</a> to access timestamps through standard Linux socket APIs.</li>
<li style="font-weight:400;">This positions AWS as a strong contender for ultra-low latency workloads that traditionally required specialized on-premises hardware, particularly in financial services, where nanosecond precision can translate to competitive advantages in high-frequency trading and market data processing.</li>
<li style="font-weight:400;">The integration with existing Time Sync Service infrastructure means customers already using PTP Hardware Clocks can enable this feature without VPC configuration changes, making adoption straightforward for teams already invested in AWS time synchronization.</li>
</ul>
<p>1:20:22  Ryan – “I was super surprised when NASDAQ announced that they were moving their trading workloads into AWS… This is a key blocker to using cloud systems. And so it’s being able to not only process things at a very near time, but being able to audit the fairness and that you’re processing in a specific order is super important in those workloads and high trading volume – you’re talking billions of transactions a second. So I get why it’s important. And it was kind of neat to learn that and all the difficulties and all the work that goes into this. I’m sure this, I wonder if this is, was this available in 2022 just for NASDAQ?”</p>
<p>1:21:45 <a href="https://aws.amazon.com/about-aws/whats-new/2025/06/amazon-vpc-raises-default-route-table-capacity/">Amazon VPC raises default Route Table capacity – AWS</a></p>
<ul>
<li style="font-weight:400;">AWS VPC increases the default route table capacity from 50 to 500 entries, eliminating the need for manual limit increase requests that previously created administrative overhead for customers managing complex network architectures.</li>
<li style="font-weight:400;">This 10x capacity increase directly benefits organizations using multiple network paths for traffic inspection, firewall insertion, or connecting to various gateways like transit gateway, VPN, or peering connections.</li>
<li style="font-weight:400;">The change applies automatically to all existing and new VPCs across commercial and GovCloud regions, though accounts with existing quota overrides will maintain their current settings.</li>
<li style="font-weight:400;">Network architects can now build more sophisticated routing topologies without hitting limits, particularly useful for hub-and-spoke designs or multi-region deployments that require granular traffic control.</li>
<li style="font-weight:400;">While there’s no additional cost for the increased capacity, customers should review their route table configurations as more complex routing rules may impact network performance if not properly optimized.</li>
</ul>
<p>1:22:17  Justin – “I don’t want to be in a situation where I’m managing 500 entries across multiple VPCs, even with things like Transit Gateway that make these things easier. I don’t want to do this.”</p>
<p>1:26:29 <a href="https://www.aboutamazon.com/news/aws/aws-project-rainier-ai-trainium-chips-compute-cluster?utm_source=ecsocial&amp;utm_medium=linkedin&amp;utm_term=36">AWS’s Project Rainier: the world’s most powerful computer for training AI</a></p>
<ul>
<li style="font-weight:400;">AWS <a href="https://www.aboutamazon.com/news/aws/aws-project-rainier-ai-trainium-chips-compute-cluster">Project Rainier</a> creates the world’s most powerful AI training computer using tens of thousands of <a href="https://aws.amazon.com/blogs/aws/amazon-ec2-trn2-instances-and-trn2-ultraservers-for-aiml-training-and-inference-is-now-available/">Trainium2 UltraServers</a> spread across multiple US data centers, providing <a href="https://www.anthropic.com/">Anthropic</a> 5x more computing power than their current largest cluster for training <a href="https://www.anthropic.com/news/claude-2">Claude</a> models.</li>
<li style="font-weight:400;">The system uses custom Trainium2 chips capable of trillions of calculations per second, connected via NeuronLinks within 64-chip UltraServers and EFA networking across data centers to minimize latency and maximize training throughput.</li>
<li style="font-weight:400;">AWS’s vertical integration from chip design through data center infrastructure enables rapid optimization across the entire stack, while new cooling and power efficiency measures reduce mechanical energy consumption by up to 46% and embodied carbon in concrete by 35%.</li>
<li style="font-weight:400;">Project Rainier establishes a template for deploying computational power at unprecedented scale, enabling AI breakthroughs in medicine, climate science, and other complex domains that require massive training resources.</li>
<li style="font-weight:400;">The infrastructure maintains AWS’s industry-leading water efficiency at 0.15 liters per kilowatt-hour (less than half the industry average) through innovations like seasonal air cooling that eliminates water use entirely during cooler months.</li>
</ul>
<p>1:28:13 <a href="https://aws.amazon.com/about-aws/whats-new/2025/06/ga-accelerate-troubleshooting-amazon-cloudwatch-investigations/">Now in GA: Accelerate troubleshooting with Amazon CloudWatch </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/06/ga-accelerate-troubleshooting-amazon-cloudwatch-investigations/">investigations – AWS</a></p>
<ul>
<li style="font-weight:400;">CloudWatch investigations uses an AI agent to automatically identify anomalies, surface related signals, and suggest root cause hypotheses across your AWS environment, reducing mean time to resolution at no additional cost.</li>
<li style="font-weight:400;">You can trigger investigations from any <a href="https://docs.aws.amazon.com/AmazonCloudWatch/latest/monitoring/WhatIsCloudWatch.html">CloudWatch</a> widget, 80+ AWS consoles, CloudWatch alarms, or <a href="https://aws.amazon.com/q/">Amazon Q chat</a>, with results accessible through <a href="https://slack.com/signin">Slack</a> and <a href="https://www.microsoft.com/en-us/microsoft-teams/log-in">Microsoft Teams</a> for team collaboration.</li>
<li style="font-weight:400;">The service provides remediation suggestions by surfacing relevant <a href="https://docs.aws.amazon.com/systems-manager/latest/userguide/what-is-systems-manager.html">AWS Systems Manager</a> Automation runbooks, AWS re: Post articles, and documentation for common operational issues.</li>
<li style="font-weight:400;">This was previously in preview as <a href="https://aws.amazon.com/blogs/aws/investigate-and-remediate-operational-issues-with-amazon-q-developer/">Amazon Q Developer operational investigations</a> and is now GA in 12 regions, including US East, Europe, and Asia Pacific.</li>
<li style="font-weight:400;">The integration across AWS services and communication channels addresses a key pain point in cloud operations where teams struggle to correlate signals across distributed systems during incidents.</li>
</ul>
<p>1:28:33  Justin – “I did see this button in my console recently and I did push it to see what it was. It has not put me out of a job, I’m still smarter than it, but it’s pretty cool.”</p>
<h2>GCP</h2>
<p>1:30:49 <a href="https://cloud.google.com/blog/products/ai-machine-learning/gemini-2-5-flash-lite-flash-pro-ga-vertex-ai/">Gemini 2.5 Updates: Flash/Pro GA, SFT, Flash-Lite on Vertex AI | Google </a><a href="https://cloud.google.com/blog/products/ai-machine-learning/gemini-2-5-flash-lite-flash-pro-ga-vertex-ai/">Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google’s <a href="https://console.cloud.google.com/vertex-ai/studio/multimodal?model=gemini-2.5-flash">Gemini 2.5 Flash</a> and <a href="https://console.cloud.google.com/vertex-ai/studio/multimodal?model=gemini-2.5-pro">Pro</a> models are now generally available on Vertex AI, with Flash optimized for high-throughput tasks like summarization and data extraction while Pro handles complex reasoning and code generation. 
<ul>
<li style="font-weight:400;">The GA release provides production-ready stability for enterprise deployments.</li>
</ul>
</li>
<li style="font-weight:400;">New <a href="https://console.cloud.google.com/vertex-ai/studio/multimodal?model=gemini-2.5-flash-lite-preview-06-17">Gemini 2.5 Flash-Lite</a> enters public preview as Google’s most cost-effective model, running 1.5x faster than 2.0 Flash at lower cost, targeting high-volume workloads like classification and translation. </li>
<li style="font-weight:400;">This positions Google competitively against AWS Bedrock’s lighter models and Azure’s economy tier offerings.</li>
<li style="font-weight:400;"><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/models/gemini-supervised-tuning">Supervised Fine-Tuning</a> for Gemini 2.5 Flash is now GA, allowing enterprises to customize the model with their own datasets and terminology. This addresses a key enterprise requirement for domain-specific AI that competitors have been pushing with their fine-tuning capabilities.</li>
<li style="font-weight:400;">The <a href="https://console.cloud.google.com/vertex-ai/studio/multimodal-live">Live API</a> with native audio-to-audio capabilities enters public preview, enabling real-time voice applications without intermediate text conversion. This streamlines development of voice agents and interactive AI systems, competing directly with OpenAI’s real-time API offerings.</li>
<li style="font-weight:400;">Pricing reflects the tiered approach with Flash-Lite for cost-sensitive workloads, Flash for balanced performance, and Pro for advanced tasks. Complete pricing details available at cloud.google.com/vertex-ai/generative-ai/pricing.</li>
</ul>
<p>1:33:25 <a href="https://cloud.google.com/blog/products/storage-data-transfer/backup-vaults-add-support-for-disk-backup-and-multi-region/">Backup vaults add support for disk backup and multi-region | Google </a><a href="https://cloud.google.com/blog/products/storage-data-transfer/backup-vaults-add-support-for-disk-backup-and-multi-region/">Cloud Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/blog/products/storage-data-transfer/backup-and-dr-service-adds-immutable-indelible-backups?e=0">Google Cloud Backup vaults</a>\ now support standalone Persistent Disk and Hyperdisk backups in preview, enabling granular disk-level protection without backing up entire VMs. This provides cost optimization for scenarios where full VM backups aren’t necessary while maintaining immutable and indelible protection against ransomware.</li>
<li style="font-weight:400;">Multi-region backup vaults are now generally available, storing backup data across multiple geographic regions to maintain accessibility during regional outages. This addresses business continuity requirements that AWS Backup doesn’t currently offer with its single-region vault limitation.</li>
<li style="font-weight:400;">Backup vaults create a logically air-gapped environment in Google-managed projects where backups cannot be modified or deleted during enforced retention periods, even by backup administrators. 
<ul>
<li style="font-weight:400;">This goes beyond traditional backup solutions by preventing malicious actors from corrupting recovery points.</li>
</ul>
</li>
<li style="font-weight:400;">The service provides unified management across Compute Engine VMs, Persistent Disks, and Hyperdisks with integration to Security Command Center for anomaly detection. 
<ul>
<li style="font-weight:400;">This consolidation reduces operational complexity compared to managing separate backup solutions for different resource types.</li>
</ul>
</li>
<li style="font-weight:400;">Key use cases include protecting database disks, file shares, and application data where granular recovery is needed. Financial services and healthcare organizations requiring immutable backups for compliance will benefit from the enforced retention capabilities.</li>
<li style="font-weight:400;">Backups. Woo!</li>
</ul>
<p>1:34:54 <a href="https://cloud.google.com/blog/products/business-intelligence/introducing-continuous-integration-for-looker/">Introducing Continuous Integration for Looker | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google introduces <a href="https://cloud.google.com/looker/docs/continuous-integration">Continuous Integration for Looker</a>, bringing software development best practices to BI workflows by automatically testing <a href="https://cloud.google.com/looker/docs/what-is-lookml">LookML</a> code changes before production deployment to catch data inconsistencies and broken dependencies early.</li>
<li style="font-weight:400;">The feature includes validators that flag upstream SQL changes breaking Looker definitions, identify dashboards referencing outdated LookML, and check for code errors and antipatterns – addressing scalability challenges as organizations expand their Looker usage across teams.</li>
<li style="font-weight:400;">Developers can manage CI test suites, runs, and configurations directly within Looker’s UI, with options to trigger tests manually, via pull requests, or on schedules – similar to how <a href="https://docs.aws.amazon.com/quicksight/latest/user/welcome.html">AWS QuickSight</a> handles version control but with deeper integration into the development workflow.</li>
<li style="font-weight:400;">This positions Looker more competitively against Microsoft Power BI’s deployment pipelines and <a href="https://www.tableau.com/">Tableau’s</a> version control features, particularly for enterprises requiring robust data governance and reliability across multiple data sources.</li>
<li style="font-weight:400;">Currently available in preview with no pricing details announced, the feature targets organizations with complex data environments where manual testing of BI assets becomes impractical as teams scale.</li>
</ul>
<p>1:36:29  Ryan – “I think this is kind of neat, and I do really like the scalability. It looks like there’s AI built into it to detect issues because that’s also a thing. Like this dashboard works great on my dataset that I started with, and then you start expanding out the use case and all of a sudden those graphs no load.”</p>
<p>1:38:53 <a href="https://cloud.google.com/blog/products/networking/run-service-extensions-plugins-with-cloud-cdn/">Run Service Extensions plugins with Cloud CDN | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/cdn">Google Cloud CDN</a> now supports Service Extensions plugins, allowing customers to run custom WebAssembly code at the edge across 200+ points of presence for request/response manipulation and custom logic execution.</li>
<li style="font-weight:400;">The feature enables edge computing use cases like custom traffic steering, cache optimization, header manipulation, and security policies, competing directly with AWS <a href="https://docs.aws.amazon.com/AmazonCloudFront/latest/DeveloperGuide/lambda-at-the-edge.html">Lambda@Edge</a> and Cloudflare Workers but integrated natively with Cloud CDN.</li>
<li style="font-weight:400;">Plugins support multiple languages including <a href="https://www.rust-lang.org/learn">Rust</a>, C++, and Go, execute with single-millisecond startup times, and run in sandboxed environments using the open-source <a href="https://github.com/proxy-wasm">Proxy-Wasm</a> API standard.</li>
<li style="font-weight:400;"><a href="https://cloudinary.com/">Cloudinary</a> has already integrated their image and video optimization solution as a packaged Wasm plugin, demonstrating partner ecosystem adoption for media-heavy workloads requiring dynamic content transformation.</li>
<li style="font-weight:400;">Developers can choose between edge extensions (before CDN cache) or traffic extensions (after cache, closer to origin), providing flexibility in where custom code executes in the request path.</li>
</ul>
<h2>Azure</h2>
<p>1:40:23 <a href="https://arstechnica.com/science/2025/06/microsoft-lays-out-its-path-to-useful-quantum-computing/">Microsoft lays out its path to useful quantum computing – Ars Technica</a></p>
<ul>
<li style="font-weight:400;">Microsoft Azure Quantum announced a quantum error correction scheme that can improve hardware qubit error rates from 1 in 1,000 to logical qubit error rates of 1 in 1 million, though this is based on mathematical proofs and simulations rather than demonstrated hardware performance.</li>
<li style="font-weight:400;">Azure’s approach differs from IBM’s fixed-layout quantum chips by supporting multiple hardware technologies including movable atom-based qubits from partners like Atom Computing and Quantinuum, allowing more flexible error correction implementations.</li>
<li style="font-weight:400;">The platform-agnostic strategy positions Azure Quantum as a multi-vendor quantum computing marketplace rather than a single-hardware solution, giving customers access to different quantum technologies through one service.</li>
<li style="font-weight:400;">While IBM designs both hardware and software for their quantum systems, Microsoft focuses on the software stack for error correction that works across various partner hardware platforms, potentially offering more choice but less optimization.</li>
<li style="font-weight:400;">Enterprise customers interested in quantum computing can evaluate different hardware approaches through Azure without committing to a single technology, though practical quantum applications remain years away pending actual hardware demonstrations of the error correction scheme.</li>
</ul>
<p>1:40:59  Ryan – “I look forward to – like our earlier comments about not getting into AI early enough and missing out on the hundred million day payday – I’m going to do the same thing when it comes to quantum computing and be like ‘they’re going to get all this money for the quantum computer scientists.’ If only I would have not been able to stay awake while I was reading through one of these articles. It’s so dense.”</p>
<p>1:41:55 <a href="https://blog.fabric.microsoft.com/en-GB/blog/introducing-mcp-support-for-real-time-intelligence-rti/">Introducing MCP Support for Real-Time Intelligence (RTI)  | Microsoft </a><a href="https://blog.fabric.microsoft.com/en-GB/blog/introducing-mcp-support-for-real-time-intelligence-rti/">Fabric Blog | Microsoft Fabric</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aka.ms/fabricrti">Microsoft Fabric Real-Time Intelligence</a> now supports <a href="https://modelcontextprotocol.io/introduction?wt.mc_id=studentamb_263805">Model Context Protocol</a> (MCP), enabling AI models like Azure OpenAI to query real-time data using natural language that gets translated into KQL queries. </li>
<li style="font-weight:400;">This open-source integration allows developers to connect AI agents to Eventhouse and Azure Data Explorer for immediate data analysis.</li>
<li style="font-weight:400;">The <a href="https://aka.ms/rti.mcp.repo">MCP server</a> acts as a bridge between AI applications (GitHub Copilot, Claude, Cline) and Microsoft’s real-time data platforms, providing schema discovery, data sampling, and query execution capabilities. 
<ul>
<li style="font-weight:400;">Installation requires VS Code with GitHub Copilot extensions and can be deployed via pip package microsoft-fabric-rti-mcp.</li>
</ul>
</li>
<li style="font-weight:400;">Current support focuses on <a href="https://aka.ms/eventhouse">Eventhouse</a> KQL queries with planned expansions to Digital Twin Builder, Eventstreams, and Activator integration for proactive insights. This positions Microsoft against AWS’s real-time analytics offerings by providing a standardized protocol for AI-to-data interactions.</li>
<li style="font-weight:400;">Target use cases include real-time threat detection, operational monitoring, and automated decision-making where AI agents need immediate access to streaming data. The natural language interface removes the KQL learning curve for business users while maintaining query optimization.</li>
<li style="font-weight:400;">The architecture follows a modular client-server model where MCP hosts (AI models) communicate through MCP clients to lightweight MCP servers, enabling plug-and-play integration with minimal configuration. No specific pricing mentioned, but leverages existing Fabric RTI infrastructure costs.</li>
</ul>
<p>1:42:19 <a href="https://devblogs.microsoft.com/devops/azure-devops-mcp-server-public-preview/">Azure DevOps MCP Server, Public Preview – Azure DevOps Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://github.com/microsoft/azure-devops-mcp">Azure DevOps MCP Server</a> enables <a href="https://github.com/microsoft/azure-devops-mcp/blob/main/README.md">GitHub Copilot</a> in VS Code and <a href="https://visualstudio.microsoft.com/">Visual Studio</a> to access <a href="https://azure.microsoft.com/en-us/products/devops/">Azure DevOps</a> data including work items, pull requests, test plans, builds, and wikis, running locally to keep private data within your network.</li>
<li style="font-weight:400;">The Model Context Provider acts as a bridge between AI assistants and Azure DevOps, injecting real-time project context into LLM prompts for more accurate and relevant responses specific to your development environment.</li>
<li style="font-weight:400;">Currently supports only Azure DevOps Services (cloud) with on-premises Azure DevOps Server support not planned for several months due to missing API availability, which may limit adoption for enterprise customers with on-prem requirements.</li>
<li style="font-weight:400;">Setup requires Azure CLI authentication and local configuration file modifications, positioning this as a developer-focused tool rather than a managed service like AWS CodeWhisperer or Google’s Duet AI integrations.</li>
<li style="font-weight:400;">The local-only architecture addresses data sovereignty concerns but lacks the scalability of cloud-based alternatives, making it suitable for individual developers or small teams rather than enterprise-wide deployments.</li>
</ul>
<p>1:43:38  Ryan – “You could argue that using AI for vibe coding is TDD because you’re basically stating the outcome you want, almost an assertion and telling it, go do this thing. It’s not exactly the same, I know.”</p>
<p>1:44:08 <a href="https://techcommunity.microsoft.com/blog/machinelearningblog/cohere-models-now-available-on-managed-compute-in-azure-ai-foundry-models/4423428">Cohere Models Now Available on Managed Compute in Azure AI Foundry </a><a href="https://techcommunity.microsoft.com/blog/machinelearningblog/cohere-models-now-available-on-managed-compute-in-azure-ai-foundry-models/4423428">Models | Microsoft Community Hub</a></p>
<ul>
<li style="font-weight:400;">Azure AI Foundry now offers Cohere’s Command A, Rerank 3.5, and Embed 4 models through Managed Compute, allowing customers to deploy these models using their own Azure GPU quota with hourly pricing ranging from $2.94 to $17.125 per instance.</li>
<li style="font-weight:400;">This deployment option provides infrastructure flexibility with A10, A100, and H100 GPU choices while maintaining enterprise features like VNet support, private endpoints, and scaling policies – addressing a gap where models weren’t available through standard pay-per-token endpoints.</li>
<li style="font-weight:400;">The pricing model compensates Cohere directly through usage fees while giving customers control over their compute infrastructure, similar to AWS SageMaker’s bring-your-own-model approach but with integrated billing for third-party models.</li>
<li style="font-weight:400;">Target use cases include RAG implementations with Rerank 3.5, vector search applications using Embed 4, and advanced reasoning tasks with Command A, making this particularly relevant for enterprises building production GenAI applications.</li>
<li style="font-weight:400;">This positions Azure competitively against AWS Bedrock and Google Vertex AI by expanding model availability beyond first-party offerings while simplifying deployment complexity for customers who need specific GPU configurations or network isolation.</li>
</ul>
<p>1:44:20 <a href="https://learn.microsoft.com/en-us/azure/azure-functions/opentelemetry-howto?tabs=app-insights&amp;pivots=programming-language-csharp">Use OpenTelemetry with Azure Functions | Microsoft Learn</a></p>
<ul>
<li style="font-weight:400;">Azure Functions now supports OpenTelemetry in preview, enabling standardized telemetry export to any OpenTelemetry-compliant endpoint beyond just Application Insights. </li>
<li style="font-weight:400;">This gives developers flexibility to use their preferred observability platforms while maintaining correlation between host and application traces.</li>
<li style="font-weight:400;">The implementation requires configuration at both the host level (host.json) and application code level, with language-specific SDKs available for C#, Node.js, Python, and PowerShell. Java support is notably absent, and C# in-process apps aren’t supported yet.</li>
<li style="font-weight:400;">This positions Azure Functions closer to <a href="https://docs.aws.amazon.com/lambda/latest/dg/services-xray.html">AWS Lambda’s X-Ray</a> integration and <a href="https://cloud.google.com/functions">GCP Cloud Functions</a>‘ native OpenTelemetry support, though Azure’s implementation is still catching up with limited trigger support (only HTTP, Service Bus, and Event Hub triggers currently work).</li>
<li style="font-weight:400;">The feature addresses vendor lock-in concerns by allowing telemetry data to flow to multiple endpoints simultaneously – both Application Insights and OTLP exporters can receive data when configured, useful for organizations transitioning between monitoring solutions.</li>
<li style="font-weight:400;">Current limitations include no log streaming support in Azure portal when OpenTelemetry is enabled and no support for managed dependencies in PowerShell on Flex Consumption plans, suggesting this is best suited for greenfield projects rather than migrations.</li>
</ul>
<p>1:44:48  Justin – “OTel should just be default Azure. Come on.”</p>
<p>1:45:26 <a href="https://techcommunity.microsoft.com/blog/azuresqlblog/public-preview---data-virtualization-for-azure-sql-database/4413834">Public Preview – Data Virtualization for Azure SQL Database | Microsoft </a><a href="https://techcommunity.microsoft.com/blog/azuresqlblog/public-preview---data-virtualization-for-azure-sql-database/4413834">Community Hub</a></p>
<ul>
<li style="font-weight:400;">Azure SQL Database now supports data virtualization in public preview, enabling direct T-SQL queries against CSV, Parquet, and Delta files stored in Azure Data Lake Storage Gen2 or Azure Blob Storage without ETL processes or data duplication. This brings PolyBase-like capabilities from SQL Server 2022 to Azure SQL Database.</li>
<li style="font-weight:400;">The feature supports three authentication methods (Managed Identity, User Identity, and SAS tokens) and allows organizations to offload cold data to cheaper storage while maintaining query access through standard SQL commands. This addresses the common challenge of balancing storage costs with data accessibility.</li>
<li style="font-weight:400;">Unlike AWS Redshift Spectrum or BigQuery external tables, Azure’s implementation leverages familiar T-SQL syntax and integrates seamlessly with existing SQL Server security models, making it easier for SQL Server shops to adopt without learning new query languages.</li>
<li style="font-weight:400;">Primary use cases include archiving historical data to reduce database storage costs, creating data lakes accessible via SQL, and enabling real-time analytics across multiple data sources without complex data pipelines. The feature is currently available in select regions with broader rollout planned.</li>
<li style="font-weight:400;">Cost implications are significant as organizations can store infrequently accessed data in blob storage (starting at $0.00099/GB/month for cool tier) versus Azure SQL Database storage (starting at $0.115/GB/month), while maintaining query capabilities through external tables.</li>
</ul>
<p>1:47:43 <a href="https://info.microsoft.com/index.php/email/emailWebview?email=MTU3LUdRRS0zODIAAAGbSJ7jMG7a9EsZLhaBn0wB-O1dMi00fRQw_As7XJYhRxdi3Ze5uIjgal9a0rv64-UitDVAca84dFZwgMQQOFVv-EljlqTyNNa2cpRs">Microsoft Ignite – Nov 18-21 2025</a> </p>
<ul>
<li style="font-weight:400;">Microsoft Ignite 2025 will be held in person in San Francisco from November 18-21, focusing on AI, infrastructure, security, and emerging technologies with hands-on labs and product demonstrations.</li>
<li style="font-weight:400;">In-person attendees receive complimentary Microsoft and GitHub certification exams on-site, providing cost savings of $165-330 per exam while validating skills in Azure and development technologies.</li>
<li style="font-weight:400;">The conference timing aligns with Microsoft’s typical fall product announcement cycle, positioning it as a key venue for Azure roadmap updates and new service launches ahead of re: Invent.</li>
<li style="font-weight:400;">Early registration opening suggests Microsoft expects high demand following the shift back to in-person events, with the San Francisco location providing better West Coast accessibility compared to previous Orlando venues.</li>
<li style="font-weight:400;">The dual focus on AI and infrastructure indicates Microsoft will likely showcase Azure AI services integration with traditional cloud workloads, competing directly with AWS’s AI/ML portfolio announcements.</li>
<li style="font-weight:400;">THEY ARE RIDICULOUSLY PROUD OF THIS CONFERENCE $2325 – and that’s the early bird price! 
<ul>
<li style="font-weight:400;">NO. </li>
<li style="font-weight:400;">But also, no. </li>
</ul>
</li>
</ul>
<h2>Oracle</h2>
<p>1:50:37 <a href="https://www.oracle.com/news/announcement/xais-grok-models-are-now-on-oracle-cloud-infrastructure-2025-06-17">xAI’s Grok Models are Now on Oracle Cloud Infrastructure</a></p>
<ul>
<li style="font-weight:400;">Oracle now offers <a href="https://x.ai/">xAI’s</a> Grok models through <a href="https://www.oracle.com/artificial-intelligence/">OCI Generative AI</a> service, marking Oracle’s entry into hosting third-party foundation models alongside AWS Bedrock and Azure OpenAI Service, though arriving significantly later to this market segment.</li>
<li style="font-weight:400;">The partnership leverages OCI’s bare metal GPU instances for training and inference, with Oracle emphasizing price-performance advantages – a claim worth scrutinizing given AWS and GCP’s established dominance in AI infrastructure and economies of scale.</li>
<li style="font-weight:400;">xAI promises zero data retention endpoints for enterprise customers, addressing a key concern for regulated industries, though implementation details and compliance certifications remain unclear compared to established enterprise AI offerings.</li>
<li style="font-weight:400;">Windstream’s exploration of Grok models for telecommunications workflows represents a practical use case, but adoption may be limited to existing Oracle customers already invested in OCI infrastructure rather than attracting new cloud customers.</li>
<li style="font-weight:400;">While Grok 3 claims advanced reasoning capabilities in mathematics and coding, the lack of public benchmarks or comparisons to GPT-4, Claude, or Gemini models makes it difficult to assess its actual competitive positioning in the enterprise AI market.</li>
</ul>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, Slack team, send feedback or ask questions at theCloudPod.net or tweet at us with hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2080936/c1e-7nkns997znu2860x-7z3qzjvgswqq-2ryd26.mp3" length="160201756"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 310 of The Cloud Pod – where the forecast is always cloudy! Matt, Ryan and Justin are here to bring you all the latest and greatest in cloud and AI news. 
Literally. 
All of it. 
This week we have announcements from re:Inforce, Manual Testing, GuardDuty, Government AI (what could go wrong?) Gemini 2.5 and, in a flash from the past, MS-DOS Editor. All this and more, this week in the cloud! 
Titles we almost went with this week:

ACM Finally Lets Its Certificates Leave the Nest
Breaking Free: AWS Certificates Get Their Export Papers
Certificate Manager Learns to Share Its Private Keys
Skynet’s Origin Story: We Bullied It Into Existence
Claude and Present Danger: When AI Fights Back
Breaking Up is Hard to GPU
EKS Marks the Spot for GuardDuty’s New Detection Powers
Kubernetes Security: GuardDuty Connects the Dots
Hub, Hub, Hooray for Unified Security
Security Hub 2: Electric Boogaloo
All Your Security Findings Are Belong to One Dashboard
GuardDuty’s EKS-cellent Adventure in Attack Detection
Shield Me From My Own Bad Decisions
AWS Plays Network Security Whack-a-Mole
Your VPC Called – It Wants Better Security Groups
Permission Impossible: Your Express App Will Self-Authorize in 5 Minutes
Breaking the Glass: AWS Backup Gets a Multi-Party System
Gemini 2.5: Now With More Flash and Less Cash
AI Goes to Washington
GPT-4: Government Property Taxpayer-funded
DDoS and Don’ts: A 45-Second Horror Story
Google’s AI Models Get a Flash-y Upgrade (Lite on the Wallet)
Flash Gordon Called – He Wants His Speed Back
From Flash to Flash-Lite: Google’s AI Diet Plan
Looker’s Pipeline Dreams Come True
MS-DOS Editor: The Reboot Nobody Asked For But Everyone Needed
Control-Alt-Delete Your Expectations: Microsoft Brings DOS to Linux
Microsoft’s Text Editor Time Machine Now Runs on Your Toaster
Copilot Gets Its Agent License
Visual Studio’s AI Agent: Now Taking Orders
The Bridge Over Troubled Prompts
Azure’s Managed Compute Gets More Coherent
Bring Your Own GPU Party: Cohere Models Join the Azure Bash
Function Telemetry Gets Open Sourced (Kind Of)
Azure Functions: Now Speaking Everyone’s Language (Except Java)
Bucket List: AWS Makes S3 Policy Monitoring a Breeze
The Policy Police: Keeping Your S3 Buckets in Check
CDK Gets Its Own Town Hall (Infrastructure Not Included)
Breaking: AWS Discovers Zoom, Plans to Use It Twice Per Quarter
AWS and 1Password: A Secret Love Affair
Keeping Secrets Has Never Been This Public
Nano Nano: AWS Brings Alien-Level Time Precision to EC2
Time Flies When You’re Having Nanoseconds
WorkSpaces Core: Now With More Cores to Work With
Mount Compute-ier: AWS Builds AI Training Peak
Making it Rain(ier): AWS Showers Anthropic with 5x More Compute
Cache Me If You Can: Google’s Plugin Play
CSI: Cloud Services Investigation

General News 
01:09 Defending the Internet: How Cloudflare blocked a monumental 7.3 Tbps DDoS attack

Cloudflare blocked a record-breaking 7.3 Tbps DDoS attack in May 2025, which delivered 37.4 TB of data in just 45 seconds – equivalent to streaming 7,480 hours of HD video or downloading 9.35 million songs in under a minute.
The attack originate...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2080936/c1a-k5d5-gp34p6n1ud1k-pezjxe.jpg"></itunes:image>
                                                                            <itunes:duration>01:51:15</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2080936/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[309: Microsoft tries to give away cloud services for free, sadly, it's only SQL]]>
                </title>
                <pubDate>Thu, 26 Jun 2025 21:05:14 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2076502</guid>
                                    <link>https://tcpfm.castos.com/episodes/309-microsoft-tries-to-give-away-cloud-services-for-free-sadly-its-only-sql</link>
                                <description>
                                            <![CDATA[<p>Welcome to episode 308 of The Cloud Pod – where the forecast is always cloudy! Justin and Matt are on hand and ready to bring you an action packed episode. Unfortunately, this one is also lullaby free. Apologies. This week we’re talking about Databricks and Lakebridge, Cedar Analysis, Amazon Q, Google’s little hiccup, and updates to SQL – plus so much more! Thanks for joining us. </p>
<h3>Titles we almost went with this week:</h3>
<ul>
<li>KV Phone Home: When Your Key-Value Store Goes AWOL</li>
<li>When Your Coreless Service Finds Its Core Problem</li>
<li>Oracle’s Vanity Fair: Pretty URLs for Pretty Penny</li>
<li>From Warehouse to Lakehouse: Your Free Ticket to Cloud Town</li>
<li>1⃣Databricks Uno: Because One is the Loneliest Number</li>
<li>Free as in Beer, Smart as in Data Science</li>
<li>Cedar Analysis: Because Your Authorization Policies Wood Never Lie</li>
<li>Cedar Analysis: Teaching Old Policies New Proofs</li>
<li>Amazon Q Finally Learns to Talk to Other Apps</li>
<li>Tomorrow: Visual Studio’s Predictive Edit Revolution</li>
<li>The Ghost of Edits Future: AI Haunts Your Code Before You Write It</li>
<li>IAM What IAM: Google’s Identity Crisis Breaks the Internet</li>
<li>Permission Denied: The Day Google Forgot Who Everyone Was</li>
<li>403 Forbidden: When Google’s Bouncer Called in Sick</li>
<li>AWS Brings the Heat to Fusion Research</li>
<li>Larry’s Cloud Nine: Oracle Stock Soars on Forecast Raise</li>
<li>OCI You Later: Oracle Bets Big on Cloud Growth</li>
<li>Oracle’s Crystal Ball Shows 40% Cloud Growth Ahead</li>
<li>Meta Scales Up Its AI Ambitions with $14 Billion Investment</li>
<li>From FAIR to Scale: Meta’s $14 Billion AI Makeover</li>
<li>Congratulations Databricks one, you are now the new low code solution. </li>
<li>AWS burns power to figure out how power works</li>
</ul>
<h2>AI Is Going Great – Or How ML Makes Money </h2>
<p>02:12 <a href="https://www.cnbc.com/2025/06/10/zuckerberg-makes-metas-biggest-bet-on-ai-14-billion-scale-ai-deal.html?utm_source=tldrnewsletter">Zuckerberg makes Meta’s biggest bet on AI, $14 billion Scale AI deal</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.cnbc.com/quotes/META/">Meta</a> is finalizing a $14 billion investment for a 49% stake in Scale AI, with CEO Alexandr Wang joining to lead a new AI research lab at Meta. </li>
<li style="font-weight:400;">This follows similar moves by <a href="https://www.cnbc.com/quotes/GOOGL/">Google</a> and <a href="https://www.cnbc.com/quotes/MSFT/">Microsoft</a> acquiring AI talent through investments rather than direct acquisitions to avoid regulatory scrutiny.</li>
<li style="font-weight:400;"><a href="https://scale.com/">Scale AI</a> specializes in data labeling and annotation services critical for training AI models, serving major clients including <a href="https://openai.com/">OpenAI</a>, Google, Microsoft, and Meta. </li>
<li style="font-weight:400;">The company’s expertise covers approximately 70% of all AI models being built, providing Meta with valuable intelligence on competitor approaches to model development.</li>
<li style="font-weight:400;">The deal reflects Meta’s struggles with its <a href="https://www.llama.com/">Llama AI models</a>, particularly the underwhelming reception of <a href="https://ai.meta.com/blog/llama-4-multimodal-intelligence/">Llama 4</a> and delays in releasing the more powerful “Behemoth” model due to concerns about competitiveness with OpenAI and <a href="https://www.deepseek.com/en">DeepSeek</a>. Meta recently reorganized its GenAI unit into two divisions following these setbacks.</li>
<li style="font-weight:400;">Wang brings both technical AI expertise and business acumen, having built Scale AI from a 2016 startup to a $14 billion valuation. His experience includes defense contracts and the recent <a href="https://scale.com/blog/defense-llama">Defense Llama</a> collaboration with Meta for national security applications.</li>
<li style="font-weight:400;">For cloud providers and dev...</li></ul>
<h3>Chapters</h3>
<ul><li>(00:00:00) - The Cloud Pod: Episode 309</li><li>(00:01:16) - Meta Completes $14 Million Investment in Scale AI</li><li>(00:06:34) - Databricks Free Edition, SQL Migration and More</li><li>(00:09:28) - WAF and Q&D: Cloud Computing</li><li>(00:17:52) - AWS Power Tools for AWS Lambda</li><li>(00:22:39) - Google IAM System Failure Causes widespread Outage</li><li>(00:27:00) - Cloudflare Outage Highlights Storage Provider's Failure</li><li>(00:31:14) - Google's Credential Scanner for Open Source</li><li>(00:33:45) - Google Cloud Location Finder: Single API for Cloud Regions</li><li>(00:35:33) - Google Cloud G4VMS and G4S: New Inst</li><li>(00:37:23) - Microsoft Cross Tenant Customer Managed Keys for SSD v2 &</li><li>(00:39:48) - Microsoft Cloud: Azure Cost Management, Next Edit suggestions in Visual Studio</li><li>(00:43:41) - Oracle's Cloud Services: Growing 16%</li><li>(00:46:53) - Oracle to Offer AMD Instinct GPUs on OCI</li><li>(00:48:03) - Oracle Allows Custom Domains for Autonomous Database</li><li>(00:50:01) - Cloud: Episode 1</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 308 of The Cloud Pod – where the forecast is always cloudy! Justin and Matt are on hand and ready to bring you an action packed episode. Unfortunately, this one is also lullaby free. Apologies. This week we’re talking about Databricks and Lakebridge, Cedar Analysis, Amazon Q, Google’s little hiccup, and updates to SQL – plus so much more! Thanks for joining us. 
Titles we almost went with this week:

KV Phone Home: When Your Key-Value Store Goes AWOL
When Your Coreless Service Finds Its Core Problem
Oracle’s Vanity Fair: Pretty URLs for Pretty Penny
From Warehouse to Lakehouse: Your Free Ticket to Cloud Town
1⃣Databricks Uno: Because One is the Loneliest Number
Free as in Beer, Smart as in Data Science
Cedar Analysis: Because Your Authorization Policies Wood Never Lie
Cedar Analysis: Teaching Old Policies New Proofs
Amazon Q Finally Learns to Talk to Other Apps
Tomorrow: Visual Studio’s Predictive Edit Revolution
The Ghost of Edits Future: AI Haunts Your Code Before You Write It
IAM What IAM: Google’s Identity Crisis Breaks the Internet
Permission Denied: The Day Google Forgot Who Everyone Was
403 Forbidden: When Google’s Bouncer Called in Sick
AWS Brings the Heat to Fusion Research
Larry’s Cloud Nine: Oracle Stock Soars on Forecast Raise
OCI You Later: Oracle Bets Big on Cloud Growth
Oracle’s Crystal Ball Shows 40% Cloud Growth Ahead
Meta Scales Up Its AI Ambitions with $14 Billion Investment
From FAIR to Scale: Meta’s $14 Billion AI Makeover
Congratulations Databricks one, you are now the new low code solution. 
AWS burns power to figure out how power works

AI Is Going Great – Or How ML Makes Money 
02:12 Zuckerberg makes Meta’s biggest bet on AI, $14 billion Scale AI deal

Meta is finalizing a $14 billion investment for a 49% stake in Scale AI, with CEO Alexandr Wang joining to lead a new AI research lab at Meta. 
This follows similar moves by Google and Microsoft acquiring AI talent through investments rather than direct acquisitions to avoid regulatory scrutiny.
Scale AI specializes in data labeling and annotation services critical for training AI models, serving major clients including OpenAI, Google, Microsoft, and Meta. 
The company’s expertise covers approximately 70% of all AI models being built, providing Meta with valuable intelligence on competitor approaches to model development.
The deal reflects Meta’s struggles with its Llama AI models, particularly the underwhelming reception of Llama 4 and delays in releasing the more powerful “Behemoth” model due to concerns about competitiveness with OpenAI and DeepSeek. Meta recently reorganized its GenAI unit into two divisions following these setbacks.
Wang brings both technical AI expertise and business acumen, having built Scale AI from a 2016 startup to a $14 billion valuation. His experience includes defense contracts and the recent Defense Llama collaboration with Meta for national security applications.
For cloud providers and dev...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[309: Microsoft tries to give away cloud services for free, sadly, it's only SQL]]>
                </itunes:title>
                                    <itunes:episode>309</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p>Welcome to episode 308 of The Cloud Pod – where the forecast is always cloudy! Justin and Matt are on hand and ready to bring you an action packed episode. Unfortunately, this one is also lullaby free. Apologies. This week we’re talking about Databricks and Lakebridge, Cedar Analysis, Amazon Q, Google’s little hiccup, and updates to SQL – plus so much more! Thanks for joining us. </p>
<h3>Titles we almost went with this week:</h3>
<ul>
<li>KV Phone Home: When Your Key-Value Store Goes AWOL</li>
<li>When Your Coreless Service Finds Its Core Problem</li>
<li>Oracle’s Vanity Fair: Pretty URLs for Pretty Penny</li>
<li>From Warehouse to Lakehouse: Your Free Ticket to Cloud Town</li>
<li>1⃣Databricks Uno: Because One is the Loneliest Number</li>
<li>Free as in Beer, Smart as in Data Science</li>
<li>Cedar Analysis: Because Your Authorization Policies Wood Never Lie</li>
<li>Cedar Analysis: Teaching Old Policies New Proofs</li>
<li>Amazon Q Finally Learns to Talk to Other Apps</li>
<li>Tomorrow: Visual Studio’s Predictive Edit Revolution</li>
<li>The Ghost of Edits Future: AI Haunts Your Code Before You Write It</li>
<li>IAM What IAM: Google’s Identity Crisis Breaks the Internet</li>
<li>Permission Denied: The Day Google Forgot Who Everyone Was</li>
<li>403 Forbidden: When Google’s Bouncer Called in Sick</li>
<li>AWS Brings the Heat to Fusion Research</li>
<li>Larry’s Cloud Nine: Oracle Stock Soars on Forecast Raise</li>
<li>OCI You Later: Oracle Bets Big on Cloud Growth</li>
<li>Oracle’s Crystal Ball Shows 40% Cloud Growth Ahead</li>
<li>Meta Scales Up Its AI Ambitions with $14 Billion Investment</li>
<li>From FAIR to Scale: Meta’s $14 Billion AI Makeover</li>
<li>Congratulations Databricks one, you are now the new low code solution. </li>
<li>AWS burns power to figure out how power works</li>
</ul>
<h2>AI Is Going Great – Or How ML Makes Money </h2>
<p>02:12 <a href="https://www.cnbc.com/2025/06/10/zuckerberg-makes-metas-biggest-bet-on-ai-14-billion-scale-ai-deal.html?utm_source=tldrnewsletter">Zuckerberg makes Meta’s biggest bet on AI, $14 billion Scale AI deal</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.cnbc.com/quotes/META/">Meta</a> is finalizing a $14 billion investment for a 49% stake in Scale AI, with CEO Alexandr Wang joining to lead a new AI research lab at Meta. </li>
<li style="font-weight:400;">This follows similar moves by <a href="https://www.cnbc.com/quotes/GOOGL/">Google</a> and <a href="https://www.cnbc.com/quotes/MSFT/">Microsoft</a> acquiring AI talent through investments rather than direct acquisitions to avoid regulatory scrutiny.</li>
<li style="font-weight:400;"><a href="https://scale.com/">Scale AI</a> specializes in data labeling and annotation services critical for training AI models, serving major clients including <a href="https://openai.com/">OpenAI</a>, Google, Microsoft, and Meta. </li>
<li style="font-weight:400;">The company’s expertise covers approximately 70% of all AI models being built, providing Meta with valuable intelligence on competitor approaches to model development.</li>
<li style="font-weight:400;">The deal reflects Meta’s struggles with its <a href="https://www.llama.com/">Llama AI models</a>, particularly the underwhelming reception of <a href="https://ai.meta.com/blog/llama-4-multimodal-intelligence/">Llama 4</a> and delays in releasing the more powerful “Behemoth” model due to concerns about competitiveness with OpenAI and <a href="https://www.deepseek.com/en">DeepSeek</a>. Meta recently reorganized its GenAI unit into two divisions following these setbacks.</li>
<li style="font-weight:400;">Wang brings both technical AI expertise and business acumen, having built Scale AI from a 2016 startup to a $14 billion valuation. His experience includes defense contracts and the recent <a href="https://scale.com/blog/defense-llama">Defense Llama</a> collaboration with Meta for national security applications.</li>
<li style="font-weight:400;">For cloud providers and developers, this consolidation signals increased competition in AI infrastructure and services, as Meta seeks to strengthen its position against OpenAI’s consumer applications and model capabilities through enhanced data preparation and training methodologies.</li>
</ul>
<p>03:29  Matt – “It’s interesting, especially the first part of this where companies are trying to acquire AI talent through investments rather than directly hiring people – and hiring them away from other companies. It’s going to be an interesting trend to see if it continues on in the industry where they just keep doing it this way. They just acquire small companies and medium (or large in this case) in order to continue to grow their teams or to at least augment their teams in that way. Or if they’re going to try to build their own in-house units too.”</p>
<p>07:50 <a href="https://www.databricks.com/blog/introducing-databricks-free-edition">Introducing Databricks Free Edition | Databricks Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.databricks.com/learn/free-edition">Databricks Free Edition</a> provides access to the same data and AI tools used by enterprise customers, removing the cost barrier for students and hobbyists to gain hands-on experience with production-grade platforms.</li>
<li style="font-weight:400;">The offering addresses the growing skills gap in AI/ML roles, where job postings have increased 74% annually over four years and 66% of business leaders require AI skills for new hires.</li>
<li style="font-weight:400;">Free Edition includes access to Databricks’ training resources and industry-recognized certifications, allowing users to validate their skills on the same platform used by major companies.</li>
<li style="font-weight:400;">Universities like Texas A&amp;M are already integrating Free Edition into their curriculum, enabling students to gain practical experience with enterprise data tools before entering the workforce.</li>
<li style="font-weight:400;">This move positions Databricks to capture mindshare among future data professionals while competing with other cloud providers’ free tiers and educational offerings.</li>
<li style="font-weight:400;">Want to try it out? You can do that <a href="http://login.databricks.com/?intent=SIGN_UP&amp;signup_experience_step=EXPRESS&amp;provider=DB_FREE_TIER">here</a>. </li>
</ul>
<p>08:28 <a href="https://www.databricks.com/blog/introducing-databricks-one">Introducing Databricks One | Databricks Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="http://www.databricks.com/product/business-intelligence/databricks-one'">Databricks One</a> creates a simplified interface specifically for business users to access data insights without needing technical expertise in clusters, queries, or notebooks. </li>
<li style="font-weight:400;">The <a href="https://docs.databricks.com/en/ai-bi/consumer">consumer access</a> entitlement is available now, with the full experience entering beta later this summer.</li>
<li style="font-weight:400;">The platform provides three key capabilities for non-technical users: <a href="https://www.databricks.com/product/business-intelligence/ai-bi-dashboards">AI/BI Dashboards</a>, <a href="https://www.databricks.com/product/business-intelligence/ai-bi-genie">Genie</a> for natural language data queries, and interaction with <a href="https://www.databricks.com/product/databricks-apps">Databricks Apps</a> through a streamlined interface designed to minimize complexity.</li>
<li style="font-weight:400;">Security and governance remain centralized through Unity Catalog, allowing administrators to expand access to business users while maintaining existing compliance and auditing controls without changing their governance strategy.</li>
<li style="font-weight:400;">The service will be included at no additional license fee for existing <a href="https://blogs.infoservices.com/databricks/databricks-data-intelligence-platform-5-core-capabilities/">Databricks Intelligence Platform</a> customers, potentially expanding data access across organizations without requiring additional technical training or resources.</li>
<li style="font-weight:400;">Future roadmap includes expanding from single workspace access to account-wide asset visibility, positioning Databricks One as a centralized hub for business intelligence across the entire Databricks ecosystem.</li>
</ul>
<p>08:42  Justin – “I think the Databricks Free Edition is a really strong move on their part… I can play with it, see what it does and kick the tires on it and be interested in it as a hobbyist. And then I can bring it back to my day job and say, hey, I was using Databricks over the weekend and I did a thing and I think it could work for us at work. Being able to get access to these tools and these types of capabilities to play with, I think it’s a huge advantage. Everything’s moving so fast right now, that unless you have access to these tools, you feel like you’re left behind.”</p>
<h2>AWS</h2>
<p>10:45 <a href="https://www.geekwire.com/2025/aws-and-national-lab-team-up-to-deploy-ai-tools-in-pursuit-of-fusion-energy/">AWS And National Lab Team Up To Deploy AI Tools In Pursuit Of Fusion </a><a href="https://www.geekwire.com/2025/aws-and-national-lab-team-up-to-deploy-ai-tools-in-pursuit-of-fusion-energy/">Energy</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/">AWS</a> is partnering with <a href="https://www.llnl.gov/">Lawrence Livermore National Laboratory</a> to apply machine learning to fusion energy research, specifically to predict and prevent plasma disruptions that can damage <a href="https://www.energy.gov/science/doe-explainstokamaks">tokamak reactors</a>. </li>
<li style="font-weight:400;">The collaboration uses AWS cloud infrastructure to process massive datasets from fusion experiments.</li>
<li style="font-weight:400;">The project leverages <a href="https://aws.amazon.com/sagemaker/">AWS SageMaker</a> and high-performance computing resources to analyze terabytes of sensor data from fusion reactors, training models that can predict plasma instabilities milliseconds before they occur. This predictive capability could prevent costly reactor damage and accelerate fusion development timelines.</li>
<li style="font-weight:400;">Cloud computing enables fusion researchers to scale their computational workloads dynamically, running complex simulations and ML training jobs that would be prohibitively expensive with on-premises infrastructure. </li>
<li style="font-weight:400;">AWS provides the elastic compute needed to process years of experimental data from multiple fusion facilities worldwide.</li>
<li style="font-weight:400;">The partnership demonstrates how cloud-based AI/ML services are becoming essential for scientific computing applications that require massive parallel processing and real-time analysis. </li>
<li style="font-weight:400;">Fusion researchers can now iterate on models faster and share findings globally through cloud collaboration tools.</li>
<li style="font-weight:400;">This application of cloud AI to fusion energy could accelerate the path to commercial fusion power by reducing experimental downtime and improving reactor designs through better predictive models. Success here would validate cloud platforms as critical infrastructure for next-generation energy research.</li>
</ul>
<p>12:34 <a href="https://aws.amazon.com/blogs/devops/use-model-context-protocol-with-amazon-q-developer-for-context-aware-ide-workflows/">Use Model Context Protocol with Amazon Q Developer for context-aware </a><a href="https://aws.amazon.com/blogs/devops/use-model-context-protocol-with-amazon-q-developer-for-context-aware-ide-workflows/">IDE workflows | AWS DevOps &amp; Developer Productivity Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/q/developer/">Amazon Q Developer</a> now supports <a href="https://modelcontextprotocol.io/introduction">Model Context Protocol (MCP)</a> in <a href="https://code.visualstudio.com/">VS Code</a> and <a href="https://www.jetbrains.com/ides/">JetBrains IDEs</a>, enabling developers to connect external tools like <a href="https://www.atlassian.com/software/jira">Jira</a> and <a href="https://www.figma.com/">Figma</a> directly into their coding workflow. </li>
<li style="font-weight:400;">This eliminates manual context switching between browser tabs and allows Q Developer to automatically fetch project requirements, design specs, and update task statuses.</li>
<li style="font-weight:400;">MCP provides a standardized way for LLMs to integrate with applications, share context, and interact with APIs. Developers can <a href="https://docs.aws.amazon.com/amazonq/latest/qdeveloper-ug/command-line-ide-configuration.html">configure MCP servers</a> with either Global scope (across all projects) or Workspace scope (current IDE only), with granular permissions for individual tools including Ask, Always Allow, or Deny options.</li>
<li style="font-weight:400;">The practical implementation shown demonstrates fetching Jira issues, moving tickets to “In Progress”, analyzing Figma designs for technical requirements, and implementing code changes based on combined context from both tools. This integration allows Q Developer to generate more accurate code by understanding both business requirements and design specifications simultaneously.</li>
<li style="font-weight:400;">This feature builds on Q Developer’s existing agentic coding capabilities which already included executing shell commands and reading local files. The addition of MCP support extends these capabilities to any tool that implements the protocol, with AWS providing an open-source MCP Servers repository on GitHub for additional integrations.</li>
<li style="font-weight:400;">For AWS customers, this reduces development friction by keeping developers in their IDE while maintaining full context from project management and design tools. The feature is available now in Q Developer’s IDE plugins with no additional cost beyond standard Q Developer pricing.</li>
</ul>
<p>13:26  Justin – “I mean, if you think Q Developer is the best tool for you, then more power to you, and I’m not going to stop you. But I am glad to see this get added to one more place.” </p>
<p>14:08 <a href="https://aws.amazon.com/about-aws/whats-new/2025/06/aws-waf-automatic-application-layer-ddos-protection/">AWS WAF now supports automatic application layer distributed denial of </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/06/aws-waf-automatic-application-layer-ddos-protection/">service (DDoS) protection – AWS</a></p>
<ul>
<li style="font-weight:400;">AWS WAF now includes automatic Layer 7 DDoS protection that detects and mitigates attacks within seconds, using machine learning to establish traffic baselines in minutes and identify anomalies without manual rule configuration.</li>
<li style="font-weight:400;">The managed rule group works across <a href="https://docs.aws.amazon.com/AmazonCloudFront/latest/DeveloperGuide/Introduction.html">CloudFront</a>, <a href="https://docs.aws.amazon.com/elasticloadbalancing/latest/application/introduction.html">ALB</a>, and other WAF-supported services, reducing operational overhead for security teams who previously had to manually configure and tune DDoS protection rules.</li>
<li style="font-weight:400;">Available to all AWS WAF and <a href="https://docs.aws.amazon.com/waf/latest/developerguide/ddos-advanced-summary.html">Shield Advanced</a> subscribers in most regions, the service automatically applies mitigation rules when traffic deviates from normal patterns, with configurable responses including challenges or blocks.</li>
<li style="font-weight:400;">This addresses a critical gap in application-layer protection where traditional network-layer DDoS defenses fall short, particularly important as L7 attacks become more sophisticated and frequent.</li>
<li style="font-weight:400;">Pricing follows standard AWS WAF managed rule group costs, making enterprise-grade DDoS protection accessible without requiring dedicated security infrastructure or expertise.</li>
</ul>
<p>14:56  Justin – “I have say that I’ve used the WAF now quite a bit – as well as Shield and CloudFront. Compared to using CloudFlare, they’re so limited what you can do on these things. I so much prefer CloudFlare over trying to tune AWS WAF properly.”</p>
<p>19:27 <a href="https://aws.amazon.com/about-aws/whats-new/2025/06/powertools-lambda-bedrock-agents-function-utility/">Powertools for AWS Lambda introduces Bedrock Agents Function utility – </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/06/powertools-lambda-bedrock-agents-function-utility/">AWS</a></p>
<ul>
<li style="font-weight:400;"><a href="https://powertools.aws.dev/">Powertools</a> for <a href="https://aws.amazon.com/lambda/">AWS Lambda</a> now includes a <a href="https://aws.amazon.com/bedrock/agents/">Bedrock Agents</a> Function utility that eliminates boilerplate code when building Lambda functions that respond to Amazon Bedrock Agent action requests. </li>
<li style="font-weight:400;">The utility handles parameter injection and response formatting automatically, letting developers focus on business logic instead of integration complexity.</li>
<li style="font-weight:400;">This utility integrates seamlessly with existing Powertools features like Logger and Metrics, providing a production-ready foundation for AI applications. Available for Python, TypeScript, and .NET, it standardizes how Lambda functions interact with Bedrock Agents across different programming languages.</li>
<li style="font-weight:400;">For organizations building agent-based AI solutions, this reduces development time and potential errors in the Lambda-to-Bedrock integration layer. The utility abstracts away the complex request/response patterns required for agent actions, making it easier to build and maintain serverless AI applications.</li>
<li style="font-weight:400;">Developers can get started by updating to the latest version of Powertools for AWS Lambda in their preferred language. Since this is an open-source utility addition, there are no additional costs beyond standard Lambda and Bedrock usage fees.</li>
<li style="font-weight:400;">This release signals AWS’s continued investment in simplifying AI application development by providing purpose-built utilities that handle common integration patterns. It addresses a specific pain point for developers who previously had to write custom code to properly format Lambda responses for Bedrock Agents.</li>
</ul>
<p>20:21  Matt – “It’s great to see them making these more accessible to *not* subject matter experts and to the general developer. So would I want to take my full app and go to full production leveraging power tools? No, but it’s good to let the standard developer that just wants to play with something and learn and figure out how to do it. Get something up and running decently easily.”</p>
<p>20:53 <a href="https://aws.amazon.com/blogs/opensource/introducing-cedar-analysis-open-source-tools-for-verifying-authorization-policies/">Introducing Cedar Analysis: Open Source Tools for Verifying Authorization </a><a href="https://aws.amazon.com/blogs/opensource/introducing-cedar-analysis-open-source-tools-for-verifying-authorization-policies/">Policies | AWS Open Source Blog</a></p>
<ul>
<li style="font-weight:400;">AWS releases <a href="https://www.cedarpolicy.com/en">Cedar</a> Analysis as open source tools for verifying authorization policies, addressing the challenge of ensuring fine-grained access controls work correctly across all scenarios rather than just test cases. The toolkit includes a <a href="https://aws.amazon.com/blogs/opensource/introducing-cedar-analysis-open-source-tools-for-verifying-authorization-policies/">Cedar Symbolic Compiler</a> that translates policies into mathematical formulas and a CLI tool for policy comparison and conflict detection.</li>
<li style="font-weight:400;">The technology uses SMT (<a href="https://people.eecs.berkeley.edu/~sseshia/pubdir/SMT-BookChapter.pdf">Satisfiability Modulo Theories</a>) solvers and formal verification with Lean to provide mathematically proven soundness and completeness, ensuring analysis results accurately reflect production behavior. </li>
<li style="font-weight:400;">This approach can answer questions like whether two policies are equivalent, if changes grant unintended permissions, or if policies contain conflicts or redundancies.</li>
<li style="font-weight:400;">Cedar itself has gained significant traction with 1.17 million downloads and production use by companies like <a href="https://www.mongodb.com/">MongoDB</a> and <a href="https://www.strongdm.com/">StrongDM</a>, making robust analysis tools increasingly important as applications scale. The open source release under <a href="https://www.apache.org/licenses/LICENSE-2.0.html">Apache 2.0</a> license allows developers to independently verify policies and researchers to build upon the formal methods foundation.</li>
<li style="font-weight:400;">The practical example demonstrates how subtle policy refactoring errors can be caught – splitting a single policy into multiple policies accidentally restricted owner access to private photos, which the analysis tool identified before production deployment. This capability helps prevent authorization bugs that could lead to security incidents or access disruptions.</li>
<li style="font-weight:400;">For AWS customers using services like <a href="https://aws.amazon.com/verified-permissions/">Verified Permissions</a> (which uses Cedar), this provides additional confidence in policy correctness and a path for building custom analysis tools tailored to specific organizational needs. The formal verification aspect also positions Cedar as a research platform for advancing authorization system design.</li>
</ul>
<p>22:57  Justin – “We’re using strong DM in the day jo0,b and it is very nice to see Cedar getting used in lots of different ways, particularly the mathematical proofs to be used in policies.”</p>
<h2>GCP</h2>
<p>23:51 <a href="https://siliconangle.com/2025/06/12/identity-access-management-failure-google-cloud-causes-widespread-internet-service-disruptions/">Identity and access management failure in Google Cloud causes </a><a href="https://siliconangle.com/2025/06/12/identity-access-management-failure-google-cloud-causes-widespread-internet-service-disruptions/">widespread internet service disruptions – SiliconANGLE</a></p>
<ul>
<li style="font-weight:400;">A misconfiguration in Google Cloud’s IAM systems caused widespread outages affecting <a href="https://cloud.google.com/appengine">App Engin</a>e, <a href="https://cloud.google.com/products/firestore?hl=en">Firestore</a>, <a href="https://cloud.google.com/sql">Cloud SQL</a>, <a href="https://cloud.google.com/bigquery">BigQuery</a>, and <a href="https://cloud.google.com/memorystore">Memorystore</a>, demonstrating how a single identity management failure can cascade across multiple cloud services and impact thousands of businesses globally.</li>
<li style="font-weight:400;">The incident highlighted the interconnected nature of modern cloud infrastructure as services like <a href="https://www.cloudflare.com/">Cloudflar</a>e Workers, Spotify, Discord, Shopify, and UPS experienced partial or complete downtime due to their dependencies on Google Cloud components.</li>
<li style="font-weight:400;"><a href="https://workspace.google.com/">Google Workspace</a> applications including Gmail, Drive, Docs, Calendar, and Meet all experienced failures, showing how IAM issues can affect both infrastructure services and end-user applications simultaneously.</li>
<li style="font-weight:400;">The outage underscores the critical importance of IAM redundancy and configuration management in cloud environments, as even major providers like Google can experience service-wide disruptions from a single misconfiguration.</li>
<li style="font-weight:400;">While AWS appeared largely unaffected, Amazon’s Twitch service may have experienced issues due to network-level interdependencies, illustrating how cloud outages can have ripple effects across provider boundaries through shared DNS, CDN, or authentication services.</li>
<li style="font-weight:400;"><a href="https://status.cloud.google.com/incidents/ow5i3PPK96RduMcb1SsW">FULL RCA</a> is available here. </li>
</ul>
<p>26:11 Matt – “For the SRE team at Google, within 2 minutes was already triaging, in 10 minutes it identified the root cause – that’s an impressive response time.” </p>
<p>28:28 <a href="https://blog.cloudflare.com/cloudflare-service-outage-june-12-2025/">Cloudflare service outage June 12, 2025</a></p>
<ul>
<li style="font-weight:400;">Cloudflare experienced a 2 hour 28 minute global outage on June 12, 2025 affecting <a href="https://www.cloudflare.com/developer-platform/products/workers-kv/">Workers KV</a>, <a href="https://developers.cloudflare.com/cloudflare-one/connections/connect-devices/warp/download-warp/">WARP</a>, <a href="https://blog.cloudflare.com/introducing-cloudflare-access/">Access</a>, <a href="https://www.cloudflare.com/zero-trust/products/gateway/">Gateway</a>, <a href="https://developers.cloudflare.com/images/">Images</a>, <a href="https://www.cloudflare.com/developer-platform/products/cloudflare-stream/">Stream</a>, <a href="https://developers.cloudflare.com/workers-ai/">Workers AI</a>, <a href="https://developers.cloudflare.com/workers-ai/">Turnstile</a>, and other critical services due to a third-party storage provider failure that exposed architectural vulnerabilities in their infrastructure.</li>
<li style="font-weight:400;">The incident revealed a critical single point of failure in Workers KV’s central data store, which depends on many Cloudflare products despite being designed as a “coreless” service that should run independently across all locations.</li>
<li style="font-weight:400;">During the outage window, 91% of Workers KV requests failed, cascading failures across dependent services while core services like DNS, Cache, proxy, and WAF remained operational, highlighting the blast radius of shared infrastructure dependencies.</li>
<li style="font-weight:400;">Cloudflare is accelerating migration of Workers KV to their own R2 storage infrastructure and implementing progressive namespace re-enablement tooling to prevent future cascading failures and reduce reliance on third-party providers.</li>
<li style="font-weight:400;">This marks at least the third significant R2-related outage in recent months (March 21 and February 6, 2025 also mentioned), raising questions about the stability of Cloudflare’s storage infrastructure during their architectural transition period.</li>
</ul>
<p>29:31 Justin – “I think the failure here is they’re running an entire KV on top of GCS or GCP in a way that they were impacted by this word that should be blast radiuses out to multiple clouds. Cloudflare is a partner of AWS, GCP, and Azure. They should be able to make things redundant – because I don’t necessarily know that their infrastructure is going to be better than anyone else’s infrastructure.”</p>
<p>32:53 <a href="https://cloud.google.com/blog/products/identity-security/securing-open-source-credentials-at-scale/">Securing open-source credentials at scale | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google Cloud has developed an automated tool that scans open-source packages and Docker images for exposed GCP credentials like API keys and service account keys, processing over 5 billion files across hundreds of millions of artifacts from repositories like PyPI, <a href="https://central.sonatype.com/">Maven Central</a>, and <a href="https://hub.docker.com/">DockerHub</a>.</li>
<li style="font-weight:400;">The system detects and reports leaked credentials within minutes of publication, matching the speed at which malicious actors typically exploit them, with automatic remediation options including disabling compromised service account keys based on customer-configured policies.</li>
<li style="font-weight:400;">Unlike <a href="https://github.com/">GitHub</a> and <a href="https://gitlab.com/users/sign_in">GitLab’s</a> source code scanning, this tool specifically targets built packages and container images where credentials often hide in configuration files, compiled binaries, and build scripts – areas traditionally overlooked in security scanning.</li>
<li style="font-weight:400;">Google plans to expand beyond GCP credentials to include third-party credential scanning later this year, positioning this as part of their broader deps.dev ecosystem for open-source security analysis.</li>
<li style="font-weight:400;">For GCP customers publishing open-source software, this provides free automated protection against credential exposure without requiring additional tooling or workflow changes, addressing what Mandiant reports as the second-highest cloud attack vector at 16% of investigations.</li>
<li style="font-weight:400;">The moral of the story? Please patch. We know it’s a pain. But please, patch. </li>
</ul>
<p>33:55 Matt – “I feel like AWS has had this, where they scan the GIthub commits for years – so I appreciate them doing it, don’t get me wrong, but also, I feel like this has been done before?”</p>
<p>35:48 <a href="https://cloud.google.com/blog/products/compute/googles-cloud-location-finder-unifies-multi-cloud-location-data/">Google’s Cloud Location Finder unifies multi-cloud location data | Google </a><a href="https://cloud.google.com/blog/products/compute/googles-cloud-location-finder-unifies-multi-cloud-location-data/">Cloud Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/location-finder/docs">Google Cloud Location Finder</a> provides a unified API for accessing location data across <a href="https://cloud.google.com/">Google Cloud</a>, <a href="https://aws.amazon.com/">AWS</a>, <a href="https://portal.azure.com/">Azure</a>, and <a href="https://www.oracle.com/">Oracle Cloud Infrastructure</a>, eliminating the need to manually track region information across multiple providers. The service is available at no cost via REST APIs and gcloud CLI.</li>
<li style="font-weight:400;">The API returns rich metadata including region proximity data (currently only for GCP regions), territory codes for compliance requirements, and carbon footprint information to support sustainability initiatives. </li>
<li style="font-weight:400;">Data freshness is maintained at 24 hours for active regions with automatic removal of deprecated locations.</li>
<li style="font-weight:400;">Key use cases include optimizing multi-cloud deployments by identifying the nearest GCP region to existing AWS/Azure/OCI infrastructure, ensuring data residency compliance by filtering regions by territory, and automating location selection in multi-cloud applications. This addresses a common pain point where organizations maintain hard-coded lists of cloud regions across providers.</li>
<li style="font-weight:400;">While AWS and Azure offer their own region discovery APIs, Google’s approach of providing cross-cloud visibility in a single service is unique among major cloud providers. The inclusion of sustainability metrics like carbon footprint data aligns with Google’s broader environmental commitments.</li>
</ul>
<p>37:39 <a href="https://cloud.google.com/blog/products/compute/c4d-vms-unparalleled-performance-for-business-workloads/">C4D VMs: Unparalleled performance for business workloads | Google </a><a href="https://cloud.google.com/blog/products/compute/c4d-vms-unparalleled-performance-for-business-workloads/">Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google’s <a href="https://cloud.google.com/compute/docs/general-purpose-machines#c4d_series">C4D VMs</a> are now generally available, powered by <a href="https://www.amd.com/en/products/processors/server/epyc/9005-series.html">5th Gen AMD EPYC processors (Turin)</a> and delivering up to 80% higher throughput for web serving and 30% better performance for general computing workloads compared to C3D. </li>
<li style="font-weight:400;">The new instances scale up to 384 vCPUs and 3TB of DDR5 memory, with support for Hyperdisk storage offering up to <a href="https://cloud.google.com/compute/docs/disks/hd-types/hyperdisk-extreme#achieve-higher-performance-with-multiple-hyperdisk-extreme-volumes">500K IOPS</a>.</li>
<li style="font-weight:400;">C4D introduces Google’s first <a href="https://cloud.google.com/compute/docs/instances/bare-metal-instances">AMD-based Bare Metal</a> instances (coming in weeks), providing direct server access for workloads requiring custom hypervisors or specialized licensing needs. The instances also feature next-gen <a href="https://cloud.google.com/titanium?e=48754805&amp;hl=en">Titanium</a> Local SSD with 35% lower read latency than previous generations.</li>
<li style="font-weight:400;">Performance benchmarks show C4D delivers 25% better price-performance than C3D for general computing and up to 20% better than comparable offerings from other cloud providers. For database workloads like MySQL and Redis, C4D shows 35% better price-performance than competitive VMs, with MySQL seeing up to 55% faster query processing.</li>
<li style="font-weight:400;">The new VMs support AVX-512 with a 512-bit datapath and 50% more memory channels, making them well-suited for CPU-based AI inference workloads with up to 75% price-performance improvement for recommendation inference. C4D also includes confidential computing support via AMD SEV for regulated workloads.</li>
<li style="font-weight:400;">C4D is available in 12 regions and 28 zones at launch, with a 30-day uptime window between planned maintenance events. Early adopters like AppLovin report 40% performance improvements, while Verve Group sees 191% faster ad serving compared to N2D instances.</li>
</ul>
<p>38:18 <a href="https://cloud.google.com/blog/products/compute/introducing-g4-vm-with-nvidia-rtx-pro-6000/">Introducing G4 VM with NVIDIA RTX PRO 6000 | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google Cloud is first to market with G4 VMs featuring NVIDIA RTX PRO 6000 Blackwell GPUs, combining 8 GPUs with AMD Turin CPUs (up to 384 vCPUs) and delivering 4x compute/memory and 6x memory bandwidth compared to G2 VMs. This positions GCP ahead of AWS and Azure in offering Blackwell-based instances for diverse workloads beyond just AI training.</li>
<li style="font-weight:400;">The G4 instances target a broader range of use cases than typical AI-focused GPUs, including cost-efficient inference, robotics simulations, generative AI content creation, and next-generation game rendering with 2x ray-tracing performance. Key customers include Snap for LLM inference, WPP for robotics simulation, and major gaming companies for next-gen rendering.</li>
<li style="font-weight:400;">With 768GB GDDR7 memory, 12 TiB local SSD, and support for Multi-Instance GPU (MIG), G4 VMs enable running multiple workloads per GPU for better cost efficiency. The instances integrate with Vertex AI, GKE, and Hyperdisk (500K IOPS, 10GB/s throughput) for complete AI inference pipelines.</li>
<li style="font-weight:400;">G4 supports NVIDIA Omniverse workloads natively, opening opportunities in manufacturing, automotive, and logistics for digital twins and real-time simulation. The combination of high CPU-to-GPU ratio (48:1) and Titanium’s 400 Gbps networking makes it suitable for complex simulations where CPUs orchestrate graphics workloads.</li>
<li style="font-weight:400;">Currently in preview with global availability by year-end through Google Cloud Sales representatives. Pricing not disclosed, but positioning suggests premium pricing for specialized workloads requiring both AI and graphics capabilities.</li>
</ul>
<h2>Azure</h2>
<p>39:40 <a href="https://azure.microsoft.com/en-us/updates?id=495809">Public Preview: Encrypt Premium SSD v2 and Ultra Disks with Cross </a><a href="https://azure.microsoft.com/en-us/updates?id=495809">Tenant Customer Managed Keys</a></p>
<ul>
<li style="font-weight:400;">Cross-Tenant customer-managed Keys (CMK) for Premium SSD v2 and Ultra disk are now in preview in select regions.</li>
<li style="font-weight:400;">Encrypting managed disks with cross-tenant CMK enables encrypting the disk with a CMK hosted in an Azure Key Vault in a different Microsoft Entra tenant than the disk. </li>
<li style="font-weight:400;">This will allow customers leveraging SaaS solutions that support CMK to use cross-tenant CMK with Premium SSD v2 and Ultra Disks without ever giving up complete control. (i have doubts)</li>
</ul>
<p>40:31 Justin – “The only was this makes sense to me is if you have a SaaS application where you’re getting single servers or small cluster of servers per tenant; which I don’t want to manage. But if that’s what you have, then this may make sense to you. But this has a pretty limited use case, in my opinion.”</p>
<p>42:10 <a href="https://techcommunity.microsoft.com/blog/finopsblog/microsoft-cost-management-updates%E2%80%94may-2025-summary/4421930">Microsoft Cost Management updates—May 2025 (summary) | Microsoft </a><a href="https://techcommunity.microsoft.com/blog/finopsblog/microsoft-cost-management-updates%E2%80%94may-2025-summary/4421930">Community Hub</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/carbon-optimization/overview">Azure Carbon Optimization</a> reaches general availability, allowing organizations to track and reduce their cloud carbon footprint alongside cost optimization efforts. </li>
<li style="font-weight:400;">This positions Azure competitively with AWS’s <a href="https://aws.amazon.com/aws-cost-management/aws-customer-carbon-footprint-tool/">Customer Carbon Footprint Tool</a> and <a href="https://cloud.google.com/carbon-footprint">GCP’s Carbon Footprint reporting</a>.</li>
<li style="font-weight:400;">Export to <a href="https://app.fabric.microsoft.com/">Microsoft Fabric</a> enters limited preview, enabling direct integration of Azure cost data into Microsoft’s unified analytics platform. </li>
<li style="font-weight:400;">This streamlines <a href="https://www.finops.org/introduction/what-is-finops/">FinOps</a> workflows by eliminating manual data transfers between Cost Management and analytics tools.</li>
<li style="font-weight:400;">Free <a href="https://learn.microsoft.com/en-us/azure/azure-sql/managed-instance/sql-managed-instance-paas-overview?view=azuresql">Azure SQL Managed Instance</a> offer launches in GA, providing a no-cost entry point for database migrations. </li>
<li style="font-weight:400;">This directly challenges AWS RDS Free Tier and could accelerate enterprise SQL Server migrations to Azure.</li>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/virtual-network/virtual-network-optimize-network-bandwidth">Network Optimized Azure Virtual Machines</a> enter preview, promising reduced network latency and improved throughput for data-intensive workloads. These specialized VMs target high-performance computing and real-time analytics scenarios.</li>
<li style="font-weight:400;">Smart VM Defaults in AKS reaches GA, automatically selecting cost-optimized VM sizes for Kubernetes workloads. </li>
<li style="font-weight:400;">This feature reduces overprovisioning and helps organizations avoid common AKS sizing mistakes that inflate costs. </li>
</ul>
<p>42:49 Matt – “I doubt they’re giving you Enterprise SQL. I assume it’s SQL Express or SQL standard – but they’re not giving you Enterprise SQL.”</p>
<p>44:20 <a href="https://devblogs.microsoft.com/visualstudio/next-edit-suggestions-available-in-visual-studio-github-copilot/">Next edit suggestions available in Visual Studio – Visual Studio Blog</a></p>
<ul>
<li style="font-weight:400;">GitHub Copilot’s Next Edit Suggestions (NES) in <a href="https://visualstudio.microsoft.com/hub/">Visual Studio</a> 2022 17.14 predicts and suggests your next code edit anywhere in the file, not just at cursor location, using AI to analyze previous edits and suggest insertions, deletions, or mixed changes.</li>
<li style="font-weight:400;">The feature goes beyond simple code completion by understanding logical patterns in your editing flow, such as refactoring a 2D Point class to 3D or updating legacy C++ syntax to modern STL, making it particularly useful for systematic code transformations.</li>
<li style="font-weight:400;">NES presents suggestions as inline diffs with red/green highlighting and provides navigation hints with arrows when the suggested edit is on a different line, allowing developers to Tab through related changes across the file.</li>
<li style="font-weight:400;">Early user feedback indicates accuracy issues with less common frameworks like Pulumi in C# and outdated training data for rapidly evolving APIs, highlighting the challenge of AI suggestions for niche or fast-changing technologies.</li>
<li style="font-weight:400;">While this enhances Visual Studio’s AI-assisted development capabilities, the feature currently appears limited to Visual Studio users rather than being a cloud-based service accessible across platforms or IDEs.</li>
</ul>
<p>45:36 Matt – “It’s a pretty cool feature and I like the premise of it, especially when you are refactoring legacy code or anything along those lines where it’s like, hey, don’t forget this thing over here – because on the flip side, while it’s distracting, it also would be fairly nice to not run everything, compile it, and then have the error because I forgot to refactor this one section out.”</p>
<h2>Oracle</h2>
<p>46:25  <a href="https://www.reuters.com/business/oracle-beats-quarterly-revenue-estimates-2025-06-11/">Oracle soars after raising annual forecast on robust cloud services </a><a href="https://www.reuters.com/business/oracle-beats-quarterly-revenue-estimates-2025-06-11/">demand | Reuters</a></p>
<ul>
<li style="font-weight:400;">Oracle raised its fiscal 2026 revenue forecast to $67 billion, projecting 16.7% annual growth driven by cloud services demand, with total cloud growth expected to accelerate from 24% to over 40%.</li>
<li style="font-weight:400;">Oracle Cloud Infrastructure (OCI) is gaining traction through multi-cloud strategies and integration with Oracle’s enterprise applications, though this growth primarily benefits existing Oracle customers rather than attracting new cloud-native workloads.</li>
<li style="font-weight:400;">The company’s approach of embedding generative AI capabilities into its cloud applications at no additional cost contrasts with AWS, Azure, and GCP’s usage-based AI pricing models, potentially lowering adoption barriers for Oracle’s enterprise customer base.</li>
<li style="font-weight:400;">Fourth quarter cloud services revenue reached $11.70 billion with 14% year-over-year growth, suggesting Oracle is capturing market share but still trails the big three cloud providers who report quarterly cloud revenues of $25+ billion.</li>
<li style="font-weight:400;">Oracle’s growth story depends heavily on enterprises already invested in Oracle databases and applications migrating to OCI, making it less relevant for organizations without existing Oracle dependencies.</li>
</ul>
<p>48:18 Justin – “Oracle is actually a really simple cloud. It is just Solaris boxes, as a cloud service to you. It’s all very server-based. That’s why they have iSCSI and they have fiber channels and they have all these things that are very data center centric. So if you love the data center, and you just want a cloud version of it, Oracle cloud is not bad for you. Or if you have a ton of egress traffic, the cost advantages of their networking is far superior to any of the other cloud providers. So there are benefits as much as I hate to say it.”</p>
<p>49:38 <a href="https://www.oracle.com/news/announcement/oracle-and-amd-collaborate-to-help-customers-deliver-breakthrough-performance-for-large-scale-ai-and-agentic-workloads-2025-06-12/">Oracle and AMD Collaborate to Help Customers Deliver Breakthrough </a><a href="https://www.oracle.com/news/announcement/oracle-and-amd-collaborate-to-help-customers-deliver-breakthrough-performance-for-large-scale-ai-and-agentic-workloads-2025-06-12/">Performance for Large-Scale AI and Agentic Workloads</a></p>
<ul>
<li style="font-weight:400;">Oracle announces AMD Instinct MI355X GPUs on OCI, claiming 2X better price-performance than previous generation and offering zettascale AI clusters with up to 131,072 GPUs for large-scale AI training and inference workloads.</li>
<li style="font-weight:400;">This positions Oracle as one of the first hyperscalers to offer AMD’s latest AI accelerators, though AWS, Azure, and GCP already have established GPU offerings from NVIDIA and their own custom silicon, making Oracle’s differentiation primarily about AMD partnership and pricing.</li>
<li style="font-weight:400;">The MI355X delivers triple the compute power and 50% more high-bandwidth memory than its predecessor, with OCI’s RDMA cluster network architecture supporting the massive 131,072 GPU configuration for customers needing extreme scale.</li>
<li style="font-weight:400;">Oracle emphasizes open-source compatibility and flexibility, which could appeal to customers wanting alternatives to NVIDIA’s CUDA ecosystem, though the real test will be whether the price-performance claims hold up against established solutions.</li>
<li style="font-weight:400;">The announcement targets customers running large language models and agentic AI workloads, but adoption will likely depend on actual benchmarks, software ecosystem maturity, and whether Oracle can deliver on the promised cost advantages.</li>
</ul>
<p>50:52 <a href="https://blogs.oracle.com/apex/post/introducing-vanity-urls-on-adb">Introducing Vanity Urls On Autonomous DB</a></p>
<ul>
<li style="font-weight:400;">Oracle now allows custom domain names for APEX applications on Autonomous Database, eliminating the need for awkward database-specific URLs like apex.oraclecloud.com/ords/f?p=12345 in favor of cleaner addresses like myapp.company.com.</li>
<li style="font-weight:400;">This vanity URL feature requires configuring DNS CNAME records and SSL certificates through Oracle’s Certificate Service, adding operational complexity compared to AWS CloudFront or Azure Front Door which handle SSL automatically.</li>
<li style="font-weight:400;">The feature is limited to paid Autonomous Database instances only, excluding Always Free tier users, which may restrict adoption for developers testing or running small applications.</li>
<li style="font-weight:400;">While this brings Oracle closer to parity with other cloud providers’ application hosting capabilities, the implementation requires manual certificate management and DNS configuration that competitors have largely automated.</li>
<li style="font-weight:400;">The primary benefit targets enterprises already invested in Oracle’s ecosystem who need professional-looking URLs for customer-facing APEX applications without exposing underlying database infrastructure details.</li>
</ul>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod where you can join our newsletter, slack team, send feedback or ask questions at theCloud Pod.net or tweet at us with hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2076502/c1e-9202fddn9zao9d6j-mk4n7vqnfk98-erc44e.mp3" length="61302976"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 308 of The Cloud Pod – where the forecast is always cloudy! Justin and Matt are on hand and ready to bring you an action packed episode. Unfortunately, this one is also lullaby free. Apologies. This week we’re talking about Databricks and Lakebridge, Cedar Analysis, Amazon Q, Google’s little hiccup, and updates to SQL – plus so much more! Thanks for joining us. 
Titles we almost went with this week:

KV Phone Home: When Your Key-Value Store Goes AWOL
When Your Coreless Service Finds Its Core Problem
Oracle’s Vanity Fair: Pretty URLs for Pretty Penny
From Warehouse to Lakehouse: Your Free Ticket to Cloud Town
1⃣Databricks Uno: Because One is the Loneliest Number
Free as in Beer, Smart as in Data Science
Cedar Analysis: Because Your Authorization Policies Wood Never Lie
Cedar Analysis: Teaching Old Policies New Proofs
Amazon Q Finally Learns to Talk to Other Apps
Tomorrow: Visual Studio’s Predictive Edit Revolution
The Ghost of Edits Future: AI Haunts Your Code Before You Write It
IAM What IAM: Google’s Identity Crisis Breaks the Internet
Permission Denied: The Day Google Forgot Who Everyone Was
403 Forbidden: When Google’s Bouncer Called in Sick
AWS Brings the Heat to Fusion Research
Larry’s Cloud Nine: Oracle Stock Soars on Forecast Raise
OCI You Later: Oracle Bets Big on Cloud Growth
Oracle’s Crystal Ball Shows 40% Cloud Growth Ahead
Meta Scales Up Its AI Ambitions with $14 Billion Investment
From FAIR to Scale: Meta’s $14 Billion AI Makeover
Congratulations Databricks one, you are now the new low code solution. 
AWS burns power to figure out how power works

AI Is Going Great – Or How ML Makes Money 
02:12 Zuckerberg makes Meta’s biggest bet on AI, $14 billion Scale AI deal

Meta is finalizing a $14 billion investment for a 49% stake in Scale AI, with CEO Alexandr Wang joining to lead a new AI research lab at Meta. 
This follows similar moves by Google and Microsoft acquiring AI talent through investments rather than direct acquisitions to avoid regulatory scrutiny.
Scale AI specializes in data labeling and annotation services critical for training AI models, serving major clients including OpenAI, Google, Microsoft, and Meta. 
The company’s expertise covers approximately 70% of all AI models being built, providing Meta with valuable intelligence on competitor approaches to model development.
The deal reflects Meta’s struggles with its Llama AI models, particularly the underwhelming reception of Llama 4 and delays in releasing the more powerful “Behemoth” model due to concerns about competitiveness with OpenAI and DeepSeek. Meta recently reorganized its GenAI unit into two divisions following these setbacks.
Wang brings both technical AI expertise and business acumen, having built Scale AI from a 2016 startup to a $14 billion valuation. His experience includes defense contracts and the recent Defense Llama collaboration with Meta for national security applications.
For cloud providers and dev...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2076502/c1a-k5d5-gp3x7nrjc90w-uart9n.jpg"></itunes:image>
                                                                            <itunes:duration>00:51:05</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2076502/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[308: SCC: Security Command Center or Super Cool Capabilities?]]>
                </title>
                <pubDate>Wed, 18 Jun 2025 18:10:11 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2069445</guid>
                                    <link>https://tcpfm.castos.com/episodes/308-scc-security-command-center-or-super-cool-capabilities</link>
                                <description>
                                            <![CDATA[<p>Welcome to episode 308 of The Cloud Pod – where the forecast is always cloudy! Justin, Matt and Ryan are in the house today to tell us all about the latest and greatest from FinOps and SnowFlake conferences, plus updates from Security Command Center, OpenAI, and even a new AWS Region. All this and more, today in the cloud! </p>
<h3>Titles we almost went with this week:</h3>
<ul>
<li>I Left My Wallet at FinOps X, But Found Savings at Snowflake Summit</li>
<li>Snowflake City Lights, FinOps by the Sea</li>
<li>The Two Summits: A Tale of FinOps and Snowflakes</li>
<li>Crunchy on the Outside, Snowflake on the Inside </li>
<li>AWS Taipei: Because Sometimes You Need Your Data Closer Than Your Night Market </li>
<li>AWS Plants Its Flag in Taipei: The 37th Time’s the Charm</li>
<li>AWS Slashes GPU Prices Faster Than a CUDA Kernel</li>
<li>Two Writers Walk Into a Database… And Both Succeed</li>
<li>AWS Network Firewall: Now With Windows!</li>
<li>The VPN Connection That Keeps Its Secrets</li>
<li>Transform and Roll Out: Pub/Sub’s New Single Message Feature</li>
<li>SAP Happens: Google’s New M4 VMs Handle It Better</li>
<li>Total Recall: Google’s 6TB Memory Machines</li>
<li>The M4trix Has You (And Your In-Memory Databases)</li>
<li>DeepSeek and You Shall Find… on Google Cloud</li>
<li>Four Score and Seven Vulnerabilities Ago – mk</li>
<li>The Fantastic Four Security Features</li>
<li>MCP: Model Context Protocol or Master Control Program from Tron?</li>
<li>No SQL? No Problem! AI Takes the Wheel</li>
<li>Injection Rejection: How Azure Keeps Your Prompts Clean</li>
</ul>
<h2>General News </h2>
<p>05:09 <a href="https://www.finops.org/insights/finops-x-2025-cloud-announcements/">FinOps X 2025 Cloud Announcements: AI Agents  and Increased FOCUS </a><a href="https://www.finops.org/insights/finops-x-2025-cloud-announcements/">Support</a></p>
<ul>
<li style="font-weight:400;">All major cloud providers announced expanded support for <a href="https://focus.finops.org/">FOCUS (FinOps Open Cost and Usage Specification)</a> 1.0, with AWS already in general availability and Google Cloud launching a <a href="https://cloud.google.com/bigquery">BigQuery</a> export in private preview. </li>
<li style="font-weight:400;">This signals an industry-wide standardization of cloud cost reporting formats.</li>
<li style="font-weight:400;">AWS introduced AI-powered cost optimization through <a href="https://aws.amazon.com/q/developer/">Amazon Q Developer</a> integration with <a href="https://docs.aws.amazon.com/cost-management/latest/userguide/cost-optimization-hub.html">Cost Optimization Hub</a>, enabling automated recommendations across millions of resources with detailed explanations and action plans for cost reduction.</li>
<li style="font-weight:400;">Microsoft Azure launched AI agents for application modernization that can reduce migration efforts from months to hours by automating code assessment and remediation across thousands of files, while also introducing flexible PTU reservations that work across multiple AI models.</li>
<li style="font-weight:400;">Google Cloud unveiled<a href="https://cloud.google.com/blog/topics/cost-management/spring-cleaning-with-finops-hub"> FinOps Hub 2.0</a> with Gemini-powered waste detection that identifies underutilized resources (like VMs at 5% usage) and provides AI-generated optimization recommendations for Kubernetes, Cloud Run, and Cloud SQL services.</li>
<li style="font-weight:400;"><a href="https://www.oracle.com/cloud/">Oracle Cloud Infrastructure</a> added carbon emissions reporting with hourly power-based calculations and GHGP compliance, plus new cost anomaly detection and rules-based cost allocation features for improved financial governance.</li>
</ul>
<p>06:11  Justin – “I mean, if I’m modernizing my application, typically it’s off .NET and Azure, but ok…” </p>
<p>07:20 <a href="https://siliconangle.com/2025/06/02/broadcom-reboots-cloudhealth-enhancements-broaden-finops-use/">Broadcom reboots CloudHealt...</a></p>
<h3>Chapters</h3>
<ul><li>(00:00:00) - Don't Buy Software Named After Fuzzy Creatures</li><li>(00:04:00) - FinOps X: Ranting About Finops Tooling</li><li>(00:05:17) - Cloud Cost Reporting Standards 1.2 Spec</li><li>(00:07:19) - CloudHealth's New Look for Finops</li><li>(00:11:05) - FinOps and the dual-role</li><li>(00:12:37) - Snowflake Summit 2018: Big Data, Intelligence & Security</li><li>(00:17:29) - Snowflake Adds Postgres to its Cloud Platform</li><li>(00:20:05) - OpenAI Adds Google Cloud to Its Infrastructure</li><li>(00:23:34) - Mistral AI Releases Magistral, Their First Language Model</li><li>(00:26:07) - Amazon Launches 37th Global Region in Taipei</li><li>(00:31:25) - Wonders of AWS: Smithy API Models</li><li>(00:37:34) - AWS to Lower GPU Prices for AI-based Instances</li><li>(00:41:01) - AWS Open-Sourcing PG Active</li><li>(00:44:43) - AWS Network Firewall: Monitoring Dashboard</li><li>(00:48:35) - AWS Site to Site VPN: New Features and Best Practices</li><li>(00:51:41) - Google Pub Sub: JavaScript Transforms (New Feature)</li><li>(00:54:51) - Google Cloud: New SAP HANA M4 VMs with In</li><li>(00:56:50) - What Sharding a Database Is Really Like</li><li>(00:59:41) - Google Cloud Announces Optimized Deployment Recipes for DeepSeq</li><li>(01:01:27) - BigQuery: reservation fairness and predictability,</li><li>(01:06:19) - SEC Cybersecurity Command Center 2018: Four new capabilities</li><li>(01:07:18) - Squid vs. Splunk</li><li>(01:07:36) - Cloud Run Threat Detection</li><li>(01:08:22) - SCC automatically detects connections to known malicious IPs by analyzing V</li><li>(01:09:32) - Google Cloud's Natural Language Data Manipulation (MLDB)</li><li>(01:11:31) - Google Cloud and Datadog: An AI Match</li><li>(01:14:21) - Google Cloud Serverless for Apache Spark & BigQuery</li><li>(01:16:40) - Microsoft: Azure Prompt Shields & More</li><li>(01:20:08) - Jazz: Microsoft's Cloud J</li><li>(01:23:45) - FinOps Tooling: The End of an Era</li><li>(01:31:05) - Will Kelly: Cloud Vendors Are Screwed</li><li>(01:37:18) - Will Cloud Health and Cloudability Help Your Finops?</li><li>(01:41:16) - The Future of FinOps: Unit Economics</li><li>(01:45:44) - Week in Cloud: The Cloud Podcast</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 308 of The Cloud Pod – where the forecast is always cloudy! Justin, Matt and Ryan are in the house today to tell us all about the latest and greatest from FinOps and SnowFlake conferences, plus updates from Security Command Center, OpenAI, and even a new AWS Region. All this and more, today in the cloud! 
Titles we almost went with this week:

I Left My Wallet at FinOps X, But Found Savings at Snowflake Summit
Snowflake City Lights, FinOps by the Sea
The Two Summits: A Tale of FinOps and Snowflakes
Crunchy on the Outside, Snowflake on the Inside 
AWS Taipei: Because Sometimes You Need Your Data Closer Than Your Night Market 
AWS Plants Its Flag in Taipei: The 37th Time’s the Charm
AWS Slashes GPU Prices Faster Than a CUDA Kernel
Two Writers Walk Into a Database… And Both Succeed
AWS Network Firewall: Now With Windows!
The VPN Connection That Keeps Its Secrets
Transform and Roll Out: Pub/Sub’s New Single Message Feature
SAP Happens: Google’s New M4 VMs Handle It Better
Total Recall: Google’s 6TB Memory Machines
The M4trix Has You (And Your In-Memory Databases)
DeepSeek and You Shall Find… on Google Cloud
Four Score and Seven Vulnerabilities Ago – mk
The Fantastic Four Security Features
MCP: Model Context Protocol or Master Control Program from Tron?
No SQL? No Problem! AI Takes the Wheel
Injection Rejection: How Azure Keeps Your Prompts Clean

General News 
05:09 FinOps X 2025 Cloud Announcements: AI Agents  and Increased FOCUS Support

All major cloud providers announced expanded support for FOCUS (FinOps Open Cost and Usage Specification) 1.0, with AWS already in general availability and Google Cloud launching a BigQuery export in private preview. 
This signals an industry-wide standardization of cloud cost reporting formats.
AWS introduced AI-powered cost optimization through Amazon Q Developer integration with Cost Optimization Hub, enabling automated recommendations across millions of resources with detailed explanations and action plans for cost reduction.
Microsoft Azure launched AI agents for application modernization that can reduce migration efforts from months to hours by automating code assessment and remediation across thousands of files, while also introducing flexible PTU reservations that work across multiple AI models.
Google Cloud unveiled FinOps Hub 2.0 with Gemini-powered waste detection that identifies underutilized resources (like VMs at 5% usage) and provides AI-generated optimization recommendations for Kubernetes, Cloud Run, and Cloud SQL services.
Oracle Cloud Infrastructure added carbon emissions reporting with hourly power-based calculations and GHGP compliance, plus new cost anomaly detection and rules-based cost allocation features for improved financial governance.

06:11  Justin – “I mean, if I’m modernizing my application, typically it’s off .NET and Azure, but ok…” 
07:20 Broadcom reboots CloudHealt...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[308: SCC: Security Command Center or Super Cool Capabilities?]]>
                </itunes:title>
                                    <itunes:episode>308</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p>Welcome to episode 308 of The Cloud Pod – where the forecast is always cloudy! Justin, Matt and Ryan are in the house today to tell us all about the latest and greatest from FinOps and SnowFlake conferences, plus updates from Security Command Center, OpenAI, and even a new AWS Region. All this and more, today in the cloud! </p>
<h3>Titles we almost went with this week:</h3>
<ul>
<li>I Left My Wallet at FinOps X, But Found Savings at Snowflake Summit</li>
<li>Snowflake City Lights, FinOps by the Sea</li>
<li>The Two Summits: A Tale of FinOps and Snowflakes</li>
<li>Crunchy on the Outside, Snowflake on the Inside </li>
<li>AWS Taipei: Because Sometimes You Need Your Data Closer Than Your Night Market </li>
<li>AWS Plants Its Flag in Taipei: The 37th Time’s the Charm</li>
<li>AWS Slashes GPU Prices Faster Than a CUDA Kernel</li>
<li>Two Writers Walk Into a Database… And Both Succeed</li>
<li>AWS Network Firewall: Now With Windows!</li>
<li>The VPN Connection That Keeps Its Secrets</li>
<li>Transform and Roll Out: Pub/Sub’s New Single Message Feature</li>
<li>SAP Happens: Google’s New M4 VMs Handle It Better</li>
<li>Total Recall: Google’s 6TB Memory Machines</li>
<li>The M4trix Has You (And Your In-Memory Databases)</li>
<li>DeepSeek and You Shall Find… on Google Cloud</li>
<li>Four Score and Seven Vulnerabilities Ago – mk</li>
<li>The Fantastic Four Security Features</li>
<li>MCP: Model Context Protocol or Master Control Program from Tron?</li>
<li>No SQL? No Problem! AI Takes the Wheel</li>
<li>Injection Rejection: How Azure Keeps Your Prompts Clean</li>
</ul>
<h2>General News </h2>
<p>05:09 <a href="https://www.finops.org/insights/finops-x-2025-cloud-announcements/">FinOps X 2025 Cloud Announcements: AI Agents  and Increased FOCUS </a><a href="https://www.finops.org/insights/finops-x-2025-cloud-announcements/">Support</a></p>
<ul>
<li style="font-weight:400;">All major cloud providers announced expanded support for <a href="https://focus.finops.org/">FOCUS (FinOps Open Cost and Usage Specification)</a> 1.0, with AWS already in general availability and Google Cloud launching a <a href="https://cloud.google.com/bigquery">BigQuery</a> export in private preview. </li>
<li style="font-weight:400;">This signals an industry-wide standardization of cloud cost reporting formats.</li>
<li style="font-weight:400;">AWS introduced AI-powered cost optimization through <a href="https://aws.amazon.com/q/developer/">Amazon Q Developer</a> integration with <a href="https://docs.aws.amazon.com/cost-management/latest/userguide/cost-optimization-hub.html">Cost Optimization Hub</a>, enabling automated recommendations across millions of resources with detailed explanations and action plans for cost reduction.</li>
<li style="font-weight:400;">Microsoft Azure launched AI agents for application modernization that can reduce migration efforts from months to hours by automating code assessment and remediation across thousands of files, while also introducing flexible PTU reservations that work across multiple AI models.</li>
<li style="font-weight:400;">Google Cloud unveiled<a href="https://cloud.google.com/blog/topics/cost-management/spring-cleaning-with-finops-hub"> FinOps Hub 2.0</a> with Gemini-powered waste detection that identifies underutilized resources (like VMs at 5% usage) and provides AI-generated optimization recommendations for Kubernetes, Cloud Run, and Cloud SQL services.</li>
<li style="font-weight:400;"><a href="https://www.oracle.com/cloud/">Oracle Cloud Infrastructure</a> added carbon emissions reporting with hourly power-based calculations and GHGP compliance, plus new cost anomaly detection and rules-based cost allocation features for improved financial governance.</li>
</ul>
<p>06:11  Justin – “I mean, if I’m modernizing my application, typically it’s off .NET and Azure, but ok…” </p>
<p>07:20 <a href="https://siliconangle.com/2025/06/02/broadcom-reboots-cloudhealth-enhancements-broaden-finops-use/">Broadcom reboots CloudHealth with enhancements to broaden FinOps use </a><a href="https://siliconangle.com/2025/06/02/broadcom-reboots-cloudhealth-enhancements-broaden-finops-use/">– SiliconANGLE</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.broadcom.com/">Broadcom</a> has redesigned <a href="https://www.finops.org/members/cloudhealth/">CloudHealth</a> with AI-powered features including Intelligent Assist for natural language queries and Smart Summary for explaining billing changes, marking the platform’s most significant update since its 2012 launch.</li>
<li style="font-weight:400;">The update addresses a key <a href="https://www.finops.org/introduction/what-is-finops/">FinOps</a> challenge by making cloud cost data accessible to non-technical teams through plain English interfaces, instead of requiring SQL knowledge, as 44% of FinOps teams were created within the past year according to the FinOps Foundation.</li>
<li style="font-weight:400;">CloudHealth processes 10 petabytes of cost and usage data daily across 22,000 customers, with the new AI features tested for over six months to ensure accuracy in recommendations for users managing millions in cloud spending.</li>
<li style="font-weight:400;">Smart Summary analyzes billing data to explain cost changes down to unit price level in plain English, condensing billions of lines of cost data into a few hundred actionable lines.</li>
<li style="font-weight:400;">The redesign aims to shift cost optimization visibility earlier in the application lifecycle by extending access beyond centralized FinOps teams to engineering and other departments involved in cloud infrastructure decisions.</li>
</ul>
<p>08:42  Justin – “I’m glad to see CloudHealth getting some love. I thought it was just going to die inside of the Broadcom behemoth.” </p>
<h2>AI Is Going Great – Or How ML Makes Money </h2>
<p>SnowFlake Summit</p>
<p>12:57  <a href="https://www.snowflake.com/en/blog/agentic-ai-ready-enterprise-data/">Democratizing Enterprise AI: Snowflake’s New AI Capabilities Accelerate Data-Driven Innovation</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.snowflake.com/en/">Snowflake</a> introduces <a href="https://www.snowflake.com/en/blog/intelligence-snowflake-summit-2025/">Snowflake Intelligence</a> and <a href="https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-agents">Cortex Agents</a> to enable natural language querying of structured and unstructured data, allowing business users to ask questions in plain English and receive governed answers without SQL knowledge or dashboards</li>
<li style="font-weight:400;"><a href="https://quickstarts.snowflake.com/guide/getting-started-with-cortex-aisql/index.html">Cortex AISQL</a> brings AI capabilities directly into SQL syntax, enabling analysts to extract metadata, classify sentiment, and process documents, images and other formats with 30-70% performance improvements over traditional pipelines.</li>
<li style="font-weight:400;">The platform now includes AI Observability tools for monitoring generative AI applications, access to models from <a href="https://openai.com/">OpenAI</a>, <a href="https://www.anthropic.com/">Anthropic</a>, <a href="https://www.meta.com/about/">Meta</a> and <a href="https://mistral.ai/">Mistral</a> within Snowflake’s security perimeter, and provisioned throughput for dedicated inference capacity.</li>
<li style="font-weight:400;">New ML capabilities include a Data Science Agent that uses Anthropic models to automatically generate ML pipelines from natural language prompts, distributed training APIs, and support for serving models from <a href="https://huggingface.co/">Hugging Face</a> with one-click deployment.</li>
<li style="font-weight:400;">All AI and ML features operate within Snowflake’s unified governance framework with role-based access control, usage tracking, and budget enforcement, eliminating the need for separate infrastructure management.</li>
</ul>
<p>14:00 <a href="https://www.snowflake.com/en/blog/ai-powered-analytics-migrations-innovation/">Experience AI-Powered Analytics and Migrations at Warp Speed with </a><a href="https://www.snowflake.com/en/blog/ai-powered-analytics-migrations-innovation/">Snowflake’s Latest Innovations</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.snowflake.com/en/">Snowflake</a>‘s <a href="https://www.snowflake.com/en/migrate-to-the-cloud/snowconvert-ai/">SnowConvert AI</a> now supports <a href="https://www.snowflake.com/en/engineering-blog/snowconvert-migration-assistant/?clear-cache=lgpbl">automated migrations</a> from <a href="https://greenplum.org/">Greenplum</a>, <a href="http://www.netezza.com/">Netezza</a>, <a href="https://www.postgresql.org/">Postgres</a>, <a href="https://cloud.google.com/bigquery">BigQuery</a>, <a href="http://www.sybase.com/">Sybase</a>, and <a href="https://azure.microsoft.com/en-us/products/synapse-analytics/">Microsoft Synapse</a>, with AI-powered code verification and data validation to reduce migration complexity and timelines.</li>
<li style="font-weight:400;"><a href="https://docs.snowflake.com/en/release-notes/2025/other/2025-06-02-cortex-aisql-public-preview">Cortex AISQL</a> enables SQL-based analysis of both structured and unstructured data (text, images, audio) in a single query, allowing data analysts to perform AI analytics without specialized expertise or external integrations.</li>
<li style="font-weight:400;"><a href="https://docs.snowflake.com/en/user-guide/warehouses-gen2">Standard Warehouse Generation</a> 2 delivers 2.1x faster performance for core analytics workloads and 4.4x faster Delete, Update, and Merge operations, while new Adaptive Compute automatically selects optimal cluster sizes and routing without manual configuration.</li>
<li style="font-weight:400;">Iceberg performance improvements include 2.4x faster analytics on externally managed tables through search optimization, query acceleration, automatic compaction, and enhanced pruning capabilities for selective queries.</li>
<li style="font-weight:400;">Semantic Views provide a unified business metrics layer accessible through <a href="https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-analyst">Cortex Analyst</a>, <a href="https://www.snowflake.com/en/news/press-releases/snowflake-intelligence-and-data-science-agent-deliver-the-next-frontier-of-data-agents-for-enterprise-ai-and-ml/">Snowflake Intelligence</a>, BI tools, or direct SQL queries, ensuring consistent results across different interfaces and partner integrations.</li>
</ul>
<p>15:52  Ryan – “…we’ve moved into running infrastructure not being sort of the first principle of a lot of businesses, and now it seems like sort of hosting data and databases and large data warehouses is sort of going that route too, which I think makes sense.”</p>
<p><a href="https://www.snowflake.com/en/blog/platform-announcements-summit-2025/">An Even Easier-to-Use and More Trusted Platform from Snowflake</a></p>
<ul>
<li style="font-weight:400;">Snowflake introduces <a href="https://snowflakechronicles.medium.com/adaptive-compute-is-here-snowflakes-self-tuning-warehouses-for-a-faster-easier-ai-ready-future-a87ede463d51">Adaptive Compute</a> in private preview, which automatically selects cluster sizes, number of clusters, and auto-suspend durations without user configuration. This service delivers 2.1x faster performance through Gen2 warehouses and optimizes costs by intelligently routing queries to right-sized clusters across a shared compute pool.</li>
<li style="font-weight:400;">The platform adds comprehensive <a href="https://www.finops.org/introduction/what-is-finops/">FinOps</a> capabilities including cost-based anomaly detection, tag-based budgets, and joins the FinOps Foundation as a <a href="https://www.finops.org/about/members/">Premier Enterprise Member</a>. </li>
<li style="font-weight:400;">These tools help organizations track spending spikes, set resource limits by tags, and align with industry best practices for cloud cost management.</li>
<li style="font-weight:400;"><a href="https://docs.snowflake.com/en/user-guide/snowflake-horizon">Horizon Catalog</a> now federates across <a href="https://iceberg.apache.org/">Apache Iceberg</a> <a href="https://medium.com/datastrato/introduction-to-rest-catalogs-for-apache-iceberg-5ee4b6d05eaa">REST catalogs</a> through Catalog-linked Databases, enabling unified governance across external data sources. </li>
<li style="font-weight:400;">The addition of AI-powered Copilot for <a href="https://www.snowflake.com/en/product/features/horizon/">Horizon Catalog</a> allows natural language queries for governance and metadata discovery tasks.</li>
<li style="font-weight:400;">New security features include anomaly detection using AI models, leaked password protection that disables compromised credentials found on the dark web, and bad IP blocking. Workload Identity Federation removes the need for long-lived credentials while passkey support adds modern authentication methods.</li>
<li style="font-weight:400;">Snowflake announces PostgreSQL support through <a href="https://other-docs.snowflake.com/en/connectors/postgres6/about">Snowflake Postgres</a> (in development) and expands Unistore to Azure with Hybrid Tables. </li>
<li style="font-weight:400;">This allows organizations to run transactional and analytical workloads on the same platform with unified governance and security.</li>
</ul>
<p><a href="https://www.snowflake.com/en/blog/adaptive-compute-smarter-warehouses/">Introducing Even Easier-to-Use Snowflake Adaptive Compute with Better </a><a href="https://www.snowflake.com/en/blog/adaptive-compute-smarter-warehouses/">Price/Performance</a></p>
<ul>
<li style="font-weight:400;"><a href="https://snowflakechronicles.medium.com/adaptive-compute-is-here-snowflakes-self-tuning-warehouses-for-a-faster-easier-ai-ready-future-a87ede463d51">Snowflake’s Adaptive Compute</a> automatically selects cluster sizes, number of clusters, and auto-suspend/resume settings, eliminating manual infrastructure decisions while maintaining familiar billing models and FinOps tools.</li>
<li style="font-weight:400;"><a href="https://docs.snowflake.com/en/user-guide/warehouses-gen2">Standard Warehouse Generation 2</a> delivers 2.1x faster performance for core analytics workloads compared to the previous generation, with upgraded hardware and performance enhancements now generally available.</li>
<li style="font-weight:400;">Converting existing warehouses to Adaptive Warehouses requires only a simple alter command with no downtime, preserving warehouse names, policies, and permissions to minimize disruption to production workloads.</li>
<li style="font-weight:400;">All Adaptive Warehouses in an account share a common resource pool, optimizing efficiency through intelligent query routing to right-sized clusters without user intervention.</li>
<li style="font-weight:400;">Pfizer reports successful consolidation of multiple warehouses across different workloads during private preview, highlighting reduced management overhead while maintaining budget controls.</li>
</ul>
<p><a href="https://www.snowflake.com/en/blog/intelligence-snowflake-summit-2025/">Snowflake Intelligence: Talk to Your Data, Unlock Real Business Insights</a></p>
<ul>
<li style="font-weight:400;"><a href="http://ai.snowflake.com/">Snowflake Intelligence</a> introduces a natural language interface at ai.snowflake.com that allows business users to query both structured and unstructured data through conversational AI, eliminating the need for SQL knowledge or waiting for data team support.</li>
<li style="font-weight:400;">The platform’s <a href="https://www.snowflake.com/en/blog/intelligence-snowflake-summit-2025/">Deep Research Agent for Analytics</a> goes beyond simple data retrieval to analyze complex business questions and uncover the “why” behind trends, while maintaining Snowflake’s existing security and governance controls automatically.</li>
<li style="font-weight:400;">Integration with third-party applications like <a href="https://login.salesforce.com/">Salesforce</a>, <a href="https://duckduckgo.com/y.js?ad_domain=zendesk.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=S3qTT31gh8vF6El0MF1U146UaQHYox6kVIguba9cwIzYDxMFotaB4XEzBHaqNNbxu4Fa4Q98_EQNavxmpFrzjaoz6Ttx2u34ZzV2GLwdo4vFFNSEtSJg2N8jtX_d374s.brvt3SIaHasoYOrgJVJpJA&amp;eddgt=wLThTKh_gvasBoemEniKPg%3D%3D&amp;rut=46791b748cd00be0a6fe8bc7b00949ea1693da09e470a6ab5663b3229adb5985&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8qbbLG7pzlDqJR3NclsicqDVUCUz_k4iVykfF0P36K0ZA_kwIUvmRBcpJ11C0lw7elEUfJyrSiFqq-Zayf0ICV-XbRang3ErTREeLEU4kl9nwyZBdKewTP8zP1k54khu8ysetI__xtFCbAfNcD9A1FUjM5d-0i4Orn79QS-iYpVD42Kbyetqhqlzn_gMDagmd-jOdPixzbIkifm7X1BsTJmFDQHfgBmq9BL7hm6zyR9KR7UDxRjzFe9ULCpQNeQFq5_0415XXEscfcSJQeqOo_j-U0GmSfcjAG4cEoTfarRcywhDt_z4k93PqU5xBfsNX8uw6xyiIGnvkYIpAqCDAMIe8o0dGzXaLuBQRPVTyAwgZ2D47Pwytvve2uEcWeonQSE05BdYOpZuZKylHwJJxSZBoZ91qM8VA6MtLaB_Cr4b_6RGJmUaTJ9D_rMxEj4nFpWTTZX_dTJhRbExRACCh--JsuDyPuZUWK9fDSAG2hKSyrSVQxqHdOIYpkV5Pymb-rq8HzdnofKKzYbBqtFWke84NFGVzJYq-ytRUe5FgZltM9TiPIkZd3L0fRnTxOnNbeJahz-tFsVDw9-og3R5DfodF0YGCycGUdPgmoHFcg74deWSAYLSoqXD7SiXLmJuQbVETQT7CCkBcYZsNkp_mkMt52RmuuXJRR5M_ngcyJu9gyuANjoDNY2rdrjN74N0cSkMb2Z7tOPscW6RlhHXYrbTBqvsWdieOCzzEHNA1_ldQDlNNpvnhsAtCjGt5MrSVHbFgaA%26u%3DaHR0cHMlM2ElMmYlMmZ3d3cuemVuZGVzay5jb20lMmZscCUyZm5iLXN1cHBvcnQlM2Z1dG1fc291cmNlJTNkYmluZyUyNnV0bV9tZWRpdW0lM2RTZWFyY2gtUGFpZCUyNnV0bV9uZXR3b3JrJTNkcyUyNnV0bV9jYW1wYWlnbiUzZFNFX0JJX0FNX1VTX0VOX05fU3VwX1BtYXhfUG1heF9aZXRhX0FsbF9IJTI2bWF0Y2h0eXBlJTNkYiUyNnV0bV90ZXJtJTNkd3d3LnplbmRlc2suY29tJTI2dXRtX2NvbnRlbnQlM2QlMjZ0aGVtZSUzZCUyNm1zY2xraWQlM2Q2NDk1OTMyMzhlYTkxNDVjYzQwMWZiNzE5N2ZhM2M4NA%26rlid%3D649593238ea9145cc401fb7197fa3c84&amp;vqd=4-114497991855415159298284209367868877870&amp;iurl=%7B1%7DIG%3D384A529261B04D4DB5B771205A298C8A%26CID%3D160FF8A73957687714F5EEA838ED6967%26ID%3DDevEx%2C5047.1">Zendesk</a>, and <a href="https://slack.com/">Slack</a> provides a unified view across business systems, and <a href="https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-knowledge-extensions/cke-overview">Cortex Knowledge Extensions</a> add external data sources like Stack Overflow and The Associated Press for enriched insights.</li>
<li style="font-weight:400;">The service enables direct action from insights, allowing users to trigger workflows, send notifications, or update records in other systems directly from the conversational interface, reducing the time from insight to action.</li>
<li style="font-weight:400;">Early adopter WHOOP reports their analytics teams can now focus on strategic work rather than manual data retrieval tasks, demonstrating the potential for organizations to democratize data access while maintaining enterprise security standards.</li>
</ul>
<p><a href="https://www.snowflake.com/en/blog/ai-sql-query-language/">Cortex AISQL: Reimagining SQL into AI Query Language for Multimodal Data</a></p>
<ul>
<li style="font-weight:400;">Snowflake Cortex AISQL brings AI capabilities directly into SQL, allowing analysts to process text, images, and audio data using familiar SQL commands like AI_FILTER, AI_AGG, and AI_CLASSIFY without needing separate AI tools or specialized skills.</li>
<li style="font-weight:400;">The new FILE data type enables direct referencing of multimodal data within Snowflake tables, eliminating the need for separate processing systems and allowing complex queries that combine structured and unstructured data analysis in a single workflow.</li>
<li style="font-weight:400;">Performance optimizations deliver up to 70% query runtime reduction for operations like FILTER and JOIN compared to manual implementations, achieved by running AI functions inside Snowflake’s core query engine with intelligent model selection.</li>
<li style="font-weight:400;">Real-world applications include financial services automating corporate action processing from news feeds, retailers detecting product quality issues from customer reviews, and healthcare researchers correlating clinical notes with patient records for new treatment insights.</li>
<li style="font-weight:400;">The public preview makes AI-powered data analysis accessible to SQL analysts without requiring data science expertise, transforming weeks of custom development into straightforward SQL queries that can be modified in minutes.</li>
</ul>
<p>17:45 <a href="https://www.snowflake.com/en/blog/snowflake-postgres-enterprise-ai-database/">Delivering the Most Enterprise-Ready Postgres, Built for Snowflake</a></p>
<ul>
<li style="font-weight:400;">Snowflake is acquiring <a href="https://www.crunchydata.com/">Crunchy Data</a> to create <a href="https://other-docs.snowflake.com/en/connectors/postgres6/about">Snowflake Postgres</a>, bringing enterprise-grade security, compliance, and operational standards to <a href="https://www.postgresql.org/">PostgreSQL</a> within the Snowflake platform. </li>
<li style="font-weight:400;">This addresses the gap between developer preference for Postgres and enterprise requirements for production workloads.</li>
<li style="font-weight:400;">The acquisition targets organizations that need advanced security features like customer-managed encryption keys and compliance certifications for regulated industries. Crunchy Data brings proven expertise in enterprise Postgres deployments across cloud, Kubernetes, and on-premise environments.</li>
<li style="font-weight:400;">Snowflake Postgres will enable developers to run existing Postgres applications on Snowflake without code rewrites while gaining access to built-in connection pooling, performance metrics, and logging support. This consolidates transactional and analytical workloads in a single platform.</li>
<li style="font-weight:400;">The offering compliments Snowflake’s existing <a href="https://www.snowflake.com/en/product/platform/unistore/">Unistore</a> solution by providing native Postgres compatibility for transactional applications. Early customers like Blue Yonder and Landing AI see opportunities to simplify their application stacks and accelerate AI development.</li>
<li style="font-weight:400;">This move positions Snowflake to capture more enterprise workloads by eliminating the need for separate database management while maintaining full Postgres compatibility. The acquisition is expected to close imminently pending standard closing conditions.</li>
</ul>
<p>19:24  Ryan – “If the data set is presented as a single data source that I can run analytical and transactional workloads against, that would be amazing value to develop on and to simplify the application architecture. So that would be super cool.”</p>
<p>20:33 <a href="https://www.reuters.com/business/retail-consumer/openai-taps-google-unprecedented-cloud-deal-despite-ai-rivalry-sources-say-2025-06-10/">Exclusive: OpenAI taps Google in unprecedented cloud deal despite AI </a><a href="https://www.reuters.com/business/retail-consumer/openai-taps-google-unprecedented-cloud-deal-despite-ai-rivalry-sources-say-2025-06-10/">rivalry, sources say | Reuters</a></p>
<ul>
<li style="font-weight:400;"><a href="https://openai.com/">OpenAI</a> is adding <a href="https://cloud.google.com/">Google Cloud’s infrastructure</a> to its compute resources despite being direct competitors in AI, marking a shift from its exclusive reliance on <a href="https://portal.azure.com/">Microsoft Azure</a> for data center infrastructure since January 2025.</li>
<li style="font-weight:400;">The deal centers on Google’s tensor processing units (TPUs) which were historically reserved for internal use but are now being offered to external customers including Apple, Anthropic, and <a href="https://ssi.inc/">Safe Superintelligence</a>.</li>
<li style="font-weight:400;">OpenAI’s compute demands are driven by both training large language models and running inference at scale, with the company reporting $10 billion in annualized revenue as of June 2025.</li>
<li style="font-weight:400;">This partnership adds to OpenAI’s infrastructure diversification strategy including the $500 billion <a href="https://openai.com/index/announcing-the-stargate-project/">Stargate project</a> with SoftBank and Oracle, plus billions in compute contracts with <a href="https://www.coreweave.com/">CoreWeave</a>.</li>
<li style="font-weight:400;">For cloud providers, the deal demonstrates how AI workloads are reshaping competitive dynamics – Google Cloud generated $43 billion in 2024 revenue and positions itself as a neutral compute provider despite competing directly with customers through <a href="https://deepmind.google/">DeepMind</a>.</li>
</ul>
<p>21:55  Matt – “It also is probably the first true multi-cloud workload that there is out there that they can train across multiple clouds. And if they do it right, they can, in theory, actually leverage spot markets and things like that, which will be interesting to see how they destroy spot markets real fast when they start training everything.”</p>
<p>24:11 <a href="https://mistral.ai/news/magistral">Magistral | Mistral AI</a></p>
<ul>
<li style="font-weight:400;">Mistral AI released Magistral, their first reasoning model available in two versions: <a href="https://huggingface.co/mistralai/Mistral-Small-24B-Instruct-2501">Magistral Small</a> (24B parameters, open source under <a href="https://www.apache.org/licenses/LICENSE-2.0.html">Apache 2.0</a>) and <a href="https://mistral.ai/models">Magistral Medium</a> (enterprise version), with the Medium version scoring 73.6% on AIME2024 benchmarks and 90% with majority voting.</li>
<li style="font-weight:400;">The model introduces transparent, traceable reasoning chains that work natively across multiple languages including English, French, Spanish, German, Italian, Arabic, Russian, and Simplified Chinese, making it suitable for global enterprise deployments requiring auditable AI decisions.</li>
<li style="font-weight:400;">Magistral Medium achieves 10x faster token throughput than competitors through Flash Answers in <a href="https://chat.mistral.ai/chat">Le Chat</a>, enabling real-time reasoning for cloud-based applications in regulated industries, software development, and data engineering workflows.</li>
<li style="font-weight:400;">Enterprise availability includes deployment options on <a href="https://aws.amazon.com/sagemaker">Amazon SageMaker</a> with upcoming support for <a href="https://www.ibm.com/watsonx">IBM WatsonX</a>, <a href="https://azure.microsoft.com/en-us/solutions/ai/">Azure AI</a>, and <a href="https://cloud.google.com/marketplace">Google Cloud Marketplace</a>, positioning it as a multi-cloud solution for businesses needing domain-specific reasoning capabilities.</li>
<li style="font-weight:400;">The open-source Magistral Small enables developers to build custom reasoning applications, with the community already creating specialized models like ether0 for chemistry and <a href="https://huggingface.co/NousResearch/DeepHermes-3-Llama-3-8B-Preview">DeepHermes 3</a>, expanding the ecosystem of thinking language models.</li>
</ul>
<p>25:19  Matt – “The multiple languages Day 1, and the quantity of languages has always impressed me. It’s not like all Latin based languages; but getting Russian and Chinese in there Day 1. They’re different alphabets and completely different speech patterns…and having all of them at once impressed me.”  </p>
<h2>AWS</h2>
<p>26:52 <a href="https://aws.amazon.com/blogs/aws/now-open-aws-asia-pacific-taipei-region/">Now open – AWS Asia Pacific (Taipei) Region | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/?trk=d21a4eb6-d91f-4286-843a-d35b2a06a274&amp;sc_channel=el">AWS</a> launches its 37th global region in Taipei (ap-east-2) with three <a href="https://docs.aws.amazon.com/global-infrastructure/latest/regions/aws-availability-zones.html?trk=d21a4eb6-d91f-4286-843a-d35b2a06a274&amp;sc_channel=el">availability zones</a>,  marking the 15th region in Asia Pacific and bringing the total to 117 availability zones worldwide. </li>
<li style="font-weight:400;">This addresses data residency requirements for Taiwan’s regulated industries including finance and healthcare.</li>
<li style="font-weight:400;">The region builds on AWS’s decade-long presence in Taiwan which includes two <a href="https://docs.aws.amazon.com/AmazonCloudFront/latest/DeveloperGuide/Introduction.html">CloudFront</a> edge locations, three Direct Connect locations, <a href="https://docs.aws.amazon.com/outposts/latest/userguide/what-is-outposts.html">AWS Outposts</a> support, and a Local Zone in Taipei for single-digit millisecond latency applications.</li>
<li style="font-weight:400;">Major Taiwan enterprises are already leveraging AWS including Cathay Financial Holdings for compliance-focused cloud environments, Gamania Group’s Vyin AI platform for celebrity digital identities, and Chunghwa Telecom using Amazon Bedrock for generative AI applications.</li>
<li style="font-weight:400;">AWS has trained over 200,000 people in Taiwan through AWS Academy, AWS Educate, and AWS Skill Builder programs, supporting the local ecosystem that includes 4 AWS Heroes, 17 Community Builders, and Premier Partners like eCloudvalley and Nextlink Technology.</li>
<li style="font-weight:400;">The region supports Taiwan’s 2050 net-zero emissions goal with customers like Ace Energy achieving 65% steam consumption reduction and Taiwan Power Company implementing smart grid technologies with drones and robotics for infrastructure management.</li>
</ul>
<p>32:18 <a href="https://aws.amazon.com/blogs/aws/introducing-aws-api-models-and-publicly-available-resources-for-aws-api-definitions/">Introducing AWS API models and publicly available resources for AWS API </a><a href="https://aws.amazon.com/blogs/aws/introducing-aws-api-models-and-publicly-available-resources-for-aws-api-definitions/">definitions | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;">AWS is now publishing <a href="https://smithy.io/">Smithy API</a> models daily to <a href="https://central.sonatype.com/search?namespace=software.amazon.api.models&amp;sort=name">Maven Central</a> and <a href="https://github.com/aws/api-models-aws">GitHub</a>, providing developers with definitive, up-to-date sources of AWS service interface definitions and behaviors that have been used internally since 2018 to generate AWS SDKs and CLI tools.</li>
<li style="font-weight:400;">Developers can use these models to generate custom<a href="https://docs.aws.amazon.com/sdkref/latest/guide/version-support-matrix.html?trk=769a1a2b-8c19-4976-9c45-b6b1226c7d20&amp;sc_channel=el"> SDKs</a> for unsupported languages, build server stubs for testing, create developer tools like IAM policy generators, or even generate <a href="https://modelcontextprotocol.io/introduction">Model Context Protocol (MCP)</a> server configurations for AI agents.</li>
<li style="font-weight:400;">The repository structure organizes models by service SDK ID and version, with each model containing detailed API contracts including operations, protocols, authentication methods, request/response types, and comprehensive documentation with examples.</li>
<li style="font-weight:400;">This release enables developers to build purpose-built integrations without waiting for official SDK support, particularly valuable for niche programming languages or specialized use cases where existing SDKs don’t meet specific requirements.</li>
<li style="font-weight:400;">The models are available at no cost through the GitHub repository and Maven Central, with Smithy CLI and build tools providing immediate access to code generation capabilities.</li>
</ul>
<p>38:36 <a href="https://aws.amazon.com/blogs/aws/announcing-up-to-45-price-reduction-for-amazon-ec2-nvidia-gpu-accelerated-instances/">Announcing up to 45% price reduction for Amazon EC2 NVIDIA </a><a href="https://aws.amazon.com/blogs/aws/announcing-up-to-45-price-reduction-for-amazon-ec2-nvidia-gpu-accelerated-instances/">GPU-accelerated instances | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;">AWS is reducing prices by up to 45% for <a href="https://aws.amazon.com/blogs/aws/new-amazon-ec2-p6-b200-instances-powered-by-nvidia-blackwell-gpus-to-accelerate-ai-innovations/">NVIDIA GPU-accelerated EC2 instances</a> including P4 (P4d/P4de) and P5 (P5/P5en) families, with On-Demand pricing effective June 1 and Savings Plans pricing after June 4, addressing the industry-wide GPU shortage that has driven up costs for AI workloads.</li>
<li style="font-weight:400;">The price cuts apply across all regions where these instances are available, with AWS expanding at-scale On-Demand capacity to additional regions including Asia Pacific, Europe, and South America, making GPU resources more accessible for distributed AI training and inference workloads.</li>
<li style="font-weight:400;">AWS is now offering the new <a href="https://aws.amazon.com/blogs/aws/new-amazon-ec2-p6-b200-instances-powered-by-nvidia-blackwell-gpus-to-accelerate-ai-innovations/">P6-B200 instances powered by NVIDIA Blackwell GPUs</a> through Savings Plans for large-scale deployments, previously only available through EC2 Capacity Blocks, providing customers with more flexible purchasing options for next-generation GPU compute.</li>
<li style="font-weight:400;">Customers can choose between <a href="https://aws.amazon.com/savingsplans/compute-pricing/">EC2 Instance Savings Plans</a> for the lowest prices on specific instance families in a region, or Compute Savings Plans for maximum flexibility across instance types and regions, with both 1-year and 3-year commitment options.</li>
<li style="font-weight:400;">This pricing reduction represents AWS passing operational efficiencies from scale back to customers, making advanced GPU computing more economically viable for generative AI applications, employee productivity tools, and customer experience improvements.</li>
</ul>
<p>40:02  Ryan- “I took issue with the way that this blog post was written and was just squinting all the way through it because like, it feels like the shortages are lightening up, and so they can offer this – which I like, right, because they are really passing down that savings – and you know, maybe it’s extra capacity. But I don’t think so. I think it’s because the capacity is available that they can, you know, via supply and demand lower the prices for it.”</p>
<p>42:06 <a href="https://aws.amazon.com/about-aws/whats-new/2025/06/open-sourcing-pgactive-active-active-replication-extension-postgresql/">Announcing open sourcing pgactive: active-active replication extension for </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/06/open-sourcing-pgactive-active-active-replication-extension-postgresql/">PostgreSQL – AWS</a></p>
<ul>
<li style="font-weight:400;">AWS open sourced pgactive, a <a href="https://www.postgresql.org/">PostgreSQL</a> extension that enables asynchronous active-active replication between database instances, allowing multiple writers across different regions to maintain data consistency and availability.</li>
<li style="font-weight:400;">The extension builds on PostgreSQL 16’s bidirectional replication features, simplifying management of active-active scenarios for use cases like regional failover, geographic data distribution, and zero-downtime migrations between instances.</li>
<li style="font-weight:400;">This addresses a common PostgreSQL limitation where traditional replication only supports single-writer architectures, making it difficult to achieve true multi-region active deployments without complex third-party solutions.</li>
<li style="font-weight:400;">Organizations can now implement disaster recovery strategies with multiple active database instances, reducing recovery time objectives (RTO) and enabling seamless traffic switching during maintenance or outages.</li>
<li style="font-weight:400;">The open source release on <a href="https://github.com/aws/pgactive">GitHub</a> allows community collaboration on improving PostgreSQL’s active-active capabilities while providing AWS customers with a supported path for multi-writer database architectures without vendor lock-in.</li>
</ul>
<p>43:49  Justin – “It’s also interesting that they announced this just after Snowflake announced the purchase of CrunchyData – which I believe also offered an active-active solution; as well as there are a couple other commercial versions that you can buy for lots of money. So interesting as well on that part.”</p>
<p>45:59  <a href="https://aws.amazon.com/about-aws/whats-new/2025/06/aws-network-firewall-monitoring-dashboard/">AWS Network Firewall launches new monitoring dashboard – AWS</a></p>
<ul>
<li style="font-weight:400;">AWS Network Firewall now includes a built-in monitoring dashboard that provides visibility into network traffic patterns, including top traffic flows, TLS SNI, and HTTP Host headers without additional charges beyond standard <a href="https://docs.aws.amazon.com/AmazonCloudWatch/latest/monitoring/WhatIsCloudWatch.html">CloudWatch</a> and <a href="https://docs.aws.amazon.com/athena/latest/ug/what-is.html">Athena</a> costs.</li>
<li style="font-weight:400;">The dashboard helps identify long-lived TCP connections and failed TCP handshakes, making it easier to troubleshoot network issues and spot potential security concerns that previously required manual log analysis.</li>
<li style="font-weight:400;">This addresses a common pain point where customers had to build custom dashboards or use third-party tools to visualize Network Firewall data, now providing out-of-the-box insights for faster incident response.</li>
<li style="font-weight:400;">Setup requires enabling <a href="https://docs.aws.amazon.com/vpc/latest/userguide/flow-logs.html">Flow logs</a> and <a href="https://docs.aws.amazon.com/AmazonCloudWatch/latest/monitoring/Alarm-On-Logs.html">Alert logs</a> in Network Firewall, then activating the monitoring dashboard – a straightforward process that immediately provides actionable network intelligence.</li>
<li style="font-weight:400;">Available in all AWS Network Firewall regions, this feature strengthens AWS’s network security observability story alongside services like VPC Flow Logs and Traffic Mirroring.</li>
</ul>
<p>47:09  Matt – “I feel like 50% of the time I get it (Athena) to work, and the other 50% of the time I just swear at it and walk away.”</p>
<p>50:04 <a href="https://aws.amazon.com/about-aws/whats-new/2025/06/aws-site-to-site-vpn-three-capabilities-enhanced-security/">AWS Site-to-Site VPN introduces three new capabilities for enhanced </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/06/aws-site-to-site-vpn-three-capabilities-enhanced-security/">security – AWS</a></p>
<ul>
<li style="font-weight:400;">AWS Site-to-Site VPN now integrates with <a href="https://aws.amazon.com/secrets-manager/features/">Secrets Manager</a> to automatically redact pre-shared keys in API responses, displaying only the ARN instead of exposing sensitive credentials.</li>
<li style="font-weight:400;">The new GetActiveVpnTunnelStatus API eliminates the need to enable VPN logs just to track negotiated security parameters like IKE version, DH groups, and encryption algorithms, reducing operational overhead.</li>
<li style="font-weight:400;">AWS added a recommended parameter to the GetVpnConnectionDeviceSampleConfiguration API that automatically configures best-practice security settings including IKE v2, DH group 20, SHA-384, and AES-GCM-256.</li>
<li style="font-weight:400;">These security enhancements come at no additional cost and address common VPN configuration challenges where customers often struggle with selecting appropriate cryptographic parameters or accidentally expose PSKs in logs.</li>
<li style="font-weight:400;">The features are available in all commercial AWS regions except Europe (Milan – we’re not sure who you ticked off), making it easier for enterprises to maintain secure hybrid connectivity without manual security configuration complexity.</li>
<li style="font-weight:400;">The only thing we have to say here is THANK YOU. </li>
</ul>
<h2>GCP</h2>
<p>53:08 <a href="https://cloud.google.com/blog/products/data-analytics/pub-sub-single-message-transforms/">Pub/Sub single message transforms | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google Pub/Sub now supports JavaScript User-Defined Functions (UDFs) for in-stream message transformations, eliminating the need for separate services like Dataflow or Cloud Run for simple data modifications. </li>
<li style="font-weight:400;">This reduces latency and operational overhead for common tasks like format conversion, PII redaction, and data filtering.</li>
<li style="font-weight:400;">The feature allows up to five JavaScript transforms per topic or subscription, with transformations happening directly within Pub/Sub before message persistence or delivery. </li>
<li style="font-weight:400;">This positions GCP competitively against <a href="https://docs.aws.amazon.com/eventbridge/latest/userguide/eb-what-is.html">AWS EventBridge’s</a> input transformers and <a href="https://duckduckgo.com/y.js?ad_domain=microsoft.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=luLm4NVRlM3BbzVUw1_IKm5WIeQjPnxwebPyNT6sUGFGWaOG3BhHqBFJyDhHn4_lHqlDFPyHTf5Yw5MbU3bFFwlJj_S2Zb6Llhygk74hNnA9KQIghPYeEd4f_k0XQXPb.MPcmk527sf33UgHEZuXacQ&amp;eddgt=3ovS37bgdw3B0MxIFOx1hw%3D%3D&amp;rut=6d96e2b4e39aca372d130268cf3049adc94df8177e1960c584d847db51a17937&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8zJrtlLxiV0zBmlSTak7Q-jVUCUx59QU2iwRBuIfQ5RnE46VZIeyIaLLvS8Ld0Fo1VSFcpbv-uaFPWfrtMQIPazYCAF9gxmt5NneSG2Dfn_LKkRsNK_4iOjYHML_DFe1XymrNud5ZfD-0ik_OiTS_Msdylh8ikoPyXBg0q1eklkuDSukE94hFrwMoNpC-83jCKNFvyMLTABYZOpx1AO-Fbklr_VTJzflnhvws-5jYOwpUSw6t7jm8IphRogeZ8iqmec3LsubOvoKkZwFfy9NMQki1TICMJZTY72SeKmREgVKxIwMVMYkEF5k3nT5A5PdJw9mqfmEuUkCoA8_30vO-iUwK3Pi1l_5crYRnXK-ZQN02V465CNXSGCd5W7tff6vKVSZMjcN6wRWeQIyuzbp3BuH7gC8stsGODepm7kw7AEm4nQUZ9G_HOhGISw_wh-svctJ4KFlVlZA0L2dpcCgxkoc9Yobgv0hfBGFeUA9CIL1dj-KtRTipbxu7clxVPP2_Aatc-SCSGHvXesADVHB3OKNehlkXtvcZKfFjBqQ8seIL2BMJyJ4wka7QfT34P3K6qsMTpwrOQuuZPDNz8GZjikXdQ0TRwrIT6X8YMls7e2ifFDkQz80qONeln2VPCGFG5FZft3OlG2Qn1lwBiWrt1nEay_QqVw5aFI1yZQmbj_1KPDdLw4L7JbfXmnic6ebL3ogGS0ErBmRnDa_VTQvstsf96Zrl7smDyzfkFgXG5ca6mwTYt3NEAT7p_QNLT3gOpxcnWg%26u%3DaHR0cHMlM2ElMmYlMmY1MzUwLnhnNGtlbi5jb20lMmZ0cmslMmZ2MSUzZnByb2YlM2Q0MzklMjZjYW1wJTNkMTY4ODA3JTI2a2N0JTNkbXNuJTI2a2NoaWQlM2QxNTkwMDE5MjElMjZjcml0ZXJpYWlkJTNka3dkLTc5MTY1NDQzNTI2NzIxJTNhbG9jLTE5MCUyNmNhbXBhaWduaWQlM2Q1OTAyMzczNTglMjZsb2NwaHklM2Q3ODQ2OCUyNmFkZ3JvdXBpZCUzZDEyNjY2MzkwNDc1NTkwNzglMjZjaWQlM2Q3OTE2NTA0NTM1MTQyMCUyNmtkdiUzZGMlMjZrZXh0JTNkJTI2a3BnJTNkJTI2a3BpZCUzZCUyNnF1ZXJ5U3RyJTNkYXp1cmUlMjUyMHNlcnZpY2UlMjUyMGJ1cyUyNnVybCUzZGh0dHBzJTNhJTJmJTJmYXp1cmUubWljcm9zb2Z0LmNvbSUyZmVuLXVzJTJmcHJvZHVjdHMlMmZzZXJ2aWNlLWJ1cyUyZiUzZmVmX2lkJTNkX2tfNzEwNzI5YjkxNDc5MTBmY2Q5NjM0MWViNTkxMjYyMWZfa18lMjZPQ0lEJTNkQUlEY21tNWVkc3dkdXVfU0VNX19rXzcxMDcyOWI5MTQ3OTEwZmNkOTYzNDFlYjU5MTI2MjFmX2tfJTI2bXNjbGtpZCUzZDcxMDcyOWI5MTQ3OTEwZmNkOTYzNDFlYjU5MTI2MjFm%26rlid%3D710729b9147910fcd96341eb5912621f&amp;vqd=4-194594676086025665619951841463838518274&amp;iurl=%7B1%7DIG%3D99610863AE294D57BA4E24A6A1771477%26CID%3D1E532126CA1268963BBA3729CBA769EA%26ID%3DDevEx%2C5048.1">Azure Service Bus’s</a> message enrichment capabilities.</li>
<li style="font-weight:400;">Key use cases include data masking for compliance, format conversion for multi-system integration, and enhanced filtering based on message content rather than just attributes. Industries handling sensitive data like healthcare and finance will benefit from built-in PII redaction capabilities.</li>
<li style="font-weight:400;">The service integrates seamlessly with existing Pub/Sub features like Import Topics and <a href="https://cloud.google.com/pubsub/docs/subscriber#export_subscription">Export Subscriptions</a>, continuing Google’s strategy of simplifying streaming architectures. Additional transforms including schema validation and AI inference are planned for future releases.</li>
<li style="font-weight:400;">Available now in GA through the <a href="https://cloud.google.com/cloud-console">Google Cloud console</a> and gcloud CLI with standard Pub/Sub pricing applying to transformed messages. </li>
<li style="font-weight:400;">The JavaScript runtime limitations and performance characteristics aren’t specified, which may be important for latency-sensitive applications.</li>
</ul>
<p>54:19  Ryan – “…the fact that this happens before persistence layer is key, right? Because it’s difficult to undo anything you introduce once that happens. so be careful. Test well.”</p>
<p>55:34 <a href="https://cloud.google.com/blog/products/compute/m4-vms-are-designed-for-memory-intensive-workloads-like-sap/">M4 VMs are designed for memory-intensive workloads like SAP | Google </a><a href="https://cloud.google.com/blog/products/compute/m4-vms-are-designed-for-memory-intensive-workloads-like-sap/">Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google Cloud launches <a href="https://cloud.google.com/compute/docs/memory-optimized-machines">M4</a> VMs with up to 224 vCPUs and 6TB of DDR5 memory, targeting memory-intensive workloads like SAP HANA and SQL Server with 66% better price-performance than the previous <a href="https://cloud.google.com/compute/docs/memory-optimized-machines">M3</a> generation and full SAP certification across all shapes.</li>
<li style="font-weight:400;">Built on Intel’s 5th gen Xeon processors (Emerald Rapids), M4 offers two memory-to-vCPU ratios (13.3:1 and 26.6:1) and delivers up to 2.25x more SAPs compared to M3, making it the first memory-optimized instance among hyperscalers to use these processors.</li>
<li style="font-weight:400;">M4 leverages Google’s <a href="https://cloud.google.com/titanium">Titanium</a> offload technology for <a href="https://cloud.google.com/compute/docs/memory-optimized-machines#m4_machine_types">200 Gb/s</a> networking bandwidth and integrates with <a href="https://cloud.google.com/compute/docs/disks/hyperdisks">Hyperdisk storage</a> supporting up to 500K IOPS and <a href="https://cloud.google.com/compute/docs/disks/hd-types/hyperdisk-extreme#achieve-higher-performance-with-multiple-hyperdisk-extreme-volumes">10,000 MiB/s</a> throughput, with dynamic tuning capabilities and <a href="https://cloud.google.com/blog/products/storage-data-transfer/hyperdisk-storage-pools-is-now-generally-available">storage pooling</a> for cost optimization.</li>
<li style="font-weight:400;">The instances are backed by a 99.95% Single Instance SLA and support hitless upgrades and live migration for minimal disruption during maintenance, with initial availability in five regions (us-east4, europe-west4, europe-west3, us-central1).</li>
<li style="font-weight:400;">M4 completes Google’s memory-optimized portfolio alongside <a href="https://cloud.google.com/solutions/sap/docs/certifications-sap-apps">X4</a> (up to 32TB memory), positioning GCP competitively for large-scale in-memory databases and analytics workloads with both on-demand and committed use discount pricing options. </li>
</ul>
<p>1:00:32 <a href="https://cloud.google.com/blog/products/ai-machine-learning/deploying-llama4-and-deepseek-on-ai-hypercomputer/">Deploying Llama4 and DeepSeek on AI Hypercomputer | Google Cloud </a><a href="https://cloud.google.com/blog/products/ai-machine-learning/deploying-llama4-and-deepseek-on-ai-hypercomputer/">Blog</a></p>
<ul>
<li style="font-weight:400;">Google Cloud releases optimized deployment recipes for Meta’s Llama4 (Scout 17B-16E and Maverick 17B-128E) and DeepSeek’s V3/R1 models on AI <a href="https://cloud.google.com/solutions/ai-hypercomputer">Hypercomputer</a>, providing step-by-step guides for running these open-source LLMs on <a href="https://cloud.google.com/blog/products/compute/trillium-tpu-is-ga?e=48754805">Trillium</a> TPUs and A3 Mega/Ultra GPUs.</li>
<li style="font-weight:400;">The recipes leverage <a href="https://github.com/AI-Hypercomputer/JetStream">JetStream</a> for TPU inference and <a href="https://github.com/vllm-project/vllm">vLLM</a>/SGLang for GPU deployments, with <a href="https://cloud.google.com/ai-hypercomputer/docs/workloads/pathways-on-cloud/pathways-intro">Pathways</a> enabling multi-host serving across TPU slices – the same system Google uses internally for Gemini model training and serving.</li>
<li style="font-weight:400;"><a href="https://github.com/AI-Hypercomputer/maxtext">MaxText</a> now includes architectural innovations from <a href="https://www.deepseek.com/en">DeepSeek</a> like Multi-Head Latent Attention, MoE Shared/Routed Experts, and YARN RoPE embeddings, allowing developers to experiment with these newer model architectures on Google Cloud infrastructure.</li>
<li style="font-weight:400;">These deployment options target enterprises needing to run large open models on-premises or in their own cloud environments, competing directly with <a href="https://docs.aws.amazon.com/sagemaker/latest/dg/whatis.html">AWS SageMaker</a> and Azure ML’s model hosting capabilities while leveraging Google’s TPU advantage.</li>
<li style="font-weight:400;">The <a href="https://github.com/AI-Hypercomputer">GitHub recipes</a> provide complete workflows including model weight downloads, checkpoint conversion, server deployment, and benchmarking scripts, reducing the typical deployment complexity from days to hours for these multi-billion parameter models.</li>
</ul>
<p>1:01:23  Matt – “I think you’re making up half these words.”</p>
<p>1:02:23 <a href="https://cloud.google.com/blog/products/data-analytics/understanding-updates-to-bigquery-workload-management/">Understanding updates to BigQuery workload management | Google </a><a href="https://cloud.google.com/blog/products/data-analytics/understanding-updates-to-bigquery-workload-management/">Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">BigQuery introduces <a href="https://cloud.google.com/bigquery/docs/slots#fairness">reservation fairness</a> and predictability features that allow organizations to set absolute maximum slot consumption limits and distribute idle capacity equally across reservations rather than projects, providing more granular control over resource allocation and costs in Enterprise editions.</li>
<li style="font-weight:400;">The new runtime reservation specification feature enables users to override default reservation assignments via CLI, UI, SQL, or API at query execution time, with role-based access controls for improved security and flexibility in multi-team environments.</li>
<li style="font-weight:400;">Autoscaler improvements deliver 50-slot increment granularity (down from 100), near-instant scale up, and faster scale down capabilities, allowing more responsive resource adjustments to workload demands compared to previous iterations.</li>
<li style="font-weight:400;">Reservation labels now integrate with Cloud Billing data for the Analysis Slots Attribution SKU, enabling detailed cost tracking and optimization by workload or team, addressing a common enterprise requirement for chargeback and showback scenarios.</li>
<li style="font-weight:400;">These updates position BigQuery’s workload management closer to dedicated resource pools found in Snowflake’s multi-cluster warehouses or AWS Redshift’s workload management queues, but with more dynamic allocation options suited for variable analytics workloads.</li>
</ul>
<p>1:03:31  Justin – “If you’re going to use reservation fairness and you’re not going to honor the project boundary, I will cut you – Ryan – when you take my BigQuery slots.”</p>
<p>1:07:16 <a href="https://cloud.google.com/blog/products/identity-security/enhancing-protection-4-new-security-command-center-capabilities/">Enhancing protection: 4 new Security Command Center capabilities | </a><a href="https://cloud.google.com/blog/products/identity-security/enhancing-protection-4-new-security-command-center-capabilities/">Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/security/products/security-command-center">Security Command Center</a> now offers agentless vulnerability scanning for Compute Engine and GKE at no additional charge, eliminating the need to deploy and manage scanning agents on each asset while providing coverage even for unauthorized VMs provisioned by adversaries.</li>
<li style="font-weight:400;">Container image vulnerability scanning is now integrated through <a href="https://cloud.google.com/artifact-registry/docs/analysis">Artifact Analysis</a>, with scans included at no extra cost for SCC Enterprise customers when images are deployed to <a href="https://cloud.google.com/kubernetes-engine">GKE</a>, Cloud Run, or App Engine, consolidating security findings in one dashboard.</li>
<li style="font-weight:400;"><a href="https://cloud.google.com/run">Cloud Run</a> threat detection introduces 16 specialized detectors that analyze serverless deployments for malicious activities, including behavioral analysis, NLP-powered code analysis, and control plane monitoring – capabilities not available in third-party products.</li>
<li style="font-weight:400;">SCC automatically detects connections to known malicious IPs by analyzing internal network traffic without requiring customers to purchase, ingest, and analyze<a href="https://docs.aws.amazon.com/vpc/latest/userguide/flow-logs.html"> VPC Flow Logs</a> separately, unlike third-party security tools that charge extra for this capability.</li>
<li style="font-weight:400;">All four capabilities leverage Google’s first-party access to infrastructure data and <a href="https://cloud.google.com/blog/products/identity-security/introducing-google-threat-intelligence-actionable-threat-intelligence-at-google-scale-at-rsa/">Google Threat Intelligence</a>, providing deeper visibility than API-based third-party tools while respecting data residency boundaries established by customers.</li>
</ul>
<p>1:10:31 <a href="https://cloud.google.com/blog/products/ai-machine-learning/new-mcp-integrations-to-google-cloud-databases/">New MCP integrations to Google Cloud Databases | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google’s <a href="https://cloud.google.com/blog/products/ai-machine-learning/mcp-toolbox-for-databases-now-supports-model-context-protocol?e=48754805">MCP Toolbox</a> now enables AI coding assistants like <a href="https://docs.anthropic.com/en/docs/claude-code/overview">Claude Code</a>, <a href="https://www.cursor.com/">Cursor</a>, and <a href="https://windsurf.com/">Windsurf</a> to directly query and modify Google Cloud databases including <a href="https://cloud.google.com/sql?hl=en">Cloud SQL</a>, <a href="https://cloud.google.com/alloydb/docs/overview">AlloyDB</a>, Spanner, and BigQuery through natural language commands in your IDE.</li>
<li style="font-weight:400;">Developers can skip writing complex SQL queries and instead use plain English to explore database schemas, create tables, modify structures, and generate test data – tasks that previously took hours or days can now be completed in minutes.</li>
<li style="font-weight:400;">The tool implements Anthropic’s <a href="https://www.anthropic.com/news/model-context-protocol">Model Context Protocol (MCP)</a>, an emerging open standard that replaces fragmented custom integrations between AI systems and data sources with a unified protocol approach.</li>
<li style="font-weight:400;">This positions Google competitively against <a href="https://aws.amazon.com/blogs/aws/amazon-codewhisperer-free-for-individual-use-is-now-generally-available/">AWS CodeWhisperer</a> and <a href="https://github.com/features/copilot">GitHub Copilot</a> by offering deeper database integration capabilities, though those services don’t yet support direct database manipulation through natural language.</li>
<li style="font-weight:400;">Key use cases include onboarding new developers, rapid prototyping, schema refactoring, and automated test generation – particularly valuable for e-commerce, SaaS, and enterprise applications with complex data models.</li>
</ul>
<p>1:12:33 <a href="https://cloud.google.com/blog/topics/partners/datadog-integrates-google-cloud-ai/">Datadog integrates Google Cloud AI | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Datadog now monitors Google’s Vertex AI Agent Engine through its new <a href="https://www.datadoghq.com/blog/dash-2025-new-feature-roundup-keynote">AI Agents Console</a>, providing unified visibility into autonomous agents’ actions, permissions, and business impact across third-party and Google-orchestrated agents.</li>
<li style="font-weight:400;">The integration covers the full AI stack on Google Cloud: application layer (AI agents), model layer (Gemini and Vertex AI LLMs with auto-instrumentation), infrastructure layer (Cloud TPU monitoring), and data layer (expanded BigQuery monitoring for cost optimization).</li>
<li style="font-weight:400;">Datadog has implemented Google Cloud’s Active Metrics APIs to reduce monitoring costs by only calling APIs when new data exists, complementing their Private Service Connect support to minimize data transfer expenses.</li>
<li style="font-weight:400;">The expanded BigQuery monitoring helps teams identify top spenders, slow queries, and failed jobs while flagging data quality issues – addressing a key pain point for organizations using BigQuery for AI data insights.</li>
<li style="font-weight:400;">Customers can purchase Datadog directly through Google Cloud Marketplace with deployment in minutes, making it straightforward for GCP users to add comprehensive AI observability to their existing infrastructure.</li>
</ul>
<p>1:13:52  Justin – “Datadog only has some of the responsibility. A lot of it is because of all of these managed monitoring solutions, it’s what you send to it. And they’re just charging by ingestion rates. And so if you’re in control of your data, your spend is not going crazy big.”</p>
<p>1:15:27 <a href="https://cloud.google.com/blog/products/data-analytics/introducing-google-cloud-serverless-for-apache-spark-in-bigquery/">Introducing Google Cloud Serverless for Apache Spark in BigQuery | </a><a href="https://cloud.google.com/blog/products/data-analytics/introducing-google-cloud-serverless-for-apache-spark-in-bigquery/">Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/products/serverless-spark">Google Cloud Serverless for Apache Spark</a> is now generally available within <a href="https://cloud.google.com/bigquery">BigQuery</a>, eliminating cluster management overhead and charging only for job runtime rather than idle infrastructure. </li>
<li style="font-weight:400;">This integration provides a unified developer experience in BigQuery Studio with seamless interoperability between Spark and BigQuery SQL engines on the same data.</li>
<li style="font-weight:400;">The service includes <a href="https://cloud.google.com/products/lightning-engine">Lightning Engine</a> (in Preview) which delivers up to 3.6x faster query performance through vectorized execution and intelligent caching. Pre-packaged ML libraries like PyTorch and Transformers come standard with Google-certified Spark images, plus GPU acceleration support for distributed AI workloads.</li>
<li style="font-weight:400;"><a href="https://www.biglakedata.com/">BigLake metastore</a> enables Spark and BigQuery to operate on a single copy of data whether in BigQuery managed tables or open formats like <a href="https://iceberg.apache.org/">Apache Iceberg</a> and <a href="https://delta.io/">Delta Lake</a>. All data access is unified through the BigQuery Storage Read API with no additional cost for reads from serverless Spark jobs.</li>
<li style="font-weight:400;">BigQuery spend-based CUDs now apply to serverless Spark usage, and the service supports full OSS compatibility with existing Spark code across Python, Java, Scala, and R. Enterprise features include job isolation, CMEK encryption, custom org policies, and end-user credential support for data access traceability.</li>
<li style="font-weight:400;"><a href="https://cloud.google.com/bigquery/docs/use-spark#write-spark-code-with-gemini-code-assist">Gemini-powered features include PySpark</a> code generation with data context awareness and <a href="https://cloud.google.com/dataproc-serverless/docs/guides/monitor-troubleshoot-batches#gemini_cloud_assist-preview">Cloud Assist</a> for troubleshooting recommendations (both in Preview). </li>
<li style="font-weight:400;">The service integrates with BigQuery Pipelines and Schedules for orchestration, plus supports Apache Airflow/Cloud Composer operators for deployment.</li>
</ul>
<h2>Azure</h2>
<p>1:17:56 <a href="https://azure.microsoft.com/en-us/blog/enhance-ai-security-with-azure-prompt-shields-and-azure-ai-content-safety/">Enhance AI security with Azure Prompt Shields and Azure AI Content </a><a href="https://azure.microsoft.com/en-us/blog/enhance-ai-security-with-azure-prompt-shields-and-azure-ai-content-safety/">Safety | Microsoft Azure Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/azure/ai-services/content-safety/concepts/jailbreak-detection">Azure Prompt Shields</a> provides real-time protection against prompt injection attacks, which OWASP identifies as the top threat to LLMs, by analyzing inputs to detect both direct jailbreak attempts and indirect attacks embedded in documents or emails.</li>
<li style="font-weight:400;">The service integrates directly with <a href="https://learn.microsoft.com/azure/ai-services/openai/concepts/content-filter">Azure OpenAI content filters</a> and Azure AI Foundry, offering contextual awareness to reduce false positives and a new Spotlighting capability that distinguishes between trusted and untrusted inputs in generative AI applications.</li>
<li style="font-weight:400;"><a href="https://www.microsoft.com/en-us/security/blog/2025/05/19/microsoft-extends-zero-trust-to-secure-the-agentic-workforce/">Microsoft Defender now integrates with Azure AI Foundry</a> to surface AI security recommendations and runtime threat alerts directly in the development environment, helping developers identify prompt injection risks early in the development process.</li>
<li style="font-weight:400;">Enterprise customers like AXA and Wrtn Technologies are using <a href="https://learn.microsoft.com/azure/ai-services/content-safety/quickstart-jailbreak">Prompt Shield</a>s to secure their AI deployments, with AXA preventing prompt injection in their Secure GPT solution and Wrtn leveraging customizable content filters for their Korean AI companion platform.</li>
<li style="font-weight:400;">Azure OpenAI customers can enable Prompt Shields through built-in content filters while <a href="https://azure.microsoft.com/en-us/products/ai-services/ai-content-safety/">Azure AI Content Safety</a> customers can activate it for non-OpenAI models.</li>
</ul>
<p>1:19:12  Ryan – “These types of tools are invaluable, right? AI is such a changing landscape, if you’re writing an AI app or taking inputs from a customer…responsible AI is built into all the larger models, but if you’re trying to use a custom model..having this is super key to protecting yourself.” </p>
<p>1:21:27 <a href="https://techcommunity.microsoft.com/blog/appsonazureblog/announcing-azure-command-launcher-for-java/4420278">Announcing Azure Command Launcher for Java | Microsoft Community </a><a href="https://techcommunity.microsoft.com/blog/appsonazureblog/announcing-azure-command-launcher-for-java/4420278">Hub</a></p>
<ul>
<li style="font-weight:400;">Microsoft introduces <a href="https://learn.microsoft.com/en-us/java/jaz/overview">jaz</a>, a JVM launcher that automatically optimizes Java applications for Azure cloud environments by detecting container limits and selecting appropriate heap sizing, garbage collection, and diagnostic settings without manual configuration.</li>
<li style="font-weight:400;">The tool addresses a significant problem where over 30% of developers deploy Java workloads with default <a href="https://openjdk.org/index.html">OpenJDK</a> settings that are too conservative for cloud environments, leading to underutilized resources and higher operational costs.</li>
<li style="font-weight:400;">Currently in private preview for Linux containers using <a href="https://build.microsoft.com/en-US/home">Microsoft Build</a> of OpenJDK and Eclipse Temurin (Java 8), jaz simplifies deployment by replacing complex JAVA_OPTS configurations with a single command: jaz -jar myapp.jar.</li>
<li style="font-weight:400;">The roadmap includes AppCDS support for improved startup times, future <a href="https://openjdk.org/projects/leyden/">Project Leyden</a> integration, and continuous tuning capabilities with Prometheus telemetry sharing, positioning it as a cloud-native alternative to manual JVM tuning or tools like Paketo Buildpacks.</li>
<li style="font-weight:400;">Target users include developers deploying Spring Boot, Quarkus, or Micronaut microservices on Azure Container Apps, AKS, Azure Red Hat OpenShift, or Azure VMs who want better performance without deep JVM expertise.</li>
</ul>
<p>1:23:56  Matt – “It just feels like these things should be things out of the box at this point. And then you could tweak them if you want to override them, not default to 128 or 256. And then you’re like, I have a 20 terabyte RAM system. Why am I using 250 megabytes? Hey, by the way, the AI that earlier from FinOps will tell you to scale down, which would be good for you.”</p>
<h2>Cloud Journey</h2>
<p>1:25:17 <a href="https://open.substack.com/pub/willkelly/p/the-coming-downfall-of-the-cloud?r=2tz8h&amp;utm_campaign=post&amp;utm_medium=web&amp;showWelcomeOnShare=false">The coming downfall of the cloud FinOps tools market and who falls </a><a href="https://open.substack.com/pub/willkelly/p/the-coming-downfall-of-the-cloud?r=2tz8h&amp;utm_campaign=post&amp;utm_medium=web&amp;showWelcomeOnShare=false">first</a></p>
<ul>
<li style="font-weight:400;">Blog Author: Will Kelly</li>
<li style="font-weight:400;">The FinOps tools market is heading for a massive shakeout by 2027, with native cloud provider tools like AWS Cost Explorer and Azure Cost Management finally catching up to third-party vendors by offering free, built-in features like tagging enforcement, anomaly detection, and savings plan recommendations that used to be the bread and butter of standalone FinOps platforms.</li>
<li style="font-weight:400;">AI is fundamentally changing the game by automating what FinOps vendors used to charge premium prices for – instead of manually reviewing cost anomalies or building reservation coverage charts, AI can now generate and execute optimization plans in real-time, making dashboard-only tools look like expensive relics from a bygone era.</li>
<li style="font-weight:400;">The article calls out specific vendors who are in trouble, including Kion’s desperate pivot to partner with ProsperOps for Kubernetes visibility after years of chasing SEO and compliance messaging instead of focusing on actual cost optimization, and Apptio Cloudability, which despite IBM’s backing, remains bloated and tied to legacy enterprise reporting models.</li>
<li style="font-weight:400;">There’s a brutal reality check for vendors disguising managed services as SaaS platforms – companies like CloudKeeper that promise “guaranteed savings” but are really just offshored analysts preparing manual reports behind a sleek UI, charging enterprise SaaS prices for what amounts to templated spreadsheets and consulting work.</li>
<li style="font-weight:400;">The lack of deep cloud provider alignment is becoming a death sentence for FinOps vendors, as enterprises increasingly want tools that integrate directly with their CSP contracts, procurement flows, and Enterprise Discount Programs – if you’re not in the AWS, Azure, or GCP marketplaces with proper billing integration, you’re essentially invisible to enterprise buyers.</li>
<li style="font-weight:400;">By 2027, the author predicts only full-stack automation platforms that embed into CI/CD pipelines, Kubernetes orchestration, and finance workflows will survive, while dashboard-only tools, fake SaaS platforms, and vendors who confused blog traffic for product-market fit will be consolidated, acqui-hired, or simply shut down.</li>
<li style="font-weight:400;">The market saturation has reached a breaking point where every vendor pitches the same “visibility, optimization, savings” story, and budget-conscious buyers are exhausted by the sameness – there’s simply no room left for “just another dashboard” in an increasingly commoditized market.</li>
<li style="font-weight:400;">This consolidation might actually be good for customers who are tired of paying for expensive tools that generate pretty charts but don’t actually reduce their cloud bills – the survivors will be forced to deliver real, automated value rather than just insights and recommendations that require manual implementation.</li>
</ul>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod where you can join our newsletter, slack team, send feedback or ask questions at theCloud Pod.net or tweet at us with hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2069445/c1e-k5d5sgj169b2n91p-8dr2g7z5f0k9-gtyvky.mp3" length="127550176"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 308 of The Cloud Pod – where the forecast is always cloudy! Justin, Matt and Ryan are in the house today to tell us all about the latest and greatest from FinOps and SnowFlake conferences, plus updates from Security Command Center, OpenAI, and even a new AWS Region. All this and more, today in the cloud! 
Titles we almost went with this week:

I Left My Wallet at FinOps X, But Found Savings at Snowflake Summit
Snowflake City Lights, FinOps by the Sea
The Two Summits: A Tale of FinOps and Snowflakes
Crunchy on the Outside, Snowflake on the Inside 
AWS Taipei: Because Sometimes You Need Your Data Closer Than Your Night Market 
AWS Plants Its Flag in Taipei: The 37th Time’s the Charm
AWS Slashes GPU Prices Faster Than a CUDA Kernel
Two Writers Walk Into a Database… And Both Succeed
AWS Network Firewall: Now With Windows!
The VPN Connection That Keeps Its Secrets
Transform and Roll Out: Pub/Sub’s New Single Message Feature
SAP Happens: Google’s New M4 VMs Handle It Better
Total Recall: Google’s 6TB Memory Machines
The M4trix Has You (And Your In-Memory Databases)
DeepSeek and You Shall Find… on Google Cloud
Four Score and Seven Vulnerabilities Ago – mk
The Fantastic Four Security Features
MCP: Model Context Protocol or Master Control Program from Tron?
No SQL? No Problem! AI Takes the Wheel
Injection Rejection: How Azure Keeps Your Prompts Clean

General News 
05:09 FinOps X 2025 Cloud Announcements: AI Agents  and Increased FOCUS Support

All major cloud providers announced expanded support for FOCUS (FinOps Open Cost and Usage Specification) 1.0, with AWS already in general availability and Google Cloud launching a BigQuery export in private preview. 
This signals an industry-wide standardization of cloud cost reporting formats.
AWS introduced AI-powered cost optimization through Amazon Q Developer integration with Cost Optimization Hub, enabling automated recommendations across millions of resources with detailed explanations and action plans for cost reduction.
Microsoft Azure launched AI agents for application modernization that can reduce migration efforts from months to hours by automating code assessment and remediation across thousands of files, while also introducing flexible PTU reservations that work across multiple AI models.
Google Cloud unveiled FinOps Hub 2.0 with Gemini-powered waste detection that identifies underutilized resources (like VMs at 5% usage) and provides AI-generated optimization recommendations for Kubernetes, Cloud Run, and Cloud SQL services.
Oracle Cloud Infrastructure added carbon emissions reporting with hourly power-based calculations and GHGP compliance, plus new cost anomaly detection and rules-based cost allocation features for improved financial governance.

06:11  Justin – “I mean, if I’m modernizing my application, typically it’s off .NET and Azure, but ok…” 
07:20 Broadcom reboots CloudHealt...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2069445/c1a-k5d5-25n63gxvskp6-hjqhfj.jpg"></itunes:image>
                                                                            <itunes:duration>01:46:18</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2069445/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[307: The AI Assistant That Finally Understands Your Kubernetes Cluster (We are Doomed)]]>
                </title>
                <pubDate>Fri, 13 Jun 2025 17:19:16 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2064879</guid>
                                    <link>https://tcpfm.castos.com/episodes/307-the-ai-assistant-that-finally-understands-your-kubernetes-cluster-we-are-doomed</link>
                                <description>
                                            <![CDATA[<p>Welcome to episode 307 of The Cloud Pod – where the forecast is always cloudy! Who else is at a conference? Justin is coming to us this week from sunny San Diego where he’s attending FinOps – so we have that news to look forward to for next week. Matt and Ryan are also on hand today to share the latest news from Kubernetes, Salesforce acquisitions, and the strange case of Azure making AWS more cost effective.</p>
<h3>Titles we almost went with this week:</h3>
<ul>
<li>The Great Redis Escape: One Year Later, Valkey is Living Its Best Life</li>
<li>Cache Me If You Can: How Valkey Outran Redis’s License Policies</li>
<li>Tier Today, Gone Tomorrow: AWS’s New Storage Class That Moves Your Data So  </li>
<li>      You Don’t </li>
<li>Hey AI, Deploy My App: AWS Makes It Actually Work</li>
<li>AWS Finally Calculates What You’ll Actually Pay</li>
<li>The Price is Right: AWS Edition</li>
<li>From List Price to Real Price: AWS Gets Transparent</li>
<li>Red Hat and AWS Sitting in a Tree, R-H-E-L-I-N-G</li>
<li>Dockerfile? More Like Dockefile-It-For-Me with Amazon’s New MCP Server</li>
<li>Elementary, My Dear Watson: Amazon Q Becomes Sherlock Holmes for AWS</li>
<li>CUD You Believe It? Red Hat Gets the Discount Treatment</li>
<li>Committed Relationship Status: It’s Complicated (But 20% Cheaper)</li>
<li>RHEL Yeah! Google Drops Prices on Enterprise Linux</li>
<li>Disk Today, Gone Tomorrow: Azure’s Vanishing OS Storage</li>
<li>ATL1: Where GPUs Meet Sweet Tea and Southern Hospitality</li>
<li>AWS Launches Operation Cloud Sovereignty</li>
<li>The Great Firewall of Europe: AWS Edition</li>
<li>Amazon Builds a GDPR Fortress in Germany</li>
</ul>
<h2>General News </h2>
<p>01:46 <a href="https://venturebeat.com/data-infrastructure/what-salesforces-8b-acquisition-of-informatica-means-for-enterprise-data-and-ai/">What Salesforce’s $8B acquisition of Informatica means for enterprise data </a><a href="https://venturebeat.com/data-infrastructure/what-salesforces-8b-acquisition-of-informatica-means-for-enterprise-data-and-ai/">and AI | VentureBeat</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.salesforce.com/">Salesforce</a> just dropped $8 billion to acquire <a href="https://www.informatica.com/">Informatica</a>. </li>
<li style="font-weight:400;">This purchase was really about building the data foundation needed for agentic AI to actually work in enterprise environments – we’re talking about combining Informatica’s 30 years of data management expertise with Salesforce’s cloud platform to create what they’re calling a “unified architecture for agentic AI.”</li>
<li style="font-weight:400;">This acquisition fills a massive gap in Salesforce’s data management capabilities, bringing in critical pieces like data cataloging, integration, governance, quality controls, and master data management – all the unsexy but absolutely essential plumbing that makes AI agents trustworthy and scalable in real enterprise deployments.</li>
<li style="font-weight:400;">The timing here is fascinating, because Informatica literally just announced their own agentic AI offerings last week at <a href="https://www.informaticaworld.com/">Informatica World</a>, so Salesforce is essentially buying a company that’s already pivoted hard into the AI space – rather than trying to build these capabilities from scratch.</li>
<li style="font-weight:400;">There’s going to be some interesting overlap with <a href="https://www.mulesoft.com/">MuleSoft</a>, which Salesforce bought for $6.5 billion back in 2018, but analysts are saying Informatica’s data management capabilities are more comprehensive and updated – this could mean some consolidation challenges ahead as they figure out how to integrate these overlapping technologies.</li>
<li style="font-weight:400;">For enterprise customers, this could be a game-changer because it promises to automate those painful, time-consuming data processes that typically take days or weeks. These AI agents can handle data ingestion, in...</li></ul>
<h3>Chapters</h3>
<ul><li>(00:00:00) - Will GCP's Gemini Understand Kubernetes?</li><li>(00:01:08) - Fooled by Conference</li><li>(00:01:45) - Salesforce Buys Informatica for GenTech AI</li><li>(00:05:02) - Valky Turns One</li><li>(00:07:42) - Harness Unveils MCP Server</li><li>(00:13:23) - Terraform 2.8: Security in the Cloud</li><li>(00:16:21) - Amazon Launches FSX for Lustre Intelligent Tiering</li><li>(00:18:56) - Amazon AI System Development with ecs, EKS and Serverless</li><li>(00:21:15) - AWS Pricing Calculator Gets a Long-Needed Feature</li><li>(00:28:19) - Amazon to Launch a European Sovereign Cloud</li><li>(00:36:10) - Google's Cloud-based Red Hat Discount</li><li>(00:38:06) - Google Launches Vertex AI Ranking API</li><li>(00:42:41) - Google Cloud Run: Bringing GPUs to Serverless</li><li>(00:45:17) - Kubernetes: Volume Populator for Machine Learning</li><li>(00:47:48) - Microsoft's Azure AI Foundry: Turn Every Software Developer Into an</li><li>(00:51:59) - C Scripts in C</li><li>(00:55:32) - Azure: General Availability of Ephemeral OS Disks</li><li>(01:01:08) - Azure AI Gateway Expands Support for AWS Bedrock Model End</li><li>(01:04:50) - DigitalOcean Making a Serious Play for GPUs</li><li>(01:10:23) - Week in Cloud: Finops X</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 307 of The Cloud Pod – where the forecast is always cloudy! Who else is at a conference? Justin is coming to us this week from sunny San Diego where he’s attending FinOps – so we have that news to look forward to for next week. Matt and Ryan are also on hand today to share the latest news from Kubernetes, Salesforce acquisitions, and the strange case of Azure making AWS more cost effective.
Titles we almost went with this week:

The Great Redis Escape: One Year Later, Valkey is Living Its Best Life
Cache Me If You Can: How Valkey Outran Redis’s License Policies
Tier Today, Gone Tomorrow: AWS’s New Storage Class That Moves Your Data So  
      You Don’t 
Hey AI, Deploy My App: AWS Makes It Actually Work
AWS Finally Calculates What You’ll Actually Pay
The Price is Right: AWS Edition
From List Price to Real Price: AWS Gets Transparent
Red Hat and AWS Sitting in a Tree, R-H-E-L-I-N-G
Dockerfile? More Like Dockefile-It-For-Me with Amazon’s New MCP Server
Elementary, My Dear Watson: Amazon Q Becomes Sherlock Holmes for AWS
CUD You Believe It? Red Hat Gets the Discount Treatment
Committed Relationship Status: It’s Complicated (But 20% Cheaper)
RHEL Yeah! Google Drops Prices on Enterprise Linux
Disk Today, Gone Tomorrow: Azure’s Vanishing OS Storage
ATL1: Where GPUs Meet Sweet Tea and Southern Hospitality
AWS Launches Operation Cloud Sovereignty
The Great Firewall of Europe: AWS Edition
Amazon Builds a GDPR Fortress in Germany

General News 
01:46 What Salesforce’s $8B acquisition of Informatica means for enterprise data and AI | VentureBeat

Salesforce just dropped $8 billion to acquire Informatica. 
This purchase was really about building the data foundation needed for agentic AI to actually work in enterprise environments – we’re talking about combining Informatica’s 30 years of data management expertise with Salesforce’s cloud platform to create what they’re calling a “unified architecture for agentic AI.”
This acquisition fills a massive gap in Salesforce’s data management capabilities, bringing in critical pieces like data cataloging, integration, governance, quality controls, and master data management – all the unsexy but absolutely essential plumbing that makes AI agents trustworthy and scalable in real enterprise deployments.
The timing here is fascinating, because Informatica literally just announced their own agentic AI offerings last week at Informatica World, so Salesforce is essentially buying a company that’s already pivoted hard into the AI space – rather than trying to build these capabilities from scratch.
There’s going to be some interesting overlap with MuleSoft, which Salesforce bought for $6.5 billion back in 2018, but analysts are saying Informatica’s data management capabilities are more comprehensive and updated – this could mean some consolidation challenges ahead as they figure out how to integrate these overlapping technologies.
For enterprise customers, this could be a game-changer because it promises to automate those painful, time-consuming data processes that typically take days or weeks. These AI agents can handle data ingestion, in...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[307: The AI Assistant That Finally Understands Your Kubernetes Cluster (We are Doomed)]]>
                </itunes:title>
                                    <itunes:episode>307</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p>Welcome to episode 307 of The Cloud Pod – where the forecast is always cloudy! Who else is at a conference? Justin is coming to us this week from sunny San Diego where he’s attending FinOps – so we have that news to look forward to for next week. Matt and Ryan are also on hand today to share the latest news from Kubernetes, Salesforce acquisitions, and the strange case of Azure making AWS more cost effective.</p>
<h3>Titles we almost went with this week:</h3>
<ul>
<li>The Great Redis Escape: One Year Later, Valkey is Living Its Best Life</li>
<li>Cache Me If You Can: How Valkey Outran Redis’s License Policies</li>
<li>Tier Today, Gone Tomorrow: AWS’s New Storage Class That Moves Your Data So  </li>
<li>      You Don’t </li>
<li>Hey AI, Deploy My App: AWS Makes It Actually Work</li>
<li>AWS Finally Calculates What You’ll Actually Pay</li>
<li>The Price is Right: AWS Edition</li>
<li>From List Price to Real Price: AWS Gets Transparent</li>
<li>Red Hat and AWS Sitting in a Tree, R-H-E-L-I-N-G</li>
<li>Dockerfile? More Like Dockefile-It-For-Me with Amazon’s New MCP Server</li>
<li>Elementary, My Dear Watson: Amazon Q Becomes Sherlock Holmes for AWS</li>
<li>CUD You Believe It? Red Hat Gets the Discount Treatment</li>
<li>Committed Relationship Status: It’s Complicated (But 20% Cheaper)</li>
<li>RHEL Yeah! Google Drops Prices on Enterprise Linux</li>
<li>Disk Today, Gone Tomorrow: Azure’s Vanishing OS Storage</li>
<li>ATL1: Where GPUs Meet Sweet Tea and Southern Hospitality</li>
<li>AWS Launches Operation Cloud Sovereignty</li>
<li>The Great Firewall of Europe: AWS Edition</li>
<li>Amazon Builds a GDPR Fortress in Germany</li>
</ul>
<h2>General News </h2>
<p>01:46 <a href="https://venturebeat.com/data-infrastructure/what-salesforces-8b-acquisition-of-informatica-means-for-enterprise-data-and-ai/">What Salesforce’s $8B acquisition of Informatica means for enterprise data </a><a href="https://venturebeat.com/data-infrastructure/what-salesforces-8b-acquisition-of-informatica-means-for-enterprise-data-and-ai/">and AI | VentureBeat</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.salesforce.com/">Salesforce</a> just dropped $8 billion to acquire <a href="https://www.informatica.com/">Informatica</a>. </li>
<li style="font-weight:400;">This purchase was really about building the data foundation needed for agentic AI to actually work in enterprise environments – we’re talking about combining Informatica’s 30 years of data management expertise with Salesforce’s cloud platform to create what they’re calling a “unified architecture for agentic AI.”</li>
<li style="font-weight:400;">This acquisition fills a massive gap in Salesforce’s data management capabilities, bringing in critical pieces like data cataloging, integration, governance, quality controls, and master data management – all the unsexy but absolutely essential plumbing that makes AI agents trustworthy and scalable in real enterprise deployments.</li>
<li style="font-weight:400;">The timing here is fascinating, because Informatica literally just announced their own agentic AI offerings last week at <a href="https://www.informaticaworld.com/">Informatica World</a>, so Salesforce is essentially buying a company that’s already pivoted hard into the AI space – rather than trying to build these capabilities from scratch.</li>
<li style="font-weight:400;">There’s going to be some interesting overlap with <a href="https://www.mulesoft.com/">MuleSoft</a>, which Salesforce bought for $6.5 billion back in 2018, but analysts are saying Informatica’s data management capabilities are more comprehensive and updated – this could mean some consolidation challenges ahead as they figure out how to integrate these overlapping technologies.</li>
<li style="font-weight:400;">For enterprise customers, this could be a game-changer because it promises to automate those painful, time-consuming data processes that typically take days or weeks. These AI agents can handle data ingestion, integration, and pipeline orchestration with minimal human intervention.</li>
<li style="font-weight:400;">The $8 billion price tag is actually lower than the rumored $11 billion bid from last year, which might indicate either tough negotiations or perhaps some concerns about integration challenges. Remember, Salesforce has already spent over $50 billion on <a href="https://www.salesforce.com/news/press-releases/2021/07/21/salesforce-slack-deal-close/">acquisitions</a> including <a href="https://slack.com/signin">Slack</a>, <a href="https://www.tableau.com/">Tableau</a>, and MuleSoft.</li>
</ul>
<p>02:56  Justin – “Just keep your hands off slack, okay guys? That’s all I care about.”</p>
<h2>Cloud Tools</h2>
<p>05:13 <a href="https://www.gomomento.com/blog/valkey-turns-one-how-the-community-fork-left-redis-in-the-dust/">Gomomento: Valkey Turns One How The Community Fork Left Redis In The </a><a href="https://www.gomomento.com/blog/valkey-turns-one-how-the-community-fork-left-redis-in-the-dust/">Dust</a></p>
<ul>
<li style="font-weight:400;"><a href="https://valkey.io/">Valkey</a> has officially hit its one-year milestone as the community-driven fork of <a href="https://redis.io/">Redis</a>, and it’s fascinating to see how quickly it’s gained traction after Redis Labs switched to a more restrictive license in March 2023. </li>
<li style="font-weight:400;">The <a href="https://www.linuxfoundation.org/">Linux Foundation</a> stepped in to support this open-source alternative, and major players like AWS, Google Cloud, and Oracle have all thrown their weight behind it, essentially creating a unified response to Redis’s licensing changes.</li>
<li style="font-weight:400;">What’s really impressive about Valkey is how it’s maintained complete compatibility with Redis while actually pushing innovation forward – they’ve already released version 8.0 with features like improved memory efficiency and better performance for large-scale deployments. </li>
<li style="font-weight:400;">This shows the community isn’t just maintaining a fork, they’re actively improving upon the original codebase.</li>
<li style="font-weight:400;">For developers and engineers, the practical impact is that you can continue using all your existing Redis tooling and client libraries without any changes, but now you have the peace of mind that comes with a truly open-source solution backed by the Linux Foundation. No more worrying about future licensing surprises or restrictions on how you can use your in-memory data store.</li>
<li style="font-weight:400;">The performance improvements in Valkey 8.0 are particularly noteworthy – they’ve managed to reduce memory overhead by up to 20% for certain workloads while maintaining the same blazing-fast performance Redis users expect. This is crucial for companies running large-scale caching layers where even small efficiency gains can translate to significant cost savings.</li>
<li style="font-weight:400;">Looking ahead, Valkey’s roadmap includes some exciting features like native support for vector similarity search and improved clustering capabilities, which suggests they’re not just playing catch-up but actually positioning themselves to lead in the in-memory database space.</li>
<li style="font-weight:400;">The irony here is that Redis’s attempt to monetize through licensing restrictions may have actually accelerated innovation in the space by spurring the creation of a well-funded, community-driven alternative that’s now pushing the entire ecosystem forward faster than before.</li>
</ul>
<p>06:37  Ryan – “I haven’t seen a lot of talk of Redis recently and every new greenfield application that I’ve seen or worked around now is looking at Valkey or using Valkey actively. So I feel like this is going to go the same way as Elasticsearch and the licensing change there where it just won’t be the go-to option anymore.”</p>
<p>07:59 <a href="https://www.harness.io/blog/mcp-announcement">The Harness MCP Server</a></p>
<ul>
<li style="font-weight:400;">Harness just released their <a href="https://github.com/harness/mcp-server">MCP Server</a>, which implements the <a href="https://modelcontext.org/">Model Context Protocol</a> – an open standard that lets AI agents like Claude Desktop, Windsurf, or Cursor securely connect to your Harness workflows without writing custom APIs or brittle glue code, essentially turning Harness into a plug-and-play backend for AI agents.</li>
<li style="font-weight:400;">This addresses a major pain point where customers are excited about AI but struggle with giving their agents secure access to delivery data from pipelines, environments, and logs. </li>
<li style="font-weight:400;">The MCP Server acts as a lightweight local gateway that translates between AI tools and the <a href="https://www.harness.io/">Harness platform</a> while maintaining enterprise-grade security controls.</li>
<li style="font-weight:400;">What’s clever here is that Harness is dogfooding their own solution – they’re using the same MCP server internally that they’re offering to customers, which means it’s battle-tested and provides consistency across different AI agents and environments without the maintenance headache of multiple adapters.</li>
<li style="font-weight:400;">The security story is particularly strong – it uses <a href="https://www.jsonrpc.org/specification">JSON-RPC 2.0</a> for communication, integrates with Harness’s existing RBAC model, handles API keys directly in the platform, and ensures no sensitive data ever gets sent to the LLM, which should make security teams much more comfortable with AI integrations.</li>
<li style="font-weight:400;">From a practical standpoint, this enables some interesting use cases like customer success engineers using AI to instantly check release statuses without bothering the dev team, or building Slack bots that alert on failed builds and surface logs with minimal setup time.</li>
</ul>
<p>10:31  Justin – “The key success of being able to build a successful MCP though is to have APIs. So if you were already behind on getting to APIs, I think this is the struggle for you. Now you’re doubly behind – because you’re not only behind on the API spec, but you’re also behind on the MCP part as well.”</p>
<p>12:12 <a href="https://www.hashicorp.com/en/blog/terraform-adds-new-pre-written-sentinel-policies-aws-foundational-security-best-practices">Hashicorp: Terraform Adds New Pre Written Sentinel Policies</a></p>
<ul>
<li style="font-weight:400;">HashiCorp has released a collection of <a href="https://www.hashicorp.com/en/blog/simplify-policy-adoption-in-terraform-with-pre-written-sentinel-policies-for-aws%5C">pre-written Sentinel policies</a> that automatically enforce <a href="https://docs.aws.amazon.com/audit-manager/latest/userguide/aws-foundational-security-best-practices.html">AWS Foundational Security Best Practices</a> within <a href="https://developer.hashicorp.com/terraform/intro/core-workflow%5C">Terraform workflows</a>, essentially giving teams a ready-made security guardrail system that prevents common misconfigurations before infrastructure gets deployed. This is huge for organizations struggling to balance developer velocity with security compliance requirements.</li>
<li style="font-weight:400;">These policies cover critical security controls like ensuring S3 buckets aren’t publicly accessible, requiring encryption for EBS volumes and RDS instances, and enforcing proper IAM configurations – basically all those security checks that teams know they should implement but often get overlooked in the rush to ship features. The beauty is that these policies run during the plan phase, catching issues before any resources are actually created.</li>
<li style="font-weight:400;">What’s particularly clever about this release is how it addresses the skills gap problem. Not every organization has security experts who can write complex policy-as-code rules, so having <a href="https://www.hashicorp.com/">HashiCorp</a> provide battle-tested policies out of the box dramatically lowers the barrier to entry for implementing proper cloud security governance. </li>
<li style="font-weight:400;">Teams can literally copy-paste these policies into their Terraform Cloud or Enterprise setup and immediately start benefiting.</li>
<li style="font-weight:400;">The timing of this release is perfect given the increasing focus on supply chain security and infrastructure compliance, with regulations getting stricter and breach costs rising, having automated policy enforcement that aligns with AWS’s own security recommendations gives organizations a defensible security posture they can point to during audits. </li>
<li style="font-weight:400;">Plus, it shifts security left in the development process without requiring developers to become security experts overnight.</li>
</ul>
<h2>AWS</h2>
<p>17:00 <a href="https://aws.amazon.com/blogs/aws/amazon-fsx-for-lustre-adds-new-storage-class-with-the-lowest-cost-and-only-fully-elastic-lustre-file-storage/">Amazon FSx for Lustre launches new storage class with the lowest-cost </a><a href="https://aws.amazon.com/blogs/aws/amazon-fsx-for-lustre-adds-new-storage-class-with-the-lowest-cost-and-only-fully-elastic-lustre-file-storage/">and only fully elastic Lustre ﬁle storage</a></p>
<ul>
<li style="font-weight:400;">Amazon just launched <a href="https://aws.amazon.com/fsx/lustre/">FSx for Lustre Intelligent-Tiering</a>, which is essentially the first fully elastic Lustre file storage in the cloud – meaning it automatically grows and shrinks as you add or delete data, so you’re only paying for what you actually use instead of over provisioning storage like you would on-premises, and at less than $0.005 per GB-month, it’s claiming to be the lowest-cost high-performance file storage option available.</li>
<li style="font-weight:400;">This is a game-changer for <a href="https://aws.amazon.com/hpc/">HPC</a> workloads like seismic imaging, weather forecasting, and genomics analysis that generate petabytes of data – the service automatically moves your data between three tiers (Frequent Access, Infrequent Access after 30 days, and Archive after 90 days), potentially reducing storage costs by up to 96% compared to other managed Lustre options without any manual intervention.</li>
<li style="font-weight:400;">For AI/ML teams trying to maximize their expensive GPU utilization, this is particularly interesting because it delivers up to 34% better price performance than on-premises HDD file systems, and with Elastic Fabric Adapter and GPU Direct Storage support, you’re getting up to 12x higher per-client throughput compared to previous FSx for Lustre systems.</li>
<li style="font-weight:400;">The tiering is completely transparent to applications – whether your data is in the Frequent Access tier or has been moved to Archive, you can still retrieve it instantly in milliseconds, which means you can migrate existing HDD or mixed HDD/SSD workloads without any application changes.</li>
<li style="font-weight:400;">The service is launching in 15 AWS regions including major hubs in North America, Europe, and Asia Pacific, and the pricing model is consumption-based – you pay for the data and metadata you store, operations when you write or read non-cached data, plus your provisioned throughput capacity, metadata IOPS, and SSD cache size.</li>
</ul>
<p>18:28   Justin – “I imagine this is truly fantastic for people who have workloads where they’re getting the performance increase out of Lustre. So that’s pretty rad that it’s automatic. It feels a little strange that you can retrieve it at the same speed, but at different costs; I would just force everything to the lower tier, but I imagine you don’t have that option.”</p>
<p>19:45 <a href="https://aws.amazon.com/blogs/aws/enhance-ai-assisted-development-with-amazon-ecs-amazon-eks-and-aws-serverless-mcp-server/">Enhance AI-assisted development with Amazon ECS, Amazon EKS and </a><a href="https://aws.amazon.com/blogs/aws/enhance-ai-assisted-development-with-amazon-ecs-amazon-eks-and-aws-serverless-mcp-server/">AWS Serverless MCP server | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;">AWS is bringing AI-powered development assistance to the next level with new <a href="https://github.com/modelcontextprotocol">Model Context Protocol servers</a> for <a href="https://aws.amazon.com/ecs/">ECS</a>, <a href="https://aws.amazon.com/blogs/aws/category/compute/amazon-kubernetes-service/">EKS</a>, and <a href="https://aws.amazon.com/blogs/aws/category/serverless/">Serverless</a>, which essentially give your AI coding assistants like <a href="http://v">Amazon Q Developer</a> real-time, contextual knowledge about your specific AWS environment instead of relying on outdated documentation. </li>
<li style="font-weight:400;">Imagine having an AI that actually knows your current cluster configuration and can help you deploy containers in minutes using natural language commands.</li>
<li style="font-weight:400;">The real game-changer here is that these MCP servers bridge the gap between what LLMs know from their training data and what’s actually happening in your AWS account right now, so when you ask your AI assistant to help deploy an application, it can configure load balancers, networking, auto-scaling, and monitoring with current best practices rather than generic advice from two years ago.</li>
<li style="font-weight:400;">What’s particularly impressive is how these tools handle the entire development lifecycle – in the demo, they showed creating a serverless video analysis application using <a href="https://aws.amazon.com/ai/generative-ai/nova/">Amazon Nova models</a>, then migrating it to containers on ECS, and finally deploying a web app on EKS, all through natural language prompts in the command line without writing deployment scripts or YAML files.</li>
<li style="font-weight:400;">The troubleshooting capabilities are where this really shines for DevOps teams – when deployments fail, the MCP servers can automatically fetch logs, identify issues, and even fix configuration problems, turning what used to be hours of debugging into a conversational problem-solving session with your AI assistant.</li>
<li style="font-weight:400;">This fits perfectly into AWS’s broader AI strategy by making their services more accessible to developers who might not be container or Kubernetes experts, essentially democratizing cloud deployment by letting you say “deploy this app to EKS and make it scalable” instead of learning the intricacies of Kubernetes manifests and AWS networking.</li>
</ul>
<p>21:58   Ryan – “I want it to completely shield me from learning Kubernetes. I’ll never know it now – I’m just gonna ask the robot to do it.” </p>
<p>22:13 <a href="https://aws.amazon.com/about-aws/whats-new/2025/05/aws-pricing-calculator-discounts-purchase-commitments/">AWS Pricing Calculator, now generally available, supports discounts and </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/05/aws-pricing-calculator-discounts-purchase-commitments/">purchase commitment – AWS</a></p>
<ul>
<li style="font-weight:400;">In news we’ve been waiting FOREVER for, AWS finally brings their <a href="https://calculator.aws/">Pricing Calculator</a> into the <a href="https://calculator.aws/">console</a> as a generally available feature, and it’s about time – this tool now lets you create cost estimates that actually reflect what you’ll pay after applying your existing discounts and commitments like Savings Plans or Reserved Instances, which is a game-changer for financial planning.</li>
<li style="font-weight:400;">The big innovation here is that you can now import your historical usage data directly into the calculator to create estimates based on real-world patterns, or build estimates from scratch for new workloads – and it gives you three different rate configurations to see costs before discounts, after AWS pricing discounts, and after both discounts AND your purchase commitments are applied.</li>
<li style="font-weight:400;">This is particularly valuable for enterprises doing their annual budget planning or preparing for board presentations because you can finally show realistic cost projections that account for your negotiated Enterprise Discount Programs and existing Reserved Instance coverage, rather than just list prices that nobody actually pays.</li>
<li style="font-weight:400;">The ability to export estimates in both CSV and JSON formats with resource-level detail is a subtle but important feature that’ll make FinOps teams happy – you can now integrate these estimates directly into your internal financial planning tools or build automated workflows around cost modeling.</li>
<li style="font-weight:400;">What’s interesting is that AWS is positioning this as both a workload estimator AND a full AWS bill estimator, which suggests they’re trying to help customers understand not just what a new project will cost, but how it impacts their overall AWS spend when layered onto existing infrastructure.</li>
<li style="font-weight:400;">For organizations considering multi-year commitments or trying to optimize their Savings Plans strategy, this tool becomes essential because you can now model different commitment scenarios and see the actual impact on your bottom line before pulling the trigger on those purchases.</li>
<li style="font-weight:400;">The fact that this is available in all commercial regions (except China) means most AWS customers can start using it immediately – and given that it’s free to use, there’s really no excuse not to be doing more sophisticated cost modeling for your AWS workloads.</li>
</ul>
<p>23:58   Ryan – “I hope it’s not something terrible where you have to feed it all your discount data and your code usage.” </p>
<p>24:30 <a href="https://aws.amazon.com/about-aws/whats-new/2025/05/red-hat-enterprise-linux-aws/">Announcing Red Hat Enterprise Linux for AWS</a></p>
<ul>
<li style="font-weight:400;">Red Hat is finally bringing <a href="https://docs.redhat.com/en/documentation/red_hat_enterprise_linux/10">RHEL 10</a> to AWS with deep native integration, marking a significant shift from just running RHEL on EC2 instances to having a purpose-built, AWS-optimized version that includes pre-tuned performance profiles and built-in CloudWatch telemetry right out of the box.</li>
<li style="font-weight:400;">This isn’t just another Linux distro in the AWS Marketplace – they’ve baked in <a href="https://aws.amazon.com/cli/">AWS CLI</a>, optimized networking with <a href="https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/enhanced-networking-ena.html">Elastic Network Adapter</a> support, and created AWS-specific performance profiles, which means enterprises can skip a lot of the manual optimization work they typically do when deploying RHEL workloads.</li>
<li style="font-weight:400;">This comes as organizations are looking to standardize their <a href="https://www.linux.org/pages/download/">Linux</a> deployments across hybrid environments, and having RHEL with native AWS integration could simplify migrations for shops that are already heavy <a href="https://www.redhat.com/en">Red Hat</a> users on-premises.</li>
<li style="font-weight:400;">One of the more innovative aspects is the inclusion of “image mode using container-native tooling,” which suggests Red Hat is bringing their edge computing and immutable OS concepts from <a href="https://www.redhat.com/en/blog/get-started-rhel-edge">RHEL for Edge</a> into the cloud, potentially making updates and rollbacks much cleaner.</li>
<li style="font-weight:400;">While the announcement mentions flexible procurement options through EC2 Console and AWS Marketplace, the real question will be pricing – traditionally RHEL has commanded a premium, and it’ll be interesting to see if the AWS-optimized version carries additional costs beyond standard RHEL subscriptions.</li>
<li style="font-weight:400;">This is available across all AWS regions including GovCloud, which signals that AWS and Red Hat are serious about capturing government and compliance-heavy workloads that have traditionally relied on RHEL’s security certifications and long-term support guarantees.</li>
</ul>
<p>24:58   Justin – “Let’s be honest – no one does the manual optimization work.” </p>
<p>26:21 <a href="https://aws.amazon.com/about-aws/whats-new/2025/06/agentic-capabilities-amazon-q-developer-chat-aws-management-console-chat-applications/">Introducing agentic capabilities for Amazon Q Developer Chat in the AWS </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/06/agentic-capabilities-amazon-q-developer-chat-aws-management-console-chat-applications/">Management Console and chat applications – AWS</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/q/developer/">Amazon Q Developer</a> just got a major upgrade with new agentic capabilities that essentially turn it into your personal AWS troubleshooting detective – it can now break down complex problems into steps, consult multiple AWS services, and piece together answers from across your entire infrastructure without you having to manually dig through logs and configurations.</li>
<li style="font-weight:400;">This is a game-changer for DevOps teams because instead of asking simple questions like “What’s an S3 bucket?”, you can now ask something like “Why is my payment processing Lambda throwing 500 errors?” and Q will automatically check CloudWatch logs, examine IAM permissions, investigate connected services like <a href="https://aws.amazon.com/api-gateway/">API Gateway</a> and <a href="https://docs.aws.amazon.com/amazondynamodb/latest/developerguide/Introduction.html">DynamoDB</a>, and even look at recent changes to figure out what’s going wrong.</li>
<li style="font-weight:400;">The multi-step reasoning capability is the real innovation here – Amazon Q now shows its work as it investigates your problem, asking for clarification when needed and explaining its reasoning process, which not only helps solve the immediate issue but also helps engineers understand their systems better and learn troubleshooting patterns.</li>
<li style="font-weight:400;">What’s particularly impressive is that this works across 200+ AWS services through their APIs, meaning Q can pull together information from virtually any part of your AWS infrastructure to answer questions, making it incredibly powerful for organizations with complex, multi-service architectures.</li>
<li style="font-weight:400;">The integration with <a href="https://www.microsoft.com/en-us/microsoft-teams/log-in?msockid=0a813a601d44649024a22fa21caa6502">Microsoft Teams</a> and <a href="https://slack.com/signin">Slack</a> is brilliant for enterprise teams because it brings this troubleshooting power directly into where engineers are already working and collaborating, eliminating the context switching between chat apps and the AWS console during incident response.</li>
</ul>
<p>27:35   Ryan – “And, if you add in instructions for your agent to respond in a snarky and sort of condescending way, you really have automated me out of a job.”</p>
<p>**Show note editor note: Welcome to my world, Ryan.**</p>
<p>28:59 <a href="https://www.theregister.com/2025/06/03/aws_european_sovereign_cloud/">AWS cooks up Euro cloud outfit to soothe sovereignty nerves • The </a><a href="https://www.theregister.com/2025/06/03/aws_european_sovereign_cloud/">Register</a></p>
<ul>
<li style="font-weight:400;">AWS is launching a <a href="https://aws.amazon.com/compliance/europe-digital-sovereignty/">European Sovereign Cloud</a> by the end of 2025, creating a legally independent entity based in Germany with EU-only staff, infrastructure, and leadership – essentially building a firewall between European customer data and potential US government reach under laws like the <a href="https://www.justice.gov/criminal/cloud-act-resources">Cloud Act</a>.</li>
<li style="font-weight:400;">This move directly responds to growing European anxiety about data sovereignty, especially with the Trump 2.0 administration’s aggressive foreign policy stance, and follows similar announcements from Microsoft and Google Cloud who are also scrambling to address European concerns about US tech dependence.</li>
<li style="font-weight:400;">AWS is creating a completely autonomous infrastructure with its own Route 53 DNS service using only European top-level domains, a dedicated European Certificate Authority, and the ability to operate indefinitely even if completely disconnected from AWS’s global infrastructure.</li>
<li style="font-weight:400;">What’s really interesting is the governance structure – they’re establishing an independent advisory board with four EU citizens, including at least one person not affiliated with Amazon, who are legally obligated to act in the best interest of the European Sovereign Cloud rather than AWS corporate.</li>
<li style="font-weight:400;">The timing couldn’t be more critical as European politicians are increasingly vocal about reducing dependence on US tech, especially after Microsoft reportedly blocked ICC prosecutor access to email in compliance with US sanctions, which really spooked EU officials about their vulnerability.</li>
<li style="font-weight:400;">For AWS customers in Europe, this means they’ll finally have an option that addresses regulatory compliance concerns while maintaining AWS’s service quality, though it remains to be seen how pricing will compare to standard AWS regions and whether the Cloud Act truly has no reach here.</li>
<li style="font-weight:400;">The bigger picture shows how geopolitical tensions are literally reshaping cloud infrastructure – we’re moving from a globally interconnected cloud to regional sovereign clouds, which could fundamentally change how multinational companies architect their systems.</li>
<li style="font-weight:400;">While AWS promises “no critical dependencies on non-EU infrastructure,” the parent company remains American-owned, so there’s still debate about whether this truly protects against Cloud Act requirements – it’s a legal gray area that will likely need court testing to resolve.</li>
</ul>
<h2>GCP</h2>
<p>37:07 <a href="https://cloud.google.com/blog/products/compute/get-committed-use-discounts-for-rhel/">Get committed use discounts for RHEL | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.bing.com/aclk?ld=e8xEiNFtG9t2OGRjvnuu6yXzVUCUz4ZZop5MMjEQfUA_T9-6poV6_gMm0Qw9Zf-a1eNPfn_uQpFaGN8bMBkC1psUYnvL3s2ccjWhj5odM8_2UUsIYXOu-hSu9cjEieThiacrXsCyuOztQQOJAy8myiZisZdRJUsB05VVNKRsT6ML_fQy2s_hDB8Ml3foaNH9wz5-K4aw&amp;u=aHR0cHMlM2ElMmYlMmZjbG91ZC5nb29nbGUuY29tJTJmZ2NwJTNmdXRtX3NvdXJjZSUzZGJpbmclMjZ1dG1fbWVkaXVtJTNkY3BjJTI2dXRtX2NhbXBhaWduJTNkbmEtVVMtYWxsLWVuLWRyLWJrd3MtYWxsLWFsbC10cmlhbC1lLWRyLTE3MTAxMzQlMjZ1dG1fY29udGVudCUzZHRleHQtYWQtbm9uZS1hbnktREVWX2MtQ1JFXy1BREdQX0Rlc2slMmIlMjU3QyUyYkJLV1MlMmItJTJiRVhBJTJiJTI1N0MlMmJUeHQtQ29yZS1HZW5lcmFsJTJiR0NQLUtXSURfNDM3MDAwNjMzNDE4NDMyOTYta3dkLTc3MjQwODk2MDc1NjQxJTNhbG9jLTE5MCUyNnV0bV90ZXJtJTNkS1dfZ29vZ2xlJTI1MjBjbG91ZC1TVF9nb29nbGUlMmJjbG91ZCUyNmdjbGlkJTNkOWFmOGUzMTQzYzhhMTdjOTM5YWE4YjVkMmNhNzdlYjIlMjZnY2xzcmMlM2QzcC5kcyUyNm1zY2xraWQlM2Q5YWY4ZTMxNDNjOGExN2M5MzlhYThiNWQyY2E3N2ViMg&amp;rlid=9af8e3143c8a17c939aa8b5d2ca77eb2">Google Cloud</a> is bringing committed use discounts to <a href="https://www.redhat.com/en/technologies/linux-platforms/enterprise-linux">Red Hat Enterprise Linux</a>, offering up to 20% savings for customers running predictable RHEL workloads on Compute Engine – this is a big deal for enterprises who’ve been paying full on-demand prices for their RHEL subscriptions in the cloud.</li>
<li style="font-weight:400;">The way these RHEL <a href="https://cloud.google.com/compute/docs/instances/signing-up-committed-use-discounts#purchaselicensecommitment">CUDs</a> work is pretty straightforward – you commit to a one-year term for a specific number of RHEL subscriptions in a particular region and project, and in exchange you get that 20% discount off the standard on-demand pricing, which really adds up when you’re running enterprise workloads 24/7.</li>
<li style="font-weight:400;">What’s interesting here is Google’s positioning compared to AWS and Azure – while both competitors offer various discount mechanisms for compute resources, Google is specifically targeting the RHEL subscription costs themselves, which is a significant expense for many enterprises running traditional workloads in the cloud.</li>
<li style="font-weight:400;">The sweet spot for these discounts kicks in when you’re utilizing RHEL instances about 80% or more of the time over the year, which honestly describes most production enterprise workloads – Google’s research shows the majority of RHEL VMs run 24/7, so this pricing model actually aligns well with real-world usage patterns.</li>
<li style="font-weight:400;">One thing to watch out for is that these commitments are completely inflexible – once you purchase them, you can’t edit or cancel, and you’re on the hook for the monthly fees regardless of actual usage, so you really need to nail your capacity planning before pulling the trigger.</li>
</ul>
<p>38:22   Justin – “So if I’m committing to the license, but I can move it between any type of instance class, I actually am okay with that – and if that’s something we’re going to see for other operating systems in the future, where maybe Windows has a discount if I’m willing to commit and things like that, this could be an interesting move by Google in general.”</p>
<p>39:11 <a href="https://cloud.google.com/blog/products/ai-machine-learning/launching-our-new-state-of-the-art-vertex-ai-ranking-api/">Launching our new state-of-the-art Vertex AI Ranking API | Google Cloud </a></p>
<p><a href="https://cloud.google.com/blog/products/ai-machine-learning/launching-our-new-state-of-the-art-vertex-ai-ranking-api/">Blog</a></p>
<ul>
<li style="font-weight:400;">Google just launched their <a href="https://cloud.google.com/generative-ai-app-builder/docs/ranking">Vertex AI Ranking API</a>, which is essentially a precision filter that sits on top of your existing search or RAG systems to dramatically improve result relevance – they’re claiming it can help businesses avoid that scary 82% customer loss rate when users can’t find what they need quickly, and it addresses the fact that up to 70% of retrieved passages in traditional search often don’t contain the actual answer you’re looking for.</li>
<li style="font-weight:400;">Google is positioning this as a drop-in enhancement rather than a rip-and-replace solution – you can keep your existing search infrastructure and just add this API as a reranking layer, which means companies can get state-of-the-art semantic search capabilities in minutes instead of going through months of migration, and they’re offering two models: a default one for accuracy and a fast one for latency-critical applications.</li>
<li style="font-weight:400;">The performance benchmarks are pretty impressive – Google’s claiming their semantic-ranker-default-004 model leads the industry in accuracy on the BEIR dataset compared to other standalone reranking services, and they’re backing this up by publishing their evaluation scripts on GitHub for reproducibility, plus they say it’s at least 2x faster than competitive reranking APIs at any scale.</li>
<li style="font-weight:400;">This feels like Google’s answer to the reranking capabilities we’ve seen from players like Cohere and their Rerank API, but Google’s bringing some unique advantages with their 200k token context window for long documents and native integrations across their ecosystem – you can use it directly in AlloyDB with a simple SQL function, integrate it with RAG Engine, or even use it with Elasticsearch, which shows they’re thinking beyond just their own stack.</li>
</ul>
<p>40:13   Justin – “Basically this is their answer to Cohere and Elasticsearch.” </p>
<p>41:02 <a href="https://cloud.google.com/blog/products/identity-security/project-shield-blocked-a-massive-recent-ddos-attack-heres-how/">Project Shield blocked a massive recent DDoS attack. Here’s how. | Google </a></p>
<p><a href="https://cloud.google.com/blog/products/identity-security/project-shield-blocked-a-massive-recent-ddos-attack-heres-how/">Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google’s Project Shield just proved its worth by defending <a href="http://krebsonsecurity.com/">KrebsOnSecurity</a> against a staggering 6.3 terabits per second DDoS attack – that’s roughly 63,000 times faster than average US broadband and one of the largest attacks ever recorded, showing that even free services can provide enterprise-grade protection when backed by Google’s infrastructure.</li>
<li style="font-weight:400;"><a href="https://cloud.google.com/blog/products/identity-security/project-shield-makes-it-easier-to-sign-up-set-up-automate-ddos-protection?e=48754805">Project Shield</a> is completely free for eligible organizations like news publishers, government election sites, and human rights defenders. It’s essentially Google weaponizing their massive global infrastructure for good, letting at-risk organizations piggyback on the same defenses that protect Google’s own services.</li>
<li style="font-weight:400;">The technical stack behind Project Shield is impressive – it combines <a href="https://cloud.google.com/load-balancing/docs/load-balancing-overview">Cloud Load Balancing</a>, <a href="https://cloud.google.com/cdn">Cloud CDN</a>, and <a href="https://cloud.google.com/armor/docs/cloud-armor-overview%5C">Cloud Armor</a> to create a multi-layered defense that blocked this attack instantly without any manual intervention, filtering 585 million packets per second at the network edge before they could even reach the application layer.</li>
<li style="font-weight:400;">This is a great example of how cloud providers are differentiating beyond just compute and storage – while AWS has Shield and Azure has DDoS Protection, Google’s approach of offering this as a free service to vulnerable organizations shows they’re thinking about cloud infrastructure as a force for protecting free speech and democracy online.</li>
<li style="font-weight:400;">For regular GCP customers, this attack validates Google’s DDoS protection capabilities – the same technologies protecting KrebsOnSecurity through Project Shield are available to any Google Cloud customer, with features like Adaptive Protection using machine learning to dynamically adjust rate limits in real-time.</li>
<li style="font-weight:400;">The simplicity of implementation is noteworthy – organizations just change their DNS settings to point to Project Shield’s IP addresses and configure their hosting server info, making it easy to enable or disable protection with a simple DNS switch, which is crucial for organizations that might not have dedicated security teams.</li>
<li style="font-weight:400;">This incident highlights the escalating DDoS threat landscape – attacks have grown from the 620 Gbps Mirai botnet attack in 2016 to this 6.3 Tbps monster in 2024, a 10x increase that shows why organizations need to think seriously about DDoS protection as attacks become more sophisticated and volumetric.</li>
</ul>
<p>44:07 <a href="https://cloud.google.com/blog/products/serverless/cloud-run-gpus-are-now-generally-available/">Cloud Run GPUs are now generally available | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;">Google just made GPU computing truly serverless with <a href="https://cloud.google.com/run">Cloud Run</a> GPUs going GA, and the killer feature here is that you only pay for what you use down to the second.</li>
<li style="font-weight:400;">Imagine spinning up an <a href="https://www.techpowerup.com/gpu-specs/l4.c4091">NVIDIA L4 GPU</a> for AI inference, having it automatically scale to zero when idle, and only paying for the actual seconds of compute time, which is a game-changer compared to keeping GPU instances running 24/7 on traditional cloud infrastructure.</li>
<li style="font-weight:400;">The cold start performance is genuinely impressive – they’re showing sub-5 second startup times to get a GPU instance with drivers installed and ready to go, and in their demo they achieved time-to-first-token of about 19 seconds for a Gemma 3 4B model including everything from cold start to model loading to inference, which makes this viable for real-time AI applications that need to scale dynamically.</li>
<li style="font-weight:400;">What’s really clever is how they’ve removed the traditional barriers to GPU access – there’s no quota request required for L4 GPUs anymore, you literally just add –gpu 1 to your command line or check a box in the console, making this as accessible as regular Cloud Run deployments, which democratizes GPU computing for developers who previously couldn’t justify the complexity or cost.</li>
<li style="font-weight:400;">The multi-regional deployment story is strong with GPUs available in five regions including US, Europe, and Asia, and you can deploy across multiple regions with a single command for global low-latency inference – they showed deploying Ollama across three continents in one go, which would be a nightmare to set up with traditional GPU infrastructure.</li>
<li style="font-weight:400;">At <a href="https://cloud.withgoogle.com/next/25">Next ’25</a> they demonstrated scaling from 0 to 100 GPU instances in just 4 minutes running Stable Diffusion, which really showcases the elasticity – this kind of burst scaling would cost a fortune with reserved GPU instances but makes perfect sense with per-second billing for handling viral AI applications or unpredictable workloads.</li>
<li style="font-weight:400;">Early customers like Wayfair are reporting 85% cost reductions by combining L4 GPU performance with Cloud Run’s auto-scaling, while companies like Midjourney are using it to process millions of images – the combination of reasonable GPU pricing with true scale-to-zero capabilities seems to be hitting a sweet spot for AI workloads that don’t need constant GPU availability.</li>
</ul>
<p>45:49   Ryan – “Anything that scales down to zero is ok in my book.”</p>
<p>46:50 <a href="https://cloud.google.com/blog/products/containers-kubernetes/gke-volume-populator-streamlines-aiml-data-transfers/">GKE Volume Populator streamlines AI/Ml data transfers | Google Cloud </a><a href="https://cloud.google.com/blog/products/containers-kubernetes/gke-volume-populator-streamlines-aiml-data-transfers/">Blog</a></p>
<ul>
<li style="font-weight:400;">Google just released GKE Volume Populator, and this is actually a pretty clever solution to a real pain point in AI/ML workflows – basically, if you’re storing your training data or model weights in Cloud Storage but need to move them to faster storage like Hyperdisk ML for better performance, you previously had to build custom scripts and workflows to orchestrate all those data transfers, but now GKE handles it automatically through the standard Kubernetes PersistentVolumeClaim API.</li>
<li style="font-weight:400;">What’s really interesting here is that Google is leveraging the Kubernetes Volume Populator feature that went GA in Kubernetes 1.33, but they’re adding their own special sauce with native Cloud Storage integration and fine-grained namespace-level access controls – this means you can have different teams or projects with their own isolated access to specific Cloud Storage buckets without having to manage complex IAM policies across your entire cluster.</li>
<li style="font-weight:400;">The timing on this is perfect for AI/ML workloads because one of the biggest challenges teams face is efficiently loading massive model weights – Abridge AI reported they saw up to 76% faster model loading speeds and reduced pod initialization times by using Hyperdisk ML with this feature, which is huge when you’re dealing with large language models that can be hundreds of gigabytes.</li>
<li style="font-weight:400;">From a cost optimization perspective, this is actually quite smart because your expensive GPU and TPU resources aren’t sitting idle waiting for data to transfer – the pods are blocked from scheduling until the data transfer completes, so you can use those accelerators for other workloads in the meantime, which could save significant money on compute costs.</li>
</ul>
<h2>Azure</h2>
<p>49:44 <a href="https://azure.microsoft.com/en-us/blog/new-ai-innovations-that-are-redefining-the-future-for-software-companies/">New AI innovations that are redefining the future for software companies | </a><a href="https://azure.microsoft.com/en-us/blog/new-ai-innovations-that-are-redefining-the-future-for-software-companies/">Microsoft Azure Blog</a></p>
<ul>
<li style="font-weight:400;">Microsoft is making a big push to turn every software developer into an AI developer with <a href="https://aka.ms/Build25/AzureAIFoundry">Azure AI Foundry,</a> their new unified platform that brings together models, tools, and services for building AI apps and agents at scale. </li>
<li style="font-weight:400;">What’s really interesting here is they’re positioning this as the shift from AI assistants that wait for instructions, to autonomous agents that can actually be workplace teammates.</li>
<li style="font-weight:400;">The <a href="https://azure.microsoft.com/en-us/products/ai-agent-service">Azure AI Foundry Agent Service</a> is now generally available, and it lets developers orchestrate multi-agent workflows where AI agents can work together to solve complex problems. </li>
<li style="font-weight:400;">This is Microsoft’s answer to the growing demand for agentic AI that can automate decision-making and complex business processes, which AWS and GCP haven’t quite matched yet in terms of a unified platform approach.</li>
<li style="font-weight:400;">Microsoft is seriously expanding their model catalog with some heavy hitters – they’ve got <a href="https://aka.ms/grok-announcement">Grok 3 from xAI</a> available today, <a href="https://aka.ms/SoraBuildBlogFinal">Sora</a> from <a href="https://ai.azure.com/explore/models?selectedCollection=aoai">OpenAI</a> coming soon in preview, and over 10,000 open-source models from <a href="http://aka.ms/FoundryHuggingFace">Hugging Face</a>, all with full fine-tuning support, which gives developers way more choice than what you typically see in competing cloud platforms.</li>
<li style="font-weight:400;">The real game-changer here might be what they’re calling “Agentic DevOps” – GitHub Copilot is evolving from just helping you write code to actually doing code reviews, writing tests, fixing bugs, and even handling app modernization tasks that used to take months but can now be done in hours, which could fundamentally change how software teams operate.</li>
<li style="font-weight:400;">They’ve introduced a <a href="https://techcommunity.microsoft.com/blog/azurepaasblog/introducing-azure-sre-agent/4414569#:~:text=SRE%20Agent%20is%20a%20new%20Azure%20service%20that,responses,%20diagnostics%20and%20collaboration%20to%20resolve%20problems%20rapidly.">Site Reliability Engineering agent</a> that monitors production systems 24/7 and can autonomously troubleshoot issues as they arise across <a href="https://kubernetes.io/">Kubernetes</a>, <a href="https://learn.microsoft.com/en-us/azure/app-service/overview">App Servic</a>e, serverless, and databases – essentially giving every developer access to the same expertise that powers Azure at global scale, which is a pretty compelling value proposition for teams that can’t afford dedicated SRE staff.</li>
<li style="font-weight:400;">For startups and ISVs, Microsoft is sweetening the deal with flexible Azure credits through Microsoft for Startups, and they’re reporting that AI and machine learning offer revenue in their marketplace grew 100% last year – companies like <a href="https://youtu.be/tParUaDdN0I?si=W9uuo1CN1eaRG7Th">Neo4j</a> have seen 6X revenue growth in 18 months through the marketplace, which shows there’s real money to be made here.</li>
</ul>
<p>53:13   Ryan – “The way I hope AI rolls out is that it does stuff like this, but then it still requires supervision – the SRE engineers, the DevOps engineers that you already have – are now freed up to do more impactful things. So maybe it’s refining prompts for these agents, giving them those constraints by, you know, thinking about how they basically operate and all those like things that aren’t written down as intangibles and really getting that executed into prompts.”</p>
<p>54:05 <a href="https://devblogs.microsoft.com/dotnet/announcing-dotnet-run-app/">Announcing dotnet run app.cs – A simpler way to start with C# and .NET 10 </a><a href="https://devblogs.microsoft.com/dotnet/announcing-dotnet-run-app/">– .NET Blog</a></p>
<ul>
<li style="font-weight:400;">Microsoft just made getting started with C# dramatically easier with .NET 10 Preview 4 by introducing the ability to run a single C# file directly using `dotnet run app.cs`, eliminating the need for project files or complex folder structures – essentially bringing Python-like simplicity to C# development while maintaining the full power of the .NET ecosystem.</li>
<li style="font-weight:400;">This new file-based approach introduces clever directives that let you reference NuGet packages, specify SDKs, and set MSBuild properties right within your C# file using simple syntax like `#:package Humanizer@2.14.1`, making it perfect for quick scripts, learning scenarios, or testing code snippets without the overhead of creating a full project structure.</li>
<li style="font-weight:400;">What’s particularly brilliant about this implementation is that it’s not a separate dialect or limited version of C# – you’re writing the exact same code with the same compiler, and when your script grows beyond a simple file, you can seamlessly convert it to a full project using `dotnet project convert app.cs`, which automatically scaffolds the proper project structure and translates all your directives.</li>
<li style="font-weight:400;">The feature even supports Unix-style <a href="https://en.wikipedia.org/wiki/Shebang_%28Unix%29">shebang lines</a>, allowing you to create executable C# scripts that run directly from the command line on Linux and macOS, positioning C# as a viable alternative to <a href="https://www.python.org/">Python</a> or <a href="https://www.gnu.org/software/bash/">Bash</a> for automation scripts and CLI utilities – imagine writing your cloud automation scripts in strongly-typed C# instead of wrestling with shell scripts.</li>
<li style="font-weight:400;">This addresses a long-standing pain point where developers had to rely on third-party tools like dotnet-script or <a href="https://github.com/oleg-shilo/cs-script">CS-Script</a> to achieve similar functionality, but now it’s built right into the core .NET CLI, requiring no additional installations or configurations beyond having .NET 10 Preview 4 installed.</li>
<li style="font-weight:400;">The timing is perfect as more cloud platforms and services provide .NET SDKs, allowing developers to quickly prototype API integrations, test cloud service connections, or build automation scripts without the ceremony of setting up a full project – you could literally test an Azure Storage connection in a single file and run it immediately.</li>
<li style="font-weight:400;"><a href="https://code.visualstudio.com/download">Visual Studio Code</a> support is already available through the pre-release version of the C# extension, with IntelliSense for the new directives, and Microsoft is exploring multi-file support and performance improvements for future previews, suggesting this feature will only get more powerful as .NET 10 approaches release.</li>
<li style="font-weight:400;">This democratizes C# development in a way that makes it accessible to beginners while still being useful for experienced developers who want to quickly test ideas or build utilities, effectively positioning C# as both a powerful enterprise language and a convenient scripting language in one package.</li>
</ul>
<p>56:20   Ryan – “I’m very mixed on this, because it’s like, .NET development; the development patterns I see are already so detached from the running environment, so I feel like this is a further abstraction on top of all the leveraged libraries and frameworks that are part of .NET.” </p>
<p>57:45 <a href="https://campaigns.endjin.com/t/t-l-ggujz-tyyuvjjtl-tu/">Announcing General Availability: Ephemeral OS Disk support for v6 Azure </a></p>
<p><a href="https://campaigns.endjin.com/t/t-l-ggujz-tyyuvjjtl-tu/">VMs | Microsoft Community Hub</a></p>
<ul>
<li style="font-weight:400;">Microsoft just made ephemeral OS disks generally available for their latest v6 VM series, and this is a big deal for anyone running stateless workloads because you’re getting up to 10X better OS disk performance by using local NVMe storage instead of remote Azure Storage – essentially eliminating network latency for your operating system disk operations.</li>
<li style="font-weight:400;">The beauty of ephemeral disks is that they’re perfect for scale-out scenarios like containerized microservices, batch processing jobs, or CI/CD build agents where you don’t need persistent OS state – you can reimage a VM in seconds and get back to a clean state, which is fantastic for auto-scaling scenarios where you’re constantly spinning up and tearing down instances.</li>
<li style="font-weight:400;">This puts Azure in a really competitive position against AWS’s instance store volumes and GCP’s local SSDs, though Microsoft’s implementation is particularly interesting because it specifically targets the OS disk placement on NVMe storage while still allowing you to use regular managed disks for your data volumes if needed.</li>
<li style="font-weight:400;">The v6 VM series that support this feature – like the Dadsv6 and Ddsv6 families – are already Azure’s latest generation with AMD EPYC processors, so you’re combining cutting-edge CPU performance with blazing-fast local storage, making these ideal for performance-sensitive workloads that can tolerate the ephemeral nature of the OS disk.</li>
<li style="font-weight:400;">From a cost perspective, ephemeral OS disks are essentially free since you’re not paying for managed disk storage – you’re just using the local storage that comes with your VM, which could lead to significant savings for large-scale deployments where you might have hundreds or thousands of VMs that don’t need persistent OS disks.</li>
<li style="font-weight:400;">One thing to keep in mind is that these disks are truly ephemeral – if your VM gets deallocated or moved to different hardware for maintenance, you lose everything on that OS disk, so this isn’t for everyone – you really need to architect your applications to be stateless and store any important data elsewhere.</li>
<li style="font-weight:400;">The deployment is surprisingly straightforward with just a few extra parameters in your ARM templates or CLI commands, and the fact that it works with marketplace images, custom images, and Azure Compute Gallery images means you can pretty much use it with any existing VM deployment pipeline you already have.</li>
<li style="font-weight:400;">For DevOps teams and platform engineers, this feature is particularly exciting because it enables faster VM boot times, quicker scale-out operations, and better performance for temporary workloads like build agents or test environments where persistence is actually a liability rather than an asset.</li>
</ul>
<p>1:03:22 <a href="https://azure.microsoft.com/en-us/updates?id=491985&amp;utm_source=devdigest.today&amp;utm_medium=website&amp;utm_campaign=feature_promo&amp;utm_content=link_click">Generally Available: Support for AWS Bedrock API in AI Gateway</a> <a href="https://azure.microsoft.com/en-us/updates?id=491985&amp;utm_source=devdigest.today&amp;utm_medium=website&amp;utm_campaign=feature_promo&amp;utm_content=link_click">Capabilities in Azure API Management</a></p>
<ul>
<li style="font-weight:400;">Announcing expanded support for <a href="https://aws.amazon.com/bedrock/">AWS Bedrock</a> model endpoints across all Generative AI policies in <a href="https://learn.microsoft.com/en-us/azure/api-management/genai-gateway-capabilities">Azure API Management’s AI Gateway</a>. </li>
<li style="font-weight:400;">This release enables you to apply advanced management and optimization features such as <a href="https://learn.microsoft.com/en-us/azure/api-management/azure-openai-token-limit-policy">Token Limit Policy</a>, <a href="https://learn.microsoft.com/en-us/azure/api-management/azure-openai-emit-token-metric-policy">Token Metric Policy</a>, and <a href="https://learn.microsoft.com/en-us/azure/api-management/azure-openai-enable-semantic-caching">Semantic Caching Policy</a> to AWS Bedrock models, empowering you to seamlessly manage and optimize your multi-cloud AI workloads.  </li>
<li style="font-weight:400;">Key benefits include: 
<ul>
<li style="font-weight:400;">Apply token limiting, tracking, and logging to AWS Bedrock APIs for better control  </li>
<li style="font-weight:400;">Enable semantic caching to enhance performance and response times for Bedrock models. </li>
<li style="font-weight:400;">Achieve unified observability and governance across multi-cloud AI endpoints.  </li>
</ul>
</li>
</ul>
<p>1:04:06  Justin – “Azure, we thank you for making AWS more cost effective and responsive with your capabilities and features.” </p>
<h2>Other Clouds</h2>
<p>1:07:20 <a href="https://www.digitalocean.com/blog/introducing-new-atlanta-data-center">Introducing ATL1: DigitalOcean’s new AI-optimized data center in Atlanta</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.digitalocean.com/">DigitalOcean</a> is making a serious play for the AI infrastructure market with their new ATL1 data center in Atlanta, which is their largest facility to date with 9 megawatts of total power capacity across two data halls.  </li>
<li style="font-weight:400;">It’s specifically designed for high-density GPU deployments that AI and machine learning workloads demand.</li>
<li style="font-weight:400;">This marks a significant shift in DigitalOcean’s strategy from being primarily known as a developer-friendly cloud provider for smaller workloads to now competing in the GPU infrastructure space, deploying over 300 GPUs including top-tier NVIDIA H200 and AMD Instinct MI300X clusters in just the first data hall.</li>
<li style="font-weight:400;">The timing of this expansion is particularly interesting as we’re seeing massive demand for GPU resources driven by the AI boom, and DigitalOcean is positioning themselves as a more accessible alternative to the hyperscalers for startups and growing tech companies that need GPU compute but don’t want the complexity or cost structure of AWS, Azure, or GCP.</li>
<li style="font-weight:400;">By choosing Atlanta as their location and partnering with <a href="https://www.flexential.com/">Flexential</a> for the facility, DigitalOcean is strategically serving the Southern U.S. market where there’s been significant tech growth, offering lower latency for regional customers while maintaining their promise of simplicity and cost-effectiveness that made them popular with developers in the first place.</li>
<li style="font-weight:400;">The integration of GPU infrastructure alongside their existing services like Droplets, Kubernetes, and managed databases creates an interesting one-stop-shop proposition for companies building AI applications, allowing them to keep their entire stack within DigitalOcean’s ecosystem rather than mixing providers.</li>
<li style="font-weight:400;">With a second data hall planned for 2025 with even more GPU capacity, this represents a multi-year commitment to AI infrastructure, suggesting DigitalOcean sees this as core to their future rather than just riding the current AI hype wave.</li>
<li style="font-weight:400;">This expansion brings DigitalOcean to 16 data centers across 10 global regions, which while still small compared to the hyperscalers, shows they’re serious about geographic distribution and reducing latency for their growing customer base.</li>
</ul>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod where you can join our newsletter, slack team, send feedback or ask questions at theCloud Pod.net or tweet at us with hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2064879/c1e-jkjku5q83rip3nkj-mk4d42xobqn9-bkg9te.mp3" length="85525216"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 307 of The Cloud Pod – where the forecast is always cloudy! Who else is at a conference? Justin is coming to us this week from sunny San Diego where he’s attending FinOps – so we have that news to look forward to for next week. Matt and Ryan are also on hand today to share the latest news from Kubernetes, Salesforce acquisitions, and the strange case of Azure making AWS more cost effective.
Titles we almost went with this week:

The Great Redis Escape: One Year Later, Valkey is Living Its Best Life
Cache Me If You Can: How Valkey Outran Redis’s License Policies
Tier Today, Gone Tomorrow: AWS’s New Storage Class That Moves Your Data So  
      You Don’t 
Hey AI, Deploy My App: AWS Makes It Actually Work
AWS Finally Calculates What You’ll Actually Pay
The Price is Right: AWS Edition
From List Price to Real Price: AWS Gets Transparent
Red Hat and AWS Sitting in a Tree, R-H-E-L-I-N-G
Dockerfile? More Like Dockefile-It-For-Me with Amazon’s New MCP Server
Elementary, My Dear Watson: Amazon Q Becomes Sherlock Holmes for AWS
CUD You Believe It? Red Hat Gets the Discount Treatment
Committed Relationship Status: It’s Complicated (But 20% Cheaper)
RHEL Yeah! Google Drops Prices on Enterprise Linux
Disk Today, Gone Tomorrow: Azure’s Vanishing OS Storage
ATL1: Where GPUs Meet Sweet Tea and Southern Hospitality
AWS Launches Operation Cloud Sovereignty
The Great Firewall of Europe: AWS Edition
Amazon Builds a GDPR Fortress in Germany

General News 
01:46 What Salesforce’s $8B acquisition of Informatica means for enterprise data and AI | VentureBeat

Salesforce just dropped $8 billion to acquire Informatica. 
This purchase was really about building the data foundation needed for agentic AI to actually work in enterprise environments – we’re talking about combining Informatica’s 30 years of data management expertise with Salesforce’s cloud platform to create what they’re calling a “unified architecture for agentic AI.”
This acquisition fills a massive gap in Salesforce’s data management capabilities, bringing in critical pieces like data cataloging, integration, governance, quality controls, and master data management – all the unsexy but absolutely essential plumbing that makes AI agents trustworthy and scalable in real enterprise deployments.
The timing here is fascinating, because Informatica literally just announced their own agentic AI offerings last week at Informatica World, so Salesforce is essentially buying a company that’s already pivoted hard into the AI space – rather than trying to build these capabilities from scratch.
There’s going to be some interesting overlap with MuleSoft, which Salesforce bought for $6.5 billion back in 2018, but analysts are saying Informatica’s data management capabilities are more comprehensive and updated – this could mean some consolidation challenges ahead as they figure out how to integrate these overlapping technologies.
For enterprise customers, this could be a game-changer because it promises to automate those painful, time-consuming data processes that typically take days or weeks. These AI agents can handle data ingestion, in...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2064879/c1a-k5d5-rk4r452ofr78-prwont.jpg"></itunes:image>
                                                                            <itunes:duration>01:11:17</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2064879/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[306: Batch Better Have MySQL: Azure's Maintenance Makeover]]>
                </title>
                <pubDate>Fri, 06 Jun 2025 05:10:01 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2059026</guid>
                                    <link>https://tcpfm.castos.com/episodes/306-batch-better-have-mysql-azures-maintenance-makeover</link>
                                <description>
                                            <![CDATA[<p>Welcome to episode 306 of The Cloud Pod – where the forecast is always cloudy! </p>
<p>This week, we have a bunch of announcements concerning the newest offering from Anthropic – Claude Sonnet 4 and Opus 4, plus container security, Azure MySQL Maintenance, Vertex AI, and Mistral AI. Plus, we’ve got a Cloud Journey installment AND an aftershow – so get comfy and get ready for a trip to the clouds!</p>
<h3>Titles we almost went with this week:</h3>
<ul>
<li>ECS Failures Now Have 4x the Excuses</li>
<li>Nailing Down Your Container Security, One Patch at a Time</li>
<li>HashiCorp’s New Recipe: Terraform, AI, and a Pinch of MCP</li>
<li>Teaching an Old DNS New IPv6 Tricks</li>
<li>Dash-ing through the Klusters, in an AWS Console</li>
<li>Google’s Generative AI Playground Gets a Glow-Up</li>
<li>Vertex AI Studio: Now with 200% More Darkness! Like our souls</li>
<li>Claude Opus 4 Strikes a Chord on Google Cloud</li>
<li>Sovereign-teed to Please: Google Cloud’s Royal Treatment</li>
<li>Google’s Cloud Kingdom Expands its Borders</li>
<li>Shall I Compare Thee to a Summer’s AI? Anthropic Drops Sonne(t) 4 Knowledge on Vertex</li>
<li>Mistral AI Chats Up a Storm on Google Cloud</li>
<li>Google Cloud’s Vertex AI Gets a Dose of Mistral Magic</li>
<li>.NET Aspire on Azure: The App Service Strikes Back</li>
<li>Default Outbound Access Retires, Decides Florida Isn’t for Everyone </li>
</ul>
<h2>AI Is Going Great – or How ML Makes Money </h2>
<p>01:52 <a href="https://www.anthropic.com/news/claude-4">Introducing Claude 4</a></p>
<ul>
<li style="font-weight:400;">Claude has launched the latest models in <a href="https://www.anthropic.com/claude/opus">Claude Opus 4</a> and <a href="https://www.anthropic.com/claude/sonnet">Claude Sonnet 4</a>, setting new standards for coding, advancing reasoning and AI agents. Maybe they’ll actually follow instructions when told to shut down? (Looking at you, ChatGPT.)</li>
<li style="font-weight:400;">Claude Opus 4 is “the world’s best coding model” with sustained performance on complex, long-running tasks and agent workflows. </li>
<li style="font-weight:400;">Opus 4 has 350 billion parameters, making it one of the largest publicly available language models. </li>
<li style="font-weight:400;">It demonstrates strong performance on academic benchmarks, including research. </li>
<li style="font-weight:400;">Sonnet 4 is a smaller 10 billion parameter model optimized for dialogue, making it well-suited for conversational AI applications. </li>
<li style="font-weight:400;">Alongside the <a href="https://docs.anthropic.com/en/docs/about-claude/models/overview">models</a>, they are also announcing:
<ul>
<li style="font-weight:400;">Extended thinking with tool use (beta): Both models can use tools – like <a href="https://docs.anthropic.com/en/docs/build-with-claude/tool-use/web-search-tool">web search</a> – during extended thinking, allowing Claude to alternate between reasoning and tool use to improve its responses.</li>
<li style="font-weight:400;">New Model Capabilities: Both models can use tools in parallel, follow instructions more precisely, and when given access to local files by developers — demonstrate significantly improved memory capabilities, extracting and saving key facts maintain continuity and build tacit knowledge over time</li>
<li style="font-weight:400;">Claude code is now generally available: After receiving extensive positive feedback during our research preview, they are expanding how developers can collaborate with Claude.  Claude code now supports background tasks via github actions and native integrations with VS code and jetbrains, displaying edits directly in your files for seamless pair programming. </li>
<li style="font-weight:400;">New Api capabilities: <a href="https://www.anthropic.com/news/agent-capabilities-api">Four new capabilities</a> on the API that enable developers to build more powerful AI agents including Code Execution tool, MCP connector, Files API and the ability to cache...</li></ul></li></ul>
<h3>Chapters</h3>
<ul><li>(00:00:00) - Cloud Pod: Azure's Maintenance Makeover</li><li>(00:00:38) - Gemini Power Glasses at IO 2017</li><li>(00:01:58) - Claude Turns 4 and More</li><li>(00:07:51) - Claude 4.2 and Silent Models</li><li>(00:08:38) - OpenAI's Language AI Response API Update</li><li>(00:11:14) - Docker: Hardened Images</li><li>(00:15:30) - Terraform MCP Server Released for AI Integration</li><li>(00:17:50) - Amazon Serverless SQL with GMAX</li><li>(00:21:43) - Amazon ECS: Extended Container Exit Reason Message</li><li>(00:23:55) - Dynamodb Local in AWS Cloud Shell</li><li>(00:26:21) - EC2 DNS now supports IPv6</li><li>(00:28:52) - EKS Dashboard: Kubernetes Cluster Management</li><li>(00:33:50) - Vertex AI Studio: Going Dark</li><li>(00:34:59) - Google's Gemma 3n AI Model for Mobile</li><li>(00:37:03) - Google's Intelligent Agent Platform Update</li><li>(00:39:24) - Google Cloud's Sovereign Cloud: Data Sovereignty</li><li>(00:43:10) - GCP 2.5: Unstructured Data with Vertex</li><li>(00:44:48) - Google Cloud AI: Lechat Enterprise and OCR</li><li>(00:48:08) - Azure FX V2 series with 5th Gen Intel Xeon</li><li>(00:49:33) - Red Hat OpenShift VM Virtualization on Azure</li><li>(00:52:15) - Microsoft SQL Server: Maintenance Experience for MySQL</li><li>(00:55:49) - Microsoft's NET Aspire Integration with Azure App Service</li><li>(00:59:42) - Azure: Retiring Implicit Outbound Connectivity for V</li><li>(01:03:19) - How to Code With AI in Visual Studio</li><li>(01:08:25) - Building a serverless bot in Python</li><li>(01:13:43) - Claude 2.8</li><li>(01:19:07) - Google Docs: AI in the Show Notes document</li><li>(01:25:27) - Building a DevOps team with AI</li><li>(01:29:16) - Black FLP02 PC Case</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 306 of The Cloud Pod – where the forecast is always cloudy! 
This week, we have a bunch of announcements concerning the newest offering from Anthropic – Claude Sonnet 4 and Opus 4, plus container security, Azure MySQL Maintenance, Vertex AI, and Mistral AI. Plus, we’ve got a Cloud Journey installment AND an aftershow – so get comfy and get ready for a trip to the clouds!
Titles we almost went with this week:

ECS Failures Now Have 4x the Excuses
Nailing Down Your Container Security, One Patch at a Time
HashiCorp’s New Recipe: Terraform, AI, and a Pinch of MCP
Teaching an Old DNS New IPv6 Tricks
Dash-ing through the Klusters, in an AWS Console
Google’s Generative AI Playground Gets a Glow-Up
Vertex AI Studio: Now with 200% More Darkness! Like our souls
Claude Opus 4 Strikes a Chord on Google Cloud
Sovereign-teed to Please: Google Cloud’s Royal Treatment
Google’s Cloud Kingdom Expands its Borders
Shall I Compare Thee to a Summer’s AI? Anthropic Drops Sonne(t) 4 Knowledge on Vertex
Mistral AI Chats Up a Storm on Google Cloud
Google Cloud’s Vertex AI Gets a Dose of Mistral Magic
.NET Aspire on Azure: The App Service Strikes Back
Default Outbound Access Retires, Decides Florida Isn’t for Everyone 

AI Is Going Great – or How ML Makes Money 
01:52 Introducing Claude 4

Claude has launched the latest models in Claude Opus 4 and Claude Sonnet 4, setting new standards for coding, advancing reasoning and AI agents. Maybe they’ll actually follow instructions when told to shut down? (Looking at you, ChatGPT.)
Claude Opus 4 is “the world’s best coding model” with sustained performance on complex, long-running tasks and agent workflows. 
Opus 4 has 350 billion parameters, making it one of the largest publicly available language models. 
It demonstrates strong performance on academic benchmarks, including research. 
Sonnet 4 is a smaller 10 billion parameter model optimized for dialogue, making it well-suited for conversational AI applications. 
Alongside the models, they are also announcing:

Extended thinking with tool use (beta): Both models can use tools – like web search – during extended thinking, allowing Claude to alternate between reasoning and tool use to improve its responses.
New Model Capabilities: Both models can use tools in parallel, follow instructions more precisely, and when given access to local files by developers — demonstrate significantly improved memory capabilities, extracting and saving key facts maintain continuity and build tacit knowledge over time
Claude code is now generally available: After receiving extensive positive feedback during our research preview, they are expanding how developers can collaborate with Claude.  Claude code now supports background tasks via github actions and native integrations with VS code and jetbrains, displaying edits directly in your files for seamless pair programming. 
New Api capabilities: Four new capabilities on the API that enable developers to build more powerful AI agents including Code Execution tool, MCP connector, Files API and the ability to cache...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[306: Batch Better Have MySQL: Azure's Maintenance Makeover]]>
                </itunes:title>
                                    <itunes:episode>306</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p>Welcome to episode 306 of The Cloud Pod – where the forecast is always cloudy! </p>
<p>This week, we have a bunch of announcements concerning the newest offering from Anthropic – Claude Sonnet 4 and Opus 4, plus container security, Azure MySQL Maintenance, Vertex AI, and Mistral AI. Plus, we’ve got a Cloud Journey installment AND an aftershow – so get comfy and get ready for a trip to the clouds!</p>
<h3>Titles we almost went with this week:</h3>
<ul>
<li>ECS Failures Now Have 4x the Excuses</li>
<li>Nailing Down Your Container Security, One Patch at a Time</li>
<li>HashiCorp’s New Recipe: Terraform, AI, and a Pinch of MCP</li>
<li>Teaching an Old DNS New IPv6 Tricks</li>
<li>Dash-ing through the Klusters, in an AWS Console</li>
<li>Google’s Generative AI Playground Gets a Glow-Up</li>
<li>Vertex AI Studio: Now with 200% More Darkness! Like our souls</li>
<li>Claude Opus 4 Strikes a Chord on Google Cloud</li>
<li>Sovereign-teed to Please: Google Cloud’s Royal Treatment</li>
<li>Google’s Cloud Kingdom Expands its Borders</li>
<li>Shall I Compare Thee to a Summer’s AI? Anthropic Drops Sonne(t) 4 Knowledge on Vertex</li>
<li>Mistral AI Chats Up a Storm on Google Cloud</li>
<li>Google Cloud’s Vertex AI Gets a Dose of Mistral Magic</li>
<li>.NET Aspire on Azure: The App Service Strikes Back</li>
<li>Default Outbound Access Retires, Decides Florida Isn’t for Everyone </li>
</ul>
<h2>AI Is Going Great – or How ML Makes Money </h2>
<p>01:52 <a href="https://www.anthropic.com/news/claude-4">Introducing Claude 4</a></p>
<ul>
<li style="font-weight:400;">Claude has launched the latest models in <a href="https://www.anthropic.com/claude/opus">Claude Opus 4</a> and <a href="https://www.anthropic.com/claude/sonnet">Claude Sonnet 4</a>, setting new standards for coding, advancing reasoning and AI agents. Maybe they’ll actually follow instructions when told to shut down? (Looking at you, ChatGPT.)</li>
<li style="font-weight:400;">Claude Opus 4 is “the world’s best coding model” with sustained performance on complex, long-running tasks and agent workflows. </li>
<li style="font-weight:400;">Opus 4 has 350 billion parameters, making it one of the largest publicly available language models. </li>
<li style="font-weight:400;">It demonstrates strong performance on academic benchmarks, including research. </li>
<li style="font-weight:400;">Sonnet 4 is a smaller 10 billion parameter model optimized for dialogue, making it well-suited for conversational AI applications. </li>
<li style="font-weight:400;">Alongside the <a href="https://docs.anthropic.com/en/docs/about-claude/models/overview">models</a>, they are also announcing:
<ul>
<li style="font-weight:400;">Extended thinking with tool use (beta): Both models can use tools – like <a href="https://docs.anthropic.com/en/docs/build-with-claude/tool-use/web-search-tool">web search</a> – during extended thinking, allowing Claude to alternate between reasoning and tool use to improve its responses.</li>
<li style="font-weight:400;">New Model Capabilities: Both models can use tools in parallel, follow instructions more precisely, and when given access to local files by developers — demonstrate significantly improved memory capabilities, extracting and saving key facts maintain continuity and build tacit knowledge over time</li>
<li style="font-weight:400;">Claude code is now generally available: After receiving extensive positive feedback during our research preview, they are expanding how developers can collaborate with Claude.  Claude code now supports background tasks via github actions and native integrations with VS code and jetbrains, displaying edits directly in your files for seamless pair programming. </li>
<li style="font-weight:400;">New Api capabilities: <a href="https://www.anthropic.com/news/agent-capabilities-api">Four new capabilities</a> on the API that enable developers to build more powerful AI agents including Code Execution tool, MCP connector, Files API and the ability to cache prompts for up to one hour</li>
</ul>
</li>
<li style="font-weight:400;">In the blog post, Claude created a “navigation guide” while playing Pokemon. Maybe it can make me one for Hogwarts Legacy? (Seriously, where the heck are all those demiguise statues…)</li>
<li style="font-weight:400;">Safety seems to be a priority, with extensive testing and evaluation, and <a href="https://www.anthropic.com/news/activating-asl3-protections">implementing</a> measures for AI safety.</li>
</ul>
<p>03:47  Ryan – “I’ve been in the midst of using this a lot and then going back between 3.7 and 4 – largely due to being rate limited. There’s a noticeable difference in 4.0. It is better at delivering working code the first time without having to go back through multiple iterations, and it’s kind of neat. It’s the first time I’ve ever actually been able to notice a difference, be honest… I don’t think I remember seeing this big of a difference between 3.5 and 3.7.”</p>
<p>07:48 <a href="https://www.databricks.com/blog/introducing-new-claude-opus-4-and-sonnet-4-models-databricks">Databricks: Introducing New Claude Opus 4 And Sonnet 4 Models</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.databricks.com/">Databricks</a> has released new versions of their Claude large language models – <a href="https://www.anthropic.com/claude/opus">Opus 4</a> and <a href="https://www.anthropic.com/claude/sonnet">Sonnet 4</a>. </li>
<li style="font-weight:400;">These are foundational models that can be adapted for various applications.</li>
<li style="font-weight:400;">The models leverage Databricks’ <a href="https://www.databricks.com/product/data-lakehouse">Lakehouse</a> platform which unifies data warehouses and data lakes. </li>
<li style="font-weight:400;">This allows training the AI on massive datasets spanning structured and unstructured data.</li>
<li style="font-weight:400;">Customers can fine-tune and deploy customized versions of the models on Databricks’ cloud platform</li>
</ul>
<p>7:55  Ryan – “I look forward to this being announced in every cloud provider for the rest of the show.” </p>
<p>08:34 <a href="https://openai.com/index/new-tools-and-features-in-the-responses-api/">New Tools And Features In The Responses API</a></p>
<ul>
<li style="font-weight:400;"><a href="https://openai.com/">OpenAI</a> has added new tools and features to their <a href="https://community.openai.com/t/introducing-the-responses-api/1140929">Responses API</a>, which allows developers to integrate OpenAI’s language models into their applications.</li>
<li style="font-weight:400;">Key new features include:
<ul>
<li style="font-weight:400;">Web browsing tool that allows models to browse websites and extract information to answer questions. </li>
<li style="font-weight:400;">Math tool for performing mathematical calculations and reasoning</li>
<li style="font-weight:400;">Code explanation tool that can explain code snippets in natural language.</li>
<li style="font-weight:400;">Improved <a href="https://platform.openai.com/docs/guides/tools-code-interpreter">code interpreter</a> for running code in a secure sandbox environment.</li>
</ul>
</li>
<li style="font-weight:400;">These new capabilities open up powerful possibilities for developers to create more sophisticated and capable applications powered by OpenAI’s language models.</li>
<li style="font-weight:400;">The web browsing tool in particular is a major step forward, allowing models to access and utilize information from the internet to provide more comprehensive and up-to-date responses.</li>
<li style="font-weight:400;">These enhancements to the Responses API demonstrate OpenAI’s continued innovation and leadership in the field of language AI. </li>
<li style="font-weight:400;">As OpenAI makes their models more flexible and feature-rich, it will enable a new wave of intelligent applications and integrations across industries</li>
<li style="font-weight:400;">Cloud professionals should take note of OpenAI’s progress, as language AI is poised to be a transformative technology that will be widely deployed via APIs and cloud services.</li>
</ul>
<p>10:01  Matt – “I felt like I needed it when there was new services that came out and I wanted write a script that hits the new PowerShell thing, but it doesn’t know about it yet. That’s where I feel like I hit the edges of AI early on in the LLMs.” </p>
<h2>Cloud Tools</h2>
<p>11:20 <a href="https://www.docker.com/blog/introducing-docker-hardened-images/">Introducing Hardened Images</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.docker.com/products/hardened-images/">Docker Hardened Images </a>(DHI) are secure-by-default container images purpose-built for modern production environments, dramatically reducing the attack surface up to 95% compared to general-purpose base images.</li>
<li style="font-weight:400;">DHI images are curated and maintained by <a href="https://www.docker.com/">Docker</a>, continuously updated to ensure near-zero known CVEs, all while supporting popular distros like <a href="https://www.alpinelinux.org/">Alpine</a> and <a href="https://www.debian.org/">Debian</a> for seamless integration. </li>
<li style="font-weight:400;">They integrate with leading security and DevOps platforms like Microsoft, (yes, we said leading security platforms like Microsoft) <a href="https://nginx.org/en/">NGINX</a>, <a href="https://gitlab.com/users/sign_in">GitLab</a>, <a href="https://www.bing.com/aclk?ld=e8v3Lwk5LCJ5XJ8GFFWyo15DVUCUzWVQx5WBVdN6RoejfILkbGTJpiaRGH-mgYJud0SjJ08LCfevGIfKPesu3VfJiwi4UkK3AUWpkFLaqbmyeoPppgFGjaQtqvn-WZFUE7tGjmSWfO2t1ijjhuuLIT13lD8JHoBoRS0wE39T543iQdFu5Oy0UjUyzGnpLUyZxceoT6Fw&amp;u=aHR0cHMlM2ElMmYlMmZ3d3cud2l6LmlvJTJmYnItcG0td2l6JTNmdXRtX3NvdXJjZSUzZGJpbmclMjZ1dG1fbWVkaXVtJTNkcHBjJTI2dXRtX2NhbXBhaWduJTNkYnJhbmQtc2VhcmNoLXVzLWNhJTI2dXRtX3Rlcm0lM2R3aXolMjZ1dG1fY29udGVudCUzZFdpeiUyNTIwaW8lMjZ1dG1fZGV2aWNlJTNkYyUyNm1zY2xraWQlM2RjZmY3YTQxYzZiNDcxNWUyNDFmN2NjMjNjMjY5MWZhNw&amp;rlid=cff7a41c6b4715e241f7cc23c2691fa7">Wiz</a>, and <a href="https://jfrog.com/">JFrog</a> to work with existing scanning tools, registries, and CI/CD pipelines.</li>
<li style="font-weight:400;">DHI solves key challenges around software integrity, attack surface sprawl, and operational overhead from constant patching by providing a minimal, focused base image.</li>
<li style="font-weight:400;">Customization is supported without compromising the hardened foundation, allowing teams to add certificates, packages, scripts and configs tailored to their environment.</li>
<li style="font-weight:400;">Docker monitors and automatically patches Critical and High severity CVEs within 7 days, faster than typical industry response times, simplifying maintenance.</li>
<li style="font-weight:400;">For cloud professionals, DHI offers a drop-in way to dramatically improve container security posture and reduce patching overhead, enabling developers to focus on shipping features.</li>
</ul>
<p>12:37  Justin – “I’m mostly glad Docker is releasing something that is not just bloat to their desktop client.” </p>
<p>15:51 <a href="https://www.infoq.com/news/2025/05/terraform-mcp-server/">HashiCorp Releases Terraform MCP Server for AI Integration – InfoQ</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.hashicorp.com/">HashiCorp</a> released the open-source <a href="https://github.com/hashicorp/terraform-mcp-server">Terraform MCP Server</a> to improve how AI models interact with infrastructure as code by providing real-time, structured data from the <a href="https://registry.terraform.io/">Terraform Registry</a>.</li>
<li style="font-weight:400;">The server exposes module metadata, provider schemas, and resource definitions in a machine-readable format, allowing AI systems to generate more accurate, context-aware Terraform code suggestions.</li>
<li style="font-weight:400;">By leveraging the <a href="https://modelcontextprotocol.io/introduction">Model Context Protocol (MCP)</a>, the server enables AI models to retrieve up-to-date configuration details and align with the latest Terraform standards, reducing reliance on potentially outdated training data</li>
<li style="font-weight:400;">The Terraform MCP Server has been demonstrated with <a href="https://github.com/features/copilot">GitHub Copilot</a> integration, allowing developers to access context-aware recommendations directly from their IDEs.</li>
<li style="font-weight:400;">This release is part of a broader trend in AI-assisted tooling to unify developer workflows through interoperable interfaces, moving away from product-specific AI integrations.</li>
<li style="font-weight:400;">For cloud professionals, the Terraform MCP Server represents a significant step towards more accurate and efficient AI-assisted infrastructure management, potentially reducing errors and improving productivity.</li>
</ul>
<p>17:21  Matt – “I also read a little bit of how they were implementing it; with the Terraform server with your corporate registry modules. So if you have a platform engineering team, they kind of have these modules predefined for you. It will interact with those in that way… where in real time we’ll pull and say, okay, now you need these variables with your, VS code or whatever your IDE is. So kind of that registry piece of it, I think to me is the key part.”</p>
<h2>AWS</h2>
<p>18:22 <a href="https://aws.amazon.com/blogs/aws/amazon-aurora-dsql-is-now-generally-available/">Amazon Aurora DSQL, the fastest serverless distributed SQL database is </a><a href="https://aws.amazon.com/blogs/aws/amazon-aurora-dsql-is-now-generally-available/">now generally available</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/rds/aurora/dsql/?trk=769a1a2b-8c19-4976-9c45-b6b1226c7d20&amp;sc_channel=el">Aurora DSQL</a> is a serverless distributed SQL database that offers unlimited scale, high availability, and zero infrastructure management. It simplifies complex relational database challenges.</li>
<li style="font-weight:400;">Aurora DSQL’s disaggregated architecture enables multi-Region strong consistency with low latency. </li>
<li style="font-weight:400;">It’s designed for 99.99% availability in a single region and 99.999% across multiple regions.</li>
<li style="font-weight:400;">It integrates with AWS services like <a href="https://docs.aws.amazon.com/AmazonRDS/latest/UserGuide/USER_WorkingWithAutomatedBackups.html">AWS Backup</a> for snapshots/restore, AWS <a href="https://docs.aws.amazon.com/vpc/latest/privatelink/what-is-privatelink.html">PrivateLink</a> for private connectivity, <a href="https://docs.aws.amazon.com/AWSCloudFormation/latest/UserGuide/Welcome.html">CloudFormation</a> for resource management, and <a href="https://aws.amazon.com/cloudtrail/features/">CloudTrail</a> for logging. </li>
<li style="font-weight:400;">The <a href="https://modelcontextprotocol.io/introduction">Model Context Protocol (MCP)</a> server improves developer productivity by allowing generative AI models to interact with the database using natural language via the <a href="https://aws.amazon.com/q/developer/">Amazon Q Developer</a> CLI.</li>
<li style="font-weight:400;">Key use cases include microservices, event-driven architectures, multi-tenant SaaS apps, data-driven services like payment processing, gaming, social media that require multi-Region scalability and resilience.</li>
<li style="font-weight:400;">Pricing starts at $0 (free tier of 100K DPUs and 1 GB storage per month), then based on Distributed Processing Units and GB-months. Want more info on pricing? You can find that <a href="https://aws.amazon.com/rds/aurora/dsql/pricing/?trk=769a1a2b-8c19-4976-9c45-b6b1226c7d20&amp;sc_channel=el">here</a>. </li>
</ul>
<p>19:44  Matt – “The pricing of it is kind of going in line with the Azure pricing, and I feel like a lot of the other RDS-type pricing where the compute is on the low end, but your storage costs are getting higher.” </p>
<p>22:30 <a href="https://aws.amazon.com/about-aws/whats-new/2025/05/amazon-ecs-container-exit-reason-message-characters/">Amazon ECS increases container exit reason message to 1024 characters – </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/05/amazon-ecs-container-exit-reason-message-characters/">AWS</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/ecs/">Amazon ECS</a> has increased the character limit for container exit reason messages from 255 to 1024 characters. </li>
<li style="font-weight:400;">This provides more detailed error messages to help customers debug failed containers more effectively.</li>
<li style="font-weight:400;">The extended error messages are accessible via the AWS Management Console and the DescribeTasks API. Look for the “reason” field in the API response.</li>
<li style="font-weight:400;">This feature is available in all AWS regions for ECS tasks running on Fargate Platform 1.4.0+ or EC2 container instances with ECS Agent v1.92.0+. </li>
<li style="font-weight:400;">Any containerized application or microservice running on ECS can benefit from more verbose error messages to speed up troubleshooting of failures and improve observability.</li>
<li style="font-weight:400;">Debugging container failures is a common pain point; increasing the error message limit is a small but impactful change to help developers identify root causes faster, reducing downtime and operational toil. Especially for Justin. </li>
<li style="font-weight:400;">We’re surprised this one took so long, but appy it’s here now! </li>
</ul>
<p>24:49 <a href="https://aws.amazon.com/about-aws/whats-new/2025/05/dynamo-db-local-accessible-aws-cloudshell/">DynamoDB local is now accessible on AWS CloudShell – AWS</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.bing.com/aclk?ld=e8A95lcfJkuiZT9Ttm91UEEjVUCUw-JgVEx_lRtw3sNTP11ChddGnL5-1Buf_QGW5sthlywoXBOx4N8iyWlsxqO-Fb1gFZhvcuZMk_PcCKtSWGSJmDJS2Iki4Uc-ecIDlo-Jl4hpiiq4aVXRH1GFjuFeERHPWH5oaZ1AdN-L-MyCR1T13fspshqyhLizVsBT9Z6gQIdg&amp;u=aHR0cHMlM2ElMmYlMmZhd3MuYW1hem9uLmNvbSUyZnBtJTJmZHluYW1vZGIlMmYlM2Z0cmslM2RmNGFkYmY1ZS1hZDM5LTQ1ZjUtOGQ3NC0wZDQyOThiNTc2ZjUlMjZzY19jaGFubmVsJTNkcHMlMjZzX2t3Y2lkJTNkQUwhNDQyMiExMCE3MTA1NjE3NDMzMTU2NyEhISE3MTA1NjcwMzcwMDY2MSEhNDgyNTEwNzU3ITExMzY4OTYwMjU0NzY4MzIlMjZlZl9pZCUzZDZkMjE4ODFlNzgzMDE1MmZkNTMxZTAyMWQ3NjQ5MzM3JTNhRyUzYXMlMjZtc2Nsa2lkJTNkNmQyMTg4MWU3ODMwMTUyZmQ1MzFlMDIxZDc2NDkzMzc&amp;rlid=6d21881e7830152fd531e021d7649337">DynamoDB</a> local is now generally available on AWS <a href="https://docs.aws.amazon.com/cloudshell/latest/userguide/getting-started.html">CloudShell</a>, allowing developers to test DynamoDB applications directly in the AWS Management Console without incurring costs.</li>
<li style="font-weight:400;">This update integrates with existing DynamoDB APIs to enable local development and testing without impacting production environments.</li>
<li style="font-weight:400;">Developers can start DynamoDB local in CloudShell using the “dynamodb-local” alias, without needing to download or install the AWS CLI or DynamoDB local</li>
<li style="font-weight:400;">To interact with the local DynamoDB instance in CloudShell, use the “–endpoint-url” parameter pointed to “localhost:8000” </li>
<li style="font-weight:400;">It’s ideal for developers building and testing DynamoDB applications who want a quick, low-friction way to run DynamoDB locally.</li>
</ul>
<p>26:14  Ryan – “I’ve always used CloudShells for very simple CLA cloud tasks; I’ve never really thought about developing inside of a CloudShell…” </p>
<p>27:21 <a href="https://aws.amazon.com/about-aws/whats-new/2025/05/ipv6-support-ec2-public-dns-names/?utm_source=convertkit&amp;utm_medium=email&amp;utm_campaign=AWS%20Graviton%20Weekly%20#%20126%20-%2017721200">AWS announces IPv6 support for EC2 Public DNS names  – AWS</a></p>
<ul>
<li style="font-weight:400;">EC2 Public DNS names can now resolve to IPv6 addresses (AAAA records) for EC2 instances and Elastic Network Interfaces, allowing public access to IPv6-enabled instances over IPv6.</li>
<li style="font-weight:400;">Previously, EC2 Public DNS only resolved to IPv4 addresses, requiring use of a specific IPv6 address or custom domain via Route 53 to access IPv6-only instances.</li>
<li style="font-weight:400;">This update enables easier access to IPv6-only instances and simplifies migration to IPv6 by allowing access to dual-stack instances via IPv6 with DNS cutover.</li>
<li style="font-weight:400;">Available in all commercial and GovCloud regions, configured using the same VPC settings as IPv4 EC2 Public DNS.</li>
<li style="font-weight:400;">It will be useful for customers adopting IPv6 who want to simplify access to IPv6-enabled instances without managing IP addresses directly.</li>
</ul>
<p>30:05 <a href="https://aws.amazon.com/blogs/aws/centralize-visibility-of-kubernetes-clusters-across-aws-regions-and-accounts-with-eks-dashboard/">Centralize visibility of Kubernetes clusters across AWS Regions and </a><a href="https://aws.amazon.com/blogs/aws/centralize-visibility-of-kubernetes-clusters-across-aws-regions-and-accounts-with-eks-dashboard/">accounts with EKS Dashboard</a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/eks/latest/userguide/cluster-dashboard.html">EKS Dashboard</a> provides a centralized view of <a href="https://kubernetes.io/">Kubernetes</a> clusters across AWS regions and accounts, making it easier to track inventory, assess compliance, and plan operational activities.</li>
<li style="font-weight:400;">It integrates natively into the <a href="https://aws.amazon.com/console/">AWS Console</a>, eliminating the need for third-party tools and their associated complexity and costs.</li>
<li style="font-weight:400;">The dashboard offers insights into clusters, managed node groups, and EKS add-ons, with data on cluster distribution, version, support status, forecasted costs, and health metrics.</li>
<li style="font-weight:400;">Advanced filtering enables drilling down into specific data points to quickly identify clusters needing attention.</li>
<li style="font-weight:400;">Setup is straightforward, using AWS Organizations’ management and delegated administrator accounts, and enabling trusted access in the EKS console.</li>
<li style="font-weight:400;">EKS Dashboard supports visibility into connected Kubernetes clusters running on-premises or on other clouds, though with more limited data compared to native EKS.</li>
<li style="font-weight:400;">This feature will especially benefit organizations running Kubernetes at scale across multiple regions, accounts, and environments who need unified visibility and control.</li>
<li style="font-weight:400;">For the Cloud Pod audience, EKS Dashboard demonstrates AWS’ continued focus on simplifying Kubernetes operations so customers can focus on their applications.</li>
<li style="font-weight:400;">And it’s GOOD NEWS – EKS Dashboard is available at no additional charge!</li>
</ul>
<p>31:02  Ryan – “AKA you have a centralized team that you’ve shafted into hosting all the Kubernetes workloads and being the subject matter experts – because there’s no way that you segregate that and decentralize it. And so at least we’re making those poor bastards’ lives easier. So I like this except for the need for it – I don’t like.”</p>
<p>18:22 <a href="https://aws.amazon.com/about-aws/whats-new/2025/05/anthropics-claude-4-foundation-models-amazon-bedrock/">Anthropic’s Claude 4 foundation models now in Amazon Bedrock – AWS</a></p>
<ul>
<li style="font-weight:400;">Anthropic has released the next generation of its Claude AI models, Claude Opus 4 and Claude Sonnet 4, which are now available in Amazon’s Bedrock AI platform.</li>
<li style="font-weight:400;">The Claude 4 models represent significant advancements in AI capabilities, excelling at coding, analyzing data, long-running tasks, content generation, and complex actions. </li>
<li style="font-weight:400;">No, I’m not redoing the links. Scroll up if you need them; but we’re going to be copy/pasting this announcement the rest of the show. </li>
</ul>
<h2>GCP</h2>
<p>35:06 <a href="https://cloud.google.com/blog/products/ai-machine-learning/vertex-ai-studio-redesigned/">Vertex AI Studio, redesigned. Take a look</a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/generative-ai-studio?e=48754805&amp;hl=en">Vertex AI</a> Studio provides a unified platform to experiment with and customize 200+ advanced foundation models from <a href="https://www.google.com/">Google</a> (like <a href="https://gemini.google.com/">Gemini</a>) and partners (like <a href="https://www.llama.com/">Meta’s Llama</a>, Anthropic’s <a href="https://claude.ai/login">Claude</a>.)</li>
<li style="font-weight:400;">The redesign focuses on developer experience with faster prompting, easier ways to build, and fresh UI – accelerating prototyping and experimentation with generative AI models.</li>
<li style="font-weight:400;">Integrates end-to-end workflow from prompting to grounding, tuning, code generation and test deployment.</li>
<li style="font-weight:400;">Enhances prompt engineering with prompt management, variables, function calling, examples</li>
<li style="font-weight:400;">Enables building with latest Gemini models for text, image, audio generation and multimodal capabilities.</li>
<li style="font-weight:400;">Simplifies <a href="https://cloud.google.com/vertex-ai/generative-ai/docs/grounding/overview">grounding models</a> with real-world data via Google Search, Maps or custom data for improved reliability and trust.</li>
<li style="font-weight:400;">Generates sample code in <a href="https://www.python.org/">Python</a>, <a href="https://developer.android.com/studio">Android</a>, <a href="https://developer.apple.com/swift/">Swift</a>, Web, <a href="https://flutter.dev/">Flutter</a>, and <a href="https://curl.se/docs/manpage.html">cURL</a> – and enables test web app deployment.</li>
<li style="font-weight:400;">Introduces dark mode UI for better visual comfort during long development sessions. Your eyes will thank you! #darkmode4life</li>
<li style="font-weight:400;">Vertex AI Studio serves as the central place to explore Google’s powerful generative AI media models like <a href="https://app.veo.co/">Veo</a>, <a href="https://deepmind.google/models/imagen/">Imagen</a>, <a href="https://chirpmyradio.com/projects/chirp/wiki/Download">Chirp</a>, and <a href="https://deepmind.google/models/lyria/">Lyria</a>. </li>
<li style="font-weight:400;">Pricing details are not provided, but Vertex AI platform likely follows typical usage-based pricing of other GCP services.</li>
</ul>
<p>36:21 <a href="https://developers.googleblog.com/en/introducing-gemma-3n/">Announcing Gemma 3n preview: powerful, efficient, mobile-first AI</a> </p>
<ul>
<li style="font-weight:400;">Gemma 3n is a powerful, efficient, mobile-first AI model optimized to run directly on phones, tablets and laptops. It enables real-time, multimodal AI experiences with advanced on-device capabilities.</li>
<li style="font-weight:400;">The model leverages a new shared architecture co-developed with mobile hardware leaders like <a href="https://www.qualcomm.com/">Qualcomm</a>, <a href="https://www.mediatek.com/">MediaTek</a> and Samsung. This positions it well versus other mobile AI offerings.</li>
<li style="font-weight:400;"><a href="https://deepmind.google/models/gemma/gemma-3n/">Gemma 3n</a> uses an innovative technique called Per-Layer Embeddings (PLE) to significantly reduce RAM usage, allowing larger models to run on mobile with 2-3GB memory footprints. </li>
<li style="font-weight:400;">It integrates closely with Google’s broader AI ecosystem, powering the next generation of on-device features like <a href="https://deepmind.google/technologies/gemini/nano/">Gemini Nano</a> in Google apps. Developers can preview core capabilities that will come to Android and Chrome.</li>
<li style="font-weight:400;">Real-time speech transcription/translation, voice interactions, and multimodal understanding combining audio, image, video and text inputs are all processed privately on-device.</li>
<li style="font-weight:400;">Gemma 3n represents an important step in democratizing access to cutting-edge, efficient AI and enabling a new wave of intelligent mobile apps with advanced on-device AI.</li>
</ul>
<p>37:26  Ryan – “As pricing with generative AI goes, you never know what you’re going to get.” </p>
<p>38:35 <a href="https://developers.googleblog.com/en/agents-adk-agent-engine-a2a-enhancements-google-io">What’s new with Agents: ADK, Agent Engine, and A2A Enhancements</a> </p>
<ul>
<li style="font-weight:400;">Google announced major updates to its intelligent agent platform, providing more robust development tools, intuitive management, and seamless agent-to-agent communication.</li>
<li style="font-weight:400;">The <a href="http://google.github.io/adk-docs">Agent Development Kit (ADK)</a> adds new capabilities to create sophisticated agents with greater stability and adaptability. </li>
<li style="font-weight:400;"><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/agent-engine/overview">Vertex AI Agent Engine</a> introduces a new UI to simplify agent lifecycle management, deployment, scaling, and monitoring – accessible from the <a href="https://cloud.google.com/cloud-console">Google Cloud console</a>.</li>
<li style="font-weight:400;">Enhancements to the <a href="https://www.a2aprotocol.org/">Agent2Agent (A2A) protocol</a> enable more sophisticated and reliable interactions between agents, with an updated specification (v0.2) and an official Python SDK.</li>
<li style="font-weight:400;">Industry adoption of A2A is accelerating, with platforms introducing new capabilities for building, deploying and securing A2A agents.</li>
<li style="font-weight:400;">These updates provide a comprehensive, flexible platform for building intelligent agent solutions, unlocking new possibilities across industries</li>
<li style="font-weight:400;">Vertex AI Agent Engine pricing starts at $0.0001 per agent session, with a free tier available (general estimate based on current Vertex AI pricing.)</li>
</ul>
<p>40:08  Justin – “The biggest thing they need to get though is security. That’s the biggest risk we’ve seen so far…there are a lot of dangers with MCP you should be a little cautious about.”  </p>
<p>40:54 <a href="https://cloud.google.com/blog/products/ai-machine-learning/anthropics-claude-opus-4-and-claude-sonnet-4-on-vertex-ai/">Anthropic’s Claude Opus 4 and Claude Sonnet 4 on Vertex AI</a></p>
<ul>
<li style="font-weight:400;">Anthropic’s newest Claude models (Opus 4 and Sonnet 4) are now available as a Model-as-a-Service offering on Google Cloud’s Vertex AI platform. This expands the choice of powerful foundation models developers can easily access and deploy.</li>
<li style="font-weight:400;">Who would have guessed? </li>
</ul>
<p>41:01 <a href="https://cloud.google.com/blog/products/identity-security/google-advances-sovereignty-choice-and-security-in-the-cloud/">Google advances sovereignty, choice, and security in the cloud</a></p>
<ul>
<li style="font-weight:400;">Google Cloud is announcing significant updates to its <a href="https://cloud.google.com/blog/products/identity-security/how-google-cloud-is-addressing-data-sovereignty-in-europe-2020?e=48754805">sovereign cloud solutions</a>, giving customers greater control, choice, and security without compromising functionality.</li>
<li style="font-weight:400;">Key offerings include:
<ul>
<li style="font-weight:400;">Google Cloud Data Boundary: Allows deploying <a href="https://cloud.google.com/security/products/assured-workloads">sovereign data boundaries</a> to control data storage/processing location and manage encryption keys externally.</li>
<li style="font-weight:400;">Google Cloud Dedicated: Designed to meet local sovereignty requirements through partnerships (e.g. Thales <a href="https://www.s3ns.io/en/offres/trusted-cloud-by-S3NS">S3NS</a> in France.)  </li>
<li style="font-weight:400;"><a href="https://cloud.google.com/distributed-cloud-air-gapped">Google Cloud Air-Gapped</a>: Fully standalone solution not requiring external network connectivity, tailored for intelligence/defense sectors</li>
</ul>
</li>
<li style="font-weight:400;">These solutions leverage Google’s massive global infrastructure (42+ regions, 202 edge locations) and key partnerships across regions.</li>
<li style="font-weight:400;">The updates enable customers to choose solutions aligning with business needs, regulations, and risk profiles – not a one-size-fits-all approach.</li>
<li style="font-weight:400;">Combines local control with access to Google’s leading security like AI-powered defenses, Confidential Computing, post-quantum crypto.</li>
<li style="font-weight:400;">Relevant for organizations navigating complex digital sovereignty landscape, especially in regulated industries and public sector. </li>
</ul>
<p>42:02  Ryan – “It’s kind of nice the way that Google does this versus AWS, right? AWS has GovCloud – and it’s almost like a separate product and a whole separate authentication, whereas these are built in.”</p>
<p>44:53 <a href="https://cloud.google.com/blog/products/data-analytics/convert-ai-generated-unstructured-data-to-a-bigquery-table/">Convert AI-generated unstructured data to a BigQuery table</a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/bigquery/docs/release-notes#April_03_2025">AI.GENERATE_TABLE</a> is a new <a href="https://www.bing.com/aclk?ld=e8KgZHktUAEvRKoLS0uIUOADVUCUwMMqJ-NeJ4JiCc4bWSM1fc9InMCwJWArVkBsGhYb8tysECr9fW9YWhG1ww-LvLqt78vKI-Z7MVTn0xILUgRCWVzPWMsh1Dpb8ydgnKiS3uKP-affUzOD3iZNxZzNUNy98JhhNuZ4qZpcWR7xBb9mQrj5TFSK27DCNXwKgfrc49Ew&amp;u=aHR0cHMlM2ElMmYlMmZjbG91ZC5nb29nbGUuY29tJTJmYmlncXVlcnklM2Z1dG1fc291cmNlJTNkYmluZyUyNnV0bV9tZWRpdW0lM2RjcGMlMjZ1dG1fY2FtcGFpZ24lM2RuYS1VUy1hbGwtZW4tZHItYmt3cy1hbGwtYWxsLXRyaWFsLWUtZHItMTcxMDEzNCUyNnV0bV9jb250ZW50JTNkdGV4dC1hZC1ub25lLWFueS1ERVZfYy1DUkVfLUFER1BfRGVzayUyYiUyNTdDJTJiQktXUyUyYi0lMmJFWEElMmIlMjU3QyUyYlR4dC1EYXRhJTJiQW5hbHl0aWNzLUJpZ1F1ZXJ5LUtXSURfNDM3MDAwODA0ODM3MTgzODYta3dkLTc2NjkxMzIxMDg2NDc3JTNhbG9jLTE5MCUyNnV0bV90ZXJtJTNkS1dfYmlncXVlcnktU1RfYmlncXVlcnklMjZnY2xpZCUzZDY2YzVhMjY5ZTZkYjE2MDhmMDc3Y2E1NjM2OGVmMDA3JTI2Z2Nsc3JjJTNkM3AuZHMlMjZtc2Nsa2lkJTNkNjZjNWEyNjllNmRiMTYwOGYwNzdjYTU2MzY4ZWYwMDc&amp;rlid=66c5a269e6db1608f077ca56368ef007">BigQuery</a> feature that converts unstructured data (images, text) into structured tables using advanced AI models like <a href="https://gemini.google/subscriptions/">Gemini 2.5 Pro/Flash</a>. </li>
<li style="font-weight:400;">It builds upon ML.GENERATE_TEXT to streamline the process of extracting insights and making unstructured data compatible with existing data analysis workflows</li>
<li style="font-weight:400;">While AWS and Azure offer some AI services for unstructured data, the tight integration between BigQuery and Vertex AI and the ability to directly generate structured tables sets GCP apart.</li>
<li style="font-weight:400;">The feature leverages large language models and techniques like constrained decoding to accurately extract key information and generate output matching a specified schema.</li>
<li style="font-weight:400;">It integrates seamlessly with BigQuery and <a href="https://cloud.google.com/storage">Google Cloud Storage</a>, allowing users to analyze the extracted data using familiar SQL queries and tools.</li>
<li style="font-weight:400;">Key use cases include analyzing social media content, processing medical transcriptions, and gaining insights from large collections of documents or images.</li>
<li style="font-weight:400;">This feature democratizes access to advanced AI capabilities, enabling more businesses to derive value from their unstructured data without needing deep AI expertise.</li>
</ul>
<p>45:31  Ryan – “The ability to sort of take a bucket of unstructured data and then have this – it’s effectively data labeling – AI data labeling of your images and your unstructured data, and then populating that metadata into BigQuery tables is pretty rad.”</p>
<p>46:33 <a href="https://cloud.google.com/blog/products/ai-machine-learning/mistral-ais-le-chat-enterprise-and-mistral-ocr-25-05-on-google-cloud/">Mistral AI’s Le Chat Enterprise and Mistral OCR 25.05 on Google Cloud</a></p>
<ul>
<li style="font-weight:400;"><a href="https://console.cloud.google.com/marketplace/product/mistralai/le-chat-enterprise">Mistral AI’s Le Chat Enterprise</a>, an AI assistant for enterprise search, custom agents, document libraries and more, is now available on <a href="https://cloud.google.com/marketplace">Google Cloud Marketplace</a>. Allowing for the building of custom AI agents without code.</li>
<li style="font-weight:400;"><a href="https://cloud.google.com/blog/products/ai-machine-learning/mistral-ais-le-chat-enterprise-and-mistral-ocr-25-05-on-google-cloud">Mistral OCR 25.05</a>, a powerful optical character recognition model for document understanding, is now available as a managed service on Vertex AI. </li>
<li style="font-weight:400;">It can comprehend text, charts, tables, equations in documents with high accuracy.</li>
<li style="font-weight:400;">Compared to other cloud AI platforms, Google Cloud offers an open, flexible ecosystem to build custom AI solutions by integrating pre-trained models like Mistral’s. </li>
<li style="font-weight:400;"><a href="https://mistral.ai/products/le-chat">Le Chat Enterprise</a> leverages Google Cloud’s secure, scalable infrastructure and integrates with services like BigQuery and Cloud SQL. </li>
<li style="font-weight:400;"><a href="https://mistral.ai/news/mistral-ocr">Mistral OCR</a> is one of 200+ foundation models in <a href="https://cloud.google.com/vertex-ai/generative-ai/docs/model-garden/explore-models">Vertex AI Model Garden</a>.</li>
<li style="font-weight:400;">Research analysis, generating insights from data, code development, content creation with Le Chat. Digitizing scientific papers, historical documents, customer service docs with Mistral OCR are all use cases. </li>
<li style="font-weight:400;">Industries that can benefit include finance, marketing, research institutions, customer service, engineering, legal and more.</li>
<li style="font-weight:400;">These Mistral AI offerings expand the options for enterprises to build generative AI agents and document AI pipelines on Google Cloud without needing to train custom models from scratch.</li>
<li style="font-weight:400;">Interested in pricing info? Reach out to the sales team via the <a href="https://console.cloud.google.com/marketplace/product/mistralai/le-chat-enterprise">Google Marketplace Listing</a>. </li>
</ul>
<p>47:34   Matt- “The concept of the paperless corporate environment is still not here, and this proves it.”</p>
<h2>Azure</h2>
<p>49:23 <a href="https://techcommunity.microsoft.com/blog/azurecompute/announcing-the-general-availability-of-azure-fxv2-series-virtual-machines/4414399">Announcing the General Availability of Azure FXv2-series Virtual Machines</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/virtual-machines/sizes/compute-optimized/fsv2-series">Azure FXv2-series Virtual Machines</a>, powered by <a href="https://newsroom.intel.com/artificial-intelligence/5th-gen-xeon-data-center-news">5th Gen Intel Xeon Platinum processors</a>, are now generally available for compute-intensive workloads like databases, analytics, and EDA.</li>
<li style="font-weight:400;">Integrates with Azure Boost for improved networking, storage, CPU performance and security, and supports all Azure remote disk types including Premium SSD v2 and Ultra Disk.</li>
<li style="font-weight:400;">Offers up to 50% better CPU performance vs previous FXv1-series, with up to 96 vCPUs, 1832 GiB memory, and enhanced AI capabilities with Intel AMX</li>
<li style="font-weight:400;">Competes favorably with similar compute-optimized instances from AWS (C6i) and GCP (C2), with higher core counts and memory.</li>
<li style="font-weight:400;">Targets customers running SQL Server, Oracle databases, supply chain solutions, and mission-critical apps requiring high IOPS and low latency.</li>
<li style="font-weight:400;">Premium AND Ultra disks. Cool! </li>
</ul>
<p>50:52 <a href="https://techcommunity.microsoft.com/blog/appsonazureblog/red-hat-openshift-virtualization-on-azure-red-hat-openshift-in-public-preview/4409301">Red Hat OpenShift Virtualization on Azure Red Hat OpenShift in Public </a><a href="https://techcommunity.microsoft.com/blog/appsonazureblog/red-hat-openshift-virtualization-on-azure-red-hat-openshift-in-public-preview/4409301">Preview</a></p>
<ul>
<li style="font-weight:400;">Unifies management of VMs and containers on a single platform, allowing organizations to modernize at their own pace while leveraging existing VM investments.</li>
<li style="font-weight:400;">Integrates with Azure services like <a href="https://techcommunity.microsoft.com/blog/itopstalkblog/what-is-azure-hybrid-benefit/1764400">Azure Hybrid Benefit</a> for cost savings, Azure security tools for enhanced protection, and <a href="https://learn.microsoft.com/en-us/azure/openshift/intro-openshift">Azure Red Hat OpenShift</a> for a managed OpenShift platform. </li>
<li style="font-weight:400;">Utilizes the <a href="https://ubuntu.com/blog/kvm-hyphervisor">KVM hypervisor</a> and <a href="https://www.redhat.com/en/technologies/linux-platforms/enterprise-linux">Red Hat Enterprise Linux</a> for improved virtualization performance and security. </li>
<li style="font-weight:400;">Differentiates from AWS and GCP by offering a fully managed, jointly engineered Red Hat OpenShift platform with native virtualization capabilities.</li>
<li style="font-weight:400;">Targets customers in industries like financial services, healthcare, manufacturing, and retail who need to modernize legacy applications incrementally. </li>
<li style="font-weight:400;">There is no additional fee for OpenShift Virtualization, but standard ARO pricing for worker nodes applies (Starts at $0.171/hour for a 4 vCPU worker node.)</li>
</ul>
<p>55:14 <a href="https://techcommunity.microsoft.com/blog/adformysql/announcing-key-maintenance-experience-enhancements-for-azure-database-for-mysql/4411810">Announcing key maintenance experience enhancements for Azure Database for MySQL</a></p>
<ul>
<li style="font-weight:400;">Provides more control, visibility and predictability over how maintenance is orchestrated across Azure Database for MySQL environments.</li>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/azure/mysql/flexible-server/concepts-maintenance#virtual-canary-maintenance-preview">Virtual Canary</a> (GA) Allows enrolling specific servers into an early maintenance ring to validate updates before broader rollout. Simplifies detecting potential compatibility issues early.</li>
<li style="font-weight:400;">Maintenance Batches explicitly assign servers to different execution batches within the same maintenance window. Ensures maintenance proceeds in a predictable, user-defined order.</li>
<li style="font-weight:400;">Maintenance Rollout Status Check (in preview) provides a centralized view of maintenance activity across servers. Users can monitor rollout progress and identify anomalies from the Azure Portal or programmatically via Azure Resource Graph. </li>
<li style="font-weight:400;">Improves transparency, reliability and alignment with enterprise deployment strategies for Azure Database for MySQL maintenance</li>
<li style="font-weight:400;">Targets customers running development workloads or managing complex multi-environment MySQL rollouts on Azure. </li>
</ul>
<p>55:44   Matt- “It’s a decently nice feature; it’s just amazing it rolled out on Azure first.” </p>
<p>57:44 <a href="https://blog.fabric.microsoft.com/en-GB/blog/warehouse-snapshots-in-microsoft-fabric-public-preview/">Warehouse Snapshots in Microsoft Fabric (Preview)</a></p>
<ul>
<li style="font-weight:400;">Guess what this does? Did you guess right? Warehouse Snapshots provides a stable, read-only view of an Azure Data Warehouse at a specific point in time, ensuring data consistency for analytics and reporting without disruptions from ETL processes.</li>
<li style="font-weight:400;">Snapshots can be seamlessly rolled forward to reflect the latest warehouse state, allowing consumers to access the same snapshot using a consistent connection string.</li>
<li style="font-weight:400;">This feature integrates with the Microsoft Fabric ecosystem, enabling users to create, manage, and query snapshots using the Fabric portal, T-SQL, or the Fabric API.</li>
<li style="font-weight:400;">Warehouse Snapshots offer benefits such as guaranteed data consistency, immediate roll-forward updates, historical analysis capabilities, and enhanced reporting accuracy.</li>
<li style="font-weight:400;">While AWS Redshift and Google BigQuery offer similar snapshot features, Azure’s Warehouse Snapshots stand out with their seamless integration into the Microsoft Fabric ecosystem and the ability to roll forward snapshots atomically.</li>
<li style="font-weight:400;">Target customers include data engineers and analysts who require stable datasets for accurate reporting and analytics, even as real-time updates occur in the background.</li>
</ul>
<p>58:16  Ryan – “Very cool. This protects you from Little Johnny drop table!” </p>
<p>58:42 <a href="https://azure.github.io/AppService/2025/05/19/Aspire-on-App-Service.html">Getting Started with .NET Aspire (Preview) on Azure App Service – Azure </a><a href="https://azure.github.io/AppService/2025/05/19/Aspire-on-App-Service.html">App Service</a></p>
<ul>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/app-service/overview">Azure App Service</a> now offers preview support for deploying <a href="https://learn.microsoft.com/dotnet/aspire/get-started/aspire-overview">.NET Aspire </a>applications, enabling developers to host their distributed apps on Azure’s fully managed platform.</li>
<li style="font-weight:400;">.NET Aspire is Microsoft’s new framework for building modern distributed applications, and this integration brings it into the broader Azure ecosystem.</li>
<li style="font-weight:400;">Developers can use familiar tools like Visual Studio and the Azure Developer CLI (azd) to build, deploy, and manage their Aspire apps on App Service.</li>
<li style="font-weight:400;">While AWS and GCP offer similar managed platforms, the tight integration between .NET Aspire and Azure App Service provides a streamlined experience for .NET developers.</li>
<li style="font-weight:400;">This preview targets .NET developers looking to build and deploy distributed applications with minimal infrastructure management.</li>
<li style="font-weight:400;">Pricing varies based on App Service Plan and usage, but a Free tier is available for testing and small workloads. </li>
</ul>
<p>1:01:32 <a href="https://techcommunity.microsoft.com/blog/azurenetworkingblog/secure-your-subnet-via-private-subnet-and-explicit-outbound-methods/3984177">Secure your subnet via private subnet and explicit outbound methods |</a> <a href="https://techcommunity.microsoft.com/blog/azurenetworkingblog/secure-your-subnet-via-private-subnet-and-explicit-outbound-methods/3984177">Microsoft Community Hub</a></p>
<ul>
<li style="font-weight:400;">File this under “news we’re kind of shocked about” – Azure is retiring implicit outbound connectivity for VMs in Sept 2025. This default outbound access assigns public IPs that are insecure and hard to manage.</li>
<li style="font-weight:400;">The new private subnet feature (in preview) prevents implicit outbound access. VMs in a private subnet require an explicit outbound method to connect to the internet. </li>
<li style="font-weight:400;">Azure’s recommended explicit outbound methods are: 1) NAT Gateway, 2) public load balancer with outbound rules, 3) public IP on the VM NIC.</li>
<li style="font-weight:400;">NAT Gateway is the preferred option – it provides secure, scalable outbound connectivity by SNAT’ing private IPs to a static public IP. No inbound connections are allowed.</li>
<li style="font-weight:400;">Load balancers with outbound rules also SNAT private IPs but require manual allocation of SNAT ports to each backend VM. This allows declarative control but is less scalable.</li>
<li style="font-weight:400;">Public IPs on VM NICs give control over the outbound IP but don’t scale well for complex workloads needing many-to-one SNAT that adjusts to traffic.</li>
<li style="font-weight:400;">These explicit methods integrate with Azure Virtual Network and follow a precedence order if multiple are configured (NAT Gateway &gt; LB &gt; Public IP).</li>
<li style="font-weight:400;">The shift to explicit outbound aligns with Azure’s secure-by-default approach. It matters for security-conscious customers running internet-facing workloads on Azure VMs.</li>
<li style="font-weight:400;">NAT Gateway pricing estimate: $0.045/hour + $0.045 per GB processed (varies by region, general estimate.)</li>
</ul>
<p>1:03:11  Matt – “There is one other option, which is using the Azure Firewall to write everything through it. It has a lower limit if you need more than the number of Snap ports running. So if you go to Firewall versus the NAT, but also they made the announcement that they were retiring implicit outbound connectivity in like 2022 or 2023. They’re ending it in September and they’re just GA’ing this feature in May… to me, this is like Azure’s running EC2 classic still, and they’re finally moving into let’s actually use our VNets and VPCs.”</p>
<h2>Cloud Journey</h2>
<p>1:01:32 Justin Does a Thing: Bolt Bot</p>
<h2>Aftershow</h2>
<p>1:01:32 <a href="https://arstechnica.com/gadgets/2025/05/return-of-the-turbo-button-silverstone-is-making-another-80s-style-beige-pc-case/">SilverStone is back with a beige PC case that looks just like your crappy </a><a href="https://arstechnica.com/gadgets/2025/05/return-of-the-turbo-button-silverstone-is-making-another-80s-style-beige-pc-case/">old 486 – Ars Technica</a></p>
<ul>
<li style="font-weight:400;">SilverStone has unveiled the FLP02, a new PC case that pays homage to the beige tower cases of the 486 and early Pentium era, complete with a faux Turbo button and power switch lock.</li>
<li style="font-weight:400;">Despite its retro exterior, the FLP02 can accommodate modern high-end components, including full-size ATX motherboards, 360mm radiators, and the latest GPUs like the GeForce RTX 5090 or 5080.</li>
<li style="font-weight:400;">While not directly related to cloud computing, the FLP02 showcases the enduring appeal of nostalgia in the tech industry and how it can drive consumer interest and sales.</li>
<li style="font-weight:400;">The case’s ability to blend vintage aesthetics with cutting-edge hardware demonstrates the flexibility and adaptability of modern PC components, a principle that also applies to cloud infrastructure.</li>
</ul>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod where you can join our newsletter, slack team, send feedback or ask questions at theCloud Pod.net or tweet at us with hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2059026/c1e-xm8mb9p0xzbr8nwv-rk493qprfvp8-mpbtbd.mp3" length="113049856"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 306 of The Cloud Pod – where the forecast is always cloudy! 
This week, we have a bunch of announcements concerning the newest offering from Anthropic – Claude Sonnet 4 and Opus 4, plus container security, Azure MySQL Maintenance, Vertex AI, and Mistral AI. Plus, we’ve got a Cloud Journey installment AND an aftershow – so get comfy and get ready for a trip to the clouds!
Titles we almost went with this week:

ECS Failures Now Have 4x the Excuses
Nailing Down Your Container Security, One Patch at a Time
HashiCorp’s New Recipe: Terraform, AI, and a Pinch of MCP
Teaching an Old DNS New IPv6 Tricks
Dash-ing through the Klusters, in an AWS Console
Google’s Generative AI Playground Gets a Glow-Up
Vertex AI Studio: Now with 200% More Darkness! Like our souls
Claude Opus 4 Strikes a Chord on Google Cloud
Sovereign-teed to Please: Google Cloud’s Royal Treatment
Google’s Cloud Kingdom Expands its Borders
Shall I Compare Thee to a Summer’s AI? Anthropic Drops Sonne(t) 4 Knowledge on Vertex
Mistral AI Chats Up a Storm on Google Cloud
Google Cloud’s Vertex AI Gets a Dose of Mistral Magic
.NET Aspire on Azure: The App Service Strikes Back
Default Outbound Access Retires, Decides Florida Isn’t for Everyone 

AI Is Going Great – or How ML Makes Money 
01:52 Introducing Claude 4

Claude has launched the latest models in Claude Opus 4 and Claude Sonnet 4, setting new standards for coding, advancing reasoning and AI agents. Maybe they’ll actually follow instructions when told to shut down? (Looking at you, ChatGPT.)
Claude Opus 4 is “the world’s best coding model” with sustained performance on complex, long-running tasks and agent workflows. 
Opus 4 has 350 billion parameters, making it one of the largest publicly available language models. 
It demonstrates strong performance on academic benchmarks, including research. 
Sonnet 4 is a smaller 10 billion parameter model optimized for dialogue, making it well-suited for conversational AI applications. 
Alongside the models, they are also announcing:

Extended thinking with tool use (beta): Both models can use tools – like web search – during extended thinking, allowing Claude to alternate between reasoning and tool use to improve its responses.
New Model Capabilities: Both models can use tools in parallel, follow instructions more precisely, and when given access to local files by developers — demonstrate significantly improved memory capabilities, extracting and saving key facts maintain continuity and build tacit knowledge over time
Claude code is now generally available: After receiving extensive positive feedback during our research preview, they are expanding how developers can collaborate with Claude.  Claude code now supports background tasks via github actions and native integrations with VS code and jetbrains, displaying edits directly in your files for seamless pair programming. 
New Api capabilities: Four new capabilities on the API that enable developers to build more powerful AI agents including Code Execution tool, MCP connector, Files API and the ability to cache...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2059026/c1a-k5d5-kp4v98ojbrk7-zpsk4n.jpg"></itunes:image>
                                                                            <itunes:duration>01:34:13</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2059026/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[305: AWS Breaks Up with Unpopular Services - "It's Not You, It's Me"]]>
                </title>
                <pubDate>Wed, 28 May 2025 14:57:15 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2051582</guid>
                                    <link>https://tcpfm.castos.com/episodes/305-aws-breaks-up-with-unpopular-services-its-not-you-its-me-1</link>
                                <description>
                                            <![CDATA[<p>Welcome to episode 305 of The Cloud Pod – where the forecast is always cloudy! How did you do on your Microsoft Build Predictions? As badly as us? Plus we’ve got news on AWS service changes, a lifecycle catch up page for all those services that bought the farm, tons of Gemini news (seriously, like a lot) and even some AI for .NET. </p>
<p>Welcome to the cloud pod- and thanks for joining us! </p>
<h3>Titles we almost went with this week:</h3>
<ul>
<li>Google’s Jules: An AI Gem for Cloud Devs  </li>
<li>Autonomous Agents of Code: Jules’ Excellent Adventure in the Google Cloud</li>
<li>Gemini 2.5 Shoots for the Stars with Cosmic-Sized AI Upgrades</li>
<li>Resistance is Futile: OpenAI Assimilates Your Codebase </li>
<li>AWS Transformers: Rise of the Agentic AI </li>
<li>Teaching an old .NET dog new Linux tricks</li>
<li>CodeBuild Puts Docker Builds in Hyperdrive</li>
<li>Inspector Gadget’s New Trick: Mapping Container Vulnerabilities</li>
<li>Yo Dawg, I Heard You Like Scanning Containers…</li>
<li>Google Cranks AI to 11 with New Ultra Plan</li>
<li>I, For One, Welcome Our New AI Ultra Overlords</li>
<li>The Inference Engine That Could: llm-d Chugs Ahead with Kubernetes-Native </li>
<li>      Scaling</li>
<li>Scaling Inference to Infinity and Beyond with Google Cloud’s llm-d</li>
<li>Google Cloud and Spring AI: A Match Made in Java-n</li>
<li>The Fast and the Serverless: Cloud Run Drifts into AI Studio Territory</li>
<li>SQL Server 2025: A Vector Victor, Not a Scalar Failure</li>
<li>AI will solve my life problems of having money in my pocket</li>
<li>I used to scan all the containers but now I will just scan yours</li>
</ul>
<h2>AI Is Going Great – or How ML Makes Money </h2>
<p>01:50 <a href="https://blog.google/technology/google-labs/jules/">Jules: Google’s autonomous AI coding agent</a></p>
<ul>
<li style="font-weight:400;"><a href="http://jules.google/">Jules</a> is an autonomous AI agent that can read code, understand intent, and make code changes on its own. </li>
<li style="font-weight:400;">It goes beyond AI coding assistants to operate independently.</li>
<li style="font-weight:400;">It clones code into a secure Google Cloud VM, allowing it to understand the full context of a project. This enables it to write tests, build features, fix bugs, and more.</li>
<li style="font-weight:400;">Jules operates asynchronously in the background, presenting its plan and reasoning when complete. This allows developers to focus on other tasks while it works.</li>
<li style="font-weight:400;">Integration with <a href="https://github.com/">GitHub</a> enables Jules to work directly in existing workflows without extra setup or context switching. Developers can steer and give feedback throughout the process.</li>
<li style="font-weight:400;">For cloud developers, Jules demonstrates the rapid advancement of AI for coding moving from prototype to product. Its cloud-based parallel execution enables efficient handling of complex, multi-file changes.</li>
<li style="font-weight:400;">While in public beta, Jules is free with some usage limits. This allows developers to experiment with this cutting-edge AI coding agent and understand its potential to accelerate development on <a href="https://cloud.google.com/">Google Cloud</a>.</li>
</ul>
<p>02:56  Ryan – “More and more, as new tools get released, it’s just going to change the way anything gets written… it’s getting more and more capable.” </p>
<p>05:45 <a href="https://blog.google/technology/ai/google-flow-veo-ai-filmmaking-tool/">Introducing Flow: Google’s AI filmmaking tool designed for Veo</a></p>
<ul>
<li style="font-weight:400;"><a href="http://flow.google/">Flow</a> is an AI-powered filmmaking tool custom-designed for Google’s advanced video, image and language models (<a href="https://www.axios.com/2025/05/23/google-ai-videos-veo-3">Veo</a>, <a href="https://deepmind.google/models/imagen/">Imagen</a>, <a href="https://gemini.google.com/">Gemini</a>). It allows creators to generate cinem...</li></ul>
<h3>Chapters</h3>
<ul><li>(00:00:00) - Cloud Pod: AWS Breaks Up With unpopular Services</li><li>(00:01:02) - Google's Joules: A Code Editing Agent for Cloud Developers</li><li>(00:04:54) - Google's AI Filmmaking Tool, Flow</li><li>(00:08:45) - Gemini 2.5 Large Language Models Update</li><li>(00:10:33) - Google's Alpha Evolve: The AI Coding Agent</li><li>(00:12:50) - OpenAI's Codex AI Agent for Cloud Development</li><li>(00:14:44) - HashiCorp Validated Patterns for Cloud-based IT</li><li>(00:16:49) - Amazon AWS: End Support for Several Services</li><li>(00:21:12) - Amazon's New Strands AI Agent SDK</li><li>(00:28:01) - Cloud Cost Management: The Right Step for IT Pros</li><li>(00:31:36) - AWS Code Build: New Docker Server Capability</li><li>(00:33:18) - Amazon Inspector for Docker & ECR</li><li>(00:34:51) - Google AI Ultra: A Premium Subscription Plan</li><li>(00:39:10) - Database Center</li><li>(00:40:32) - PostgreSQL on GKE</li><li>(00:43:32) - Google Cloud Introduces LLM-D for Large Language Inference</li><li>(00:47:00) - Spring Boot: AI in Java 1.0</li><li>(00:49:33) - Google Cloud: Bringing AI Studio to Cloud Run</li><li>(00:51:12) - Google's Vertex AI for Creative Content Generation</li><li>(00:52:30) - Two Gemini Stories In One Week</li><li>(00:52:46) - Microsoft's Build 2020 Prediction</li><li>(00:55:14) - Microsoft's App Services Platform Announcement</li><li>(00:57:24) - Microsoft's Cloud Announcement</li><li>(00:59:25) - Azure AI Foundry: New Features, Changes</li><li>(01:01:58) - Microsoft Fabric and Azure Data Portfolio: Powering the Next AI</li><li>(01:04:22) - Microsoft Discovery: Accelerating Research and Development (New Platform)</li><li>(01:06:35) - Microsoft, GitHub Copilot: Agentic DevOps</li><li>(01:09:33) - Oracle Launches E6 Cloud Compute</li><li>(01:11:32) - Week in the Cloud: Starting Late</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 305 of The Cloud Pod – where the forecast is always cloudy! How did you do on your Microsoft Build Predictions? As badly as us? Plus we’ve got news on AWS service changes, a lifecycle catch up page for all those services that bought the farm, tons of Gemini news (seriously, like a lot) and even some AI for .NET. 
Welcome to the cloud pod- and thanks for joining us! 
Titles we almost went with this week:

Google’s Jules: An AI Gem for Cloud Devs  
Autonomous Agents of Code: Jules’ Excellent Adventure in the Google Cloud
Gemini 2.5 Shoots for the Stars with Cosmic-Sized AI Upgrades
Resistance is Futile: OpenAI Assimilates Your Codebase 
AWS Transformers: Rise of the Agentic AI 
Teaching an old .NET dog new Linux tricks
CodeBuild Puts Docker Builds in Hyperdrive
Inspector Gadget’s New Trick: Mapping Container Vulnerabilities
Yo Dawg, I Heard You Like Scanning Containers…
Google Cranks AI to 11 with New Ultra Plan
I, For One, Welcome Our New AI Ultra Overlords
The Inference Engine That Could: llm-d Chugs Ahead with Kubernetes-Native 
      Scaling
Scaling Inference to Infinity and Beyond with Google Cloud’s llm-d
Google Cloud and Spring AI: A Match Made in Java-n
The Fast and the Serverless: Cloud Run Drifts into AI Studio Territory
SQL Server 2025: A Vector Victor, Not a Scalar Failure
AI will solve my life problems of having money in my pocket
I used to scan all the containers but now I will just scan yours

AI Is Going Great – or How ML Makes Money 
01:50 Jules: Google’s autonomous AI coding agent

Jules is an autonomous AI agent that can read code, understand intent, and make code changes on its own. 
It goes beyond AI coding assistants to operate independently.
It clones code into a secure Google Cloud VM, allowing it to understand the full context of a project. This enables it to write tests, build features, fix bugs, and more.
Jules operates asynchronously in the background, presenting its plan and reasoning when complete. This allows developers to focus on other tasks while it works.
Integration with GitHub enables Jules to work directly in existing workflows without extra setup or context switching. Developers can steer and give feedback throughout the process.
For cloud developers, Jules demonstrates the rapid advancement of AI for coding moving from prototype to product. Its cloud-based parallel execution enables efficient handling of complex, multi-file changes.
While in public beta, Jules is free with some usage limits. This allows developers to experiment with this cutting-edge AI coding agent and understand its potential to accelerate development on Google Cloud.

02:56  Ryan – “More and more, as new tools get released, it’s just going to change the way anything gets written… it’s getting more and more capable.” 
05:45 Introducing Flow: Google’s AI filmmaking tool designed for Veo

Flow is an AI-powered filmmaking tool custom-designed for Google’s advanced video, image and language models (Veo, Imagen, Gemini). It allows creators to generate cinem...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[305: AWS Breaks Up with Unpopular Services - "It's Not You, It's Me"]]>
                </itunes:title>
                                    <itunes:episode>305</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p>Welcome to episode 305 of The Cloud Pod – where the forecast is always cloudy! How did you do on your Microsoft Build Predictions? As badly as us? Plus we’ve got news on AWS service changes, a lifecycle catch up page for all those services that bought the farm, tons of Gemini news (seriously, like a lot) and even some AI for .NET. </p>
<p>Welcome to the cloud pod- and thanks for joining us! </p>
<h3>Titles we almost went with this week:</h3>
<ul>
<li>Google’s Jules: An AI Gem for Cloud Devs  </li>
<li>Autonomous Agents of Code: Jules’ Excellent Adventure in the Google Cloud</li>
<li>Gemini 2.5 Shoots for the Stars with Cosmic-Sized AI Upgrades</li>
<li>Resistance is Futile: OpenAI Assimilates Your Codebase </li>
<li>AWS Transformers: Rise of the Agentic AI </li>
<li>Teaching an old .NET dog new Linux tricks</li>
<li>CodeBuild Puts Docker Builds in Hyperdrive</li>
<li>Inspector Gadget’s New Trick: Mapping Container Vulnerabilities</li>
<li>Yo Dawg, I Heard You Like Scanning Containers…</li>
<li>Google Cranks AI to 11 with New Ultra Plan</li>
<li>I, For One, Welcome Our New AI Ultra Overlords</li>
<li>The Inference Engine That Could: llm-d Chugs Ahead with Kubernetes-Native </li>
<li>      Scaling</li>
<li>Scaling Inference to Infinity and Beyond with Google Cloud’s llm-d</li>
<li>Google Cloud and Spring AI: A Match Made in Java-n</li>
<li>The Fast and the Serverless: Cloud Run Drifts into AI Studio Territory</li>
<li>SQL Server 2025: A Vector Victor, Not a Scalar Failure</li>
<li>AI will solve my life problems of having money in my pocket</li>
<li>I used to scan all the containers but now I will just scan yours</li>
</ul>
<h2>AI Is Going Great – or How ML Makes Money </h2>
<p>01:50 <a href="https://blog.google/technology/google-labs/jules/">Jules: Google’s autonomous AI coding agent</a></p>
<ul>
<li style="font-weight:400;"><a href="http://jules.google/">Jules</a> is an autonomous AI agent that can read code, understand intent, and make code changes on its own. </li>
<li style="font-weight:400;">It goes beyond AI coding assistants to operate independently.</li>
<li style="font-weight:400;">It clones code into a secure Google Cloud VM, allowing it to understand the full context of a project. This enables it to write tests, build features, fix bugs, and more.</li>
<li style="font-weight:400;">Jules operates asynchronously in the background, presenting its plan and reasoning when complete. This allows developers to focus on other tasks while it works.</li>
<li style="font-weight:400;">Integration with <a href="https://github.com/">GitHub</a> enables Jules to work directly in existing workflows without extra setup or context switching. Developers can steer and give feedback throughout the process.</li>
<li style="font-weight:400;">For cloud developers, Jules demonstrates the rapid advancement of AI for coding moving from prototype to product. Its cloud-based parallel execution enables efficient handling of complex, multi-file changes.</li>
<li style="font-weight:400;">While in public beta, Jules is free with some usage limits. This allows developers to experiment with this cutting-edge AI coding agent and understand its potential to accelerate development on <a href="https://cloud.google.com/">Google Cloud</a>.</li>
</ul>
<p>02:56  Ryan – “More and more, as new tools get released, it’s just going to change the way anything gets written… it’s getting more and more capable.” </p>
<p>05:45 <a href="https://blog.google/technology/ai/google-flow-veo-ai-filmmaking-tool/">Introducing Flow: Google’s AI filmmaking tool designed for Veo</a></p>
<ul>
<li style="font-weight:400;"><a href="http://flow.google/">Flow</a> is an AI-powered filmmaking tool custom-designed for Google’s advanced video, image and language models (<a href="https://www.axios.com/2025/05/23/google-ai-videos-veo-3">Veo</a>, <a href="https://deepmind.google/models/imagen/">Imagen</a>, <a href="https://gemini.google.com/">Gemini</a>). It allows creators to generate cinematic video clips and scenes.</li>
<li style="font-weight:400;">The tool leverages cloud AI capabilities to make AI video generation more accessible. Creators can describe their vision in plain language, and bring their own image/video assets.</li>
<li style="font-weight:400;">Key features include camera controls, scene editing/extension, asset management, and a library of example clips. This aims to enable a new wave of AI-assisted filmmaking.</li>
<li style="font-weight:400;">Flow is an evolution of Google’s earlier VideoFX experiment, now productized for Google AI cloud subscribers. It’s an example of applied ML moving from research into cloud products and services.</li>
<li style="font-weight:400;">Potential use cases include storyboarding, pre-visualization, and final rendered clips for both amateurs and professional filmmakers. Early collaborations demonstrate applications in short films.</li>
<li style="font-weight:400;">For cloud providers and developers, Flow showcases how foundational AI models can be packaged into vertical applications. It represents an emerging class of AI tools built on cloud infrastructure.</li>
<li style="font-weight:400;">The ‘so what’: Flow demonstrates tangible progress in making generative AI accessible to creatives, powered by the scale and ease-of-use of the cloud. It signals the disruptive potential of cloud AI to reshape content creation industries.</li>
<li style="font-weight:400;">As of right now, Flow is available to users of Google AI Pro and <a href="https://blog.google/products/google-one/google-ai-ultra">Google AI Ultra</a>.</li>
</ul>
<p>06:53  Ryan – “This is another area – like coding – it’s going to change movie making and directing; because not only do you need to have the vision in your head, but you have to be good at the prompt engineering to get it out.” </p>
<p> </p>
<p>07:37 <a href="https://blog.google/technology/google-deepmind/gemini-universal-ai-assistant/">Google I/O 2025: Gemini as a universal AI assistant</a></p>
<ul>
<li style="font-weight:400;">Google is extending its multimodal foundation model, <a href="https://cloud.google.com/vertex-ai/generative-ai/docs/models/gemini/2-5-pro">Gemini 2.5 Pro</a>, into a “world model” that can understand, simulate, plan and imagine like the human brain. What could go wrong? </li>
<li style="font-weight:400;">Gemini is showing emerging capabilities to simulate environments, understand physics, and enable robots to grasp and follow instructions. </li>
<li style="font-weight:400;">The goal is to make Gemini a universal AI assistant that can perform tasks, handle admin, make recommendations, and enrich people’s lives across any device.</li>
<li style="font-weight:400;">Google is integrating live AI capabilities from <a href="https://deepmind.google/models/project-astra/">Project Astra</a> like video understanding, screen sharing and memory into products like <a href="https://gemini.google/overview/gemini-live/">Gemini Live</a>, Search, and the <a href="https://ai.google.dev/gemini-api/docs/live">Live API</a>. </li>
<li style="font-weight:400;"><a href="https://deepmind.google/models/project-mariner/">Project Mariner</a> is a research prototype exploring agentic capabilities, with a system of agents that can multitask up to 10 different things like looking up info, making bookings and purchases. </li>
<li style="font-weight:400;">These AI developments aim to make AI more personal, proactive, and powerful to boost productivity and usher in a new era of discovery.</li>
<li style="font-weight:400;">For cloud, this points to a future where highly capable AI agents and models can be accessed as a service to enhance any application with intelligent assistance.</li>
<li style="font-weight:400;">The implications are that cloud AI is evolving from single-purpose APIs to multi-skilled AI assistants that developers can leverage. Businesses should consider how universal AI agents could transform their products and customer experiences.</li>
</ul>
<p> </p>
<p>08:28  Justin – “I can’t wait for an assistant – my own personal JARVIS.” </p>
<p>09:50 <a href="https://blog.google/technology/google-deepmind/google-gemini-updates-io-2025/">Google I/O 2025: Updates to Gemini 2.5 from Google DeepMind</a></p>
<ul>
<li style="font-weight:400;">Google announced major updates to its <a href="https://deepmind.google/technologies/gemini/?_gl=1*1hcx28i*_up*MQ..*_ga*OTE5NDY4NDk5LjE3NDc1NzI2Mzk.*_ga_LS8HVHCNQ0*czE3NDc1NzI2MzgkbzEkZzAkdDE3NDc1NzI2MzgkajAkbDAkaDA.">Gemini 2.5</a> large language models, including the <a href="https://blog.google/technology/google-deepmind/gemini-model-thinking-updates-march-2025/?utm_source=deepmind.google&amp;utm_medium=referral&amp;utm_campaign=gdm&amp;utm_content=">2.5 Pro</a> and <a href="https://deepmind.google/models/gemini/flash/">2.5 Flash</a> versions, which are leading benchmarks for coding, reasoning, learning, and more.</li>
<li style="font-weight:400;">New capabilities to the models include native audio output for more natural conversations, advanced security safeguards against prompt injection attacks, and the ability to use tools and access computers. </li>
<li style="font-weight:400;">An experimental “<a href="https://deepmind.google/models/gemini/pro">Deep Think</a>” mode enables enhanced reasoning for highly complex math and coding tasks.</li>
<li style="font-weight:400;">Developer experience improvements include thought summaries for transparency, adjustable thinking budgets for cost control, and support for <a href="https://modelcontextprotocol.io/introduction">Model Context Protocol</a> (MCP) tools.</li>
<li style="font-weight:400;">The models are available in Google’s cloud AI platforms like <a href="https://cloud.google.com/vertex-ai">Vertex AI</a>, and the Gemini API for businesses and developers to build intelligent applications</li>
<li style="font-weight:400;">The rapid progress and expanding capabilities of large language models have major implications for unlocking new AI use cases and experiences across industries</li>
<li style="font-weight:400;">The ‘so what’: Google’s Gemini models represent the state-of-the-art in large language model performance and are poised to enable a new wave of intelligent applications leveraging natural conversations, reasoning, coding and more. Businesses and developers should pay close attention as language AI rapidly becomes an essential cloud computing technology.</li>
</ul>
<p> </p>
<p>11:43 <a href="https://arstechnica.com/ai/2025/05/google-deepmind-creates-super-advanced-ai-that-can-invent-new-algorithms/">Google DeepMind creates super-advanced AI that can invent new </a><a href="https://arstechnica.com/ai/2025/05/google-deepmind-creates-super-advanced-ai-that-can-invent-new-algorithms/">algorithms – Ars Technica</a></p>
<ul>
<li style="font-weight:400;"><a href="https://deepmind.google/discover/blog/alphaevolve-a-gemini-powered-coding-agent-for-designing-advanced-algorithms/">AlphaEvolve</a> is a new AI coding agent from <a href="https://deepmind.google/discover/blog/alphaevolve-a-gemini-powered-coding-agent-for-designing-advanced-algorithms/">Google DeepMind</a> based on their <a href="https://gemini.google.com/">Gemini</a> large language models, with the addition of an “evolutionary” approach to evaluate and improve algorithms.</li>
<li style="font-weight:400;">It uses an automatic evaluation system to generate multiple solutions to a problem, analyze each one, and iteratively focus on and refine the best solution.</li>
<li style="font-weight:400;">Unlike previous DeepMind AIs trained extensively on a single knowledge domain, AlphaEvolve is a general-purpose AI to aid research on any programming or algorithmic problem.</li>
<li style="font-weight:400;">Google has already started deploying AlphaEvolve across its business, with positive results.</li>
<li style="font-weight:400;">For cloud computing, AlphaEvolve could enable more intelligent, efficient and robust cloud services and applications by optimizing underlying algorithms and architectures.</li>
<li style="font-weight:400;">Businesses and developers could leverage AlphaEvolve to tackle complex problems and accelerate R&amp;D in fields like scientific computing, analytics, AI/ML, etc. on the cloud.</li>
<li style="font-weight:400;">AlphaEvolve represents an important step towards using AI to augment human intelligence in solving big challenges in math, science and computing.</li>
</ul>
<p>13:25  Justin – “The other AIs doing all the programming work, this is creating the new algorithms, and then we’re getting quantum computing which is just going to figure out all the possibilities and figure out that we’re just going to die at this point…” </p>
<p>14:08 <a href="https://arstechnica.com/ai/2025/05/openai-introduces-codex-its-first-full-fledged-ai-agent-for-coding/">OpenAI introduces Codex, its first full-fledged AI agent for coding</a></p>
<ul>
<li style="font-weight:400;"><a href="https://openai.com/">OpenAI</a> has released <a href="https://openai.com/index/introducing-codex/">Codex</a>, an AI agent that can generate production-ready code based on natural language prompts from developers.</li>
<li style="font-weight:400;">Codex runs in a containerized environment that mirrors the user’s codebase and development setup.</li>
<li style="font-weight:400;">Developers can provide an “AGENTS.md” file to give Codex additional context and guidance on project standards. </li>
<li style="font-weight:400;">Codex is built on the codex-1 model, a variant of <a href="https://openai.com/index/introducing-o3-and-o4-mini/">OpenAI’s o3 reasoning model</a> that was trained via reinforcement learning on a broad set of coding tasks.</li>
<li style="font-weight:400;">For cloud developers, Codex could automate routine programming work, boosting productivity.</li>
<li style="font-weight:400;">Businesses could leverage Codex to rapidly prototype cloud applications and services.</li>
<li style="font-weight:400;">Codex represents a major step towards AI systems becoming full-fledged software development partners working alongside human programmers. </li>
<li style="font-weight:400;">While still in research preview, Codex points to a future where AI is deeply integrated into the cloud application development lifecycle.</li>
<li style="font-weight:400;">We’re currently not spending the money on this one – so if any of our listeners out there are using this, we’d love to hear about your experiences. </li>
<li style="font-weight:400;">RIP to everyone’s jobs. </li>
</ul>
<h2>Cloud Tools</h2>
<p>16:11 <a href="https://www.hashicorp.com/en/blog/introducing-hashicorp-validated-patterns-for-product-use-cases">Hashicorp: Introducing Hashicorp Validated Patterns For Product Use </a><a href="https://www.hashicorp.com/en/blog/introducing-hashicorp-validated-patterns-for-product-use-cases">Cases</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.hashicorp.com/en/blog/introducing-hashicorp-validated-patterns-for-product-use-cases">HashiCorp Validated Patterns</a> provide pre-built, validated solutions for common use cases using <a href="https://www.hashicorp.com/en">HashiCorp</a> tools like <a href="https://developer.hashicorp.com/terraform">Terraform</a>, <a href="https://www.hashicorp.com/products/vault">Vault</a>, <a href="https://www.hashicorp.com/products/consul">Consul</a>, and <a href="https://developer.hashicorp.com/nomad">Nomad</a>. </li>
<li style="font-weight:400;">They help accelerate time-to-value by providing a starting point for building and deploying production-ready infrastructure and apps in the cloud.</li>
<li style="font-weight:400;">Patterns cover core use cases, like service networking, zero trust security, multi-cloud deployments, Kubernetes deployments, and more.</li>
<li style="font-weight:400;">Validated Patterns integrate with major cloud platforms including <a href="https://www.bing.com/aclk?ld=e8EAzSuWnFoAiaBymZpkldvzVUCUwwVMMS6CrO-WNDpf0SIkvFkepq1B9YR514K_DC2AWURc0GB23QvaMN0o53_ciNqRDVedIMBwiYahWo8XvvyJx2BVP_6xmcaJyXN5gResak2TLgpD1Y_tiJo7KiqT2fLyaFSHZnoXnke55O-1KDdSddZgsmM4H7lny_vP9pJpHvTw&amp;u=aHR0cHMlM2ElMmYlMmZhd3MuYW1hem9uLmNvbSUyZmZyZWUlMmYlM2Z0cmslM2Q0NTFkNDM1Ni0xODg2LTQyZjktYTFhZS0wOGY4Y2ZhMWJjMGIlMjZzY19jaGFubmVsJTNkcHMlMjZzX2t3Y2lkJTNkQUwhNDQyMiExMCE3MTEyNDg5NDE5NTAyMCEhISE3MTEyNTQyMTc4NDYzNCEhNDgyNTEwNzU0ITExMzc5OTU1MzYwNTU4NTclMjZlZl9pZCUzZDk3MmQ5ZDY3NzMzYTEyMDUzMjYxZjU1MWUzZGUwN2ExJTNhRyUzYXMlMjZtc2Nsa2lkJTNkOTcyZDlkNjc3MzNhMTIwNTMyNjFmNTUxZTNkZTA3YTE&amp;rlid=972d9d67733a12053261f551e3de07a1">AWS</a>, <a href="https://portal.azure.com/">Azure</a>, and <a href="https://www.bing.com/aclk?ld=e81bOpY4vJCfLtgsQzmnh9ZjVUCUxHc5mXr6p_2RtaFf_mDtbmCn6CZ3Ll-plrJ2i6_Q3Qh7S9ZxWDAwOxSqs2JZJ3wDTYd7E_BGSYBS0bXwJ_S4VqI96vCN8cDnPpy1LEMlarJ0df4VQDSvsPMMju9ILOGNh4Gj5A446ocWZTvvMgoEfwHVoaYc9E9SZ9Lx1xQSScRw&amp;u=aHR0cHMlM2ElMmYlMmZjbG91ZC5nb29nbGUuY29tJTJmZ2NwJTNmdXRtX3NvdXJjZSUzZGJpbmclMjZ1dG1fbWVkaXVtJTNkY3BjJTI2dXRtX2NhbXBhaWduJTNkbmEtVVMtYWxsLWVuLWRyLWJrd3MtYWxsLWFsbC10cmlhbC1lLWRyLTE3MTAxMzQlMjZ1dG1fY29udGVudCUzZHRleHQtYWQtbm9uZS1hbnktREVWX2MtQ1JFXy1BREdQX0Rlc2slMmIlMjU3QyUyYkJLV1MlMmItJTJiRVhBJTJiJTI1N0MlMmJUeHQtQ29yZS1HZW5lcmFsJTJiR0NQLUtXSURfNDM3MDAwNjMzNDE4NDMyOTYta3dkLTc3MjQwODk2MDc1NjQxJTNhbG9jLTE5MCUyNnV0bV90ZXJtJTNkS1dfZ29vZ2xlJTI1MjBjbG91ZC1TVF9nb29nbGUlMmJjbG91ZCUyNmdjbGlkJTNkMDU4YTk4OWZkY2MyMWExMjk3NDhmMGE5NGRjYjA1MjklMjZnY2xzcmMlM2QzcC5kcyUyNm1zY2xraWQlM2QwNThhOTg5ZmRjYzIxYTEyOTc0OGYwYTk0ZGNiMDUyOQ&amp;rlid=058a989fdcc21a129748f0a94dcb0529">Google Cloud Platform</a>. What, no Oracle? </li>
<li style="font-weight:400;">Validated Patterns solve the problem of figuring out best practices and recommended architectures when using HashiCorp tools for common scenarios.</li>
<li style="font-weight:400;">The patterns are fully open source and customizable, allowing users to adapt them to their specific needs. </li>
<li style="font-weight:400;">This matters for YOU – the cloud professional – because it makes it faster and easier to properly implement HashiCorp tools in production by leveraging curated, validated solutions. </li>
</ul>
<p>17:02  Matt – “I looked a little bit more into the article… they’re like, cool. Terraform with Prisma Cloud by Palo Alto Networks. Maybe that’s a good idea? I don’t know, I just feel like there’s gonna be someone that runs a Terraform destroyer, takes down your time in Prisma Cloud. Feels like a bad life choice.”</p>
<h2>AWS</h2>
<p>18:22 <a href="https://aws.amazon.com/about-aws/whats-new/2025/05/aws-service-changes/">AWS service changes</a></p>
<ul>
<li style="font-weight:400;">It’s a big week for killing things off… RIP. </li>
<li style="font-weight:400;">AWS is ending support for several services including <a href="https://aws.amazon.com/pinpoint/">Amazon Pinpoint</a>, <a href="https://aws.amazon.com/iq/experts/">AWS IQ</a>, <a href="https://aws.amazon.com/iot-analytics/">IoT Analytics</a>, <a href="https://docs.aws.amazon.com/iotevents/latest/developerguide/understanding-aws-iotevents-entries.html">IoT Events</a>, <a href="https://docs.aws.amazon.com/simspaceweaver/latest/userguide/what-is.html">SimSpace Weaver</a>, <a href="https://aws.amazon.com/panorama/">Panorama</a>, <a href="https://docs.aws.amazon.com/inspector/v1/userguide/inspector_introduction.html">Inspector Classic</a>, <a href="https://aws.amazon.com/connect/voice-id/">Connect Voice ID</a>, and <a href="https://docs.aws.amazon.com/dms/latest/userguide/dms_fleet.advisor-end-of-support.html">DMS Fleet Advisor</a>. </li>
<li style="font-weight:400;">End of support means these services will no longer be available after specific announced dates.</li>
<li style="font-weight:400;">AWS will provide customers with detailed migration guidance and support to transition to alternative services. </li>
<li style="font-weight:400;">Some services, like <a href="https://www.datacenterdynamics.com/en/news/aws-retires-private-5g-service/">AWS Private 5G</a> and <a href="https://aws.amazon.com/datasync/discovery/">DataSync Discovery</a>,  have already reached the end of support and are no longer accessible.</li>
<li style="font-weight:400;">This announcement matters because ending support for services can significantly impact customers who rely on them, and requires careful planning to migrate.</li>
<li style="font-weight:400;">Customers should review the end of support dates and migration paths in the linked documentation for each affected service.</li>
<li style="font-weight:400;">The AWS Product Lifecycle page provides more details on end of support timelines and options: <a href="https://aws.amazon.com/products/lifecycle">https://aws.amazon.com/products/lifecycle</a>​</li>
</ul>
<p>19:15 <a href="https://aws.amazon.com/blogs/aws/introducing-the-aws-product-lifecycle-page-and-aws-service-availability-updates/">Introducing the AWS Product Lifecycle page and AWS service availability </a><a href="https://aws.amazon.com/blogs/aws/introducing-the-aws-product-lifecycle-page-and-aws-service-availability-updates/">updates</a></p>
<ul>
<li style="font-weight:400;">AWS launched a new <a href="https://aws.amazon.com/products/lifecycle/">Product Lifecycle page</a> that provides a centralized view of upcoming changes to AWS service availability, including services closing to new customers, services announcing end of support, and services that have reached end of support.</li>
<li style="font-weight:400;">The page helps customers stay informed about service changes that may impact their workloads and plan migrations more efficiently by consolidating lifecycle information in one place.</li>
<li style="font-weight:400;">Several services are closing to new customers after June 20, 2025 but will continue to operate for existing users, while other services have announced specific end of support dates.</li>
<li style="font-weight:400;">Services that have already reached end of support and are no longer accessible include AWS Private 5G and AWS DataSync Discovery</li>
<li style="font-weight:400;">The Product Lifecycle page integrates with existing resources like service documentation pages that provide detailed migration guidance for services being discontinued</li>
<li style="font-weight:400;">Having a single reference for service lifecycle information reduces time spent tracking down updates across different pages and allows customers to focus on their core business</li>
<li style="font-weight:400;">Checking the Product Lifecycle page regularly along with the What’s New with AWS page is recommended to stay on top of important availability changes</li>
<li style="font-weight:400;">This page is missing ones previously announced, but it’s a good place to start. </li>
</ul>
<p>21:09 Justin – “Sometimes they build stuff to see if it sticks to the wall, and maybe it does for one or two customers, but then no one else is interested, and I think that’s a death knell for a lot of these things.” </p>
<p>22:50 <a href="https://aws.amazon.com/blogs/opensource/introducing-strands-agents-an-open-source-ai-agents-sdk/">Introducing Strands Agents, an Open Source AI Agents SDK</a></p>
<ul>
<li style="font-weight:400;"><a href="https://strandsagents.com/">Strands Agents</a> is an open source SDK that simplifies building AI agents by leveraging advanced language models to plan, chain thoughts, call tools, and reflect. </li>
<li style="font-weight:400;">Developers can define an agent with just a prompt and list of tools.</li>
<li style="font-weight:400;">It integrates with <a href="https://aws.amazon.com/bedrock/">Amazon Bedrock</a> models that support tool use and streaming, as well as models from <a href="https://www.anthropic.com/">Anthropic</a>, <a href="https://www.llama.com/">Meta’s Llama</a>, and other providers. </li>
<li style="font-weight:400;">Strands can run anywhere.</li>
<li style="font-weight:400;">The model-driven approach of Strands reduces complexity compared to frameworks requiring complex agent workflows and orchestration. This enables faster development and iteration on AI agents.</li>
<li style="font-weight:400;">Use cases include conversational agents, event-triggered agents, scheduled agents, and continuously running agents. </li>
<li style="font-weight:400;">Strands provides deployment examples for <a href="https://aws.amazon.com/lambda/">AWS Lambda</a>, <a href="https://aws.amazon.com/fargate/">Fargate</a>, and <a href="https://aws.amazon.com/ec2/">EC2</a>.</li>
<li style="font-weight:400;">For The Cloud Pod listeners, Strands Agents dramatically lowers the bar to building practical AI agents on AWS by providing an open source, model-driven framework to define, test and deploy agents that leverage state-of-the-art language models. Teams at AWS already use it in production.</li>
<li style="font-weight:400;">Strands Agents project on <a href="https://github.com/strands-agents">GitHub</a>: <a href="https://github.com/strands-agents">https://github.com/strands-agents</a>​</li>
<li style="font-weight:400;">Pricing: Varies based on usage of underlying models and AWS services. (General estimate, pricing not provided in article. YMMV.)</li>
</ul>
<p>23:49 Ryan – “I hope we don’t get too many more of these to be honest, because now OpenAI has one, Google has one, Amazon now has one – it feels like great, we’ve got a whole bunch of open source options that do the same thing. And it’s like, instead of collaborating in the open space, in the open source market, they’re creating their own competing versions of it. And it’s going to make things diverge, which I don’t like.”</p>
<p>25:43 <a href="https://aws.amazon.com/blogs/aws/aws-transform-for-net-the-first-agentic-ai-service-for-modernizing-net-applications-at-scale/">AWS Transform for .NET, the first agentic AI service for modernizing .NET </a><a href="https://aws.amazon.com/blogs/aws/aws-transform-for-net-the-first-agentic-ai-service-for-modernizing-net-applications-at-scale/">applications at scale</a></p>
<ul>
<li style="font-weight:400;">AWS Transform for <a href="https://aws.amazon.com/blogs/aws/category/programing-language/dot-net/">.NET</a> is a new AI-powered service that automates porting .NET Framework applications to cross-platform .NET, making modernization faster and less error-prone. This matters because ported apps are 40% cheaper to run on <a href="https://www.linux.org/pages/download/">Linux</a>, have 1.5-2x better performance, and 50% better scalability.</li>
<li style="font-weight:400;">It integrates with source code repositories like GitHub, <a href="https://gitlab.com/users/sign_in">GitLab</a>, <a href="https://bitbucket.org/">Bitbucket</a> and provides experiences through a web UI for large-scale portfolio transformation and a Visual Studio extension for individual projects. </li>
<li style="font-weight:400;">New capabilities include support for private <a href="https://www.nuget.org/">NuGet</a> packages, porting MVC Razor views, executing ported unit tests, cross-repo dependency detection, and detailed transformation reports.</li>
<li style="font-weight:400;">Enterprises with large portfolios of legacy .NET Framework apps that want to modernize to Linux – in order to reduce costs and improve performance/scalability – will benefit most. </li>
<li style="font-weight:400;">Individual developers can also use it to port specific projects.</li>
<li style="font-weight:400;">For The Cloud Pod listeners, this automates a previously manual, time-consuming process of porting .NET apps to Linux. It showcases how AWS is innovating by applying AI to solve real customer challenges around app modernization at scale.</li>
<li style="font-weight:400;">Official service page: <a href="https://aws.amazon.com/transform/net/">https://aws.amazon.com/transform/net/</a>​ </li>
<li style="font-weight:400;">Pricing: No additional charge for AWS Transform itself. Standard pricing applies for any AWS resources used to run the ported applications. (General estimate based on article.)</li>
</ul>
<p>33:00 <a href="https://aws.amazon.com/blogs/aws/accelerate-ci-cd-pipelines-with-the-new-aws-codebuild-docker-server-capability/">Accelerate CI/CD pipelines with the new AWS CodeBuild Docker Server </a><a href="https://aws.amazon.com/blogs/aws/accelerate-ci-cd-pipelines-with-the-new-aws-codebuild-docker-server-capability/">capability | AWS News Blog</a></p>
<ul>
<li style="font-weight:400;">Yes, another way to run Docker in Amazon. </li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/codebuild/">AWS CodeBuild</a>‘s new <a href="https://www.docker.com/">Docker</a> Server capability provisions a dedicated, persistent Docker server within a CodeBuild project in order to accelerate Docker image builds.</li>
<li style="font-weight:400;">It centralizes image building to a remote host with consistent caching, reducing wait times and increasing efficiency (up to 98% faster builds in example.)</li>
<li style="font-weight:400;">The persistent Docker server maintains layer caches between builds, especially beneficial for large, complex Docker images with many layers</li>
<li style="font-weight:400;">Integrates seamlessly with existing CodeBuild projects – simply enable the Docker Server option when creating or editing a project.</li>
<li style="font-weight:400;">Supports both x86 (Linux) and ARM architectures</li>
<li style="font-weight:400;">Ideal for CI/CD pipelines that frequently build and deploy Docker images, dramatically improving throughput.</li>
<li style="font-weight:400;">Pricing varies based on Docker Server compute type; be sure to check the CodeBuild pricing page for details. </li>
<li style="font-weight:400;">Available in all regions where CodeBuild is offered.</li>
<li style="font-weight:400;">For teams heavily using Docker in their build pipelines, this new CodeBuild capability can provide a major speed boost and efficiency gain with minimal setup or workflow changes required. Faster builds mean faster deployments. You’re welcome. </li>
</ul>
<p>34:02  Justin – “Right now, if you have CodeBuild and you want to build on a Docker server, you have to connect to an ECS or Fargate instance that’s inside of a VPC elsewhere. So you had to do peering to where you code build environments. And now you can basically run this as a fully managed Docker server inside the code build environment. So you don’t have to do all those extra connectivity steps. That’s the advantage here.”</p>
<p>34:50 <a href="https://aws.amazon.com/blogs/aws/amazon-inspector-enhances-container-security-by-mapping-amazon-ecr-images-to-running-containers/">Amazon Inspector enhances container security by mapping Amazon ECR </a><a href="https://aws.amazon.com/blogs/aws/amazon-inspector-enhances-container-security-by-mapping-amazon-ecr-images-to-running-containers/">images to running containers</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/category/security-identity-compliance/amazon-inspector/">Amazon Inspector</a> now maps <a href="https://aws.amazon.com/ecr/?trk=fccf147c-636d-45bf-bf0a-7ab087d5691a&amp;sc_channel=el">Amazon ECR container images</a> to running containers in <a href="https://aws.amazon.com/blogs/aws/category/compute/amazon-elastic-container-service/">Amazon ECS</a> and EKS, providing visibility into which images are actively deployed and their usage patterns.</li>
<li style="font-weight:400;">This enhancement allows security teams to prioritize fixing vulnerabilities based on severity and actual runtime usage of the container images.</li>
<li style="font-weight:400;">Inspector shows the cluster <a href="https://docs.aws.amazon.com/IAM/latest/UserGuide/reference-arns.html">ARN</a>, number of EKS pods/ECS tasks an image is deployed to, and last run time to help prioritize fixes.</li>
<li style="font-weight:400;">Vulnerability scanning is extended to minimal base images like scratch, distroless, and Chainguard images, and supports additional ecosystems like <a href="https://go.dev/doc/toolchain">Go</a>, <a href="https://www.oracle.com/java/technologies/">Oracle JDK</a>, <a href="https://tomcat.apache.org/">Tomcat</a>, <a href="https://wordpress.org/">WordPress</a> and more.</li>
<li style="font-weight:400;">This enables comprehensive security scanning even for highly optimized container environments, eliminating the need for multiple tools.</li>
<li style="font-weight:400;">The features work across single AWS accounts, cross-account setups, and AWS Organizations via delegated admin for centralized vulnerability management.</li>
<li style="font-weight:400;">Available now in all regions where Amazon Inspector is offered at no additional cost, so that’s a plus. </li>
<li style="font-weight:400;">The enhancements significantly improve container security posture by focusing on vulnerabilities in images that are actively running, not just sitting in a repository.</li>
</ul>
<h2>GCP</h2>
<p>36:38 <a href="https://blog.google/products/google-one/google-ai-ultra/">Google announces AI Ultra subscription plan</a></p>
<ul>
<li style="font-weight:400;"><a href="http://one.google.com/ai?utm_source=g1&amp;utm_medium=web&amp;utm_campaign=google_ai_plan_blog">Google AI Ultra</a> is a new premium subscription plan providing access to Google’s most advanced AI models and features, including <a href="https://gemini.google.com/">Gemini</a>, <a href="https://labs.google/flow/about">Flow</a>, <a href="https://labs.google/fx/tools/whisk">Whisk</a>, <a href="https://notebooklm.google/">NotebookLM</a>, and more.</li>
<li style="font-weight:400;">Offers the highest usage limits and early access to cutting-edge capabilities like <a href="https://deepmind.google/models/veo/">Veo 3</a> video generation and <a href="https://deepmind.google/models/gemini/pro/">Deep Think 2.5</a> Pro enhanced reasoning mode</li>
<li style="font-weight:400;">Integrates Google AI directly into apps like Gmail, Docs, Chrome browser for seamless AI assistance </li>
<li style="font-weight:400;">Includes <a href="https://www.youtube.com/premium">YouTube Premium</a> and 30 TB of Google One storage, which, let’s be honest. They’re just trying to justify the cost here. Youtube Premium? Really? </li>
<li style="font-weight:400;">The plan targets filmmakers, developers, researchers and power users demanding “the best Google AI has to offer”.</li>
<li style="font-weight:400;">Costs $249.99/month with a 50% off intro offer for the first 3 months, U.S. only initially. We DO love a good promo code, but we fully expected this to be the new norm of $100 a month. </li>
<li style="font-weight:400;">Expands Google’s AI offerings to compete with Microsoft, Amazon, OpenAI and others in the rapidly growing generative AI market.</li>
<li style="font-weight:400;">They’ll still charge you for other stuff, don’t worry. </li>
</ul>
<p>40:46 <a href="https://cloud.google.com/blog/products/databases/database-center-is-now-generally-available/">Database Center is now generally available</a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/database-center/docs/overview">Database Center</a> is an AI-powered unified fleet management solution that simplifies monitoring, optimization, and security for database fleets on GCP</li>
<li style="font-weight:400;">It provides a single pane of glass view across <a href="https://cloud.google.com/sql">Cloud SQL</a>, <a href="https://cloud.google.com/alloydb/docs/overview">AlloyDB</a>, <a href="https://cloud.google.com/spanner">Spanner</a>, <a href="https://cloud.google.com/bigtable/docs/overview">Bigtable</a>, <a href="https://cloud.google.com/memorystore">Memorystore</a>, and <a href="https://firebase.google.com/docs/firestore/">Firestore</a> databases.</li>
<li style="font-weight:400;">Proactively identifies risks and provides intelligent recommendations to optimize performance, reliability, cost, compliance and security.</li>
<li style="font-weight:400;">Introduces an AI-powered natural language chat interface to ask questions, resolve issues, and get optimization recommendations</li>
<li style="font-weight:400;">Leverages <a href="https://gemini.google.com/">Google’s Gemini</a> foundation models to enable assistive performance troubleshooting, of course. </li>
<li style="font-weight:400;">DC allows creating custom views, tracking historical data on database resources and issues, and centralizing database alerts.</li>
<li style="font-weight:400;">Competes with database management offerings from AWS and Azure, but differentiates with AI-powered insights and tight integration with GCP’s database and AI/ML services. </li>
<li style="font-weight:400;">Key use cases include enterprises managing large fleets of databases powering critical applications that need unified visibility and optimization</li>
<li style="font-weight:400;">There is no additional cost for core features, but premium features like Gemini-based performance/cost recommendations require <a href="https://cloud.google.com/products/gemini/cloud-assist">Gemini Cloud Assist</a>.</li>
<li style="font-weight:400;">Advanced security requires <a href="https://cloud.google.com/security/products/security-command-center">Security Command Center</a> subscription, which is VERY pricey, so be wary. </li>
</ul>
<p>41:47  Ryan – “While I really like this feature, I want to make fun of it just because it’ll be like a lot of the other Google services where it’ll just be very confusing to the end user – where they won’t really know which service they’re using under the covers. They’ll click a button, they’ll set up a whole bunch of stuff up, and then they’ll get a bill that has AlloyDB on it and they’ll be like, I don’t understand what this is at all. So I look forward to that conversation.”</p>
<p>42:18 <a href="https://cloud.google.com/blog/products/containers-kubernetes/gke-data-cache-now-ga-accelerates-stateful-apps/">GKE Data Cache, now GA, accelerates stateful apps | Google Cloud Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/kubernetes-engine/docs/how-to/persistent-volumes/data-cache">GKE Data Cache</a> is a new managed solution that accelerates read-heavy stateful apps on GKE by intelligently caching data from persistent disks on high-speed local SSDs.</li>
<li style="font-weight:400;">It can provide up to 480% higher transactions/sec and 80% lower latency for <a href="https://www.postgresql.org/">PostgreSQL</a> on GKE.</li>
<li style="font-weight:400;">It simplifies implementing a high-performance cache layer vs complex manual setup, and supports all read/write Persistent Disk types.</li>
<li style="font-weight:400;">Competes with offerings like Amazon ElastiCache and Azure Cache for Redis, but is more tightly integrated with GKE and Persistent Disks.</li>
<li style="font-weight:400;">There are potential cost savings by allowing use of smaller persistent disks and less memory, while still achieving high read performance. Just remember, those local disks go away when the server dies. </li>
<li style="font-weight:400;">Key use cases include databases, analytics platforms, content management systems, developer environments that need fast startup.</li>
<li style="font-weight:400;">Based on local SSD usage, <a href="https://cloud.google.com/compute/disks-image-pricing#localssds)">pricing</a> varies by configuration. E.g. The 375GB local SSD is $95.40/month. </li>
<li style="font-weight:400;">Also, I’d like to point out that once again Ryan is trying to convince Justin to run things in containers that shouldn’t be in containers. Cody would like a word. </li>
</ul>
<p>45:30 <a href="https://cloud.google.com/blog/products/ai-machine-learning/enhancing-vllm-for-distributed-inference-with-llm-d/">Enhancing vllm for distributed inference with llm-d</a></p>
<ul>
<li style="font-weight:400;">Google Cloud is introducing llm-d, an open-source project that enhances the vLLM inference engine to enable distributed and disaggregated inference for large language models (LLMs) in a <a href="https://cloud.google.com/kubernetes-engine">Kubernetes</a>-native way.</li>
<li style="font-weight:400;">llm-d makes inference more cost-effective and easier to scale by incorporating a vLLM-aware inference scheduler, support for disaggregated serving to handle longer requests, and a multi-tier KV cache for intermediate values.</li>
<li style="font-weight:400;">Early tests by Google Cloud using llm-d show 2x improvements in time-to-first-token for use cases like code completion.</li>
<li style="font-weight:400;">llm-d is a collaboration between Google Cloud, <a href="https://www.redhat.com/en">Red Hat</a>, <a href="https://research.ibm.com/">IBM Research</a>, <a href="http://www.nvidia.com/page/home.html">NVIDIA</a>, <a href="https://www.bing.com/aclk?ld=e86nOgJeYzOLmQLWeTtCr7ITVUCUyV-nYEe_Q0wTeSxNvcG_gSoJNmnxtLOPH2mVMgQn83VBDImxpmZG1eV-6a9G2Isx927bRgfgJRj72BZIJcY3XZPfVFlxcxZMfjogOLS9fQjIRdYvfKDmbNSZMQfB9wsN5PVa8UuwXnl61nL5tl_c7f-9-Edss521lwabddRb83cQ&amp;u=aHR0cHMlM2ElMmYlMmZ3d3cuY29yZXdlYXZlLmNvbSUyZiUzZnV0bV9zb3VyY2UlM2RiaW5nJTI2dXRtX3Rlcm0lM2Rjb3Jld2VhdmUlMjZ1dG1fbWVkaXVtJTNkY3BjJTI2dXRtX2NhbXBhaWduJTNkNDM2MDExMzA0JTI2bXNjbGtpZCUzZGM5NDY2ZTkxOWIyODE0ZTc0OWE3MDZhNzI4NzA1MmZi&amp;rlid=c9466e919b2814e749a706a7287052fb">CoreWeave</a>, <a href="https://www.amd.com/en/support/download/drivers.html">AMD</a>, <a href="https://www.cisco.com/">Cisco</a>, <a href="https://huggingface.co/">Hugging Face</a>, <a href="https://www.intel.com/content/www/us/en/homepage.html">Intel</a>, <a href="https://docs.aws.amazon.com/lambda/latest/dg/welcome.html">Lambda</a>, and <a href="https://mistral.ai/">Mistral AI</a>, all leveraging Google’s proven technology in securely serving AI at scale.</li>
<li style="font-weight:400;">It works across <a href="https://pytorch.org/">PyTorch</a> and <a href="https://docs.jax.dev/en/latest/quickstart.html">JAX frameworks</a> and supports both GPU and <a href="https://cloud.google.com/kubernetes-engine/docs/tutorials/serve-vllm-tpu">Google Cloud TPU accelerators</a>, providing flexibility and choice.</li>
<li style="font-weight:400;">Deploying llm-d on Google Cloud enables low-latency, high-performance inference by integrating with Google’s global network, GKE AI capabilities, and AI Hypercomputer across software and hardware.</li>
<li style="font-weight:400;">Key use cases include agentic AI workflows and reasoning models that require highly scalable and efficient inference.</li>
</ul>
<p>As AI moves from prototyping to large-scale deployment, efficient inference becomes critical. llm-d tackles this challenge head-on, optimizing vLLM to drastically improve performance and cost-effectiveness for demanding LLM workloads. It showcases Google Cloud’s leadership in AI infrastructure and commitment to open innovation.</p>
<ul>
<li style="font-weight:400;"><strong><em>Show editor note: Remember in the 300th episode blog post where I said I was doing so much better understanding all the technical information? Yeah. I take it back. </em></strong></li>
</ul>
<p>47:48   Ryan – “I wonder if this is capitalizing on… did the community look at Vertex AI and some of the things that they’ve sort of ‘productized’ and be like, how are you doing it? And then started the collaboration that way? It’d be kind of fun to be a fly on the wall and how this was made.”</p>
<p>49:17 <a href="https://cloud.google.com/blog/topics/developers-practitioners/google-cloud-and-spring-ai-10/">Google Cloud and Spring AI 1.0</a></p>
<ul>
<li style="font-weight:400;"><a href="https://spring.io/blog/2025/04/10/spring-ai-1-0-0-m7-released">Spring AI 1.0</a> enables seamless integration of AI capabilities into Java applications running on <a href="https://spring.io/projects/spring-boot">Spring Boot</a>, allowing enterprises to leverage AI without complex integrations.</li>
<li style="font-weight:400;">Supports various AI models for image generation, audio transcription, semantic search, and chatbots.</li>
<li style="font-weight:400;">Provides tools to enhance chat models with memory, external functions, private data injection, vector stores, accuracy evaluation, and cross-service connectivity via the Model Context Protocol (MCP.)</li>
<li style="font-weight:400;">Integrates with Google Cloud’s Vertex AI platform and Gemini models, though specific comparisons to other cloud AI offerings are not provided.</li>
<li style="font-weight:400;">Utilizes Google Cloud’s AlloyDB or Cloud SQL for scalable, highly-available PostgreSQL databases with pgVector capabilities to support vector similarity searches.</li>
<li style="font-weight:400;">Key use cases include modernizing enterprise Java applications with AI capabilities across various industries already using Spring Boot.</li>
<li style="font-weight:400;">Developers should care as it significantly lowers the barrier to entry for incorporating AI into their Java applications, with familiar Spring abstractions and starter dependencies</li>
</ul>
<p>50:14   Ryan – “I guess Spring Boot’s better as a framework for Java apps than some things that have come before it. It’s done a good job of standardizing a lot of Java startups…so I guess if you do the same thing with AI integration perhaps it will be a little easier?” </p>
<p>51:49 <a href="https://cloud.google.com/blog/products/ai-machine-learning/ai-studio-to-cloud-run-and-cloud-run-mcp-server/">AI Studio to Cloud Run and Cloud Run MCP server</a></p>
<ul>
<li style="font-weight:400;">AI Studio now allows deploying apps directly to <a href="https://cloud.google.com/run?e=48754805&amp;hl=en">Cloud Run</a> with one click, making it faster and easier to go from idea to shareable app.</li>
<li style="font-weight:400;"><a href="https://deepmind.google/models/gemma/">Gemma 3</a> models can be deployed from <a href="https://ai.google.dev/aistudio">AI Studio</a> to Cloud Run, enabling easy scaling of Gemma projects to production on serverless infrastructure with GPU support</li>
<li style="font-weight:400;">The new Cloud Run MCP server lets MCP-compatible AI agents (like <a href="https://claude.ai/login">Claude</a>, <a href="https://copilot.microsoft.com/">Copilot</a>, Google Gen AI SDK) deploy apps to Cloud Run, empowering AI-assisted development.</li>
<li style="font-weight:400;">These integrations streamline the AI app development workflow on GCP, from building and testing in AI Studio to production deployment on Cloud Run’s scalable serverless platform.</li>
<li style="font-weight:400;">Cloud Run’s granular billing and free tier make hosting AI Studio apps very cost-effective, with estimates starting at $0/mo with 2M free requests, then pay-per-use after that. </li>
<li style="font-weight:400;">Automated deployment from AI agents via the MCP server is a differentiator vs. other clouds, leveraging GCP’s strength in AI.</li>
<li style="font-weight:400;">Rapid prototyping and deployment of AI-powered apps, scaling Gemma/LLM workloads, AI agent-based development are some of the key features. </li>
<li style="font-weight:400;">Developers and businesses looking to quickly build and deploy AI apps at scale without infrastructure overhead should take note of these new capabilities that demonstrate GCP’s expanding and integrating AI/ML portfolio.</li>
</ul>
<p>52:47   Justin – “MCP is like the new ClickOps.” </p>
<h2>Azure</h2>
<p>55:14  Remember Last week when Matt made us do Build Predictions? Well, as predicted – we did horribly. </p>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;">Ryan</li>
</ul>
</li>
</ul>
<ul>
<li>Announce an enhancement to GitHub Copilot, that allows agentic code development and agentic tasks. </li>
</ul>
<ul>
<li>Full Coding Agent</li>
</ul>
<ul>
<li>Agent Mode in Github Copilot</li>
</ul>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;">Quantum Computing – Double down on Majorna and quantum computing capabilities. </li>
<li style="font-weight:400;">Augmented/Virtual Reality for Teams  (Right subject, wrong cloud.)</li>
</ul>
</li>
<li style="font-weight:400;">Matt
<ul>
<li style="font-weight:400;">New Version of the ARM processor Cobalt</li>
<li style="font-weight:400;">New generation of Surface hardware</li>
<li style="font-weight:400;">Major update to the App Services Platform in Azure  </li>
</ul>
</li>
<li style="font-weight:400;">Justin
<ul>
<li style="font-weight:400;">Microsoft will launch their own LLM</li>
</ul>
</li>
</ul>
</li>
</ul>
<ul>
<li>Microsoft Office Copilot upgrade with MCP inclusion in it.</li>
</ul>
<ul>
<li>Agentspaces or Glean Type Competitor</li>
</ul>
<ul>
<li style="font-weight:400;">Specifically, Satya Nadella mentioned that Microsoft 365 Copilot can now search across data from various applications, including Salesforce. (16:53)</li>
</ul>
<ul>
<li>Number of times copilot will be mentioned in the keynote</li>
</ul>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;">55 Justin</li>
<li style="font-weight:400;">75 Matt</li>
</ul>
</li>
</ul>
</li>
</ul>
<ul>
<li>62 Ryan – Actual Number 69 (If you didn’t chuckle we can’t be friends.)</li>
</ul>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;">1 Jonathan </li>
</ul>
</li>
</ul>
<p>Big Congrats to Ryan for winning – at Azure predictions. Lotto? No. Azure? </p>
<p>     Yes.  <a href="https://www.youtube.com/watch?v=LdE3WlQ__GY">https://www.youtube.com/watch?v=LdE3WlQ__GY</a></p>
<p>1:01:58 <a href="https://azure.microsoft.com/en-us/blog/azure-ai-foundry-your-ai-app-and-agent-factory/">Azure AI Foundry: Your AI App and agent factory | Microsoft Azure Blog</a></p>
<ul>
<li style="font-weight:400;">Azure AI Foundry is an end-to-end platform for building and deploying AI apps and agents. </li>
<li style="font-weight:400;">It provides a unified development experience across code (Visual Studio Code) collaboration (GitHub) and cloud (Azure).</li>
<li style="font-weight:400;">It offers a growing catalog of state-of-the-art AI models, including <a href="https://aka.ms/grok-announcement">Grok 3</a>, Flux Pro 1.1, <a href="https://aka.ms/SoraBuildBlogFinal">Sora</a>, and 10,000+ open-source models from <a href="http://aka.ms/FoundryHuggingFace">Hugging Face</a>. </li>
<li style="font-weight:400;">A new model router optimizes model selection.</li>
<li style="font-weight:400;">The <a href="https://azure.microsoft.com/en-us/blog/azure-ai-foundry-your-ai-app-and-agent-factory/">Azure AI Foundry Agent Service</a> (now GA) enables designing, deploying and scaling production-grade AI agents. It integrates with 1,400+ enterprise data sources and platforms like Microsoft 365, Slack, Twilio.</li>
<li style="font-weight:400;">Multi-agent orchestration allows agents to collaborate on complex workflows across clouds. Agentic retrieval in Azure AI Search improves answer relevance by 40% for multi-part questions. </li>
<li style="font-weight:400;">Enterprise-grade features include end-to-end observability, first-class identity management via Microsoft Entra Agent ID, and built-in responsible AI guardrails.</li>
<li style="font-weight:400;">Foundry Local is a new runtime for building offline, cross-platform AI apps on Windows and Mac. Integration with <a href="https://learn.microsoft.com/en-us/azure/azure-arc/overview">Azure Arc</a> enables central management of edge AI.</li>
<li style="font-weight:400;">Compared to AWS and GCP, Azure AI Foundry offers tighter integration with Microsoft’s developer tools and enterprise platforms. It targets customers building enterprise AI workflows.</li>
<li style="font-weight:400;">Azure AI Foundry aims to democratize AI development with an integrated, full-stack platform. Its agent orchestration, enterprise features, and edge runtime differentiate it. </li>
<li style="font-weight:400;">For companies already using Azure and Microsoft 365, it could accelerate adoption of generative AI in their apps and processes.</li>
</ul>
<p>1:04:33 <a href="https://azure.microsoft.com/en-us/blog/powering-the-next-ai-frontier-with-microsoft-fabric-and-the-azure-data-portfolio/">Powering the next AI frontier with Microsoft Fabric and the Azure data </a><a href="https://azure.microsoft.com/en-us/blog/powering-the-next-ai-frontier-with-microsoft-fabric-and-the-azure-data-portfolio/">portfolio  | Microsoft Azure Blog</a></p>
<ul>
<li style="font-weight:400;"><a href="https://app.fabric.microsoft.com/">Microsoft Fabric</a> and Azure data services are being enhanced to power the next generation of AI applications that combine analytical, transactional, and operational data in structured and unstructured forms.</li>
<li style="font-weight:400;"><a href="https://duckduckgo.com/y.js?ad_domain=microsoft.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=2LQ1d2HEcs4KAsg4r1JRvhZjT_BrwN18dFFYbLl6ItqOYoE7Y53r5dId_vRcUWTTzkWI-2UZxmpqWVDIWdDUIPQIyCjD_Q1PJLjVRG_cBTiBi07AvTrr5-rU6b_UAIEg.dJ8t4_gN9ty7-33hZp6lTA&amp;eddgt=A80tHTAodVhQBsu7xgdtGw%3D%3D&amp;rut=11b00c7d09ee6cbc7c1efd669c2e35b3b9789cf0935f20176db354f413907d5e&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8X-_61hfkwJwCspeAsqfzLjVUCUxZUPgWMlkWAM-tCiqVrgsZwXgNkMp0ad-mg1-5etFLeddycHQdbz6meFE0EBtZy9eWu4-L7lcq_qCaqSI4Tfb5iSttnMjm5qM4t1I7VIWsThYfhVImsu80FWyMXimMgyURyBpIK5X9dz3mUeedhUpEivaXVSCRi9cE6mrZYu_NxoxibywhCX6AdQZV4k0x5SGl9k_-M8WApE8iaE4paOWXqUKOwI0t0eMUrChOu77qhaxATpaaokY9eZJVPlqTk4v6fungs7HtuzCrIbx-np5-nqQXuw4mP1LA7C9L64SrZsKEPI0yW9bZ-7X_w1xyr40RA67NgqCXL0RByxu5F02kDZF1_RTgypV6TMvZObnTyqWIJInut6ZlYjX7i6-psABogdvFGoZeQRZd1cmYh_ij6ECMAcnWO-PA5g2o9PbxMsL94RMnZqQ9UQ98R5zhSoYoiLj3mlOJyjgnZU81bZwu_Y_kMbDshPax_gEYQ1SXbVuWfPSvujyfVmcivo4uIOPsKZFdYoMcZe-fv93bjXNCrZnSuiXTxEdiOxFIEnvEBEH33_8OuyHeucJtYXQJdJulbJNagF-glski_O6lHh8e83Yvmhe60lnnBRgvahC8SM1YjJXYhL9x5NoFFA3jyYrqqCauv3s5KuyIK2MnHDAF-P2uEdSX_dHzE1eguz5Ccp22WAE8HZUovRmT9n1r5GIirkapoL-WeMzYVYIHte_rz1L34MwhQLsi7yHiEnUpZQ%26u%3DaHR0cHMlM2ElMmYlMmY1MzUwLnhnNGtlbi5jb20lMmZ0cmslMmZ2MSUzZnByb2YlM2Q0MzklMjZjYW1wJTNkMTY4ODA3JTI2a2N0JTNkbXNuJTI2a2NoaWQlM2QxNTkwMDE5MjElMjZjcml0ZXJpYWlkJTNka3dkLTc5MjM0MTYzMzgyOTUwJTNhbG9jLTE5MCUyNmNhbXBhaWduaWQlM2Q1OTAyMzczNTglMjZsb2NwaHklM2Q4MDUxMSUyNmFkZ3JvdXBpZCUzZDEyNjc3Mzg1NTkwNTE0NDclMjZjaWQlM2Q3OTIzMzc2NDM5NDIzOCUyNmtkdiUzZGMlMjZrZXh0JTNkJTI2a3BnJTNkJTI2a3BpZCUzZCUyNnF1ZXJ5U3RyJTNkY29zbW9zJTI1MjBkYiUyNnVybCUzZGh0dHBzJTNhJTJmJTJmYXp1cmUubWljcm9zb2Z0LmNvbSUyZmVuLXVzJTJmcHJvZHVjdHMlMmZjb3Ntb3MtZGIlMmYlM2ZlZl9pZCUzZF9rXzk5YzE2ZWM3NjdiYzE0NTMxOTFlMDZmMWRiMTEwM2M1X2tfJTI2T0NJRCUzZEFJRGNtbTVlZHN3ZHV1X1NFTV9fa185OWMxNmVjNzY3YmMxNDUzMTkxZTA2ZjFkYjExMDNjNV9rXyUyNm1zY2xraWQlM2Q5OWMxNmVjNzY3YmMxNDUzMTkxZTA2ZjFkYjExMDNjNQ%26rlid%3D99c16ec767bc1453191e06f1db1103c5&amp;vqd=4-92302310461445770988795548452057989839&amp;iurl=%7B1%7DIG%3D8604F8FFC6C8480EA773B3C66BBA66B2%26CID%3D1C2CE49A84186F992F06F16085846EED%26ID%3DDevEx%2C5048.1">Cosmos DB</a> NoSQL database is now available in Microsoft Fabric to handle semi-structured data for AI apps, in addition to SQL databases. </li>
<li style="font-weight:400;">Pricing starts at $0.25/hour for serverless instances.</li>
<li style="font-weight:400;">A new “digital twin builder” low-code tool allows creating virtual replicas of physical and logical entities to enable analytics, simulations and process automation.</li>
<li style="font-weight:400;">Power BI is getting a new Copilot experience to allow users to chat with their data and ask questions; this will also integrate with Microsoft 365 Copilot.</li>
<li style="font-weight:400;">SQL Server 2025 preview adds vector database capabilities and integrations with AI frameworks like LangChain to power intelligent apps. Pricing varies based on cores and edition.</li>
<li style="font-weight:400;">The PostgreSQL extension for VS Code now includes GitHub Copilot for AI assistance writing queries. Azure Database for PostgreSQL adds high-performance vector indexing.</li>
<li style="font-weight:400;">Azure Cosmos DB and Azure Databricks now integrate with Azure AI Foundry to store conversation data and power AI solutions</li>
<li style="font-weight:400;">Microsoft is partnering with SAP on the <a href="https://www.sap.com/sea/products/data-cloud.html">SAP Business Data Cloud</a> and <a href="https://www.sap.com/products/data-cloud/databricks.html">SAP Databricks</a> on Azure initiatives to help customers innovate on SAP data</li>
<li style="font-weight:400;">These enhancements position Azure as a leader in converging databases, analytics and AI compared to point solutions from AWS and GCP, targeting enterprise customers building next-gen AI applications.</li>
</ul>
<p>1:06:15   Matt- “The big thing here is CosmoDB – that felt like a little bit of a gap in the past.” </p>
<p>1:07:09 <a href="https://azure.microsoft.com/en-us/blog/transforming-rd-with-agentic-ai-introducing-microsoft-discovery/">Transforming R&amp;D with agentic AI: Introducing Microsoft Discovery</a></p>
<ul>
<li style="font-weight:400;"><a href="https://azure.microsoft.com/en-us/blog/transforming-rd-with-agentic-ai-introducing-microsoft-discovery/">Microsoft Discovery</a> is a new enterprise AI platform that aims to accelerate research and development (R&amp;D) by enabling scientists to collaborate with specialized AI agents and a graph-based knowledge engine. </li>
<li style="font-weight:400;">Microsoft says it can help drive scientific outcomes faster and more accurately.</li>
<li style="font-weight:400;">Discovery integrates with Azure infrastructure and services to provide enterprise-grade trust, compliance, governance and extensibility. </li>
<li style="font-weight:400;">Researchers can bring their own models, tools and datasets. It also leverages innovations from Microsoft Research and will integrate future capabilities like reliable quantum computing.</li>
<li style="font-weight:400;">The platform introduces a new “agentic AI” paradigm where people and AI agents cooperatively refine knowledge and experimentation iteratively in real-time. The AI can deeply reason over nuanced scientific data, specialize across domains, and learn and adapt.</li>
<li style="font-weight:400;">While AWS and GCP offer some AI/ML tools for research, Microsoft Discovery appears to be a more comprehensive, specialized platform focused on the full R&amp;D lifecycle and scientific reasoning. The agentic AI approach also seems novel.</li>
<li style="font-weight:400;">Target customers include R&amp;D teams in industries like chemistry, materials, pharma, manufacturing, semiconductors and more. Microsoft is partnering with companies like GSK, Estée Lauder, NVIDIA, Synopsys and others.</li>
<li style="font-weight:400;">For Cloud Pod listeners, this shows how Microsoft is applying advanced AI to help enterprises accelerate scientific innovation, a key economic engine. It demonstrates Azure’s AI/ML capabilities and how Microsoft is partnering across industries.</li>
</ul>
<p>1:09:37 <a href="https://azure.microsoft.com/en-us/blog/agentic-devops-evolving-software-development-with-github-copilot-and-microsoft-azure/">Agentic DevOps: Evolving software development with GitHub Copilot and </a><a href="https://azure.microsoft.com/en-us/blog/agentic-devops-evolving-software-development-with-github-copilot-and-microsoft-azure/">Microsoft Azure</a></p>
<ul>
<li style="font-weight:400;"><a href="https://duckduckgo.com/y.js?ad_domain=github.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=s-QWCUI6k6Qf0xyWZarj8TAXuHSIMU--RNA7wlN1qUsjcJ7XeSk5_s917VvKg_WqGtpnRKeKM7-KI4uyhy4KIuHbP2wogxubjH5wYtjBs9w9DWh2xva4uISw5OHPvAn2.U2iHumk_rtokdZZ6lrtBMg&amp;eddgt=CG0102cCDUdShwxd2n002Q%3D%3D&amp;rut=e1cb42432996db45e3c96ecd29fe92c020d65a1f5271589808b8222a5ffccf13&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8_u3dM-ByAYouY4z2E3K_ijVUCUypOUxUPgENocn2Go-9iCxzwVzeivf0AJpQ12XY1RDZES6GzsXjP38bfSPEwXZDcP24NT8JHHKWVexS5SLNHw-PRnkZwCXk2Pu0Mnnc_x24r3u1UyULOjHe8F-5Csq1HdJkJz1M5yYtFXSzvvcrsABEpygezYqAyNOdQJIafo6I0c3F-DL9FCTZ4joUelBsQCMuvyx3iOvOK60X8sjCOJo_5vQ-VeTExE2zMMZq_VauNxYjT9EcvSulxy1f1ArewDTKLpfABxueNmsrLUQywZZ9ub6faML5oc3twPjCnvmP0bn9AWXYY5dWiEuH7qWhtRkqTagHh_pOrOb3A4pVaki4D9ATVKyuzfmOY3rzwHjuRh9A4SL0DKp_j4DY-dXxxJ-GFuxP9OM4w1mJCxteeR2TfgbNGTgtRMl3IsjiCiUHg_UZrEf3w0NVFPYfT_gJNp4NaPisOq94XLRsHRvGGR5BTS-96k85fxPLhSD63TV1WbGXluu4FGEj4Ut_UyMaB7J7Mb5t6zNkK0ejO4r7-X-lsKbVmpXVBux_BqZ3wh4iSMBafGX0grqzaTet2A5rH2InEa-9j3zQ0F5AVGSSFazimnocDr3DSsCG4HLpfFlL8FGOOoc4agD2onCBu1XEcv35gH93sl0NQk-6r0cdbAG0SPjq6tfHpUklUzYF49BV4ul60ccYOabdGWFf8h5OhQYE7-Z_VNEa2GNgf7KDo3SZO4TycpLt3z4JGkWNrjTrkw%26u%3DaHR0cHMlM2ElMmYlMmY1MzUwLnhnNGtlbi5jb20lMmZ0cmslMmZ2MSUzZnByb2YlM2Q0NDUlMjZjYW1wJTNkMTUwMjY2JTI2a2N0JTNkbXNuJTI2a2NoaWQlM2QxNDAxNzEzNzglMjZjcml0ZXJpYWlkJTNka3dkLTc5MjM0MTYzNDc2MDA4JTNhbG9jLTE5MCUyNmNhbXBhaWduaWQlM2Q1OTAwODA2NTglMjZsb2NwaHklM2Q3OTcxMiUyNmFkZ3JvdXBpZCUzZDEyNjc3Mzc5MDE0NDgwNzElMjZjaWQlM2Q3OTIzMzc2NDM3NDM4MSUyNmtkdiUzZGMlMjZrZXh0JTNkJTI2a3BnJTNkJTI2a3BpZCUzZCUyNnF1ZXJ5U3RyJTNkZ2l0aHViJTI1MjBjb3BpbG90JTI2dXJsJTNkaHR0cHMlM2ElMmYlMmZnaXRodWIuY29tJTJmZmVhdHVyZXMlMmZjb3BpbG90JTNmZWZfaWQlM2Rfa181MjVlYjQ4YTBkYjMxZWQ3ZDBiMWRjYmNiMzg3MGQ2MF9rXyUyNk9DSUQlM2RBSURjbW1iMTUwdmJ2MV9TRU1fX2tfNTI1ZWI0OGEwZGIzMWVkN2QwYjFkY2JjYjM4NzBkNjBfa18lMjZtc2Nsa2lkJTNkNTI1ZWI0OGEwZGIzMWVkN2QwYjFkY2JjYjM4NzBkNjA%26rlid%3D525eb48a0db31ed7d0b1dcbcb3870d60&amp;vqd=4-215487117149378751849885773557283073924&amp;iurl=%7B1%7DIG%3DDF2B9FBCB54F4D289F05D466A46D1B26%26CID%3D0072ABA310B2609D202CBE5911DD61E0%26ID%3DDevEx%2C5048.1">GitHub Copilot</a> is evolving into an AI-powered coding agent that collaborates with developers across the entire software development lifecycle, from planning to production. GOOD LUCK.</li>
<li style="font-weight:400;">The new agentic DevOps approach reimagines DevOps by having intelligent agents automate and optimize each stage, while keeping developers in control.</li>
<li style="font-weight:400;">Agent mode in GitHub Copilot can analyze codebases, make multi-file edits, generate tests, fix bugs and suggest commands based on prompts.</li>
<li style="font-weight:400;">The new coding agent in Copilot acts as a peer programmer, taking on code reviews, tests, bug fixes and feature specs so developers can focus on high-value work.</li>
<li style="font-weight:400;">Azure is adding app modernization capabilities to Copilot to assess, update and remediate legacy Java, .NET and mainframe apps to reduce technical debt</li>
<li style="font-weight:400;">The new Azure Site Reliability Engineering (SRE) Agent monitors production apps 24/7, responds to incidents and troubleshoots autonomously to improve reliability.</li>
<li style="font-weight:400;">GitHub Models make it easy to experiment with and deploy AI models from various providers right from GitHub with enterprise guardrails</li>
<li style="font-weight:400;">Microsoft is open-sourcing the GitHub Copilot extensions in VS Code, reflecting their commitment to transparency and community-driven AI development.</li>
<li style="font-weight:400;">These agentic AI capabilities remove friction, reduce complexity and change the cost structure of software development while enabling developer creativity.</li>
</ul>
<p>1:06:15   Matt- “During the keynote they talked about (if) there’s a production outage and it automatically goes and scales and fixes it and then makes an issue that then it can self fix with their GitHub CoPilot agent. It’s really terrifying. You’re gonna wake up all of a sudden to an Azure bill of like $400,000, because it’d be like, hey, there’s a problem with your SQL…  All of a sudden I’m writing, you know, 128 V cores on my SQL Hyperscale cluster because someone’s DDoSing me. Feel like there’s gonna be things it’s gonna miss.”</p>
<p>1:12:46 <a href="https://blogs.oracle.com/cloud-infrastructure/post/oci-launches-e6-standard-compute-powered-by-amd">Oci Launches E6 Standard Compute Powered By Amd</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.oracle.com/cloud/">Oracle Cloud Infrastructure</a> (OCI) has launched new E6 Standard Compute instances powered by AMD EPYC processors, claiming “up to 55% better price-performance compared to similar compute offerings” – but specifics are vague and comparisons likely cherry-picked.</li>
<li style="font-weight:400;">E6 instances are “supposedly” ideal for workloads like web servers, application servers, batch processing, and distributed analytics – but these are generic use cases that any major cloud provider can easily handle. </li>
<li style="font-weight:400;">Oracle touts security benefits from using “in-house designed servers with built-in firmware-level security” – an improvement, but likely table stakes compared to security from AWS, Azure, GCP. </li>
<li style="font-weight:400;">E6 instances offer up to 128 OCPUs, 2,048 GB of RAM, and 1 PB of remote block storage – specs that match or trail other cloud providers, despite Oracle’s positioning as “industry-leading price performance.”</li>
<li style="font-weight:400;">Oracle claims E6 is “the best price-performance in the industry for scale-out workloads” – a bold claim that warrants deep skepticism without rigorous, independent benchmarking</li>
<li style="font-weight:400;">Pricing details are unclear beyond a starting price of “$0.075 per OCPU hour” – Oracle’s pricing is notoriously complex and opaque compared to major cloud rivals.</li>
<li style="font-weight:400;">Oracle is likely targeting existing Oracle database/software customers and trying to keep them in the Oracle ecosystem as they move to the cloud – but organizations are increasingly adopting multi-cloud strategies.</li>
<li style="font-weight:400;">For most organizations using AWS, Azure or GCP, there’s little reason to get excited – those clouds offer similar or better options, with more mature ecosystems and without Oracle lock-in risks.</li>
<li style="font-weight:400;">Oracle wants to stay relevant in cloud discussions with splashy “we’re the best!” announcements – but informed observers will remain healthily skeptical until proven otherwise.</li>
<li style="font-weight:400;">Show note editor Heather would like to remind Oracle fanboys (Are there any of those?) The snark factor in this one brought to you by AI. Star Wars and Zoolnader puns? All me. Oracle Snark? Justin’s AI prompts. </li>
</ul>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod where you can join our newsletter, slack team, send feedback or ask questions at theCloud Pod.net or tweet at us with hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2051582/c1e-nj4jtd32wxadj94g-qdm7n89vsv48-j9x522.mp3" length="87372736"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 305 of The Cloud Pod – where the forecast is always cloudy! How did you do on your Microsoft Build Predictions? As badly as us? Plus we’ve got news on AWS service changes, a lifecycle catch up page for all those services that bought the farm, tons of Gemini news (seriously, like a lot) and even some AI for .NET. 
Welcome to the cloud pod- and thanks for joining us! 
Titles we almost went with this week:

Google’s Jules: An AI Gem for Cloud Devs  
Autonomous Agents of Code: Jules’ Excellent Adventure in the Google Cloud
Gemini 2.5 Shoots for the Stars with Cosmic-Sized AI Upgrades
Resistance is Futile: OpenAI Assimilates Your Codebase 
AWS Transformers: Rise of the Agentic AI 
Teaching an old .NET dog new Linux tricks
CodeBuild Puts Docker Builds in Hyperdrive
Inspector Gadget’s New Trick: Mapping Container Vulnerabilities
Yo Dawg, I Heard You Like Scanning Containers…
Google Cranks AI to 11 with New Ultra Plan
I, For One, Welcome Our New AI Ultra Overlords
The Inference Engine That Could: llm-d Chugs Ahead with Kubernetes-Native 
      Scaling
Scaling Inference to Infinity and Beyond with Google Cloud’s llm-d
Google Cloud and Spring AI: A Match Made in Java-n
The Fast and the Serverless: Cloud Run Drifts into AI Studio Territory
SQL Server 2025: A Vector Victor, Not a Scalar Failure
AI will solve my life problems of having money in my pocket
I used to scan all the containers but now I will just scan yours

AI Is Going Great – or How ML Makes Money 
01:50 Jules: Google’s autonomous AI coding agent

Jules is an autonomous AI agent that can read code, understand intent, and make code changes on its own. 
It goes beyond AI coding assistants to operate independently.
It clones code into a secure Google Cloud VM, allowing it to understand the full context of a project. This enables it to write tests, build features, fix bugs, and more.
Jules operates asynchronously in the background, presenting its plan and reasoning when complete. This allows developers to focus on other tasks while it works.
Integration with GitHub enables Jules to work directly in existing workflows without extra setup or context switching. Developers can steer and give feedback throughout the process.
For cloud developers, Jules demonstrates the rapid advancement of AI for coding moving from prototype to product. Its cloud-based parallel execution enables efficient handling of complex, multi-file changes.
While in public beta, Jules is free with some usage limits. This allows developers to experiment with this cutting-edge AI coding agent and understand its potential to accelerate development on Google Cloud.

02:56  Ryan – “More and more, as new tools get released, it’s just going to change the way anything gets written… it’s getting more and more capable.” 
05:45 Introducing Flow: Google’s AI filmmaking tool designed for Veo

Flow is an AI-powered filmmaking tool custom-designed for Google’s advanced video, image and language models (Veo, Imagen, Gemini). It allows creators to generate cinem...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2051582/c1a-k5d5-mk473np9fj3r-ltmz9n.jpg"></itunes:image>
                                                                            <itunes:duration>01:12:49</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2051582/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[304: It’s Chile Up Here in The Cloud!]]>
                </title>
                <pubDate>Thu, 22 May 2025 05:52:55 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2044366</guid>
                                    <link>https://tcpfm.castos.com/episodes/304-its-chile-up-here-in-the-cloud</link>
                                <description>
                                            <![CDATA[<p>Welcome to episode 304 of The Cloud Pod – where the forecast is always cloudy! Justin, Ryan and Matt are in the house tonight to bring you all the latest and greatest in Cloud and AI news, including AWS new Chilean region, the ongoing tug of war between Open AI and Microsoft, and even some K8 updates – plus an aftershow. Let’s get started! </p>
<h3>Titles we almost went with this week:</h3>
<ul>
<li>Open AI gets a COO delivered</li>
<li>Things get Chile with new regions</li>
<li>Observability and AI, I Q-uestion the logic</li>
<li>Cloud Pod tries to Microsoft Build predictions</li>
<li>K8 resizes pods on the fly</li>
<li>Microsoft strongly reinforces the AI Foundry</li>
<li>The Cloud Pod renegotiates the hosts’ contracts … we now have to pay the Cloud Pod to be on it </li>
</ul>
<h2>Follow Up </h2>
<p>01:53 <a href="https://blog.google/outreach-initiatives/public-policy/doj-search-remedies-may-2025/">DOJ’s extreme proposals will hurt consumers and America’s tech </a><a href="https://blog.google/outreach-initiatives/public-policy/doj-search-remedies-may-2025/">leadership</a> </p>
<ul>
<li style="font-weight:400;">We previously talked about the DOJ and <a href="https://www.bing.com/ck/a?!&amp;&amp;p=1dce9f7d9dd3eb415f6ac5f5f4a1f909643a18c2caf414d4ac2d34e3e1504a52JmltdHM9MTc0NzY5OTIwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=google&amp;u=a1aHR0cHM6Ly93d3cuZ29vZ2xlLmNvbS8&amp;ntb=1">Google</a> Antitrust lawsuit – and now the DOJ has wrapped up their <a href="https://blog.google/outreach-initiatives/public-policy/doj-search-remedies-apr-2025/">remedies hearing</a>, and Google has *not* been quiet about it.</li>
<li style="font-weight:400;">One of the claims is that the remedies would hurt browser choice, putting browsers like Firefox <a href="https://www.theverge.com/news/660548/firefox-google-search-revenue-share-doj-antitrust-remedies">out of business</a> completely. </li>
<li style="font-weight:400;">Google also claimed that data disclosure mandates would threaten user’s privacy – it would be MUCH safer if they could just sell it to you via their marketplace. </li>
<li style="font-weight:400;">We do agree that divesting Chrome would make things more complicated for people living in the Google Cloud. </li>
<li style="font-weight:400;">Really, what comes down to is that Google claims DOJ’s solutions are the wrong solutions – although to us, Google’s solutions aren’t much better. </li>
</ul>
<h2>AI – Or How ML Makes Money </h2>
<p>09:20 <a href="https://openai.com/index/leadership-expansion-with-fidji-simo/">OpenAI Expands Leadership with Fidji Simo</a> </p>
<p><a href="https://www.theinformation.com/articles/openai-talks-hire-senior-executive-major-leadership-role?rc=3t8xtd">OpenAI Hires Instacart CEO Simo For Major Leadership Role</a> </p>
<ul>
<li style="font-weight:400;"><a href="https://openai.com/">OpenAI</a> is hiring Fidji Simo as the CEO of applications, representing a major restructuring of leadership at the company. </li>
<li style="font-weight:400;">She was the CEO at <a href="https://www.bing.com/aclk?ld=e89C50KnB4bxovgfnrUdBnrjVUCUwEG_AD3T0DCBmqhWoM-VryfyqQIVD4NYjGE3YG07z7_ieSDj_DJsQbELXDG4I64lXQc8QEEAFKNunIV7i_XsQh9qeuLIOR3-y-KQguwgxpV2pMdrPczKlZ4iXQne4NtlExCctLoB4psN9_i0e0XSJnO3x3qrXCJMabImsILpunZw&amp;u=aHR0cHMlM2ElMmYlMmZ3d3cuaW5zdGFjYXJ0LmNvbSUyZiUzZnV0bV9tZWRpdW0lM2RzZW0lMjZ1dG1fc291cmNlJTNkaW5zdGFjYXJ0X2JpbmclMjZ1dG1fY2FtcGFpZ24lM2RhZF9kZW1hbmRfc2VhcmNoX2JyYW5kX1VTLUNBX2hlYWR0ZXJtX2V4YWN0JTI1MjZwaHJhc2VfZGVza3RvcF9BVURBQ1QlMjZ1dG1fY29udGVudCUzZGFjY291bnRpZC00NzAwMjM3NV9jYW1wYWlnbmlkLTY1MDI5NzI4Nl9hZGdyb3VwaWQtMTMyNDkxMzkwODk4NDQ0M19kZXZpY2UtY19hZGlkLTgyODA3MzQ4MzExNjEzX25ldHdvcmstbyUyNnV0bV90ZXJtJTNkbWF0Y2h0eXBlLWVfa2V5d29yZC1pbnN0YWNhcnRfdGFyZ2V0aWQta3dkLTgyODA4MjY0MzUzODI2JTNhbG9jLTQwODRfbG9jYXRpb25pZC04MDcxNCUyNmtza3dpZCUzZDUwMTk5NCUyNmtzYWRpZCUzZDQ1ODI3MTclMjZtc2Nsa2lkJTNkOTUxNTliMTc3ZThmMWVkMDNkM2ZmMzNhNGRmMTg5ZDY&amp;rlid=95159b177e8f1ed03d3ff33a4df189d6&amp;am..."></a></li></ul>
<h3>Chapters</h3>
<ul><li>(00:00:00) - The Cloud Pod</li><li>(00:02:05) - Google Lashes Out Over DOJ's Antitrust Proposal</li><li>(00:06:19) - Does a Google Divested Chrome Affect the Internet?</li><li>(00:09:17) - OpenAI Expands Leadership Team</li><li>(00:10:22) - OpenAI's Nonprofit Status</li><li>(00:11:34) - OpenAI Announces OpenAI for Countries and Data Residency for</li><li>(00:13:58) - OpenAI in Tough Negotiations With Microsoft</li><li>(00:16:48) - Terraform's AWS Provider Hits 4 Billion Downloads</li><li>(00:17:41) - Amazon Terraform Provider 6.0 in Public Beta</li><li>(00:23:14) - Amazon Launches New AWS Region in Chile</li><li>(00:24:29) - Amazon Q Developer support to OpenSearch</li><li>(00:27:04) - Kubernetes 1.33 Release Notes</li><li>(00:31:23) - Does AWS have cloud commitment insurance?</li><li>(00:33:25) - Google's Gecko Tool for Generative AI</li><li>(00:35:54) - First Build Prediction: GitHub Copilot</li><li>(00:37:04) - Microsoft's LLM for OpenAI</li><li>(00:38:11) - Intel Announces New Quantum Computing Chip</li><li>(00:39:17) - Third Choice: Microsoft Office PC Updates</li><li>(00:40:30) - Top Three Office Products for 2020</li><li>(00:42:07) - Google, Microsoft's AI Competitor</li><li>(00:42:46) - The Number of Times Copilot Is Invited to Microsoft's Conference</li><li>(00:46:15) - Microsoft Giving Virtual Data Center Tours</li><li>(00:49:21) - Azure Storage Actions</li><li>(00:52:15) - How many storage accounts can I have in a subscription?</li><li>(00:54:46) - Azure Storage Actions</li><li>(00:59:41) - "Oh, I can't handle that!"</li><li>(01:00:14) - Red Hat Summit 2025 & Azure Migrate</li><li>(01:02:57) - Azure AI: Reinforcement Fine-tuning (RFT</li><li>(01:05:48) - Cloud Podcast: Week 3</li><li>(01:06:40) - Linux Kernels to Drop 486 CPUs</li><li>(01:09:29) - Can I Run Linux on a 486?</li><li>(01:14:07) - AMD vs Intel: Which Is The Best?</li><li>(01:16:21) - 486 compatibility in the Linux kernel</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 304 of The Cloud Pod – where the forecast is always cloudy! Justin, Ryan and Matt are in the house tonight to bring you all the latest and greatest in Cloud and AI news, including AWS new Chilean region, the ongoing tug of war between Open AI and Microsoft, and even some K8 updates – plus an aftershow. Let’s get started! 
Titles we almost went with this week:

Open AI gets a COO delivered
Things get Chile with new regions
Observability and AI, I Q-uestion the logic
Cloud Pod tries to Microsoft Build predictions
K8 resizes pods on the fly
Microsoft strongly reinforces the AI Foundry
The Cloud Pod renegotiates the hosts’ contracts … we now have to pay the Cloud Pod to be on it 

Follow Up 
01:53 DOJ’s extreme proposals will hurt consumers and America’s tech leadership 

We previously talked about the DOJ and Google Antitrust lawsuit – and now the DOJ has wrapped up their remedies hearing, and Google has *not* been quiet about it.
One of the claims is that the remedies would hurt browser choice, putting browsers like Firefox out of business completely. 
Google also claimed that data disclosure mandates would threaten user’s privacy – it would be MUCH safer if they could just sell it to you via their marketplace. 
We do agree that divesting Chrome would make things more complicated for people living in the Google Cloud. 
Really, what comes down to is that Google claims DOJ’s solutions are the wrong solutions – although to us, Google’s solutions aren’t much better. 

AI – Or How ML Makes Money 
09:20 OpenAI Expands Leadership with Fidji Simo 
OpenAI Hires Instacart CEO Simo For Major Leadership Role 

OpenAI is hiring Fidji Simo as the CEO of applications, representing a major restructuring of leadership at the company. 
She was the CEO at ]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[304: It’s Chile Up Here in The Cloud!]]>
                </itunes:title>
                                    <itunes:episode>304</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p>Welcome to episode 304 of The Cloud Pod – where the forecast is always cloudy! Justin, Ryan and Matt are in the house tonight to bring you all the latest and greatest in Cloud and AI news, including AWS new Chilean region, the ongoing tug of war between Open AI and Microsoft, and even some K8 updates – plus an aftershow. Let’s get started! </p>
<h3>Titles we almost went with this week:</h3>
<ul>
<li>Open AI gets a COO delivered</li>
<li>Things get Chile with new regions</li>
<li>Observability and AI, I Q-uestion the logic</li>
<li>Cloud Pod tries to Microsoft Build predictions</li>
<li>K8 resizes pods on the fly</li>
<li>Microsoft strongly reinforces the AI Foundry</li>
<li>The Cloud Pod renegotiates the hosts’ contracts … we now have to pay the Cloud Pod to be on it </li>
</ul>
<h2>Follow Up </h2>
<p>01:53 <a href="https://blog.google/outreach-initiatives/public-policy/doj-search-remedies-may-2025/">DOJ’s extreme proposals will hurt consumers and America’s tech </a><a href="https://blog.google/outreach-initiatives/public-policy/doj-search-remedies-may-2025/">leadership</a> </p>
<ul>
<li style="font-weight:400;">We previously talked about the DOJ and <a href="https://www.bing.com/ck/a?!&amp;&amp;p=1dce9f7d9dd3eb415f6ac5f5f4a1f909643a18c2caf414d4ac2d34e3e1504a52JmltdHM9MTc0NzY5OTIwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=google&amp;u=a1aHR0cHM6Ly93d3cuZ29vZ2xlLmNvbS8&amp;ntb=1">Google</a> Antitrust lawsuit – and now the DOJ has wrapped up their <a href="https://blog.google/outreach-initiatives/public-policy/doj-search-remedies-apr-2025/">remedies hearing</a>, and Google has *not* been quiet about it.</li>
<li style="font-weight:400;">One of the claims is that the remedies would hurt browser choice, putting browsers like Firefox <a href="https://www.theverge.com/news/660548/firefox-google-search-revenue-share-doj-antitrust-remedies">out of business</a> completely. </li>
<li style="font-weight:400;">Google also claimed that data disclosure mandates would threaten user’s privacy – it would be MUCH safer if they could just sell it to you via their marketplace. </li>
<li style="font-weight:400;">We do agree that divesting Chrome would make things more complicated for people living in the Google Cloud. </li>
<li style="font-weight:400;">Really, what comes down to is that Google claims DOJ’s solutions are the wrong solutions – although to us, Google’s solutions aren’t much better. </li>
</ul>
<h2>AI – Or How ML Makes Money </h2>
<p>09:20 <a href="https://openai.com/index/leadership-expansion-with-fidji-simo/">OpenAI Expands Leadership with Fidji Simo</a> </p>
<p><a href="https://www.theinformation.com/articles/openai-talks-hire-senior-executive-major-leadership-role?rc=3t8xtd">OpenAI Hires Instacart CEO Simo For Major Leadership Role</a> </p>
<ul>
<li style="font-weight:400;"><a href="https://openai.com/">OpenAI</a> is hiring Fidji Simo as the CEO of applications, representing a major restructuring of leadership at the company. </li>
<li style="font-weight:400;">She was the CEO at <a href="https://www.bing.com/aclk?ld=e89C50KnB4bxovgfnrUdBnrjVUCUwEG_AD3T0DCBmqhWoM-VryfyqQIVD4NYjGE3YG07z7_ieSDj_DJsQbELXDG4I64lXQc8QEEAFKNunIV7i_XsQh9qeuLIOR3-y-KQguwgxpV2pMdrPczKlZ4iXQne4NtlExCctLoB4psN9_i0e0XSJnO3x3qrXCJMabImsILpunZw&amp;u=aHR0cHMlM2ElMmYlMmZ3d3cuaW5zdGFjYXJ0LmNvbSUyZiUzZnV0bV9tZWRpdW0lM2RzZW0lMjZ1dG1fc291cmNlJTNkaW5zdGFjYXJ0X2JpbmclMjZ1dG1fY2FtcGFpZ24lM2RhZF9kZW1hbmRfc2VhcmNoX2JyYW5kX1VTLUNBX2hlYWR0ZXJtX2V4YWN0JTI1MjZwaHJhc2VfZGVza3RvcF9BVURBQ1QlMjZ1dG1fY29udGVudCUzZGFjY291bnRpZC00NzAwMjM3NV9jYW1wYWlnbmlkLTY1MDI5NzI4Nl9hZGdyb3VwaWQtMTMyNDkxMzkwODk4NDQ0M19kZXZpY2UtY19hZGlkLTgyODA3MzQ4MzExNjEzX25ldHdvcmstbyUyNnV0bV90ZXJtJTNkbWF0Y2h0eXBlLWVfa2V5d29yZC1pbnN0YWNhcnRfdGFyZ2V0aWQta3dkLTgyODA4MjY0MzUzODI2JTNhbG9jLTQwODRfbG9jYXRpb25pZC04MDcxNCUyNmtza3dpZCUzZDUwMTk5NCUyNmtzYWRpZCUzZDQ1ODI3MTclMjZtc2Nsa2lkJTNkOTUxNTliMTc3ZThmMWVkMDNkM2ZmMzNhNGRmMTg5ZDY&amp;rlid=95159b177e8f1ed03d3ff33a4df189d6&amp;ntb=1">Instacart</a> prior to this new role. </li>
<li style="font-weight:400;">Altman will continue to oversee research and infrastructure teams that are core to the company’s AI development, while leaving the rest of the company to Simo. </li>
<li style="font-weight:400;">One of the key areas Simo will focus on is managing executives. Under Altman, turf wars festered and sometimes key decisions were delayed after receiving requests for computing or bigger headcounts. </li>
<li style="font-weight:400;">That history factored into some of the decisions to oust him and the departure of Mira Murati. </li>
<li style="font-weight:400;">Show editor note: The Information did Simo DIRTY when they chose that lead pic. </li>
</ul>
<p>11:43 <a href="https://openai.com/global-affairs/openai-for-countries/">Introducing OpenAI for Countries</a>  </p>
<p><a href="https://openai.com/index/introducing-data-residency-in-asia/">Introducing Data Residency in Asia</a></p>
<ul>
<li style="font-weight:400;">In addition to the leadership changes, they are also announcing OpenAI for countries, a new initiative within the stargate project. </li>
<li style="font-weight:400;">Through formalized infrastructure collaborations, and in in coordination with the US government, open AI will:
<ul>
<li style="font-weight:400;">Partner with countries to help build in-country-data center capacity</li>
<li style="font-weight:400;">Provided customized <a href="https://help.openai.com/en/articles/9903489-data-residency-for-chatgpt">ChatGPT</a> to citizens</li>
<li style="font-weight:400;">Continue evolving security and safety controls for AI models</li>
<li style="font-weight:400;">Together, raise and deploy a national startup fund</li>
<li style="font-weight:400;">This doesn’t sound ominous at all</li>
</ul>
</li>
<li style="font-weight:400;">Open AI is announcing data residency for Japan, India, Singapore and South Korea for Chat GPT Enterprise, ChatGPT EDU and the <a href="https://platform.openai.com/docs/guides/your-data">API platform</a>. </li>
<li style="font-weight:400;">This lets organizations meet local data sovereignty requirements when using OpenAI products in their businesses and building new solutions with AI. </li>
</ul>
<p>13:42  Justin – “They are supposed to be in other countries…but they could be built in the US on the Stargate infrastructure for other countries as well – that’s a possible scenario.” </p>
<p>14:10 <a href="https://techcrunch.com/2025/05/11/microsoft-and-openai-may-be-renegotiating-their-partnership/">Microsoft and OpenAI may be renegotiating their partnership</a></p>
<ul>
<li style="font-weight:400;">TechCrunch is reporting that OpenAI is in a <a href="https://www.ft.com/content/8d9e5149-7e4f-4886-a035-9d200204972a">tough negotiation with Microsoft</a>. </li>
<li style="font-weight:400;">The AI startup <a href="https://techcrunch.com/2025/05/05/openai-reverses-course-says-its-nonprofit-will-remain-in-control-of-its-business-operations/">is trying to restructure itself</a>, with its business arm in a for-profit public benefit corporation, while its non-profit board will remain in control. </li>
<li style="font-weight:400;">Microsoft is apparently the key holdout, and after investing $13B to date, they need to approve the restructuring.</li>
<li style="font-weight:400;">The main issue is how much Equity MS will receive in the for-profit entity, the companies are also apparently renegotiating their broader contract, with Microsoft offering to give up some of its equity in exchange for access to OpenAI tech developed after the current 2030 cutoff. </li>
<li style="font-weight:400;">These negotiations are complicated due to the increasing competitive pressure between the companies. 
<ul>
<li style="font-weight:400;"><a href="https://docs.github.com/en/enterprise-cloud@latest/copilot/using-github-copilot/ai-models/using-claude-sonnet-in-github-copilot">https://docs.github.com/en/enterprise-cloud@latest/copilot/using-github-copilot/ai-models/using-claude-sonnet-in-github-copilot</a> </li>
</ul>
</li>
</ul>
<p>14:48  Matt – “It’s amazing to me that Microsoft wants to put all of their eggs in the OpenAI basket.” </p>
<p> </p>
<h2>Cloud Tools </h2>
<p>17:03 <a href="https://www.hashicorp.com/en/blog/terraform-aws-provider-tops-4-billion-downloads-6-0-now-in-public-beta">Terraform AWS provider tops 4 billion downloads, 6.0 now in public beta</a>  </p>
<ul>
<li style="font-weight:400;">The <a href="https://developer.hashicorp.com/terraform/tutorials/aws-get-started">AWS Terraform</a> provider is the engine that continues to drive massive downloads, with them just eclipsing 4 billion downloads – with 569.3M just this year. </li>
<li style="font-weight:400;"><a href="https://www.bing.com/ck/a?!&amp;&amp;p=7f3bd558366390da0f35695286c95e0ba3264326e9390280ff7e720eed2f46bbJmltdHM9MTc0NzY5OTIwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=terraform+6.&amp;u=a1aHR0cHM6Ly9yZWdpc3RyeS50ZXJyYWZvcm0uaW8vcHJvdmlkZXJzL2hhc2hpY29ycC9hd3MvbGF0ZXN0L2RvY3MvZ3VpZGVzL3ZlcnNpb24tNi11cGdyYWRl&amp;ntb=1">The 6.0 Terraform provider</a> is now in public beta, bringing a lot of exciting changes to the provider. </li>
<li style="font-weight:400;">Enhanced Region Support:
<ul>
<li style="font-weight:400;">Previously, the Terraform AWS Provider only targeted a single AWS region. This limitation meant that practitioners had to update every configuration file individually if they wanted to change the configuration of a particular resource. For global companies, this could mean editing the same parameter in 32 separate configuration files for each region. </li>
<li style="font-weight:400;">Now you can support multiple regions all within a single configuration file. The new approach leverages an inject region attribute at the resource level to simplify configuration efforts.  This reduces the need to load multiple instances of the AWS provider, lowering memory usage overall. </li>
<li style="font-weight:400;">Some of the key highlights include:
<ul>
<li style="font-weight:400;">Single provider config. Reducing the need to load multiple instances of the provider and lowering memory usage</li>
<li style="font-weight:400;">Region attribute injection with the region argument</li>
<li style="font-weight:400;">Global resource exclusions — services like IAM, cloudfront and route 53 remain unaffected as they operate globally. </li>
<li style="font-weight:400;">Terraform plugin framework updates – adjustments to the AWS API client mechanism to support per region API client mappings</li>
<li style="font-weight:400;">Resource import enhancements to allow the @ suffice to allow importing of resources from different regions. </li>
<li style="font-weight:400;">Improved document and testing to ensure backward compatibility. </li>
</ul>
</li>
</ul>
</li>
<li style="font-weight:400;">EC2 Instance User Data Improvements
<ul>
<li style="font-weight:400;">Updating the diffs to show user_data changes instead of hashed Values (HALLELUJAH)</li>
<li style="font-weight:400;">But you’ll really want to make sure you don’t have secrets in user-data now. </li>
</ul>
</li>
<li style="font-weight:400;">Services being deprecated:
<ul>
<li style="font-weight:400;"><a href="https://www.bing.com/ck/a?!&amp;&amp;p=adf1a988d1d6e99729dd3c980d542c4d2d728191a1ae0c5f648f76a0a4c09d07JmltdHM9MTc0NzY5OTIwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=amazon+chime&amp;u=a1aHR0cHM6Ly9hcHAuY2hpbWUuYXdzLw&amp;ntb=1">Amazon Chime</a>, <a href="https://www.bing.com/ck/a?!&amp;&amp;p=6f5249a0c5f8374600014bf3aa6bfe40681b8281c1d24c28fca37a06b5b54494JmltdHM9MTc0NzY5OTIwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=cloudwatch+evidently&amp;u=a1aHR0cHM6Ly9hd3MuYW1hem9uLmNvbS9ibG9ncy9hd3MvY2xvdWR3YXRjaC1ldmlkZW50bHkv&amp;ntb=1">CloudWatch Evidently</a>, <a href="https://www.bing.com/ck/a?!&amp;&amp;p=2cf21688c2c4a56bba0ef8d31241e36eee8902a322aaf5fab868932b7347c883JmltdHM9MTc0NzY5OTIwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=amazon+elastic+transcoder&amp;u=a1aHR0cHM6Ly9hd3MuYW1hem9uLmNvbS9lbGFzdGljdHJhbnNjb2Rlci8&amp;ntb=1">Amazon Elastic Transcoder</a>, <a href="https://www.bing.com/ck/a?!&amp;&amp;p=3202765186e58f24f13a00336edeb02ec5a0f00f625dd98d6656b2774bfac16cJmltdHM9MTc0NzY5OTIwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=AWS+Elemental+mediastore&amp;u=a1aHR0cHM6Ly9hd3MuYW1hem9uLmNvbS9tZWRpYXN0b3JlLw&amp;ntb=1">AWS Elemental Mediastore</a></li>
<li style="font-weight:400;">Removed as already deprecated: Elastic Inference, Elastic Graphics, Opsworks Stacks, aws_simpledb_domains.</li>
</ul>
</li>
<li style="font-weight:400;">Other things of note:
<ul>
<li style="font-weight:400;">Will remove the S3 global endpoints in the providers</li>
</ul>
</li>
</ul>
<p>21:14  Justin – “You’re going to want to make sure you don’t have secrets in the user data; because this will not be hashed in the state file – they’ll now be in plain text in Terraform plan and Terraform apply dif.” </p>
<h2>AWS</h2>
<p>23:43 <a href="https://aws.amazon.com/blogs/aws/coming-soon-aws-south-america-chile-region/">In the works – AWS South America (Chile) Region</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/">AWS</a> announced plans to launch a new <a href="https://aws.amazon.com/about-aws/global-infrastructure/regions_az/">AWS region</a> in Chile by the end of 2026. </li>
<li style="font-weight:400;">The AWS Chile Region will consist of <a href="https://docs.aws.amazon.com/global-infrastructure/latest/regions/aws-availability-zones.html">three AZ’s</a> and will join the <a href="https://aws.amazon.com/blogs/aws/now-open-south-america-sao-paulo-region-ec2-s3-and-lots-more/">Sao Paulo</a> and <a href="https://aws.amazon.com/blogs/aws/now-open-aws-mexico-central-region/">Mexico</a> regions as the third in Latin America.</li>
</ul>
<p>24:55 <a href="https://aws.amazon.com/blogs/big-data/introducing-amazon-q-developer-in-amazon-opensearch-service/">Introducing Amazon Q Developer in Amazon OpenSearch Service</a></p>
<ul>
<li style="font-weight:400;">Many companies use <a href="https://docs.aws.amazon.com/opensearch-service/latest/developerguide/what-is.html">OpenSearch</a> to store operational and telemetry signal data. </li>
<li style="font-weight:400;">They use this data to monitor the health of their applications and infrastructure, however at scale the sheer volume and variety in data makes the process complex and time-consuming leading to high MTTRs. </li>
<li style="font-weight:400;">To address this, Amazon is introducing <a href="https://aws.amazon.com/blogs/big-data/category/amazon-q/amazon-q-developer/">Amazon Q Developer</a> support to OpenSearch.  </li>
<li style="font-weight:400;">This allows an AI-Assisted analysis, both new and experienced users can navigate complex operational data without training, analyze issues, and gain insights in a fraction of the time. </li>
<li style="font-weight:400;">Q Developer reduces MTTR by integrating generative AI capabilities directly into open search workflows. </li>
</ul>
<p>25:40  Ryan – “This is just adding natural text descriptions to the product; but couldn’t it just be a part of Open Search?”  </p>
<h2>GCP</h2>
<p>27:36  <a href="https://opensource.googleblog.com/2025/05/kubernetes-1.33-available-on-gke.html">Kubernetes 1.33 is available on GKE!</a></p>
<ul>
<li style="font-weight:400;"><a href="https://kubernetes.io/blog/2025/04/23/kubernetes-v1-33-release/">K8 1.33</a> is now available on <a href="https://www.bing.com/ck/a?!&amp;&amp;p=4747a50f93b5450c3551f34f35db5e24ce68b9097f6ffe239aaa51a9563310fcJmltdHM9MTc0NzY5OTIwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=GKE+rapid+channle&amp;u=a1aHR0cHM6Ly9jbG91ZC5nb29nbGUuY29tL2t1YmVybmV0ZXMtZW5naW5lL2RvY3MvcmVsZWFzZS1ub3Rlcy1yYXBpZA&amp;ntb=1">GKE Rapid Channel</a>. (Which hopefully none of you are using in production.) </li>
<li style="font-weight:400;">The 1.33 version has several enhancements including:
<ul>
<li style="font-weight:400;"><a href="https://kubernetes.io/docs/tasks/configure-pod-container/resize-container-resources/">In-Place Pod Resizing</a></li>
<li style="font-weight:400;"><a href="https://kubernetes.io/blog/2025/05/01/kubernetes-v1-33-dra-updates/">K8 Dynamic Resource Allocation</a></li>
<li style="font-weight:400;"><a href="https://cloud.google.com/kubernetes-engine/docs/deprecations/migrate-containerd-2">Containerd 2.0 runtime support</a></li>
<li style="font-weight:400;">Multiple Service <a href="https://github.com/kubernetes/enhancements/blob/master/keps/sig-network/1880-multiple-service-cidrs/README.md">Cidr Support</a></li>
<li style="font-weight:400;">Google itself contributed:
<ul>
<li style="font-weight:400;"><a href="https://github.com/kubernetes/enhancements/blob/master/keps/sig-api-machinery/4355-coordinated-leader-election/README.md">Coordinated Leader Election</a></li>
<li style="font-weight:400;"><a href="https://github.com/kubernetes/enhancements/tree/master/keps/sig-architecture/4330-compatibility-versions">Compatibility Versions</a></li>
<li style="font-weight:400;"><a href="https://kubernetes.io/docs/reference/instrumentation/zpages/">zPages</a></li>
<li style="font-weight:400;"><a href="https://github.com/kubernetes/enhancements/blob/master/keps/sig-api-machinery/5116-streaming-response-encoding/README.md">Streamline List responses</a></li>
<li style="font-weight:400;"><a href="https://github.com/kubernetes/enhancements/blob/master/keps/sig-api-machinery/4988-snapshottable-api-server-cache/README.md">Snapshottable API server cache</a></li>
<li style="font-weight:400;"><a href="https://github.com/kubernetes/enhancements/blob/master/keps/sig-api-machinery/5073-declarative-validation-with-validation-gen/README.md">Declarative Validation</a></li>
<li style="font-weight:400;"><a href="https://github.com/kubernetes/enhancements/blob/master/keps/sig-api-machinery/5080-ordered-namespace-deletion/README.md">Ordered Namespace Deletions</a></li>
</ul>
</li>
</ul>
</li>
</ul>
<p>29:58  Justin – “I do find it funny that it’s taken this long to get pod resizing. To be able to change the CPU memory request assigned to containers that are in a running pod seems like something that would have been needed a while ago.”</p>
<p>33:22 <a href="https://cloud.google.com/blog/products/ai-machine-learning/evaluate-your-gen-media-models-on-vertex-ai/">Evaluate your gen media models on Vertex AI</a></p>
<ul>
<li style="font-weight:400;">Google is releasing <a href="https://iclr.cc/virtual/2025/poster/30151">Gecko</a>, now available through Google Cloud’s <a href="https://cloud.google.com/vertex-ai/generative-ai/docs/models/evaluation-overview">Vertex AI evaluation service</a>. </li>
<li style="font-weight:400;">Gecko is a rubric-based and interpretable autorater for evaluating generative AI models that empowers developers with a more nuanced, customizable, and transparent way to assess the performance of image and video generation models. </li>
<li style="font-weight:400;">This is ideal to replace traditional human evaluation, while its the gold standard, it can be slow and costly, hindering rapid development cycles as Generative AI innovates rapidly. </li>
<li style="font-weight:400;">One of the challenges this Gecko solves is that when traditionally using auto-raters they lack the interpretability needed to understand model behavior and pinpoint areas for improvement. For instance, when evaluating a generated image depicts a text prompt, a single score doesn’t reveal WHY the model succeeded or failed. </li>
<li style="font-weight:400;">Gecko offers a fine-grained interpretable and customizable auto-rater.  This is based on a <a href="https://arxiv.org/abs/2404.16820">DeepMind</a> research paper, that an auto rater can reliably evaluate image and video generation across a range of skills, reducing the dependency on costly human judgement.  </li>
<li style="font-weight:400;">Notably, beyond its interoperability, Gecko exhibits strong performance and has already been instrumental in benchmarking the progress of leading models like <a href="https://deepmind.google/technologies/imagen-3/">Imagen</a>. </li>
</ul>
<h2>Azure</h2>
<p>Just so everyone is aware – Matt is making us do this, so here goes nothing…</p>
<p>34:56  Build Predictions</p>
<ul>
<li style="font-weight:400;">Ryan
<ul>
<li style="font-weight:400;">Announce an enhancement to GitHub Copilot, that allows agentic code development and agentic tasks. </li>
<li style="font-weight:400;">Quantum Computing – Double down on Majorna and quantum computing capabilities. </li>
<li style="font-weight:400;">Augmented/Virtual Reality for Teams</li>
</ul>
</li>
<li style="font-weight:400;">Matt
<ul>
<li style="font-weight:400;">New Version of the ARM processor Cobalt</li>
<li style="font-weight:400;">New generation of Surface hardware</li>
<li style="font-weight:400;">Major update to the App Services Platform in Azure  </li>
</ul>
</li>
<li style="font-weight:400;">Justin
<ul>
<li style="font-weight:400;">Microsoft will launch their own LLM</li>
<li style="font-weight:400;">Microsoft Office Copilot upgrade with MCP inclusion in it.</li>
<li style="font-weight:400;">Agentspaces or Glean Type Competitor</li>
</ul>
</li>
<li style="font-weight:400;">Number of times copilot will be mentioned in the keynote
<ul>
<li style="font-weight:400;">55 Justin</li>
<li style="font-weight:400;">75 Matt</li>
<li style="font-weight:400;">62 Ryan</li>
<li style="font-weight:400;">1 Jonathan (who isn’t here)</li>
</ul>
</li>
</ul>
<p>46:46 <a href="https://azure.microsoft.com/en-us/blog/microsofts-virtual-datacenter-tour-opens-a-door-to-the-cloud/">Microsoft’s Virtual Datacenter Tour opens a door to the cloud</a></p>
<ul>
<li style="font-weight:400;">If your auditors love touring datacenters, or if you have a general curiosity about what a datacenter looks like (Justin has absolutely no desire) Microsoft is giving you the new <a href="https://datacenters.microsoft.com/tour">virtual datacenter tour</a>, where customers can explore the infrastructure and datacenter design that powers over 60 datacenter regions and 300 plus data centers globally. </li>
<li style="font-weight:400;">Microsoft wishes they could take you to the datacenter but its prohibitive security, safety and staffing issues, so they’re bringing the datacenter to you with the new virtual datacenter tour microsite, that includes a 3d self-guided virtual journey that will allow you to interact with the MS datacenter firsthand. </li>
<li style="font-weight:400;">You can even check out recent innovations like Microsoft’s zero-water cooling datacenter design, which eliminates water use in datacenter cooling plus <a href="https://azure.microsoft.com/en-us/blog/quantum/2025/02/19/microsoft-unveils-majorana-1-the-worlds-first-quantum-processor-powered-by-topological-qubits/">Majorna 1</a> the world’s first quantum chip powered by a topological core. </li>
<li style="font-weight:400;">We do think it might be cool if this was available in Oculus or Meta quests or whatever VR thing is popular with the youths these days.</li>
</ul>
<p>49:50 <a href="https://www.microsoft.com/en-us/microsoft-cloud/blog/2025/05/07/empowering-multi-agent-apps-with-the-open-agent2agent-a2a-protocol/">Empowering multi-agent apps with the open Agent2Agent (A2A) protocol</a></p>
<ul>
<li style="font-weight:400;">Microsoft knows a good OSS project when it sees it and it wants you to know that it is committed to advancing open protocols like <a href="https://github.com/google/A2A">Agent2Agent (A2A)</a>, coming soon to <a href="https://ai.azure.com/">Azure AI Foundry</a> and <a href="https://www.microsoft.com/en-us/microsoft-copilot/microsoft-copilot-studio">CoPilot Studio</a>, which will enable agents to collaborate across clouds, platforms and organizational boundaries. </li>
<li style="font-weight:400;">As customers scale their AI systems, operability is no longer optional, says Microsoft. </li>
<li style="font-weight:400;">They are delivering with support for A2A
<ul>
<li style="font-weight:400;">Azure AI Foundry</li>
<li style="font-weight:400;">Copilot Studio</li>
</ul>
</li>
</ul>
<p>50:18 <a href="https://azure.microsoft.com/en-us/blog/unlock-seamless-data-management-with-azure-storage-actions-now-generally-available/">Unlock seamless data management with Azure Storage Actions—now </a><a href="https://azure.microsoft.com/en-us/blog/unlock-seamless-data-management-with-azure-storage-actions-now-generally-available/">generally available</a></p>
<ul>
<li style="font-weight:400;">Azure is announcing the GA of <a href="https://aka.ms/Azure-Storage-Actions-Productpage">Azure Storage Actions</a>, their fully managed platform that transforms how organizations automate data management tasks for <a href="https://www.bing.com/ck/a?!&amp;&amp;p=4e62e8dcb1e51ee28c60e9871d9cd1f1c89e4631dde5309b8a88c63e0bf62c10JmltdHM9MTc0Nzc4NTYwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=azure+blob&amp;u=a1aHR0cHM6Ly9henVyZS5taWNyb3NvZnQuY29tL2VuLXVzL3Byb2R1Y3RzL3N0b3JhZ2UvYmxvYnMvP21zb2NraWQ9MGE4MTNhNjAxZDQ0NjQ5MDI0YTIyZmEyMWNhYTY1MDI&amp;ntb=1">Azure Blob</a> and <a href="https://www.bing.com/ck/a?!&amp;&amp;p=80592f2797e1e91dbfbac6e19bfd8af623acbc91807bec66488a90cec640ac5cJmltdHM9MTc0Nzc4NTYwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=data+lake+storage&amp;u=a1aHR0cHM6Ly9sZWFybi5taWNyb3NvZnQuY29tL2VuLXVzL2F6dXJlL3N0b3JhZ2UvYmxvYnMvZGF0YS1sYWtlLXN0b3JhZ2UtaW50cm9kdWN0aW9u&amp;ntb=1">Data Lake Storage</a>. </li>
<li style="font-weight:400;">Today, customers use disparate tools to manage their data estates. Depending on dataset size and use cases, they may use analytics queries with inventory reports, write programs or scripts to list all objects and metadata, or subscribe to storage events or change feed for filtering. </li>
<li style="font-weight:400;">The key advantage of storage actions is:
<ul>
<li style="font-weight:400;">Eliminating complexity</li>
<li style="font-weight:400;">Boosting your efficiency</li>
<li style="font-weight:400;">Drive consistency</li>
<li style="font-weight:400;">Hands free operations</li>
</ul>
</li>
</ul>
<p>54:32 Matt – “In AWS terms a storage account is an S3 bucket – so each bucket you might want different things to happen in. And then in Azure, because they don’t really understand the cloud still, you can say this is one zone – versus multi zone versus – replicated to DR multi zone – versus replicate to DR single zone. And each of those has to be done at the storage account, AKA S3 bucket level, not the container level.”</p>
<p>1:00:59 <a href="https://azure.microsoft.com/en-us/blog/unlock-whats-next-microsoft-at-red-hat-summit-2025/">Unlock what’s next: Microsoft at Red Hat Summit 2025</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.redhat.com/en/summit">Red Hat Summit 2025</a> is around the corner, and Microsoft is a platinum sponsor. They will showcase several new capabilities:
<ul>
<li style="font-weight:400;">RHEL for WSL</li>
<li style="font-weight:400;">Azure Red Hat OpenShift</li>
<li style="font-weight:400;">RHEL Landing Zone for Azure</li>
<li style="font-weight:400;">Application awareness and wave planning in <a href="https://www.bing.com/ck/a?!&amp;&amp;p=e29e23ff980092009c58c564a7bb5a40905d35bcfa0d041fd5a68ceb28d37305JmltdHM9MTc0Nzc4NTYwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=azure+migrae&amp;u=a1aHR0cHM6Ly9henVyZS5taWNyb3NvZnQuY29tL2VuLXVzL3Byb2R1Y3RzL2F6dXJlLW1pZ3JhdGUvP21zb2NraWQ9MGE4MTNhNjAxZDQ0NjQ5MDI0YTIyZmEyMWNhYTY1MDI&amp;ntb=1">Azure Migrate</a></li>
<li style="font-weight:400;">JBoss EAP on App Services</li>
<li style="font-weight:400;">JBoss EAP on Azure Virtual Machines</li>
</ul>
</li>
</ul>
<p>1:03:48 <a href="https://azure.microsoft.com/en-us/blog/announcing-new-fine-tuning-models-and-techniques-in-azure-ai-foundry/">Announcing new fine-tuning models and techniques in Azure AI </a> <a href="https://azure.microsoft.com/en-us/blog/announcing-new-fine-tuning-models-and-techniques-in-azure-ai-foundry/">Foundry</a></p>
<ul>
<li style="font-weight:400;">Azure is announcing three enhancements to model fine tuning with Azure AI foundry. 
<ul>
<li style="font-weight:400;">ReInforcement Fine-Tuning (RFT) with <a href="https://azure.microsoft.com/en-us/products/ai-foundry">o4-mini</a> (coming soon)</li>
<li style="font-weight:400;">Supervised Fine-Tuning (SFT) for the gpt-4.1-nano (available now)</li>
<li style="font-weight:400;"><a href="https://www.bing.com/ck/a?!&amp;&amp;p=192f6cc3643e02884b1ba608cfb97398cddb040096fa32a5af7f2119ed204875JmltdHM9MTc0Nzc4NTYwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=llama+4+scout&amp;u=a1aHR0cHM6Ly93d3cubGxhbWEuY29tL21vZGVscy9sbGFtYS00Lw&amp;ntb=1">Llama 4</a> Scout Model (available now) </li>
</ul>
</li>
<li style="font-weight:400;">Reinforcement fine tuning introduces a new level of control for aligning model behavior with complex business logic, rewarding accurate reasoning and penalizing undesirable outputs, RFT improves model decision making in dynamic or high-stakes environments.</li>
<li style="font-weight:400;">RFT is best suited for use cases where adaptability, iterative learning and domain-specific behavior are essential. RFT should be considered in the following scenarios:
<ul>
<li style="font-weight:400;">Custom Rules where decision logic is highly specific to your organization and cannot be easily captured through static prompts or traditional training data. </li>
<li style="font-weight:400;">Domain specific operational standard where internal procedures diverge from industry norms and where success depends on adhering to those bespoke standards.  RFT’s can effectively encode procedural variations, such as extended timelines or modified compliance thresholds, into the model behavior. </li>
<li style="font-weight:400;">High decision-making complexity: RFT excels in domains with layered logic and variable rich decision trees. When outcomes depend on navigating numerous subcases or dynamically weighing multiple inputs, RFT helps models generalize across complexity and deliver more consistent, accurate decisions. </li>
</ul>
</li>
<li style="font-weight:400;">Supervised Fine Tuning allows you to install your models with company-specific tone, terminology, workflows and structured outputs — all tailored to your domain.  This is well suited for large scale workloads like:
<ul>
<li style="font-weight:400;">Customer support automation, where models handle thousands of tickets per hour with consistent tone and accuracy</li>
<li style="font-weight:400;">Internal knowledge assistants that follow company style and protocol in summarizing documentation or responding to FAQs. </li>
</ul>
</li>
</ul>
<p>1:06:19  Ryan – “It’s a continuance of the trend of more and more customization of these large language models. At the beginning, everyone was training their own bespoke models, but now with RAGs and RFTs and a whole bunch of grounding you can really tailor your existing model to your workload.” </p>
<h2>After Show </h2>
<p>1:07:22 <a href="https://arstechnica.com/gadgets/2025/05/linux-to-end-support-for-1989s-hottest-chip-the-486-with-next-release/">Linux to end support for 1989’s hottest chip, the 486, with next release – </a><a href="https://arstechnica.com/gadgets/2025/05/linux-to-end-support-for-1989s-hottest-chip-the-486-with-next-release/">Ars Technica</a> </p>
<ul>
<li style="font-weight:400;">First of all, we had no idea. </li>
<li style="font-weight:400;">Second… Can you even get 486 chips still? And the answer is yes second hand… but you could have bought brand new from Intel until 2007!!!!!!!</li>
<li style="font-weight:400;"><a href="https://distrowatch.com/search.php?ostype=All&amp;category=All&amp;origin=All&amp;basedon=All&amp;notbasedon=None&amp;desktop=All&amp;architecture=i386&amp;package=All&amp;rolling=All&amp;isosize=All&amp;netinstall=All&amp;language=All&amp;defaultinit=All&amp;status=Active#simpleresults">https://distrowatch.com/search.php?ostype=All&amp;category=All&amp;origin=All&amp;basedon=All&amp;notbasedon=None&amp;desktop=All&amp;architecture=i386&amp;package=All&amp;rolling=All&amp;isosize=All&amp;netinstall=All&amp;language=All&amp;defaultinit=All&amp;status=Active#simpleresults</a> </li>
</ul>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod where you can join our newsletter, slack team, send feedback or ask questions at theCloud Pod.net or tweet at us with hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2044366/c1e-1xdxb53pjxa4mx9g-jpdgx4ojur2o-ldcenm.mp3" length="92270176"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 304 of The Cloud Pod – where the forecast is always cloudy! Justin, Ryan and Matt are in the house tonight to bring you all the latest and greatest in Cloud and AI news, including AWS new Chilean region, the ongoing tug of war between Open AI and Microsoft, and even some K8 updates – plus an aftershow. Let’s get started! 
Titles we almost went with this week:

Open AI gets a COO delivered
Things get Chile with new regions
Observability and AI, I Q-uestion the logic
Cloud Pod tries to Microsoft Build predictions
K8 resizes pods on the fly
Microsoft strongly reinforces the AI Foundry
The Cloud Pod renegotiates the hosts’ contracts … we now have to pay the Cloud Pod to be on it 

Follow Up 
01:53 DOJ’s extreme proposals will hurt consumers and America’s tech leadership 

We previously talked about the DOJ and Google Antitrust lawsuit – and now the DOJ has wrapped up their remedies hearing, and Google has *not* been quiet about it.
One of the claims is that the remedies would hurt browser choice, putting browsers like Firefox out of business completely. 
Google also claimed that data disclosure mandates would threaten user’s privacy – it would be MUCH safer if they could just sell it to you via their marketplace. 
We do agree that divesting Chrome would make things more complicated for people living in the Google Cloud. 
Really, what comes down to is that Google claims DOJ’s solutions are the wrong solutions – although to us, Google’s solutions aren’t much better. 

AI – Or How ML Makes Money 
09:20 OpenAI Expands Leadership with Fidji Simo 
OpenAI Hires Instacart CEO Simo For Major Leadership Role 

OpenAI is hiring Fidji Simo as the CEO of applications, representing a major restructuring of leadership at the company. 
She was the CEO at ]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2044366/c1a-k5d5-gp30j28vtj9v-zgevig.jpg"></itunes:image>
                                                                            <itunes:duration>01:16:54</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2044366/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[303: Someday You Will Find Me, Caught Beneath the AI Landslide, in a Champagne Premier Nova in The Sky]]>
                </title>
                <pubDate>Sun, 18 May 2025 20:29:59 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2042130</guid>
                                    <link>https://tcpfm.castos.com/episodes/303-someday-you-will-find-me-caught-beneath-the-ai-landslide-in-a-champagne-premier-nova-in-the-s</link>
                                <description>
                                            <![CDATA[<p>Welcome to episode 303 of The Cloud Pod – where the forecast is always cloudy! Justin, Ryan and exhausted dad Matt are here (and mostly awake) ready to bring the latest in cloud news! This week we’ve got more news from Nova, updates to Claude, earnings news, and a mini funeral for Skype – plus a new helping of Cloud Journey!</p>
<h3>Titles we almost went with this week:</h3>
<ul>
<li>Claude researches so Ryan can nap</li>
<li>The best AI for Nova Corps, Amazon Nova Premiere JB</li>
<li>If you can’t beat them, change the licensing terms and make them fork, and then </li>
<li>     reverse course… and profit</li>
<li>Q has invaded your IDE!!</li>
<li>Skype bites the dust</li>
</ul>
<h3>A big thanks to this week’s sponsor:</h3>
<h3>We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our Slack channel for more info. </h3>
<h2>Follow Up </h2>
<p>02:50 <a href="https://openai.com/index/sycophancy-in-gpt-4o/">Sycophancy in GPT-4o: What happened and what we’re doing about it</a></p>
<ul>
<li style="font-weight:400;">OpenAI wrote up a blog post about their sycophantic <a href="https://www.bing.com/ck/a?!&amp;&amp;p=3fc129c3812975421c5d14a2d2ca757c9dfa520a03d0c568fdb1c58e86512029JmltdHM9MTc0NzAwODAwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=gpt-4o&amp;u=a1aHR0cHM6Ly9vcGVuYWkuY29tL2luZGV4L2hlbGxvLWdwdC00by8&amp;ntb=1">Chat GPT 4o</a> upgrade last week, and they wanted to set the record straight. </li>
<li style="font-weight:400;">They made adjustments at improving the models default personality to make it feel more intuitive and effective across a variety of tasks. </li>
<li style="font-weight:400;">When shaping model behavior, they start with a baseline principle and instructions outlined in their model spec. </li>
<li style="font-weight:400;">They also teach their models how to apply these principles by incorporating user signals like thumbs up and thumbs down feedback on responses. </li>
<li style="font-weight:400;">In this update, though, they focused too much on short-term feedback and did not fully account for how users’ interactions with ChatGPT evolve. This skewed the results towards responses that were overly supportive – but disingenuous. </li>
<li style="font-weight:400;">Beyond rolling back the changes, they are taking steps to realign the model behavior, including refining core training techniques and system prompts to explicitly steer the model away from sycophancy. </li>
<li style="font-weight:400;">They also plan to build more guardrails to increase honesty and transparency principles in the model spec.</li>
<li style="font-weight:400;">Additionally, they plan to expand ways for users to test and give direct feedback before deployments.</li>
<li style="font-weight:400;">Lastly, OpenAI continues to expand evaluations building on the model sync and our ongoing research. </li>
</ul>
<p>04:43 Deep Research on Microsoft Hotpatching:</p>
<ul>
<li style="font-weight:400;">Yes, they’re grabbing money and screwing you. Basically. </li>
</ul>
<p>07:06  Justin – “I’m not going to give them any credit on this one. I appreciate that they created hotpatching, but I don’t like what you want to charge me for it.” </p>
<h2>General News</h2>
<p>It’s Earnings time – cue the sound effects!</p>
<p>08:03 <a href="https://www.businessinsider.com/alphabet-q1-earnings-2025-4">Alphabet’s Q1 earnings shattered analyst expectations, sending the stock </a><a href="https://www.businessinsider.com/alphabet-q1-earnings-2025-4">soaring. Google’s CEO credits its AI efforts</a></p>
<p><a href="https://blog.google/inside-google/message-ceo/alphabet-earnings-q1-2025/">Alphabet Q1 2025 earnings call: CEO Sundar Pichai’s remarks</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.bing.com/ck/a?!&amp;&amp;p=9f9b2e403bee420c748defa557cda24fc17ff704d4f22c213cf036c89b4c9a..."></a></li></ul>
<h3>Chapters</h3>
<ul><li>(00:00:00) - The Cloud Pod</li><li>(00:00:54) - Manto Man's 3 Kids Announcement</li><li>(00:02:40) - Microsoft Hot Patching: Changes Coming soon to the Model</li><li>(00:07:36) - Before the Earnings, How to Prepare</li><li>(00:07:54) - Good Quarter for Microsoft, Google, and Amazon</li><li>(00:10:47) - Amazon AWS Sales Up 17%</li><li>(00:16:32) - Skype Is Dead</li><li>(00:18:18) - Claude's Cloud Research: How ML Makes Money</li><li>(00:20:22) - OpenAI Rescues Plan to Split Off and Become for Profit</li><li>(00:22:56) - Anthropic's $61.5 Million Stock Offer</li><li>(00:25:00) - Redis: Moving back to the SSPL</li><li>(00:30:19) - HP Terraform Premium</li><li>(00:33:09) - Amazon Nova Premiere Announced at AWS Revamp</li><li>(00:34:09) - Amazon's Cloud Commitment Insurance</li><li>(00:35:36) - Amazon Q Developer Introduces in VS Code</li><li>(00:37:27) - Amazon Q Developer in GitHub</li><li>(00:39:45) - EC2 Image Builder</li><li>(00:42:44) - Amazon EBS Snapshot: Fast Provisioned Rate for Volume Initial</li><li>(00:46:58) - Google Cloud: Vertex AI Prediction Dedicated Endpoints</li><li>(00:48:28) - Microsoft Copilot for Azure in April</li><li>(00:52:57) - Alexa's Small Language Models</li><li>(00:53:56) - OpenAI Announces New 5.4</li><li>(00:55:21) - Azure Portal</li><li>(00:59:01) - How to really become a Windows admin with Terraform</li><li>(01:03:44) - Microsoft Virtual Network Terminal Access Point (VNTAP) Public Preview</li><li>(01:07:01) - Oracle Touts the Cloud on the Sphere</li><li>(01:09:41) - Why Your Tagging Strategy Matters for the Cloud</li><li>(01:11:36) - Cloudsecurity: Tagging our Services</li><li>(01:19:16) - Amazon vs. GCP: Service Management & Tagging</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 303 of The Cloud Pod – where the forecast is always cloudy! Justin, Ryan and exhausted dad Matt are here (and mostly awake) ready to bring the latest in cloud news! This week we’ve got more news from Nova, updates to Claude, earnings news, and a mini funeral for Skype – plus a new helping of Cloud Journey!
Titles we almost went with this week:

Claude researches so Ryan can nap
The best AI for Nova Corps, Amazon Nova Premiere JB
If you can’t beat them, change the licensing terms and make them fork, and then 
     reverse course… and profit
Q has invaded your IDE!!
Skype bites the dust

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our Slack channel for more info. 
Follow Up 
02:50 Sycophancy in GPT-4o: What happened and what we’re doing about it

OpenAI wrote up a blog post about their sycophantic Chat GPT 4o upgrade last week, and they wanted to set the record straight. 
They made adjustments at improving the models default personality to make it feel more intuitive and effective across a variety of tasks. 
When shaping model behavior, they start with a baseline principle and instructions outlined in their model spec. 
They also teach their models how to apply these principles by incorporating user signals like thumbs up and thumbs down feedback on responses. 
In this update, though, they focused too much on short-term feedback and did not fully account for how users’ interactions with ChatGPT evolve. This skewed the results towards responses that were overly supportive – but disingenuous. 
Beyond rolling back the changes, they are taking steps to realign the model behavior, including refining core training techniques and system prompts to explicitly steer the model away from sycophancy. 
They also plan to build more guardrails to increase honesty and transparency principles in the model spec.
Additionally, they plan to expand ways for users to test and give direct feedback before deployments.
Lastly, OpenAI continues to expand evaluations building on the model sync and our ongoing research. 

04:43 Deep Research on Microsoft Hotpatching:

Yes, they’re grabbing money and screwing you. Basically. 

07:06  Justin – “I’m not going to give them any credit on this one. I appreciate that they created hotpatching, but I don’t like what you want to charge me for it.” 
General News
It’s Earnings time – cue the sound effects!
08:03 Alphabet’s Q1 earnings shattered analyst expectations, sending the stock soaring. Google’s CEO credits its AI efforts
Alphabet Q1 2025 earnings call: CEO Sundar Pichai’s remarks

]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[303: Someday You Will Find Me, Caught Beneath the AI Landslide, in a Champagne Premier Nova in The Sky]]>
                </itunes:title>
                                    <itunes:episode>303</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p>Welcome to episode 303 of The Cloud Pod – where the forecast is always cloudy! Justin, Ryan and exhausted dad Matt are here (and mostly awake) ready to bring the latest in cloud news! This week we’ve got more news from Nova, updates to Claude, earnings news, and a mini funeral for Skype – plus a new helping of Cloud Journey!</p>
<h3>Titles we almost went with this week:</h3>
<ul>
<li>Claude researches so Ryan can nap</li>
<li>The best AI for Nova Corps, Amazon Nova Premiere JB</li>
<li>If you can’t beat them, change the licensing terms and make them fork, and then </li>
<li>     reverse course… and profit</li>
<li>Q has invaded your IDE!!</li>
<li>Skype bites the dust</li>
</ul>
<h3>A big thanks to this week’s sponsor:</h3>
<h3>We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our Slack channel for more info. </h3>
<h2>Follow Up </h2>
<p>02:50 <a href="https://openai.com/index/sycophancy-in-gpt-4o/">Sycophancy in GPT-4o: What happened and what we’re doing about it</a></p>
<ul>
<li style="font-weight:400;">OpenAI wrote up a blog post about their sycophantic <a href="https://www.bing.com/ck/a?!&amp;&amp;p=3fc129c3812975421c5d14a2d2ca757c9dfa520a03d0c568fdb1c58e86512029JmltdHM9MTc0NzAwODAwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=gpt-4o&amp;u=a1aHR0cHM6Ly9vcGVuYWkuY29tL2luZGV4L2hlbGxvLWdwdC00by8&amp;ntb=1">Chat GPT 4o</a> upgrade last week, and they wanted to set the record straight. </li>
<li style="font-weight:400;">They made adjustments at improving the models default personality to make it feel more intuitive and effective across a variety of tasks. </li>
<li style="font-weight:400;">When shaping model behavior, they start with a baseline principle and instructions outlined in their model spec. </li>
<li style="font-weight:400;">They also teach their models how to apply these principles by incorporating user signals like thumbs up and thumbs down feedback on responses. </li>
<li style="font-weight:400;">In this update, though, they focused too much on short-term feedback and did not fully account for how users’ interactions with ChatGPT evolve. This skewed the results towards responses that were overly supportive – but disingenuous. </li>
<li style="font-weight:400;">Beyond rolling back the changes, they are taking steps to realign the model behavior, including refining core training techniques and system prompts to explicitly steer the model away from sycophancy. </li>
<li style="font-weight:400;">They also plan to build more guardrails to increase honesty and transparency principles in the model spec.</li>
<li style="font-weight:400;">Additionally, they plan to expand ways for users to test and give direct feedback before deployments.</li>
<li style="font-weight:400;">Lastly, OpenAI continues to expand evaluations building on the model sync and our ongoing research. </li>
</ul>
<p>04:43 Deep Research on Microsoft Hotpatching:</p>
<ul>
<li style="font-weight:400;">Yes, they’re grabbing money and screwing you. Basically. </li>
</ul>
<p>07:06  Justin – “I’m not going to give them any credit on this one. I appreciate that they created hotpatching, but I don’t like what you want to charge me for it.” </p>
<h2>General News</h2>
<p>It’s Earnings time – cue the sound effects!</p>
<p>08:03 <a href="https://www.businessinsider.com/alphabet-q1-earnings-2025-4">Alphabet’s Q1 earnings shattered analyst expectations, sending the stock </a><a href="https://www.businessinsider.com/alphabet-q1-earnings-2025-4">soaring. Google’s CEO credits its AI efforts</a></p>
<p><a href="https://blog.google/inside-google/message-ceo/alphabet-earnings-q1-2025/">Alphabet Q1 2025 earnings call: CEO Sundar Pichai’s remarks</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.bing.com/ck/a?!&amp;&amp;p=9f9b2e403bee420c748defa557cda24fc17ff704d4f22c213cf036c89b4c9a2fJmltdHM9MTc0NzA5NDQwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=google&amp;u=a1aHR0cHM6Ly93d3cuZ29vZ2xlLmNvbS8&amp;ntb=1">Google</a> started us off the last week of April by hitting a grand slam of earnings performance! </li>
<li style="font-weight:400;"><a href="https://www.bing.com/ck/a?!&amp;&amp;p=08e7340fdf4af902a6c7b1dac93032c820402168d25f066bed23e2752e5cfce7JmltdHM9MTc0NzA5NDQwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=alphabet+earnings&amp;u=a1aHR0cDovL2FiYy54eXovaW52ZXN0b3Iv&amp;ntb=1">Alphabet exceeded revenue estimates</a> and shares were up in after hours trading. </li>
<li style="font-weight:400;">PES was 2.81 vs 2.01 expected on revenue of 90.23 Billion vs 89.1 billion expected.  </li>
<li style="font-weight:400;"><a href="https://www.bing.com/ck/a?!&amp;&amp;p=7e42d9b88de0421bd38da6c76adaf46b1f6cc726b7aa45cf07fab6b4e88289ffJmltdHM9MTc0NzA5NDQwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=Google+cloud&amp;u=a1aHR0cHM6Ly9jbG91ZC5nb29nbGUuY29tLw&amp;ntb=1">Google Cloud</a> revenue rose from 12.26 Billion to 12.31 billion. </li>
<li style="font-weight:400;">Sundar in his remarks pointed at the strong growth of their AI investments including adoption of <a href="https://www.bing.com/ck/a?!&amp;&amp;p=3fd2a214f06016e378bc2351513081f3757b7d149c8eb29ff50a074ca21d42b3JmltdHM9MTc0NzA5NDQwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=google+gemini+2.0&amp;u=a1aHR0cHM6Ly9nZW1pbmkuZ29vZ2xlLmNvbS8&amp;ntb=1">Gemini 2</a>. </li>
</ul>
<p>09:19 <a href="https://www.businessinsider.com/microsoft-q3-earnings-report-msft-stock-beats-expectations-2025-4">Microsoft stock surges after hours after the company blows past Q3 </a><a href="https://www.businessinsider.com/microsoft-q3-earnings-report-msft-stock-beats-expectations-2025-4">estimates</a> </p>
<ul>
<li style="font-weight:400;"><a href="https://www.bing.com/ck/a?!&amp;&amp;p=600beb028f1a6f23ca01c5c994410c254a7cef88d72a913d6b485719b55f5749JmltdHM9MTc0NzA5NDQwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=microsoft+earnings&amp;u=a1aHR0cHM6Ly93d3cubWljcm9zb2Z0LmNvbS9lbi11cy9pbnZlc3Rvci9kZWZhdWx0P21zb2NraWQ9MGE4MTNhNjAxZDQ0NjQ5MDI0YTIyZmEyMWNhYTY1MDI&amp;ntb=1">Microsoft</a> followed up with their earnings on the 30th, also crushing Wall Street estimates for their 3rd quarter.  </li>
<li style="font-weight:400;"><a href="https://www.bing.com/ck/a?!&amp;&amp;p=62c15b4365f4d8209873653528dba0085cc2da5e23bf05f8e952d7c426f933d5JmltdHM9MTc0NzA5NDQwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=microsoft+cloud&amp;u=a1aHR0cHM6Ly93d3cubWljcm9zb2Z0LmNvbS9lbi11cy9taWNyb3NvZnQtY2xvdWQvd2hhdC1pcy1taWNyb3NvZnQtY2xvdWQ_bXNvY2tpZD0wYTgxM2E2MDFkNDQ2NDkwMjRhMjJmYTIxY2FhNjUwMg&amp;ntb=1">Cloud</a> and AI are the essential inputs for every business to expand output, reduce costs and accelerate growth, which leads to lots of money for Microsoft. </li>
<li style="font-weight:400;">EPS was 3.46 vs 3.21 on 70.1 billion in revenue (68.48 expected).  </li>
<li style="font-weight:400;">Cloud Revenue was 42.4 billion vs 42.22 billion, and intelligent cloud was 26.8 billion vs 25.99 billion.</li>
</ul>
<p>10:28 <a href="https://www.businessinsider.com/amazon-earnings-call-report-amzn-stock-live-updates-2025-5">Amazon earnings recap: Company ‘maniacally focused on’ keeping prices </a><a href="https://www.businessinsider.com/amazon-earnings-call-report-amzn-stock-live-updates-2025-5">low amid light Q2 guidance</a> </p>
<p><a href="https://amazon2022tf.q4web.com/news/news-details/2025/Amazon-com-Announces-First-Quarter-Results/default.aspx">Amazon Announces First Quarter Results</a> </p>
<ul>
<li style="font-weight:400;"><a href="https://www.bing.com/ck/a?!&amp;&amp;p=fa3034eed63d53965bdf03816fa09662893107e186bf8bf663999453fd81e386JmltdHM9MTc0NzA5NDQwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=amazon+earnings&amp;u=a1aHR0cHM6Ly9pci5hYm91dGFtYXpvbi5jb20vbmV3cy1yZWxlYXNlL25ld3MtcmVsZWFzZS1kZXRhaWxzLzIwMjUvQW1hem9uLWNvbS1Bbm5vdW5jZXMtRmlyc3QtUXVhcnRlci1SZXN1bHRzL2RlZmF1bHQuYXNweA&amp;ntb=1">Amazon</a> is a bit more complicated as they will be heavily impacted by tariffs, but it appears it hasn’t caused any problem – at least not yet. </li>
<li style="font-weight:400;">Amazon also reported better-than-expected earnings on May 1st.  </li>
<li style="font-weight:400;">The company is heads down on keeping prices low in the coming months as tariffs take effect.  </li>
<li style="font-weight:400;">Jassy reiterated that their investments in <a href="https://www.bing.com/ck/a?!&amp;&amp;p=2673555db6e1afd88d1b9e3245c838fc80b85d0f973d81eb0792f3029a313a1fJmltdHM9MTc0NzA5NDQwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=amazon+AI&amp;u=a1aHR0cHM6Ly9hd3MuYW1hem9uLmNvbS9haS8&amp;ntb=1">AI</a> will pay off as more businesses turn to Amazon for their AI needs. </li>
<li style="font-weight:400;">Sales increased 9% in the quarter to 155.7 billion, up from 143.3 billion the year prior.  </li>
<li style="font-weight:400;"><a href="https://www.bing.com/aclk?ld=e8_KRtds5KG1e3K5hWwVZPijVUCUy27qiQ_JacH1Ti9lUb7vgwvEELjEzIIUkfxhsRHZ2qbDYULDDFNOiG3z8dZnzkIhX92tK4-aGJgsSDVLio5CfyEOJqDHpgihrV2WGtFKVFGxF4hNCVM_6ecMCFcEdEz0lhs0YJc-9ckHdAtmyXjsk9R_zA7ufMtxEzr5d5SV9Fmg&amp;u=aHR0cHMlM2ElMmYlMmZhd3MuYW1hem9uLmNvbSUyZmZyZWUlMmYlM2Z0cmslM2Q0NTFkNDM1Ni0xODg2LTQyZjktYTFhZS0wOGY4Y2ZhMWJjMGIlMjZzY19jaGFubmVsJTNkcHMlMjZzX2t3Y2lkJTNkQUwhNDQyMiExMCE3MTEyNDg5NDE5NTAyMCEhISE3MTEyNTQyMTc4NDYzNCEhNDgyNTEwNzU0ITExMzc5OTU1MzYwNTU4NTclMjZlZl9pZCUzZGFlMzFkODRlMTFhMTEwYTYyYzFmYjA2ZGUwNmU1ODE2JTNhRyUzYXMlMjZtc2Nsa2lkJTNkYWUzMWQ4NGUxMWExMTBhNjJjMWZiMDZkZTA2ZTU4MTY&amp;rlid=ae31d84e11a110a62c1fb06de06e5816&amp;ntb=1">AWS</a> sales increased 17% YOY to 29.3 billion. </li>
</ul>
<p>11:44  Justin – “I think a lot of companies are not estimating AI uplifts into their forecasts until they know for sure adoption and market and are they making money, etc.”</p>
<p>16:17 <a href="https://arstechnica.com/gadgets/2025/05/microsoft-officially-shuts-down-skype-redirects-all-app-users-to-teams/">RIP Skype (2003–2025), survived by multiple versions of Microsoft Teams</a></p>
<ul>
<li style="font-weight:400;">Skype is officially dead, we talked about it when it was announced back in February, but the ax has officially fallen.  </li>
<li style="font-weight:400;">We aren’t sad about it. </li>
<li style="font-weight:400;">*TAPS*</li>
</ul>
<h2>AI – Or How ML Makes Money </h2>
<p>18:45 <a href="https://arstechnica.com/ai/2025/05/claudes-ai-research-mode-now-runs-for-up-to-45-minutes-before-delivering-reports/">Claude’s AI research mode now runs for up to 45 minutes before delivering </a><a href="https://arstechnica.com/ai/2025/05/claudes-ai-research-mode-now-runs-for-up-to-45-minutes-before-delivering-reports/">reports</a></p>
<ul>
<li style="font-weight:400;">Last week <a href="https://www.anthropic.com/news/integrations">Anthropic updated Claude</a> and introduced research capabilities that will have Claude run for up to 45 minutes before delivering comprehensive reports. </li>
<li style="font-weight:400;">The company has also expanded its integration options, allowing <a href="https://www.bing.com/ck/a?!&amp;&amp;p=2781175e633b6930d8cf421cbf198a04228e574b06a918a5f198d89f648687d9JmltdHM9MTc0NzA5NDQwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=Anthropic+Claude&amp;u=a1aHR0cHM6Ly93d3cuYW50aHJvcGljLmNvbS9jbGF1ZGU&amp;ntb=1">Claude</a> to connect with popular third party services. </li>
<li style="font-weight:400;">Anthropic first announced its <a href="https://www.bing.com/ck/a?!&amp;&amp;p=9d48df92c682a780778fe2efe59757a88bc0f65590a415eb0dc1a7886172ab27JmltdHM9MTc0NzA5NDQwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=caude+research&amp;u=a1aHR0cHM6Ly93d3cuYW50aHJvcGljLmNvbS9uZXdzL3Jlc2VhcmNo&amp;ntb=1">Research</a> feature on April 15th, but now they have taken it a step further allowing it to conduct deeper investigations across hundreds of internal and external sources. </li>
<li style="font-weight:400;">When users toggle the research button, Claude breaks down complex tasks into smaller components, examines each one, and compiles a report with citations linking to original sources. </li>
<li style="font-weight:400;">Unfortunately this is only included in the $100 per month Max plan. </li>
<li style="font-weight:400;">Currently nobody at TCP has this plan. We’re waiting for Justin to bite the bullet and will report back when he does. </li>
</ul>
<p>19:42  Justin – “If they were to include unlimited API calls from Claude Code or from a Visual Studio plugin that would probably push me over the edge.” </p>
<p>20:44 <a href="https://arstechnica.com/ai/2025/05/openai-scraps-controversial-plan-to-become-for-profit-after-mounting-pressure/">OpenAI scraps controversial plan to become for-profit after mounting </a><a href="https://arstechnica.com/ai/2025/05/openai-scraps-controversial-plan-to-become-for-profit-after-mounting-pressure/">pressure</a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.bing.com/ck/a?!&amp;&amp;p=1a157bc9c58fe415399b263bb1ed13d75ab8dbeaea8344e967bfcad12dc406feJmltdHM9MTc0NzA5NDQwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=chatgpt&amp;u=a1aHR0cHM6Ly9vcGVuYWkuY29tL2NoYXRncHQvb3ZlcnZpZXcv&amp;ntb=1">ChatGPT</a> maker <a href="https://www.bing.com/ck/a?!&amp;&amp;p=d24a9f8e097187e0155f50d88b05c98f6b7b336e9b334a04fc57a0e6f8910aa3JmltdHM9MTc0NzA5NDQwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=open+ai&amp;u=a1aHR0cHM6Ly9vcGVuYWkuY29tLw&amp;ntb=1">OpenAI</a> has announced it will remain under the control of its nonprofit board, scrapping its controversial plan to split off its commercial operations as a for-profit company after mounting pressure from critics. </li>
<li style="font-weight:400;">Sam Altman blogged they made the decision after hearing from civic leaders and having discussions with the Attorneys General of California and Delaware. </li>
<li style="font-weight:400;">This move represents a shift in how OpenAI will be restructured. </li>
<li style="font-weight:400;">The <a href="https://arstechnica.com/information-technology/2024/09/openai-plans-tectonic-shift-from-nonprofit-to-for-profit-giving-altman-equity/">previous plan</a> would have established OpenAI as a public benefit corporation with the non-profit merely holding shares and having limited influence; the revised approach keeps the nonprofit firmly in control of operations. </li>
<li style="font-weight:400;">This doesn’t mean they aren’t changing the structure at all – they still plan to do a for-profit LLC under the non-profit, and will transition to a Public Benefit Corporation with the same mission, instead of their current complex capped profit structure, which made sense when it looked like there might be one dominant AGI effort. </li>
<li style="font-weight:400;">This is not a sale, but a change to the structure to something simpler. </li>
<li style="font-weight:400;">There may still be some uncertainties, such as OpenAI’s <a href="https://www.cnbc.com/2025/03/31/openai-closes-40-billion-in-funding-the-largest-private-fundraise-in-history-softbank-chatgpt.html">recent raise with Softbank</a> stipulated that it would reduce its contribution to 20B if it failed to restructure into a fully for-profit entity by the end of 2025. </li>
</ul>
<p>23:22 <a href="https://www.theinformation.com/articles/anthropic-buy-back-employee-shares-61-5-billion-valuation?rc=3t8xtd">Anthropic to Buy Back Employee Shares at $61.5 Billion Valuation</a></p>
<ul>
<li style="font-weight:400;">Anthropic reportedly offers to buy back shares from hundreds of former and current employees, the first transaction of its kind for the 4-year-old company. </li>
<li style="font-weight:400;">The buyback shows how integral these are to rewarding employees at fast-growing startups and retaining rare research talent in the AI talent war. </li>
<li style="font-weight:400;">For employees who have worked for the company for at least 2 years, their offering lets them sell up to 20% of their equity, with a maximum of $2 million each.  </li>
<li style="font-weight:400;">The buyback values the startup at 61.5 billion, the exact valuation of its recent March fundraising.  </li>
</ul>
<p>24:08  Ryan – “This says to me don’t sell – hold.” </p>
<h2>Cloud Tools </h2>
<p>25:31 <a href="https://redis.io/blog/agplv3/">Redis is now available under the AGPLv3 open source license</a> </p>
<ul>
<li style="font-weight:400;"><a href="https://duckduckgo.com/y.js?ad_domain=redis.io&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=rEyijpcZ8gOJzmjXrQQSObnc2wK2ZW6RbFr5wp1Ducz0-tHWXdq8r5ZoT9_rDY8JuYakEZLvFLVplQPOf_NEKE5i2_TEd_PSJ3r_hXl0rQj3V1MoxTT198qLUQ2eTigs.0rU-CaJUSP7OESl12u-9VA&amp;eddgt=_1TfYqwPoTo4dqHdqk6tdA%3D%3D&amp;rut=41a76f9020e1fe7e7271db10e9cc49ea8444a5d870c6cb986439d64ada2f2621&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De87OU3FqMVl3OrlTIX0w3LNTVUCUwLE_wtyBioJjO8sC8MYEfM8084x6jlcLGeT7JIDin3VKDDF2m0Iej9nmf5nKMbcZ97lSW_9bsRTPFHX2QOOd0hEmbL15NBKgmqVNsXnJrJJLpjKGnog6R-vEi8jtP9cRWnB1mP3JqJT4hoI6VvaJka681yFfz2-kbO_FzFNZNFcwbr49C1XgXiY8E0BZUQMxhszAdzsG3QB4IWM460_ir8_2i8LoSP800NJ10wV7UkYUXvYnR8KIUoMZ6ZWbGLtATbaCiZ-lHrRhiZeERYRYnKSMHGxfyDF6jWOqOIr1Z0QOBDe7ve9bJZqCcv_PQCoPOjK95Dac27Fkh88fF6rhb0MJhp6wkFfWseYaakMTa3F0WcO-vWi4N_eXruFPo-ChzskQ5_mNao9FvJHhhTDFYp7TpPfiaT2BEIr4PP8Xs8CpKbYhe6LTfo1WBrgZDtMxG3NeYS1qSCat3_VCqDACfA2XjaNRz6BJJYuiP8CiGcztouwrxwyM1FvFFz74Km3QBRCDXRcDT1VDMlTCsVj9y-qPSBxMchclVnhOwC3K6C3Nonp40ks4w9p3z0mwvb84LBK8psS5wtSgpr24vXAYKOJ668ycG2JzTMA8Aoy1EWRsqOl972RdCwlWFxI1RKJsGMUmeNWjnpQ2MxpLOhcdqP6iIHSIgvbPNXCYYJkiRD2TEJAv-vrHbviHPk8ZNRbizvvcPB5rTSRVbpVeYluN-IK2q7T3pvpXakr68w9hRZrg%26u%3DaHR0cHMlM2ElMmYlMmZyZWRpcy5pbyUyZmNsb3VkJTNmdXRtX2NhbXBhaWduJTNkYmdfc19jb3JlX2FtZXJfZW5fYnJhbmRfYWNxX3N0YXRpY181ODAzMjgwNjUlMjZ1dG1fc291cmNlJTNkYmluZyUyNnV0bV9tZWRpdW0lM2RjcGMlMjZ1dG1fY29udGVudCUzZHJlZGlzX2V4YWN0JTI2dXRtX3Rlcm0lM2QlMjZtc2Nsa2lkJTNkMGM2ZTg1MTNmYWE0MWI2YzkzOWJlZDM3ZmQ5ZDIwYmU%26rlid%3D0c6e8513faa41b6c939bed37fd9d20be&amp;vqd=4-111239261673731275461615970052579574622&amp;iurl=%7B1%7DIG%3DA9CDACD5B9BE4227832D6560F4F7E971%26CID%3D0AC55C1F1CA76690024B49F31D1D67C6%26ID%3DDevEx%2C5047.1">Redis</a> foiled those pesky hyperscalers by adopting SSPL to protect their business from cloud providers extracting value without reinvesting.  </li>
<li style="font-weight:400;">Redis says <a href="https://redis.io/blog/redis-adopts-dual-source-available-licensing/">moving to the SSPL</a> achieved their goal AWS and Google now maintaining their own fork, but they admit it hurt their relationship with the Redis community. </li>
<li style="font-weight:400;">Duh. </li>
<li style="font-weight:400;">SSPL is not truly open source because the OSI clarified it lacks the requisites to be an OSI-approved license.</li>
<li style="font-weight:400;">Following the SSPL change, Salvatore Sanfillipo decided to <a href="https://antirez.com/news/144">rejoin Redis</a> as a developer evangelist. </li>
<li style="font-weight:400;">The CEO Rowan Trolloope and him collaborated on new capabilities, company strategy and community engagement. </li>
<li style="font-weight:400;">The CEO, CTO and Salvatore and the core developers have decided to make some improvements to improve Redis going forward:
<ul>
<li style="font-weight:400;">Adding the OSI Approved <a href="https://www.gnu.org/licenses/agpl-3.0.en.html">AGPL</a> as an additional licensing option for Redis, starting with Redis 8</li>
<li style="font-weight:400;">Introducing Vector sets – the first new data type in years – created by Salvatore</li>
<li style="font-weight:400;">Integrated Redis stack technologies including JSON, Time Series, Probabilistic data types, Redis Query engine and more into Core Redis 8 under GPL.</li>
<li style="font-weight:400;">Delivered over 30 performance improvements with up to 87% faster commands 2x throughput</li>
<li style="font-weight:400;">Improved community engagement, particular with client ecosystem contributions. </li>
</ul>
</li>
</ul>
<p>27:14  Ryan – “We’ll see… There’s a lot of people who moved over to Valkey, and I don’t know that they’re going to be swapping back anytime soon.”</p>
<p>30:50 <a href="https://www.hashicorp.com/en/blog/announcing-hcp-terraform-premium-infrastructure-lifecycle-management-at-scale">Announcing HCP Terraform Premium: Infrastructure Lifecycle Management </a><a href="https://www.hashicorp.com/en/blog/announcing-hcp-terraform-premium-infrastructure-lifecycle-management-at-scale">at scale</a></p>
<ul>
<li style="font-weight:400;">If your <a href="https://www.hashicorp.com/en/products/terraform">HCP Terraform</a> solution wasn’t expensive enough, you can now get PREMIUM to extend the capabilities of HCP Terraform, offering powerful features that enable organizations to scale their infrastructure. </li>
<li style="font-weight:400;">Woohoo! PREMIUM! </li>
<li style="font-weight:400;">HCP Terraform Premium is designed to help enterprises with their <a href="https://www.hashicorp.com/en/resources/infrastructure-lifecycle-management-with-the-hashicorp-cloud-platform">Infrastructure Lifecycle Management</a> at high scale and includes everything from the standard and plus plans, with additional features:
<ul>
<li style="font-weight:400;">Private VCS access: Access private VCS repositories securely by ensuring that your source code and static credentials are not exposed over the public internet.</li>
<li style="font-weight:400;">Private policy enforcement: Apply and enforce internal security and compliance policies within private cloud environments.</li>
<li style="font-weight:400;">Private run tasks: Integrate Terraform workflows with internal systems securely, creating a seamless automation pipeline that aligns with your internal processes and policies.</li>
<li style="font-weight:400;">Module lifecycle management – Revocation: Streamline module management by revoking outdated or vulnerable modules.</li>
</ul>
</li>
<li style="font-weight:400;">All of this simplifies operations, improves security and lowers your TCO (per Hashi) and maybe increases your likelihood of outages, but that’s neither here nor there. </li>
</ul>
<p>32:09  Matthew – “The only thing that I like here is the revocation. I think that that’s cool. If you have credentials in your repo, I have better questions about why you have credentials in your repo – and what life choices you’ve already made from that one. And policy enforcement, there’s enough other add-ons that you can get without paying for this premium feature.”</p>
<h2>AWS</h2>
<p>33:44 <a href="https://aws.amazon.com/blogs/aws/amazon-nova-premier-our-most-capable-model-for-complex-tasks-and-teacher-for-model-distillation/">Amazon Nova Premier: Our most capable model for complex tasks and </a><a href="https://aws.amazon.com/blogs/aws/amazon-nova-premier-our-most-capable-model-for-complex-tasks-and-teacher-for-model-distillation/">teacher for model distillation</a> </p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/?nc2=h_lg">Amazon</a> is expanding the <a href="https://aws.amazon.com/blogs/aws/category/artificial-intelligence/amazon-machine-learning/amazon-bedrock/amazon-nova/">Nova</a> family of foundation models announced at AWS Re:invent with the GA of <a href="https://aws.amazon.com/ai/generative-ai/nova/understanding/?trk=e61dee65-4ce8-4738-84db-75305c9cd4fe&amp;sc_channel=el">Amazon Nova Premier.</a> </li>
<li style="font-weight:400;">Premier joins the existing Nova models in Amazon Bedrock. </li>
<li style="font-weight:400;">Similar to Nova Lite and Pro, premier can produce text, images and videos (excluding audio.) With its advanced capabilities, Nova premier excels at complex tasks that require deep understanding of context, multi-step planning, and precise execution across multiple tools and data sources.  </li>
<li style="font-weight:400;">It has a context length of 1 million tokens, allowing you to process long documents and large code bases. </li>
<li style="font-weight:400;">Nova Premier, combined with <a href="https://aws.amazon.com/bedrock/model-distillation/?trk=e61dee65-4ce8-4738-84db-75305c9cd4fe&amp;sc_channel=el">Bedrock Model Distillation</a>, allows you to create a capable, cost effective and low-latency version of Nova Pro, Lite and Micro for your specific needs. </li>
<li style="font-weight:400;">“Amazon Nova Premier has been outstanding in its ability to execute interactive analysis workflows, while still being faster and nearly half the cost compared to other leading models in our tests,” said <a href="https://www.linkedin.com/in/curtis-allen-68425566/">Curtis Allen</a>, Senior Staff Engineer at <a href="https://slack.com/">Slack</a>, “a company bringing conversations, apps, and customers together in one place.” (Sure, Jan)</li>
</ul>
<p>34:58  Justin – “You know what I was mostly disappointed about was that I did not find it on the LLM Leaderboard from <a href="https://lmarena.ai/">Chatbot Arena</a>, so either it didn’t score or hasn’t been tested.” </p>
<p>35:36 <a href="https://aws.amazon.com/blogs/aws/amazon-q-developer-elevates-the-ide-experience-with-new-agentic-coding-experience/">Amazon Q Developer elevates the IDE experience with new agentic coding </a><a href="https://aws.amazon.com/blogs/aws/amazon-q-developer-elevates-the-ide-experience-with-new-agentic-coding-experience/">experience</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/q/developer/?trk=4f1e9f0e-7b21-4369-8925-61f67341d27c&amp;sc_channel=el">Amazon Q Developer</a> introduces a new, interactive, agentic coding experience that is now available in the <a href="https://docs.aws.amazon.com/amazonq/latest/qdeveloper-ug/q-in-IDE.html?trk=4f1e9f0e-7b21-4369-8925-61f67341d27c&amp;sc_channel=el">IDE</a> for <a href="https://marketplace.visualstudio.com/items?itemName=AmazonWebServices.amazon-q-vscode">VS Code</a>.  </li>
<li style="font-weight:400;">This brings interactive coding capabilities, building upon existing prompt-based features. </li>
<li style="font-weight:400;">You now have a natural, real-time collaborative partner working alongside you while writing code, creating documentation, running tests and reviewing changes.</li>
<li style="font-weight:400;">Q developer transforms how you write and maintain code by providing transparent reasoning for its suggestions and giving you the choice between automated modifications or step-by-step confirmation of changes.  </li>
<li style="font-weight:400;">You can chat with Q in English, Mandarin, French, German, Italian, Japanese, Spanish, Korean, Hindi and Portuguese. </li>
<li style="font-weight:400;">The system uses your repository structure, files and documentation while giving you flexibility to interact seamlessly with natural dialog with your local development environment. This deep comprehension allows for more accurate and contextual assistance during development tasks. </li>
<li style="font-weight:400;">Q developer provides continuous status updates, as it works through tasks, and lets you choose between automated code modifications or step-by-step review, giving you complete control over the development process.</li>
</ul>
<p>37:32 <a href="https://aws.amazon.com/blogs/aws/amazon-q-developer-in-github-now-in-preview-with-code-generation-review-and-legacy-transformation-capabilities/">Amazon Q Developer in GitHub (in preview) accelerates code generation</a></p>
<ul>
<li style="font-weight:400;">Starting today, you can now use Amazon Q Developer in Github in preview. This allows for developers who use github, whether at work or for personal projects.  They can use Amazon Q developer for feature development, code reviews, and java code migration directly within the GitHub Interface. </li>
</ul>
<p>38:24  Ryan – “People use the web ID for more than just resolving merge conflicts?” </p>
<p>39:49 <a href="https://aws.amazon.com/about-aws/whats-new/2025/04/ec2-image-builder-integrates-ssm-parameter-store/">EC2 Image Builder now integrates with SSM Parameter Store</a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/imagebuilder/latest/userguide/tutorial-ssm-parameters-recipe.html">EC2 Image Builder</a> now integrates with <a href="https://docs.aws.amazon.com/systems-manager/latest/userguide/systems-manager-parameter-store.html">Systems Manager Parameter Store</a>, offering customers a streamlined approach for referencing SSM parameters in their image recipes, components and distribution configurations. </li>
<li style="font-weight:400;">This capability allows customers to dynamically select base images within their image recipes, easily use configuration data and sensitive information for components, and update their SSM parameters with the latest output images. </li>
<li style="font-weight:400;">Before this you had to specify AMI IDs in the image recipe to use custom base images, leading to a constant maintenance cycle when these base images had to be updated. </li>
<li style="font-weight:400;">Furthermore, customers were required to create custom scripts to update SSM parameters with output images and to utilize SSM parameter values in components, resulting in substantially lower overhead. </li>
</ul>
<p>42:53 <a href="https://aws.amazon.com/blogs/aws/accelerate-the-transfer-of-data-from-an-amazon-ebs-snapshot-to-a-new-ebs-volume/">Accelerate the transfer of data from an Amazon EBS snapshot to a new </a><a href="https://aws.amazon.com/blogs/aws/accelerate-the-transfer-of-data-from-an-amazon-ebs-snapshot-to-a-new-ebs-volume/">EBS volume</a></p>
<ul>
<li style="font-weight:400;">AWS is announcing the GA of Amazon EBS provisioned rate for volume initialization, a feature that accelerates the transfer of data from an EBS Snapshot, a highly durable backup of volumes stored in S3 to a new EBS volume.</li>
<li style="font-weight:400;">This allows you to create fully performance EBS volumes within a predictable amount of time. You can use this feature to speed up the initialization of hundreds of concurrent volumes and instances.  You can also use this feature when you need to recover from an existing EBS snapshot and need your EBS volume to be created and initialized as quickly as possible. </li>
<li style="font-weight:400;">This allows you to specify a specific rate between 100 MiB/2 and 300 MiB/s. You can specify this rate when the snapshot blocks are downloaded from S3 to the volume. </li>
</ul>
<h2>GCP</h2>
<p>47:05  <a href="https://cloud.google.com/blog/products/ai-machine-learning/reliable-ai-with-vertex-ai-prediction-dedicated-endpoints/">Reliable AI with Vertex AI Prediction Dedicated Endpoints</a></p>
<ul>
<li style="font-weight:400;">Google is announcing <a href="https://cloud.google.com/vertex-ai">Vertex AI</a> prediction dedicated endpoints, a new family of <a href="https://cloud.google.com/vertex-ai/docs/predictions/using-dedicated-endpoints">Vertex AI Prediction endpoints</a>, designed to address the needs of modern AI applications, including those related to large-scale generative AI models. </li>
<li style="font-weight:400;">These dedicated endpoints are engineered to help you build more reliability with the following new features:
<ul>
<li style="font-weight:400;">Native support for streaming inference</li>
<li style="font-weight:400;">gRPC protocol support</li>
<li style="font-weight:400;">Customizable request timeouts</li>
<li style="font-weight:400;">Optimized resource handling</li>
</ul>
</li>
<li style="font-weight:400;">In addition you can utilize these dedicated endpoints via Private Service Connect</li>
</ul>
<p>47:33  Ryan – “All this means to me is that the engineers that were supporting the service within Google were really sick of the two separate types of workloads that were going across these endpoints… I bet you it was a nightmare to predict load and support from that direction.” </p>
<h2>Azure</h2>
<p>48:42  <a href="https://azure.microsoft.com/en-us/blog/microsoft-cost-management-updates-april-2025/">Microsoft Cost Management updates—April 2025</a></p>
<ul>
<li style="font-weight:400;">Several enhancements for <a href="https://www.finops.org/introduction/what-is-finops/">Finops</a> professionals in the <a href="https://duckduckgo.com/y.js?ad_domain=microsoft.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=_s9jvja08nVrZiYCD9hkXNoQJ6SbJ-i1zIlnrx6fioQNQdr3vaiCyrO-X9YUI-_0DCQ7hKf3R-r1P8s5IWlt6vhOcGz1g_HMAgMMRAQq5l94oC7pLjg2yHIuj10lZU8r.HLNGGJkCSYbjBPtvg6r9pg&amp;eddgt=c9f7FWc4gQE3MUFxPJJX-Q%3D%3D&amp;rut=7aba88a61a5887d51d0778f9c732ca57a74c8984549a93bcbecd89e2189a817a&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8vvWkN08g0H0TvJ_3ZJL-2zVUCUzcLnKVTyUGYwme13YA5y1i0PtmNJF5d63ZJAGPaUNXvu-Ix3r58VqiltDgYQ06cJr8hMAj_h6r-ZguL44CVbZCwuZNqC26veSbl9qvydybV4Vq7m4vwZ_TDOZXhYAfb01DLXMxDUQDjUBKz_y_JU6IwGXQz-LflS4e6MSiHMb2FTdZ_019lJXYfmRcJ1ILzGUApabJoIJPOSClWMMzqcOQkJRpc_Th0mjSXHt9SbKxCNbqM8Y3sd0T9pFspgnMLzFwKC7U4RgUk2jsasNmTWfLowe2BRk2_GEf45l-x1djtRYETZ89Xv_3UZZY6hW33LLBUcfdV8voYmmhz7ZDkRzGIi1Ar6uz8YAUIBmqc6-jg-nx0um4DUdq72vNeWO1FPkIe3bBqyax252MTjtGw6DD5c0pELBSAJfvbE9e8li93QxZqWJP8S48z8Y7tfOf5jgYPMSW0dZXvzDjc4vTV7l4DMP4WFlw4FFmgEhhZ_Y28DkIM6891o8ekaSk2HvaMJh0UH6F-UXvjwawinyow1-dgEVhgMGAdv2j1VH6ztLxgucwOaZOd9U-9hAuAdAtGSAfxUMul1cYjXC7L_cPVp_TRtg-XqPHULTvVZRu9TxTW9lCYTjyBclkjyOxVfaE2KZ-Zotr53mdZi6baMCK4cLXPdCc8_yF8Ownogg5li0hgLWO0OuRTd_FAiT-NcNXaDjG_YUwiOHUdWh6GB_yHBPEdN2fTxH61tGaPjegd-RU4A%26u%3DaHR0cHMlM2ElMmYlMmY1MzUwLnhnNGtlbi5jb20lMmZ0cmslMmZ2MSUzZnByb2YlM2Q0MzklMjZjYW1wJTNkMTY4ODA3JTI2a2N0JTNkbXNuJTI2a2NoaWQlM2QxNTkwMDE5MjElMjZjcml0ZXJpYWlkJTNka3dkLTc4ODIxODQ2NDc2Njc1JTNhbG9jLTE5MCUyNmNhbXBhaWduaWQlM2Q1OTAyMzczNTglMjZsb2NwaHklM2Q4MDQ2MSUyNmFkZ3JvdXBpZCUzZDEyNjExNDE0ODk3MDI1NDUlMjZjaWQlM2Q3ODgyMTQ0ODU3MjY3MyUyNmtkdiUzZGMlMjZrZXh0JTNkJTI2a3BnJTNkJTI2a3BpZCUzZCUyNnF1ZXJ5U3RyJTNkYXp1cmUlMjZ1cmwlM2RodHRwcyUzYSUyZiUyZmF6dXJlLm1pY3Jvc29mdC5jb20lMmZlbi11cyUyZnByaWNpbmclMmZwdXJjaGFzZS1vcHRpb25zJTJmYXp1cmUtYWNjb3VudCUyZnNlYXJjaCUzZmVmX2lkJTNkX2tfYzY4ZDJkMThiMjQzMWQ0ZjYwNTg0NTNjOTM1NTUzY2Ffa18lMjZPQ0lEJTNkQUlEY21tNWVkc3dkdXVfU0VNX19rX2M2OGQyZDE4YjI0MzFkNGY2MDU4NDUzYzkzNTU1M2NhX2tfJTI2bXNjbGtpZCUzZGM2OGQyZDE4YjI0MzFkNGY2MDU4NDUzYzkzNTU1M2Nh%26rlid%3Dc68d2d18b2431d4f6058453c935553ca&amp;vqd=4-311430927781170421913882352909892453120&amp;iurl=%7B1%7DIG%3D82FC3D7E14F44FF09DEFD6E82AD74DC4%26CID%3D2A16FBDC5D70675B1812EE305C96669E%26ID%3DDevEx%2C5048.1">Azure</a> world in April.  </li>
<li style="font-weight:400;">First up is the GA of <a href="https://learn.microsoft.com/en-us/azure/copilot/overview">Microsoft Copilot for Azure</a>. You can ask natural language questions about your subscriptions, costs and drivers.  </li>
<li style="font-weight:400;">Also included are several enhancements for exports, including the ability to export price sheets, reservation recommendations, reservation details and reservation transactions, along with standard cost and usage data</li>
<li style="font-weight:400;">Support for <a href="https://learn.microsoft.com/en-us/cloud-computing/finops/focus/what-is-focus">FOCUS</a> is now GA.</li>
<li style="font-weight:400;">Export data in either CSV or Parquet formats.</li>
<li style="font-weight:400;">There are several new ways to save money in Microsoft Cloud, including AKS Cost recommendations, autoscale for vcore-based Azure <a href="https://duckduckgo.com/y.js?ad_domain=microsoft.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=tsTuer3vMsZ01gGW_f5mpLzTKQijEQT9K6WC6QahNxXSGZSA7ipova506jaMTF386X3B_RbWM28wNVoLGXh6GAcFelRv2uCxNkZlKBLu976iO9AcK6A4tGmpvWUmrFCE.JS-r9KQczafCdIYLkhF9VA&amp;eddgt=gmavVUAGBaCtIiXzys3QgQ%3D%3D&amp;rut=550c611b48187db9897d6210efc4a6af4e7d18cf709b9705037f1155a0122297&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De85Xez3qg1FmzK9fBsEzC_aDVUCUykrb7m6OG_f-C5p-pVnRKHdsjhKMyVXLPyw8EvlHdt1eKyT-9tNSC9lvoTSixpsGfz_oO7UZ5hz0rkPsHRPL7VO46F6iX-1_cuRsFQq7QktZPwGekLtMNhLigWIFypCFLs5AZ-IjJM157A41SXC7b2I4AuhdUvk0hmS8L8Sv3yct0bhtF62jzJEfQyrjzNVDkbdamZoUgwUHqz8O3GeVxqSridAcBgdanlebrzwx8M0vHOAk1WSw0Jl8mIR4f5ksGAkoZiXWulOXtjUcy4KszcbkRBRmEhnm7nh8maVJeRsLn_kxmhnUmOz5R_uGK5wwd2qUx_5PLzGTbJv7DX_CBvYpGw6uvfe8ax9OkWW66nNqRVytCxKS3UKFdPQiOV3QPO7PeUzm5ry4QKvlYJWBUQIYvJYfmcB6mCdfk9gIQ_LHVivC78ydGGDm0AJcNq79GoWUCh4dA-rvohVP8ZCSAxeJ7tpJcLHJl8qQebphg6EQJainoLrIHRm6G5kDPoXgUsuTMlIEWctaYy7a4E73DRYQ1Jh_OnZcTHEtTOBoTk0lKohU7yawJBmpVhGb-LPFKzwM1hyGfFNZ_1mxC-B0NRyauXd3yM3Mf31RSSC4e3SV2dOwdtLujJawPENCxanuENXfaukqnSQ0NN9zpi3iS0bO64XMevGgEx7-pqJdfOjH_f-ew_gWb4_pMFkqAkgvIctKYRlK69XsuxUVbIyITAVW8iyA8Pc332sp91uUmNRQ%26u%3DaHR0cHMlM2ElMmYlMmY1MzUwLnhnNGtlbi5jb20lMmZ0cmslMmZ2MSUzZnByb2YlM2Q0MzklMjZjYW1wJTNkMTY4ODA3JTI2a2N0JTNkbXNuJTI2a2NoaWQlM2QxNTkwMDE5MjElMjZjcml0ZXJpYWlkJTNka3dkLTc5MjM0MTYzMzgyOTUwJTNhbG9jLTE5MCUyNmNhbXBhaWduaWQlM2Q1OTAyMzczNTglMjZsb2NwaHklM2Q3OTc5NyUyNmFkZ3JvdXBpZCUzZDEyNjc3Mzg1NTkwNTE0NDclMjZjaWQlM2Q3OTIzMzc2NDM5NDIzOCUyNmtkdiUzZGMlMjZrZXh0JTNkJTI2a3BnJTNkJTI2a3BpZCUzZCUyNnF1ZXJ5U3RyJTNkY29zbW9zJTI1MjBkYiUyNnVybCUzZGh0dHBzJTNhJTJmJTJmYXp1cmUubWljcm9zb2Z0LmNvbSUyZmVuLXVzJTJmcHJvZHVjdHMlMmZjb3Ntb3MtZGIlMmYlM2ZlZl9pZCUzZF9rXzQwMWZiMTY1NWI4NTE0MjdkOGQ4ZDk0MTdhMTYyZDRmX2tfJTI2T0NJRCUzZEFJRGNtbTVlZHN3ZHV1X1NFTV9fa180MDFmYjE2NTViODUxNDI3ZDhkOGQ5NDE3YTE2MmQ0Zl9rXyUyNm1zY2xraWQlM2Q0MDFmYjE2NTViODUxNDI3ZDhkOGQ5NDE3YTE2MmQ0Zg%26rlid%3D401fb1655b851427d8d8d9417a162d4f&amp;vqd=4-312764821819636286844429701095769233637&amp;iurl=%7B1%7DIG%3DC5C1C20E4F9A4FCEAD247FF60D7569E9%26CID%3D3B1E5C2709606D74012B49CB08866C0A%26ID%3DDevEx%2C5048.1">Cosmos DB</a> for <a href="https://www.mongodb.com/">MongoDB</a>.  </li>
<li style="font-weight:400;">Troubleshoot disk performance with Copilot. </li>
<li style="font-weight:400;">On demand backups for Azure Database for <a href="https://azure.microsoft.com/en-us/pricing/details/postgresql/flexible-server/">PostgreSQL Flexible</a>, VM Hibernation on GPU VMs and <a href="https://learn.microsoft.com/en-us/azure/azure-netapp-files/azure-netapp-files-introduction">Azure Netapp</a> Files Flexible service in preview.</li>
</ul>
<p>51:25  Justin – “I look forward to exporting all my data into Parquet formats and just sending it to people randomly…figure it out bro!” </p>
<p>53:05 <a href="https://azure.microsoft.com/en-us/blog/one-year-of-phi-small-language-models-making-big-leaps-in-ai/">One year of Phi: Small language models making big leaps in AI</a></p>
<ul>
<li style="font-weight:400;">A year ago Microsoft introduced small language models (SLMs) to customers with the release of <a href="https://azure.microsoft.com/en-us/blog/introducing-phi-3-redefining-whats-possible-with-slms/">Phi-3</a>.  </li>
<li style="font-weight:400;">Now they are announcing the new <a href="https://techcommunity.microsoft.com/blog/aiplatformblog/introducing-phi-4-microsoft%E2%80%99s-newest-small-language-model-specializing-in-comple/4357090">Phi-4</a> family, including <a href="https://techcommunity.microsoft.com/blog/educatordeveloperblog/showcasing-phi-4-reasoning-a-game-changer-for-ai-developers/4409892">Phi-4-reasoning</a>, <a href="https://huggingface.co/microsoft/Phi-4-reasoning-plus">Phi-4-reasoning-plus</a>, and <a href="https://huggingface.co/microsoft/Phi-4-mini-reasoning">phi-4-mini-reasoning</a> marking a new era for small language models and once again redefining what is possible with small and efficient AI. </li>
<li style="font-weight:400;">These are all reasoning models trained to leverage inference-time scaling to perform complex tasks that demand multi-step decomposition and internal reflections. </li>
<li style="font-weight:400;">Phi-4-reasoning is a 14-billion parameter open-weight reasoning model that rivals much larger models on complex reasoning tasks. </li>
<li style="font-weight:400;">Trained via supervised fine-tuning of Phi-4 on carefully curated reasoning demonstrations from OpenAI o3-mini, Phi-4 reasoning generates detailed reasoning chains that effectively leverage additional inference-time compute. </li>
<li style="font-weight:400;">The model demonstrates that meticulous data curation and high-quality synthetic datasets allow smaller models to compete with larger counterparts. </li>
<li style="font-weight:400;">Phi-4-reasoning-plus builds on the phi-4 reasoning model further trained with reinforcement learning to utilize more inference-time compute, using 1.5x more tokens than Phi-4-reasoning, to deliver higher accuracy. </li>
<li style="font-weight:400;">The Phi-4-mini-reasoning is designed to meet the demand for a compact reasoning model. </li>
<li style="font-weight:400;">This transformer-based language model is optimized for mathematical reasoning, providing high-quality, step-by-step problem solving in environments with constrained computing or latency. Fine-tuning with synthetic data generated by the <a href="https://api-docs.deepseek.com/news/news250120">Deepseek-R1</a> model, phi-4-mini-reasoning balances efficiency with advanced reasoning ability. </li>
</ul>
<p>55:41 <a href="https://techcommunity.microsoft.com/blog/azuretoolsblog/announcing-public-preview-of-terraform-export-from-the-azure-portal/4409889">Announcing Public Preview of Terraform Export from the Azure Portal</a></p>
<ul>
<li style="font-weight:400;">Azure is announcing the preview of <a href="http://github.com/Azure/aztfexport">Terraform Export within the Azure</a> portal.  </li>
<li style="font-weight:400;">With this new feature, you can now easily export your existing Azure resources to be managed declaratively directly from the Azure Portal.  </li>
<li style="font-weight:400;">This will streamline IaC workflows, making it simpler to manage and automate your Azure resources via the <a href="https://registry.terraform.io/providers/hashicorp/azurerm/latest/docs">AzureRM</a> and <a href="https://registry.terraform.io/providers/Azure/azapi/latest/docs">AzAPI</a> providers.</li>
</ul>
<p>56:06  Matthew – “So, this is a feature that is useful when you are learning Terraform, or need to figure out what the settings are. Because, sometimes you don’t know what all the variables are when you’re going through it… So it’s fine if you’re trying to use it, but please don’t just take this code and use it in your infrastructure as code. You will hate yourself because everything is hard coded.”</p>
<p>1:03:52 <a href="https://techcommunity.microsoft.com/blog/azurenetworkingblog/azure-virtual-network-terminal-access-point-tap-public-preview-announcement/4405540">Azure virtual network terminal access point (TAP) public preview </a><a href="https://techcommunity.microsoft.com/blog/azurenetworkingblog/azure-virtual-network-terminal-access-point-tap-public-preview-announcement/4405540">announcement</a></p>
<ul>
<li style="font-weight:400;">Virtual Network TAP allows customers to continuously stream virtual machine network traffic to a network packet collector or analytics tool. </li>
<li style="font-weight:400;">Many security and performance tools rely on packet-level insights that are difficult to access in cloud environments. </li>
<li style="font-weight:400;">Virtual Network TAP bridges this gap by integrating with their industry partners to offer:
<ul>
<li style="font-weight:400;">Enhanced security and threat detection</li>
<li style="font-weight:400;">Performance monitoring and troubleshooting</li>
<li style="font-weight:400;">Regulatory compliance.</li>
</ul>
</li>
</ul>
<p>1:04:20  Justin – “I always appreciate when they say ‘this is for threat detection’ because we love to make our security tools the biggest risk in the whole business by sending all the data and all the packets there.” </p>
<h2>Oracle</h2>
<p>1:07:27 <a href="https://www.oracle.com/news/announcement/sphere-powers-its-ai-platform-with-oracle-database-23ai-2025-05-01/">Sphere Powers its AI Platform with Oracle Database 23ai</a></p>
<ul>
<li style="font-weight:400;">All the hyperscalers want to be doing stuff for the Sphere, from Google doing the Wizard of Oz movie, to apparently google providing <a href="https://www.oracle.com/database/23ai/">Oracle Database 23ai</a> on the <a href="https://www.oracle.com/autonomous-database/">Oracle Autonomous Database</a>.  </li>
<li style="font-weight:400;">In general we don’t really care that much, but thought it was funny, considering Google has regularly bought ads during Re:invent. </li>
</ul>
<h2>Cloud Journey</h2>
<p>1:09:59 <a href="https://medium.com/@keeganjustis/why-your-tagging-strategy-matters-on-aws-ab8c3b8335a6">Why Your Tagging Strategy Matters on AWS | by Keegan Justis | May, </a><a href="https://medium.com/@keeganjustis/why-your-tagging-strategy-matters-on-aws-ab8c3b8335a6">2025</a></p>
<ul>
<li style="font-weight:400;">Keegan Justis had a great medium post on why your tagging strategy matters on AWS. </li>
<li style="font-weight:400;">He highlights the benefits of tagging:
<ul>
<li style="font-weight:400;">Improved Cost Visibility and Accountability</li>
<li style="font-weight:400;">Effective Resource Ownership and Management</li>
<li style="font-weight:400;">Enhanced Security and Compliance</li>
<li style="font-weight:400;">Reliable Automation and Lifecycle Management</li>
<li style="font-weight:400;">Operational Clarity and Faster Troubleshooting</li>
<li style="font-weight:400;">Streamlined Multi-account and Multi-Team governance</li>
<li style="font-weight:400;">Reduced Manual Work and Better efficiency</li>
<li style="font-weight:400;">Simplified Onboarding and Knowledge Transfer</li>
</ul>
</li>
<li style="font-weight:400;">Recommended shared</li>
<li style="font-weight:400;">Enforcement of Tags</li>
</ul>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod where you can join our newsletter, slack team, send feedback or ask questions at theCloud Pod.net or tweet at us with hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2042130/c1e-jkjku51p8zup3nkj-rk41k79zujrx-pibou5.mp3" length="101875456"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 303 of The Cloud Pod – where the forecast is always cloudy! Justin, Ryan and exhausted dad Matt are here (and mostly awake) ready to bring the latest in cloud news! This week we’ve got more news from Nova, updates to Claude, earnings news, and a mini funeral for Skype – plus a new helping of Cloud Journey!
Titles we almost went with this week:

Claude researches so Ryan can nap
The best AI for Nova Corps, Amazon Nova Premiere JB
If you can’t beat them, change the licensing terms and make them fork, and then 
     reverse course… and profit
Q has invaded your IDE!!
Skype bites the dust

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our Slack channel for more info. 
Follow Up 
02:50 Sycophancy in GPT-4o: What happened and what we’re doing about it

OpenAI wrote up a blog post about their sycophantic Chat GPT 4o upgrade last week, and they wanted to set the record straight. 
They made adjustments at improving the models default personality to make it feel more intuitive and effective across a variety of tasks. 
When shaping model behavior, they start with a baseline principle and instructions outlined in their model spec. 
They also teach their models how to apply these principles by incorporating user signals like thumbs up and thumbs down feedback on responses. 
In this update, though, they focused too much on short-term feedback and did not fully account for how users’ interactions with ChatGPT evolve. This skewed the results towards responses that were overly supportive – but disingenuous. 
Beyond rolling back the changes, they are taking steps to realign the model behavior, including refining core training techniques and system prompts to explicitly steer the model away from sycophancy. 
They also plan to build more guardrails to increase honesty and transparency principles in the model spec.
Additionally, they plan to expand ways for users to test and give direct feedback before deployments.
Lastly, OpenAI continues to expand evaluations building on the model sync and our ongoing research. 

04:43 Deep Research on Microsoft Hotpatching:

Yes, they’re grabbing money and screwing you. Basically. 

07:06  Justin – “I’m not going to give them any credit on this one. I appreciate that they created hotpatching, but I don’t like what you want to charge me for it.” 
General News
It’s Earnings time – cue the sound effects!
08:03 Alphabet’s Q1 earnings shattered analyst expectations, sending the stock soaring. Google’s CEO credits its AI efforts
Alphabet Q1 2025 earnings call: CEO Sundar Pichai’s remarks

]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2042130/c1a-k5d5-47kq7zkkb6qd-4nvdk8.jpg"></itunes:image>
                                                                            <itunes:duration>01:24:54</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2042130/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[302: It’s So Hot, Even Windows is Hotpatching]]>
                </title>
                <pubDate>Thu, 08 May 2025 15:47:43 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2027382</guid>
                                    <link>https://tcpfm.castos.com/episodes/302-its-so-hot-even-windows-is-hotpatching</link>
                                <description>
                                            <![CDATA[<p>Welcome to episode 302 of The Cloud Pod – where the forecast is always cloudy! This week Justin and Ryan are on hand to bring you all the latest in Cloud (and AI news.) We’ve got hotpatching, Project Greenland, and a rollback of GPT-4.o, which sort of makes us sad – and our egos are definitely less stroked. Plus Saas, containers, and outposts – all of this and more. Thanks for joining us in the cloud! </p>
<h3>Titles we almost went with this week:</h3>
<ul>
<li>The Cloud Pod was never accused of being sycophantic</li>
<li>2nd Gen outposts!?! I didn’t even know anyone was using Gen 1</li>
<li>AWS Outposts 2nd Gen… not with AI (GASP)</li>
<li>If you’re doing SaaS wrong, Google &amp; AWS have your back this week with new Features </li>
<li>Patching, so hot right now</li>
<li>Larger container sizes for Azure….  You don’t say</li>
<li>AWS Green reporting detects hotspots… surprisingly close to Maryland…..</li>
<li>Visual pipeline for Opensearch… I want to like this… but I just can’t</li>
</ul>
<h3>A big thanks to this week’s sponsor:</h3>
<h3>We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our Slack channel for more info. </h3>
<h2>General News </h2>
<p>01:37 <a href="https://cloud.google.com/blog/products/ai-machine-learning/sharing-new-dora-research-for-gen-ai-in-software-development/">Sharing new DORA research for gen AI in software development</a></p>
<ul>
<li style="font-weight:400;">The DORA team at Google has released a new report, “<a href="https://cloud.google.com/resources/content/dora-impact-of-gen-ai-software-development">Impact of Generative AI In Software Development</a>.” The report is based on data and developer interviews, and the report aims to move beyond hype to offer a proper perspective on AI’s impact on individuals, teams and organizations. </li>
<li style="font-weight:400;">Click on the link in our show notes to access the full report. However, Google has highlighted a few key points in the blog post.</li>
<li style="font-weight:400;">AI is Real – A staggering 89% of organizations are prioritizing the integration of AI into their applications, and 76% of technologists are already using AI in some part of their daily work. </li>
<li style="font-weight:400;">Productivity gains confirmed: Developers using Gen AI report significant increases in flow, productivity, and job satisfaction.  For instance, a 25% increase in AI adoption is associated with a 2.1% increase in individual productivity.</li>
<li style="font-weight:400;">Organization benefits are tangible: Beyond individual gains, Dora found strong correlations between AI adoption and improvements in crucial organizational metrics. A 25% increase in AI adoption is associated with increases in document quality, code quality, code review speeds and approval speeds. </li>
<li style="font-weight:400;">If you are looking to utilize AI in your development organization, they provide five practical approaches for both leaders and practitioners.
<ul>
<li style="font-weight:400;">Have transparent communications</li>
<li style="font-weight:400;">Empower developers with learning and experimentation</li>
<li style="font-weight:400;">Establish clear policies</li>
<li style="font-weight:400;">Rethink performance metrics</li>
<li style="font-weight:400;">Embrace fast feedback loops</li>
</ul>
</li>
</ul>
<p>045:06  Ryan – “Those are really good approaches, but really difficult to implement in practice. You know, in my day job, watching the company struggle to get a handle on AI from all the different angles you need to, from data protection, legal liability – just operationally – it’s very hard. So I think having a mature program where you’re rolling that out with intent and being very specific with your AI tasks I think will go a long way with a lot of companies.”  </p>
<h2>AI Is Going Great – Or How ML Makes Its Money </h2>
<p>08...</p>
<h3>Chapters</h3>
<ul><li>(00:00:00) - The Cloud Pod</li><li>(00:01:51) - A New Report on AI in Software Development</li><li>(00:02:57) - How to Use AI in the Development Organization</li><li>(00:08:37) - A Code Team's Journey</li><li>(00:09:04) - How OpenAI Is Making Money With AI</li><li>(00:12:13) - ChatGPT: The Chatbot's Syncophantic</li><li>(00:14:38) - Cloudflare: DDoS Attacks reached 1 TB per second</li><li>(00:18:22) - Cloudflare: Best DDoS Protection for $30 a month</li><li>(00:20:09) - Amazon's US East 1 Availability Zone Announcement</li><li>(00:25:41) - AWS AppSync Events</li><li>(00:30:23) - EKS Cluster Node Monitoring and Auto-Repair</li><li>(00:34:11) - Amazon Bedrock: Prompt Optimization (General Availability)</li><li>(00:36:50) - Amazon Q Business Integrations for Microsoft Word and Outlook</li><li>(00:39:05) - Amazon Serverless Reservations: New Discount for Analytics</li><li>(00:42:25) - Amazon OpenSearch Injection Pipelines</li><li>(00:44:50) - Amazon Announces Second Generation AWS Outpost Racks</li><li>(00:48:09) - Amazon Cloudfront SaaS Manager: Multi-Termite Webs</li><li>(00:52:34) - Amazon VPC Endpoints: 10 years too late</li><li>(00:54:06) - SaaS Runtime: Fully Managed by Google Cloud</li><li>(01:01:04) - On the Cloud: The IMS Blueprint</li><li>(01:03:44) - Google Cloud Database and LangChain Integrations now support Go Java and</li><li>(01:04:22) - OpenAI Unveils GPT Image 1 at Microsoft</li><li>(01:06:16) - How to Stop restarting your Windows Servers for Patching</li><li>(01:06:44) - Microsoft Hot Patching for Windows Server 25</li><li>(01:13:27) - Let it go.</li><li>(01:13:42) - Azure Confidential VMs</li><li>(01:16:43) - Azure: Large Container Sizes for ACI</li><li>(01:19:14) - DigitalOcean Launches Managed Caching for Valky</li><li>(01:20:28) - How Amazon Rescued Its GPU Crunch</li><li>(01:22:15) - Amazon's GPU Priority Process</li><li>(01:24:01) - NVIDIA GPUs, Storage, and Collaboration</li><li>(01:24:46) - Efficiency and confidentiality in the R&D environment</li><li>(01:26:31) - Amazon's GPU Orchestration System</li><li>(01:31:34) - Week in the Cloud: Longest Episode Yet</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 302 of The Cloud Pod – where the forecast is always cloudy! This week Justin and Ryan are on hand to bring you all the latest in Cloud (and AI news.) We’ve got hotpatching, Project Greenland, and a rollback of GPT-4.o, which sort of makes us sad – and our egos are definitely less stroked. Plus Saas, containers, and outposts – all of this and more. Thanks for joining us in the cloud! 
Titles we almost went with this week:

The Cloud Pod was never accused of being sycophantic
2nd Gen outposts!?! I didn’t even know anyone was using Gen 1
AWS Outposts 2nd Gen… not with AI (GASP)
If you’re doing SaaS wrong, Google & AWS have your back this week with new Features 
Patching, so hot right now
Larger container sizes for Azure….  You don’t say
AWS Green reporting detects hotspots… surprisingly close to Maryland…..
Visual pipeline for Opensearch… I want to like this… but I just can’t

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our Slack channel for more info. 
General News 
01:37 Sharing new DORA research for gen AI in software development

The DORA team at Google has released a new report, “Impact of Generative AI In Software Development.” The report is based on data and developer interviews, and the report aims to move beyond hype to offer a proper perspective on AI’s impact on individuals, teams and organizations. 
Click on the link in our show notes to access the full report. However, Google has highlighted a few key points in the blog post.
AI is Real – A staggering 89% of organizations are prioritizing the integration of AI into their applications, and 76% of technologists are already using AI in some part of their daily work. 
Productivity gains confirmed: Developers using Gen AI report significant increases in flow, productivity, and job satisfaction.  For instance, a 25% increase in AI adoption is associated with a 2.1% increase in individual productivity.
Organization benefits are tangible: Beyond individual gains, Dora found strong correlations between AI adoption and improvements in crucial organizational metrics. A 25% increase in AI adoption is associated with increases in document quality, code quality, code review speeds and approval speeds. 
If you are looking to utilize AI in your development organization, they provide five practical approaches for both leaders and practitioners.

Have transparent communications
Empower developers with learning and experimentation
Establish clear policies
Rethink performance metrics
Embrace fast feedback loops



045:06  Ryan – “Those are really good approaches, but really difficult to implement in practice. You know, in my day job, watching the company struggle to get a handle on AI from all the different angles you need to, from data protection, legal liability – just operationally – it’s very hard. So I think having a mature program where you’re rolling that out with intent and being very specific with your AI tasks I think will go a long way with a lot of companies.”  
AI Is Going Great – Or How ML Makes Its Money 
08...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[302: It’s So Hot, Even Windows is Hotpatching]]>
                </itunes:title>
                                    <itunes:episode>302</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p>Welcome to episode 302 of The Cloud Pod – where the forecast is always cloudy! This week Justin and Ryan are on hand to bring you all the latest in Cloud (and AI news.) We’ve got hotpatching, Project Greenland, and a rollback of GPT-4.o, which sort of makes us sad – and our egos are definitely less stroked. Plus Saas, containers, and outposts – all of this and more. Thanks for joining us in the cloud! </p>
<h3>Titles we almost went with this week:</h3>
<ul>
<li>The Cloud Pod was never accused of being sycophantic</li>
<li>2nd Gen outposts!?! I didn’t even know anyone was using Gen 1</li>
<li>AWS Outposts 2nd Gen… not with AI (GASP)</li>
<li>If you’re doing SaaS wrong, Google &amp; AWS have your back this week with new Features </li>
<li>Patching, so hot right now</li>
<li>Larger container sizes for Azure….  You don’t say</li>
<li>AWS Green reporting detects hotspots… surprisingly close to Maryland…..</li>
<li>Visual pipeline for Opensearch… I want to like this… but I just can’t</li>
</ul>
<h3>A big thanks to this week’s sponsor:</h3>
<h3>We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our Slack channel for more info. </h3>
<h2>General News </h2>
<p>01:37 <a href="https://cloud.google.com/blog/products/ai-machine-learning/sharing-new-dora-research-for-gen-ai-in-software-development/">Sharing new DORA research for gen AI in software development</a></p>
<ul>
<li style="font-weight:400;">The DORA team at Google has released a new report, “<a href="https://cloud.google.com/resources/content/dora-impact-of-gen-ai-software-development">Impact of Generative AI In Software Development</a>.” The report is based on data and developer interviews, and the report aims to move beyond hype to offer a proper perspective on AI’s impact on individuals, teams and organizations. </li>
<li style="font-weight:400;">Click on the link in our show notes to access the full report. However, Google has highlighted a few key points in the blog post.</li>
<li style="font-weight:400;">AI is Real – A staggering 89% of organizations are prioritizing the integration of AI into their applications, and 76% of technologists are already using AI in some part of their daily work. </li>
<li style="font-weight:400;">Productivity gains confirmed: Developers using Gen AI report significant increases in flow, productivity, and job satisfaction.  For instance, a 25% increase in AI adoption is associated with a 2.1% increase in individual productivity.</li>
<li style="font-weight:400;">Organization benefits are tangible: Beyond individual gains, Dora found strong correlations between AI adoption and improvements in crucial organizational metrics. A 25% increase in AI adoption is associated with increases in document quality, code quality, code review speeds and approval speeds. </li>
<li style="font-weight:400;">If you are looking to utilize AI in your development organization, they provide five practical approaches for both leaders and practitioners.
<ul>
<li style="font-weight:400;">Have transparent communications</li>
<li style="font-weight:400;">Empower developers with learning and experimentation</li>
<li style="font-weight:400;">Establish clear policies</li>
<li style="font-weight:400;">Rethink performance metrics</li>
<li style="font-weight:400;">Embrace fast feedback loops</li>
</ul>
</li>
</ul>
<p>045:06  Ryan – “Those are really good approaches, but really difficult to implement in practice. You know, in my day job, watching the company struggle to get a handle on AI from all the different angles you need to, from data protection, legal liability – just operationally – it’s very hard. So I think having a mature program where you’re rolling that out with intent and being very specific with your AI tasks I think will go a long way with a lot of companies.”  </p>
<h2>AI Is Going Great – Or How ML Makes Its Money </h2>
<p>08:55  <a href="https://openai.com/index/image-generation-api/">Introducing our latest image generation model in the API</a></p>
<ul>
<li style="font-weight:400;">You can now generate images via the <a href="https://openai.com/index/image-generation-api/">ChatGPT API via gpt-image-1</a>, enabling developers and businesses to easily integrate high-quality, professional-grade image generation directly in their tools and platforms.</li>
<li style="font-weight:400;">The GPT-image-1 API is <a href="https://openai.com/chatgpt/pricing/">priced per token</a>, with separate pricing for text and image tokens.  </li>
<li style="font-weight:400;">Text input is $5 per 1M tokens, Image input tokens are $10 per 1 million tokens, and Image output or generated images is $40 per 1M token.  </li>
</ul>
<p>09:47  Ryan – “It’s still tricky pricing these things out…forecasting these things in a way that you can coordinate as a business is really challenging.”</p>
<p>12:03 <a href="https://arstechnica.com/ai/2025/04/openai-rolls-back-update-that-made-chatgpt-a-sycophantic-mess/">OpenAI rolls back update that made ChatGPT a sycophantic mess</a></p>
<ul>
<li style="font-weight:400;"><a href="https://openai.com/chatgpt/overview/">ChatGPT</a> is becoming less of a suck up apparently. </li>
<li style="font-weight:400;">ChatGPT users have grown frustrated with the overly positive and complementary output generated by the model.  </li>
<li style="font-weight:400;">This rollback will occur on the GPT-4o model, which is the default model you get access to via ChatGPT.  </li>
<li style="font-weight:400;">OpenAI says that as you interact with the chatbot, OpenAI gathers data on the responses people like more, then the engineers revise the production model using a technique called reinforcement learning from human feedback.  </li>
<li style="font-weight:400;">However, that’s where things went off the rails – turning ChatGPT into the world’s biggest suck up. </li>
<li style="font-weight:400;">Users could present ChatGPT with completely terrible ideas or misguided claims, and it might respond “Wow, you’re a genius” or “This is on a whole different level.” Which, to be fair, “on a whole different level” doesn’t necessarily mean GOOD. </li>
<li style="font-weight:400;">Designing the model’s tone is important to make them something you want to chat with, and this sycophantic response process results in a toxic feedback loop. </li>
<li style="font-weight:400;">Claude is a little more realistic, but honestly – it’s sort of a let down. </li>
</ul>
<h2>Cloud Tools </h2>
<p>14:30 <a href="https://blog.cloudflare.com/ddos-threat-report-for-2025-q1/">Targeted by 20.5 million DDoS attacks, up 358% year-over-year: </a><a href="https://blog.cloudflare.com/ddos-threat-report-for-2025-q1/">Cloudflare’s 2025 Q1 DDoS Threat Report</a> </p>
<ul>
<li style="font-weight:400;"><a href="https://www.cloudflare.com/network/">Cloudflare</a> has released their Q1 <a href="http://www.ddosreport.com/">DDOS threat report</a>, and it isn’t great if you’re trying to protect internet resources. </li>
<li style="font-weight:400;">They even touched on a late breaking <a href="https://blog.cloudflare.com/ddos-threat-report-for-2025-q1/#hyper-volumetric-ddos-attacks">DDOS attack observed in April 2025</a> that are some of the largest publicly disclosed.  </li>
<li style="font-weight:400;">Cloudflare says they blocked an intense packet rate attack, peaking 4.8 billion packets per second, 52% higher than their previous benchmark, and also defended against a 6.5 tbps flood, matching the highest bandwidth reports ever reported. </li>
<li style="font-weight:400;">In the first quarter though:
<ul>
<li style="font-weight:400;">Blocked 20.5 Million DDOS attacks, representing 358% YoY increase and 198% quarter-over-quarter increase.</li>
<li style="font-weight:400;">One third of the attacks, 6.6 million, targeted the CloudFlare network infrastructure directly, as part of an 18-day multi-vector attack campaign.</li>
<li style="font-weight:400;">Furthermore, in the first quarter of 2025, Cloudflare blocked approximately 700 hyper-volumetric DDoS attacks that exceeded 1 Tbps or 1 BPSS or about eight attacks per day</li>
</ul>
</li>
</ul>
<p>15:57  Justin – “I was thinking about this earlier, actually. Typically DDoS attacks are compromised computers that are then used in these massive attacks, and they’re all controlled by botnets and this has been going on for over a decade now – and it just keeps getting worse… I mean, I’m a computer guy, so all my shit’s locked down and secure, and I have firewalls, but do normal people just go raw dogging on the internet and their computers get hacked and compromised all the time?” </p>
<h2>AWS</h2>
<p>20:09 <a href="https://aws.amazon.com/blogs/aws/in-the-works-new-availability-zone-in-maryland-for-us-east-n-virginia-region/">In the works – New Availability Zone in Maryland for US East (Northern </a><a href="https://aws.amazon.com/blogs/aws/in-the-works-new-availability-zone-in-maryland-for-us-east-n-virginia-region/">Virginia) Region</a></p>
<ul>
<li style="font-weight:400;">This might explain the recent update to the API to include location in the response, with <a href="https://aws.amazon.com/">Amazon</a> announcing a new <a href="https://docs.aws.amazon.com/global-infrastructure/latest/regions/aws-availability-zones.html">Availability Zone</a> for US-East in Maryland vs Virginia. </li>
<li style="font-weight:400;">Today the AWS US-East region is 6 Availability Zones, now with this new Maryland zone opening in 2026, they will have 7 AZ’s connected by high-bandwidth, low-latency network connections over dedicated, fully redundant fiber. </li>
<li style="font-weight:400;">With this new AZ joining an ever growing list of new regions including New Zealand, KSA, Taiwan and the AWS European Sovereign Cloud AWS is investing heavily in Datacenter capacity. </li>
</ul>
<p>25:40 <a href="https://aws.amazon.com/blogs/aws/enhance-real-time-applications-with-aws-appsync-events-data-source-integrations/">Enhance real-time applications with AWS AppSync Events data source </a><a href="https://aws.amazon.com/blogs/aws/enhance-real-time-applications-with-aws-appsync-events-data-source-integrations/">integrations</a>  </p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/appsync/product-details/#AWS_AppSync_Events">AWS AppSync</a> events now support data source integrations for channel namespaces, enabling developers to create more sophisticated real-time applications. </li>
<li style="font-weight:400;">With the new capabilities, you can associate <a href="https://aws.amazon.com/lambda/">AWS Lambda functions</a>, <a href="https://aws.amazon.com/dynamodb/">Amazon DynamoDB</a> tables, <a href="https://aws.amazon.com/rds/aurora/">Amazon Aurora</a> databases and other data sources with channel namespace handlers. </li>
<li style="font-weight:400;">Leveraging <a href="https://docs.aws.amazon.com/appsync/latest/eventapi/event-api-welcome.html">AppSync events</a> you can build rich, real-time applications with features like data validation, event transformation and persistent storage of events. </li>
<li style="font-weight:400;">You can integrate these event flow workflows by transforming and filtering events using Lambda functions or save batches of events to DynamoDB using the new <a href="https://docs.aws.amazon.com/appsync/latest/eventapi/runtime-reference.html">AppSync_JS</a> batch utilities. </li>
</ul>
<p>26:45  Ryan – “I kind of like this thing because it’s a little bit of putting a Band-Aid on your around your managed application, but sure is powerful when you can use it.”</p>
<p>29:49 <a href="https://aws.amazon.com/blogs/containers/amazon-eks-introduces-node-monitoring-and-auto-repair-capabilities/">Amazon EKS introduces node monitoring and auto repair capabilities</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/eks/">EKS</a> now provides node monitoring and auto repair capabilities. </li>
<li style="font-weight:400;">This new feature enables automatic detection and remediation of node-level issues in EKS clusters, improving your availability and reliability of K8 apps.</li>
<li style="font-weight:400;">There are two components responsible for detecting node failures:
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/about-aws/whats-new/2024/12/node-health-monitoring-auto-repair-amazon-eks/">The Node Monitoring Agent</a>– that detects a wide range of issues 
<ul>
<li style="font-weight:400;">It is bundled into the container image that runs as a daemonSet in all worker nodes.  </li>
<li style="font-weight:400;">The agent communicates any issue it finds by updating the status of the K8 node object in the cluster and by emitting K8 events. </li>
<li style="font-weight:400;">Detects GPU Failures related to Hardware Issues, Driver Issues, Memory Problems or unexpected performance drops</li>
<li style="font-weight:400;"><a href="https://kubernetes.io/docs/tasks/debug/debug-cluster/monitor-node-health/">Kubelet Health</a></li>
<li style="font-weight:400;">ContainerD issues</li>
<li style="font-weight:400;">Networking CNI problems, missing route table entries and packet drop issues</li>
<li style="font-weight:400;">Disk Space and I/O errors</li>
<li style="font-weight:400;">CPU throttling, memory pressure and overall system load</li>
<li style="font-weight:400;">Kernel panics</li>
</ul>
</li>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/eks/latest/userguide/node-health.html">Node Repair System</a>: This is a backend component that collects health information and repairs worker nodes. 
<ul>
<li style="font-weight:400;">System either replaces or reboots nodes in response to the conditions within, at most 30 minutes</li>
<li style="font-weight:400;">If a GPU failure is detected it will replace or reboot that node within, at most, 10 minutes.</li>
<li style="font-weight:400;">Repair actions are logged and can be audited</li>
<li style="font-weight:400;">Repair system respects user-specific disruption controls, such as Pod Disruption budgets. If zonal shift is activated in your EKS cluster, then node auto repair actions are halted</li>
</ul>
</li>
</ul>
</li>
</ul>
<p>32:29  Ryan – “I do like that it’s built in to the existing agent, you know, in terms of those health checks. And hopefully that the thresholds and the tuning of this is, you know, tunable where you can set it. Or it’s just completely like hands off running and it just works like magic. That would also be acceptable.”</p>
<p>33:42 <a href="https://aws.amazon.com/about-aws/whats-new/2025/04/prompt-optimization-amazon-bedrock-generally-available/">Prompt Optimization in Amazon Bedrock now generally available</a></p>
<ul>
<li style="font-weight:400;">Prompt Optimization in Bedrock is now GA.  </li>
<li style="font-weight:400;">Prompt engineering is the process of designing prompts to guide FMs to generate relevant responses. </li>
<li style="font-weight:400;">These prompts must be customized for each FM according to its best practices and guidelines, which is a time-consuming process that delays application development. </li>
<li style="font-weight:400;">Prompt optimization can now automatically rewrite prompts for better performance and more concise responses on <a href="https://www.anthropic.com/">Anthropic</a>, <a href="https://www.llama.com/">Llama</a>, <a href="https://nova.amazon.com/">Nova</a>, <a href="https://www.deepseek.com/">Deepseek</a>, <a href="https://mistral.ai/">Mistral</a>, and <a href="https://aws.amazon.com/bedrock/amazon-models/titan/">Titan Models</a>. </li>
<li style="font-weight:400;">You can compare optimized prompts against original versions without deployment and save them in <a href="https://aws.amazon.com/bedrock/prompt-management/">Amazon Bedrock Prompt Management</a> for prompt lifecycle management. </li>
<li style="font-weight:400;">Prompt Optimization will take $0.030 per 1000 tokens. Want more info on pricing? You can find that <a href="https://aws.amazon.com/bedrock/pricing/">here</a>. </li>
</ul>
<p>34:22  Justin – “This is one of those things you create the prompts, you optimize them once for each of the models, and they don’t really change all that often. That’s the guidelines that change.”</p>
<p>36:20 <a href="https://aws.amazon.com/about-aws/whats-new/2025/04/upgrades-amazon-q-business-m365-word-outlook/">AWS announces upgrades to Amazon Q Business integrations for M365 </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/04/upgrades-amazon-q-business-m365-word-outlook/">Word and Outlook</a></p>
<ul>
<li style="font-weight:400;">AWS announced upgrades to its <a href="https://aws.amazon.com/q/business/">Amazon Q business</a> integrations for M365 Word and Outlook to enhance their utility when performing document and email-centered tasks. </li>
<li style="font-weight:400;">The upgrade includes company knowledge access, image file attachment support, and expanded prompt context windows. </li>
<li style="font-weight:400;">With company knowledge support, users can now ask questions about their company’s indexed data directly through the Word and Outlook integrations, allowing them to instantly find relevant information when drafting their documents and emails without needing to switch context. </li>
<li style="font-weight:400;">We are *shocked* that you’re not locked into Microsoft’s AI capabilities.</li>
</ul>
<p>38:42 <a href="https://aws.amazon.com/about-aws/whats-new/2025/04/serverless-reservations-discounted-pricing-option-amazon-redshift-serverless/">Announcing Serverless Reservations, a new discounted pricing option for </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/04/serverless-reservations-discounted-pricing-option-amazon-redshift-serverless/">Amazon Redshift Serverless</a></p>
<ul>
<li style="font-weight:400;"><a href="https://duckduckgo.com/y.js?ad_domain=amazon.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=SObHLOVeXW4xke-uMY9ZXUw2YqpvlmQglK0xGfKyca23J8_RCJ2cDAy2GAChhluNORyviLDct8AoebidAnkNA-eFxoReIdbczrSAmq8erhO5DZbTpE4xS63bq0xlYA8a.oBLp4UHXIJwI-b1WworgBA&amp;eddgt=hZtk5KXVQqlswWTsZVM8Wg%3D%3D&amp;rut=ec82c80f02f8768d138a77c41e118505e34861248ab71a091adb725459440dd6&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8v2uv7zFXiCXTKd264k__GzVUCUyVNsNBfMW7AnNr_gaXiNZji8clN69zIUnI5i5eCrS3bBFtqpM-XnFYXuI-ElDwBpGwPHPYkrJ2Ekmx2bFwjnna215VVjnA_VSVJSgW325WW5qw1J-QNYLUoVOhsayHpKnDiNBL2gc_SajeJrD_NEZ0DLjjOrLvTWpGUYAjEtQzcnthxGZKby8WUI_ntmihL_3TdP9POtTm21cq8FTp_FyA6WlModaDgenoG2o7WpR_oDi1LZppSlXDknguyXNCplpkY-pvUMW0GMV5sHCIXtG9GlGNrHLFFrrTyInlp6yesMONTJ2IDcqVIPfiQMcRKAmYYdHFmkfqvoeKAabbfzVAiuzaCb8yuy7UYzB39UHefx7KobKjwOlRIkw91DmnFG6hw274T9Iqs9lTQK1txdF6Ip_Wlij9IoKGd01aAhpQgax40-WnUOklrV6w7x1dlLHdvmlRreMMBS06r5tXehbe2gCbHui-gu8kLbXMOz0bVMQRshfCxqG3Idse7dS2L-BFgilPVlMswbcOFvdCIcMQ4o5kzj72sFK1op09o3k60SGGrm2IPPz-fx60hHt3Ij0JZ_RIususUsomDqvGVSNu_MbJKgQv6042_5XURi-JZreGJTSB011U5DFWApeiWc8X4hDRmscpn4b-grcGCjh0qORq5nMbrKHK9JKgdpx3U2PPpUneUp_xx5jMwAMuVPAP75OfpjNSnFS-fLW2862W2GpzYAm2iaFZuN1e87QYXQ%26u%3DaHR0cHMlM2ElMmYlMmZwaXhlbC5ldmVyZXN0dGVjaC5uZXQlMmY0NDIyJTJmY3ElM2Zldl9zaWQlM2QxMCUyNmV2X2xuJTNkYW1hem9uJTI1MjByZWRzaGlmdCUyNmV2X2x0eCUzZCUyNmV2X2x4JTNka3dkLTcxMzMxNTc2MzczNDIwJTNhbG9jLTcxMjI4JTI2ZXZfY3J4JTNkNzEzMzEwNTIyNTE3MDklMjZldl9tdCUzZGUlMjZldl9kdmMlM2RjJTI2ZXZfcGh5JTNkODA1OTYlMjZldl9sb2MlM2QlMjZldl9jeCUzZDQ4MjUxMDc0MSUyNmV2X2F4JTNkMTE0MTI5NDA3MDgwODIyOCUyNmV2X2V4JTNkJTI2ZXZfZWZpZCUzZDVlMmExMDEwODMyNzEwYmRkM2I3NzgwYWUwYzljMTUwJTNhRyUzYXMlMjZ1cmwlM2RodHRwcyUyNTNBJTI1MkYlMjUyRmF3cy5hbWF6b24uY29tJTI1MkZwbSUyNTJGcmVkc2hpZnQlMjUzRnRyayUyNTNEMTkyMzUzZmYtMDc5Mi00OTc4LTlkOTktNzg0NzY0MmM2ZjJhJTI1MjZzY19jaGFubmVsJTI1M0RwcyUyNTI2c19rd2NpZCUyNTNEQUwhNDQyMiExMCE3MTMzMTA1MjI1MTcwOSEhISE3MTMzMTU3NjM3MzQyMCEhNDgyNTEwNzQxITExNDEyOTQwNzA4MDgyMjglMjUyNmVmX2lkJTI1M0Q1ZTJhMTAxMDgzMjcxMGJkZDNiNzc4MGFlMGM5YzE1MCUyNTNBRyUyNTNBcyUyNm1zY2xraWQlM2Q1ZTJhMTAxMDgzMjcxMGJkZDNiNzc4MGFlMGM5YzE1MA%26rlid%3D5e2a1010832710bdd3b7780ae0c9c150&amp;vqd=4-213519883894775796767383074904437390766&amp;iurl=%7B1%7DIG%3D0EFE19A961634D41AABDC588F2C1DB83%26CID%3D346C20914CA460AF241335754DCB61E6%26ID%3DDevEx%2C5047.1">Amazon Redshift</a> now offers Serverless Reservations for Redshift Serverless, a new discounted pricing option that helps you save up to 24% and gain greater cost predictability for your analytics workload. </li>
<li style="font-weight:400;">With Serverless Reservations, you can commit to a specific number of Redshift Processing Units (RPUs) for a one-year term, and choose between two <a href="https://aws.amazon.com/redshift/pricing?p=pm&amp;c=rs&amp;z=4">payment options</a>: a no-upfront option that provides a 20% discount for on-demand rates, or an all-upfront option that provides a 24% discount. </li>
</ul>
<p>39:06  Justin – “Save all the monies!” </p>
<p>39:37 <a href="https://aws.amazon.com/about-aws/whats-new/2025/04/aws-transfer-family-terraform-module-sftp-endpoints/">AWS Transfer Family introduces Terraform module for deploying SFTP </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/04/aws-transfer-family-terraform-module-sftp-endpoints/">server endpoints</a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/aws-transfer-family/">AWS Transfer Family</a> introduces a <a href="https://github.com/aws-ia/terraform-aws-transfer-family">Terraform module</a> for deploying managed file transfer (MFT) server endpoints backed by <a href="https://aws.amazon.com/s3/">Amazon S3</a>. </li>
<li style="font-weight:400;">This enables you to leverage IaC to automate and streamline centralized provisioning of MFT servers and users at scale. </li>
<li style="font-weight:400;">AWS Transfer Family provides a fully-managed file transfer for SFTP, AS2, FTPS, FTP and Web Browser-based interfaces directly into and out of AWS storage services. </li>
</ul>
<p>39:57  Justin – “If you’re using FTP you should stop immediately.” </p>
<p>42:10 <a href="https://aws.amazon.com/about-aws/whats-new/2025/04/guided-visual-pipeline-builder-amazon-opensearch-ingestion/">Introducing a guided visual pipeline builder for Amazon OpenSearch </a><a href="https://aws.amazon.com/about-aws/whats-new/2025/04/guided-visual-pipeline-builder-amazon-opensearch-ingestion/">Ingestion</a>      </p>
<ul>
<li style="font-weight:400;">Amazon is releasing a new visual user interface for creating and editing <a href="https://docs.aws.amazon.com/opensearch-service/latest/developerguide/ingestion.html">Amazon OpenSearch Ingestion</a> pipelines on the AWS console</li>
<li style="font-weight:400;">This new capability gives you a guided visual workflow, automatic permission creations, and enhanced real-time validations to streamline the pipeline development process. </li>
<li style="font-weight:400;">The new workflow simplifies pipeline development, reducing setup time and minimizing errors, making it easier to ingest, transform, and route data to <a href="https://aws.amazon.com/what-is/opensearch/">Amazon OpenSearch</a> Service. </li>
</ul>
<p>43:02  Justin – “All of Ryan’s grey hair in his goatee and the reason why I have no color in my goatee is because of ElasticSearch.” </p>
<p>44:35 <a href="https://aws.amazon.com/blogs/aws/announcing-second-generation-aws-outposts-racks-with-breakthrough-performance-and-scalability-on-premises/">Announcing second-generation AWS Outposts racks with breakthrough </a><a href="https://aws.amazon.com/blogs/aws/announcing-second-generation-aws-outposts-racks-with-breakthrough-performance-and-scalability-on-premises/">performance and scalability on-premises</a></p>
<ul>
<li style="font-weight:400;">Amazon is announcing the second generation of <a href="https://aws.amazon.com/outposts/rack/">AWS Outpost Racks</a>, which marks the latest innovation from AWS for edge computing.</li>
<li style="font-weight:400;">The new generation includes support for the latest x86 powered <a href="https://aws.amazon.com/ec2">EC2</a> instances, simplified network scaling and configurations, and accelerated networking instances designed specifically for ultra-low latency and high-throughput workloads. </li>
<li style="font-weight:400;">The enhancements deliver greater performance for a broad range of on-premise workloads, as well as delivering greater performance for a broad range of on-premises workloads, such as core trading systems of financial services and telecom 5G core networks. </li>
<li style="font-weight:400;">Multiple customers have taken advantage of Outposts, including AthenaHealth, FanDuel, Riot Games, etc. </li>
<li style="font-weight:400;">The second generation outpost rack can provide low latency, local data processing, or data residency needs, such as game servers for multiplayer online games, customer transaction data, medical record, industrial and manufacturing control systems, telecom BSS, and edge inference of a variety of ML models. </li>
<li style="font-weight:400;">Justin is impressed that they didn’t slather AI all over this. Missed opportunity! </li>
<li style="font-weight:400;">You can get the 7th generation of X86 processors on outpost racks (C7I, M7I, and R7I optimized instances)</li>
<li style="font-weight:400;">They note that Support for more latest generation EC2 and GPU enabled instances is coming soon (which we guess explains the lack of AI.)</li>
</ul>
<p>45:40 Justin – “You know what this announcement doesn’t say a thousand times? No AI. Not a single mention of it. They did mention inference for a variety of ML models, and they do specifically call out CPU based ML models, and that’s because none of these instances support GPUs yet…but they do promise that they are coming soon – both the latest generation EC2 and GPU enabled instances.”</p>
<p>48:16 <a href="https://aws.amazon.com/blogs/aws/reduce-your-operational-overhead-today-with-amazon-cloudfront-saas-manager/">Reduce your operational overhead today with Amazon CloudFront SaaS </a><a href="https://aws.amazon.com/blogs/aws/reduce-your-operational-overhead-today-with-amazon-cloudfront-saas-manager/">Manager</a></p>
<ul>
<li style="font-weight:400;">Amazon is announcing the GA of <a href="https://aws.amazon.com/blogs/aws/category/networking-content-delivery/amazon-cloudfront/">Amazon CloudFront</a> <a href="https://aws.amazon.com/blogs/aws/category/saas/">SaaS Manager</a>, a new feature that helps <a href="https://aws.amazon.com/what-is/saas/">SaaS providers</a>, web development platform providers, and companies with multiple brands and websites to efficiently manage delivery across multiple domains.  </li>
<li style="font-weight:400;">Cloudfront SaaS manager addresses critical challenge organizations face: managing tenant websites at scale, each requiring TLS certificates, Distributed denial of service (DDoS) protection and performance monitoring</li>
<li style="font-weight:400;">With Cloudfront SaaS manager, web development platform providers and enterprise SaaS providers who manage a large number of domains will use simple API’s and reusable configurations that use CloudFront edge locations worldwide, AWS WAF, and <a href="https://aws.amazon.com/certificate-manager/">AWS Certificate Manager</a>. </li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/developer/application-security-performance/articles/saas/">Multi-Tenant SaaS deployments</a> is a strategy where a single cloudfront distribution serves content for multiple distinct tenants (users or organizations.)  CloudFront SaaS Manager utilizes a new template-based distribution model, known as a multi-tenant distribution, to serve content across multiple domains while sharing configuration and infrastructure. However, if supporting single websites or applications, a standard distribution would be better or recommended. </li>
<li style="font-weight:400;">A template distribution defines the base configuration that will be used across domains ,such as the origin configurations, cache behaviors, and security settings.  </li>
<li style="font-weight:400;">Each template distribution has a distribution tenant to represent domain-specific origin paths or origins domain names, including web access control list overrides and custom TLS certificates. </li>
</ul>
<p>50:05  Justin – “So now you have a very complicated set of CloudFront configurations because every one of them has to have its own CloudFront configuration – because you did custom URL vanity URLs. But now you can use this to help you make that less toil, which is appreciated, but it’s also a *terrible* model. And I don’t recommend it for a SaaS application if you can help it.”</p>
<p>52:22 <a href="https://aws.amazon.com/about-aws/whats-new/2025/04/amazon-route-53-profiles-vpc-endpoints/">Amazon Route 53 Profiles now supports VPC endpoints</a></p>
<ul>
<li style="font-weight:400;">AWS announced support for VPC endpoints in <a href="https://aws.amazon.com/route53/">Route 53</a> profiles, allowing you to create, manage, and share private hosted zones for interface VPC endpoints across multiple VPCs and AWS accounts within your organization.  </li>
<li style="font-weight:400;">This enhancement for Amazon Route 53 profiles simplifies the management of VPC endpoints by streamlining the process of creating and associating interface VPC endpoint managed private zones (PHZs) with VPCs and AWS accounts, without requiring manual association.</li>
</ul>
<h2>GCP</h2>
<p>53:56 <a href="https://cloud.google.com/blog/products/application-modernization/introducing-saas-runtime/">Introducing SaaS Runtime</a></p>
<ul>
<li style="font-weight:400;">We missed this announcement at Google Next, but they unveiled the <a href="https://youtu.be/PVz_NKIXMUY">preview of SaaS Runtime</a>, a fully managed <a href="https://www.bing.com/aclk?ld=e8AKrUwN04m11y30UGBgK85zVUCUzCkEsWNQvTBEcAe6lQeiKgpZdZf5-gQK7Sq69SSBHZXrzL3UoW8MC0EsUEOKw08yX3-ab7W32iCyv-8Ckj_Cgo_YCit6skpiErmO7BsoetAOKIx-RCdp5XKm4j18dcHmoBhguv7VsRMoADmNwfSNgA9_sXK7es6NUqzt_qhxYkCQ&amp;u=aHR0cHMlM2ElMmYlMmZjbG91ZC5nb29nbGUuY29tJTJmZ2NwJTNmdXRtX3NvdXJjZSUzZGJpbmclMjZ1dG1fbWVkaXVtJTNkY3BjJTI2dXRtX2NhbXBhaWduJTNkbmEtVVMtYWxsLWVuLWRyLWJrd3MtYWxsLWFsbC10cmlhbC1lLWRyLTE3MTAxMzQlMjZ1dG1fY29udGVudCUzZHRleHQtYWQtbm9uZS1hbnktREVWX2MtQ1JFXy1BREdQX0Rlc2slMmIlMjU3QyUyYkJLV1MlMmItJTJiRVhBJTJiJTI1N0MlMmJUeHQtQ29yZS1HZW5lcmFsJTJiR0NQLUtXSURfNDM3MDAwNjMzNDE4NDMyOTYta3dkLTc3MjQwODk2MDc1NjQxJTNhbG9jLTE5MCUyNnV0bV90ZXJtJTNkS1dfZ29vZ2xlJTI1MjBjbG91ZC1TVF9nb29nbGUlMmJjbG91ZCUyNmdjbGlkJTNkODA4ZDBmMWU2ZGY3MTQ4ZDBlNDQ2NzA0MGIxOWVlZWIlMjZnY2xzcmMlM2QzcC5kcyUyNm1zY2xraWQlM2Q4MDhkMGYxZTZkZjcxNDhkMGU0NDY3MDQwYjE5ZWVlYg&amp;rlid=808d0f1e6df7148d0e4467040b19eeeb">Google Cloud</a> service management platform designed to simplify and automate the complexities of infrastructure operations, enabling SaaS providers to focus on their core business. </li>
<li style="font-weight:400;">Based on their internal platform for serving millions of users across multiple tenants, SaaS runtime leverages their extensive experience managing services at Google Scale.  </li>
<li style="font-weight:400;">SaaS runtime helps you model your SaaS environment, accelerate deployments and streamline operations with a rich set of tools to manage at scale, with automation at its core. </li>
<li style="font-weight:400;">SaaS Runtime vision includes:
<ul>
<li style="font-weight:400;">Launch quickly, customize and iterate:  SaaS Runtime empowers you with pre-built customizable blueprints, allowing for rapid iteration and deployment. You can easily integrate AI architecture blueprints into existing systems through simple data model abstractions.</li>
<li style="font-weight:400;">Automate operations, observe and scale tenants: As a fully managed service, SaaS runtime allows automation at scale. Starting from your current continuous integration/continuous delivery (CI/CD) pipeline, onboard to SaaS runtime and then scale it to simplify service management, tenant observability and operations across both cloud and edge environments. </li>
<li style="font-weight:400;">Integrate, optimize, and expand rapidly: SaaS Runtime is integrated into Google Cloud, allowing developers to design applications using the new <a href="https://cloud.google.com/application-design-center/docs/overview">Application Design Center</a>. </li>
<li style="font-weight:400;">These applications can then be deployed via the <a href="https://cloud.google.com/marketplace">Google Cloud Marketplace</a>. Once deployed across tenants, their performance can be monitored with <a href="https://cloud.google.com/stackdriver/docs">Cloud Observability</a> and the <a href="https://cloud.google.com/products/app-hub">App Hub</a>. </li>
</ul>
</li>
</ul>
<p>55:33  Justin – “This is for a SaaS company that literally deploys an instance for each customer. It’s an expensive pattern number one, but sometimes customers like this, because it makes it very easy to say, well, these are your direct costs, and so you should pay for them. This is a model that Jira uses. This is the model that ServiceNow uses – where you’re getting a dedicated app server in addition to a dedicated database server. And so yeah – this is to manage all of that at scale… But this really isn’t how you should do it.” </p>
<p>1:03:49 <a href="https://cloud.google.com/blog/products/databases/google-cloud-database-and-langchain-integrations-support-go-java-and-javascript/">Google Cloud Database and LangChain integrations support Go, Java, </a><a href="https://cloud.google.com/blog/products/databases/google-cloud-database-and-langchain-integrations-support-go-java-and-javascript/">and JavaScript</a></p>
<ul>
<li style="font-weight:400;">Three new language support <a href="https://github.com/googleapis?q=google-langchain&amp;type=all&amp;language=&amp;sort=">integrations for LangChain</a> are available for <a href="https://go.dev/">Go</a>, <a href="https://cloud.google.com/java/">Java</a> and Javascript</li>
<li style="font-weight:400;">Each package supports Vector stores for semantic search of databases, Chat message history to enable chains to recall previous conversations and document loader for loading documents from your enterprise data. </li>
</ul>
<h2>Azure</h2>
<p>1:04:20 <a href="https://azure.microsoft.com/en-us/blog/unveiling-gpt-image-1-rising-to-new-heights-with-image-generation-in-azure-ai-foundry/">Unveiling GPT-image-1: Rising to new heights with image generation in </a><a href="https://azure.microsoft.com/en-us/blog/unveiling-gpt-image-1-rising-to-new-heights-with-image-generation-in-azure-ai-foundry/">Azure AI Foundry</a></p>
<ul>
<li style="font-weight:400;">We get it. You’re excited. </li>
<li style="font-weight:400;">Microsoft is thrilled to announce the launch of <a href="https://ai.azure.com/explore/models">GPT-image-1</a>, the latest and most advanced image generation model.  </li>
<li style="font-weight:400;">Our API is available now to all gated customers: <a href="https://aka.ms/oai/gptimage1access">limited access model application</a>, and playground is coming early next week.  </li>
<li style="font-weight:400;">This groundbreaking model sets a new standard in generating high-quality images, solving complex prompts and offering zero-shot capabilities in various scenarios. 
<ul>
<li style="font-weight:400;">Granular Instruction Response</li>
<li style="font-weight:400;">Text Rendering</li>
<li style="font-weight:400;">Image Input Acceptance</li>
</ul>
</li>
<li style="font-weight:400;">GPT image 1 supports multiple modalities:
<ul>
<li style="font-weight:400;">Text-to-image</li>
<li style="font-weight:400;">Image-to-image</li>
<li style="font-weight:400;">Text transformation</li>
<li style="font-weight:400;">Inpainting</li>
</ul>
</li>
</ul>
<p>1:06:16 <a href="https://www.microsoft.com/en-us/windows-server/blog/2025/04/24/tired-of-all-the-restarts-get-hotpatching-for-windows-server/">Tired of all the restarts? Get hotpatching for Windows Server</a></p>
<ul>
<li style="font-weight:400;"><a href="https://techcommunity.microsoft.com/blog/windowsservernewsandbestpractices/windows-server-hotpatching-is-here/3174930">Hotpatching for Windows Server 2025</a>, made available in preview in 2024, will become generally available as a subscription service on July 1st, 2025 (because you’re not already paying for the Microsoft licensing.)  </li>
<li style="font-weight:400;">One of the key updates in the latest release of Windows Server 2025 is the addition of hybrid and multi cloud capabilities, aligned with Azure’s adaptive cloud approach. </li>
<li style="font-weight:400;">Hotpatching, we are taking what was previously an Azure-only capability and now making it available to Windows Server machines outside of Azure through <a href="https://learn.microsoft.com/en-us/azure/azure-arc/">Azure Arc</a>.  </li>
<li style="font-weight:400;">Hotpatching is a new way to install <a href="https://www.microsoft.com/en-us/evalcenter/evaluate-windows-server-2025">Windows Server 2025</a> updates that does not require a reboot after installation, by patching the in-memory code of running processes without need to restart the process</li>
<li style="font-weight:400;">Some of the benefits of hotpatching include the following:
<ul>
<li style="font-weight:400;">Higher availability with fewer reboots</li>
<li style="font-weight:400;">Faster deployment of updates as the packages are smaller, install faster, and have easier patch orchestration with Azure Update Manager</li>
<li style="font-weight:400;">Hotpatch packages install without the need to schedule a reboot, so they can happen sooner. This can decrease the window of vulnerability which can result if an administrator normally delaying an update and restart after a Windows security update is released. </li>
</ul>
</li>
<li style="font-weight:400;">Hotpatching is available at no charge to preview now, but starting in July with the subscription launch, hotpatching for Windows Server 2025 will be offered at a subscription of $1.50 per CPU core per month. </li>
<li style="font-weight:400;">To make this work, though, the service must be connected to Azure Arc.</li>
</ul>
<p>1:07:57  Ryan – “I hope that there’s a technical reason, because it feels like a cash grab. On one hand, I get it – they’re solving operational problems they have by managing their workloads on Azure, and this is an enhancement that comes directly out of managing servers with that scale, which is fantastic. The fact that they put it as a subscription on Arc makes me feel a little dirty about it.”</p>
<p>1:13:53 <a href="https://techcommunity.microsoft.com/blog/azureconfidentialcomputingblog/announcing-preview-for-the-next-generation-of-azure-intel%C2%AE-tdx-confidential-vms/4404625">Announcing preview for the next generation of Azure Intel® TDX </a><a href="https://techcommunity.microsoft.com/blog/azureconfidentialcomputingblog/announcing-preview-for-the-next-generation-of-azure-intel%C2%AE-tdx-confidential-vms/4404625">Confidential VMs</a></p>
<ul>
<li style="font-weight:400;">Azure is announcing the preview of their next generation of confidential VM’s powered by the 5th gen <a href="https://www.intel.com/content/www/us/en/products/details/processors/xeon.html">Intel Xeon processor</a> (Emerald Rapids) with <a href="https://www.intel.com/content/www/us/en/products/docs/accelerator-engines/trust-domain-extensions.html">Intel Trust Domain Extensions</a> (TDX).  </li>
<li style="font-weight:400;">This enables organizations to bring confidential workloads to the cloud without code changes to applications. The supported SKUs include the general purpose DCesv6-series and the memory optimized ECesv6-series.  </li>
<li style="font-weight:400;">Confidential VM’s are designed for tenants with <a href="https://www.intel.com/content/www/us/en/developer/articles/technical/intel-trust-domain-extensions.html">high security</a> and confidentiality requirements, providing a strong, attestable, hardware-enforced boundary. </li>
<li style="font-weight:400;">They ensure that your data and applications stay private and encrypted even while in use, keeping your sensitive code and other data encrypted in memory during processing.</li>
</ul>
<p>1:17:09 <a href="https://techcommunity.microsoft.com/blog/appsonazureblog/announcing-public-preview-of-larger-container-sizes-on-azure-container-instances/4394587">Announcing Public Preview of Larger Container Sizes on Azure Container </a><a href="https://techcommunity.microsoft.com/blog/appsonazureblog/announcing-public-preview-of-larger-container-sizes-on-azure-container-instances/4394587">Instances</a></p>
<ul>
<li style="font-weight:400;">Azure is announcing the preview of larger container sizes of <a href="https://learn.microsoft.com/en-us/azure/container-instances/">Azure Container Instances</a>.  </li>
<li style="font-weight:400;">Customers can now deploy workloads with higher vCPU and memory for standard containers, confidential containers, containers with virtual networks, and containers utilizing virtual nodes to connect to <a href="https://learn.microsoft.com/en-us/azure/aks/">AKS</a>.  </li>
<li style="font-weight:400;">ACI now supports vCPU counts greater than 4 and memory capacities greater than 16, with the new maximum being 32 vCPU and 256gb for standard containers and 32vcpu and 192gb of confidential containers </li>
</ul>
<p>1:18:09  Ryan – “I’m just surprised they got away with it for as long as they did. Because I went on the same journey you did, which was to point and laugh – they only have four? Cause I’ve never seen a workload need more than four CPUs, but everyone asked for more than four.”</p>
<h2>Other Clouds</h2>
<p>1:19:47 <a href="https://www.digitalocean.com/blog/introducing-managed-valkey">Introducing DigitalOcean Managed Caching for Valkey, The New Evolution </a> <a href="https://www.digitalocean.com/blog/introducing-managed-valkey">of Managed Caching</a> </p>
<ul>
<li style="font-weight:400;"><a href="https://www.bing.com/aclk?ld=e8qdAcSFhyfm4N-wTWyQG7_DVUCUxJ2C54uIxzc0N44Q90Uz14q-ZCWwD7euvXX6Kq8cEcERPHGE7AVCuEWnhwD1IFYjMmgrYKhw-BVAO4P3-axYKPu7v7caP3ar3Bulggfw481rvGT-cxYGVK91kgq20wGBrzszkTxXTdMFYfjQQELPchzqWWlkp0JygeZFwK_uAPMQ&amp;u=aHR0cHMlM2ElMmYlMmZ3d3cuZGlnaXRhbG9jZWFuLmNvbSUyZnByb2R1Y3RzJTJmYWktbWwlMmYxLWNsaWNrLW1vZGVscyUzZnV0bV9hZGdyb3VwJTNkJTI2dXRtX2tleXdvcmQlM2R3d3cuZGlnaXRhbG9jZWFuLmNvbSUyNnV0bV9tYXRjaHR5cGUlM2RiJTI2dXRtX2FkcG9zaXRpb24lM2QlMjZ1dG1fY3JlYXRpdmUlM2QlMjZ1dG1fcGxhY2VtZW50JTNkJTI2dXRtX2RldmljZSUzZGMlMjZ1dG1fbG9jYXRpb25pJTNkJTI2dXRtX2xvY2F0aW9uJTNkODA3MTQlMjZtc2Nsa2lkJTNkNWQ2MDgxMWQ3NTU1MThjOGU3MzdlOTA3ZGYzOGE4OTclMjZ1dG1fc291cmNlJTNkYmluZyUyNnV0bV9tZWRpdW0lM2RjcGMlMjZ1dG1fY2FtcGFpZ24lM2RwbWF4X25iX2h1Z2dpbmdmYWNlX2dlbmVyaWNrZXl3b3Jkc191c2FfZW4lMjZ1dG1fdGVybSUzZHd3dy5kaWdpdGFsb2NlYW4uY29tJTI2dXRtX2NvbnRlbnQlM2RHZW5lcmljJTI1MjBLZXl3b3Jkcw&amp;rlid=5d60811d755518c8e737e907df38a897">Digital Ocean</a> has launched a managed caching for <a href="https://valkey.io/">Valkey</a> offering, which is their managed database service that seamlessly replaces Managed Caching (previous <a href="https://azure.microsoft.com/en-us/products/managed-redis/?msockid=0a813a601d44649024a22fa21caa6502">Managed Redis</a>). </li>
<li style="font-weight:400;">The offering is compatible with <a href="https://valkey.io/blog/valkey-8-0-0-rc1/">Valkey 8.0</a> and <a href="https://redis.io/docs/latest/operate/rs/release-notes/rs-7-2-4-releases/">Redis 7.2.4</a> and is meant to be a drop in replacement for their managed caching database service while offering enhanced functionality for fast and efficient data storage. </li>
</ul>
<p>1:20:11  Ryan – “I like to hear DigitalOcean coming up with these managed services. And so if you have a workload on DigitalOcean you don’t have to manage your own service offering on compute. You can take advantage of these things. It’s great. I’d like to see more competition in this marketplace.”</p>
<h2>Cloud Journey</h2>
<p>1:20:50 <a href="https://www.businessinsider.com/amazon-strategy-overcome-gpu-shortages-nvidia-2025-4">‘Project Greenland’: How Amazon Overcame a GPU Crunch</a></p>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;">Interesting project Amazon is working on related to AI chip crunch.</li>
<li style="font-weight:400;">Amazon retail business had a big problem, it couldn’t get enough GPU’s to power its crucial inference/training workloads. </li>
<li style="font-weight:400;">With projects hitting delays, Amazon revamped internal processes and technology to solve the problem.</li>
<li style="font-weight:400;">The solution was <a href="https://www.projectgreenland.com/">Project Greenland</a>, a centralized GPU capacity pool to better manage and allocate its limited GPU supply. </li>
<li style="font-weight:400;">GPU’s are too valuable to be given out on a first come, first serve basis.  Instead, distribution should be determined based on ROI layered with common sense considerations and provide for long-term growth of the company’s free cash flow” per internal guidelines.</li>
<li style="font-weight:400;">Two years since the shortage began, GPU’s remain scarce, but Amazon’s efforts to tackle the problem may be paying off, with internal forecasts suggesting the crunch would ease this year with chip availability expected to improve</li>
</ul>
</li>
</ul>
<ul>
<li style="font-weight:400;">“Amazon has ample GPU capacity to continue innovating for our retail business and other customers across the company,” the spokesperson said. “AWS recognized early on that generative AI innovations are fueling rapid adoption of cloud computing services for all our customers, including Amazon, and we quickly evaluated our customers’ growing GPU needs and took steps to deliver the capacity they need to drive innovation.”</li>
<li style="font-weight:400;">Amazon demands hard data and return on investment proof for all internal GPU requests. </li>
<li style="font-weight:400;">Initiatives are prioritized and ranked for GPU allocation based on several factors, including the completeness of data provided and the financial benefit per GPU.  Projects must be shovel-ready, or approved for development, and prove they are competitive in the race to market.  They also must provide a timeline for when benefits are expected to be realized. </li>
<li style="font-weight:400;">If your system doesn’t provide the return on investment the GPU’s are redistributed to the next project/program.</li>
<li style="font-weight:400;">They codified this process into official “tenets” or internal guidelines that individual teams or projects create for faster decision making.  The tenets emphasize a strong return on investment, selective approvals and push for speed and efficiency. 
<ol>
<li style="font-weight:400;">ROl + High Judgment thinking is required for GPU usage prioritization. GPUs are too valuable to be given out on a first-come, first-served basis. Instead, distribution should be determined based on ROl layered with common sense considerations, and provide for the long-term growth of the Company’s free cash flow. Distribution can happen in bespoke infrastructure or in hours of a sharing/pooling tool.</li>
<li style="font-weight:400;">Continuously learn, assess, and improve: We solicit new ideas based on continuous review and are willing to improve our approach as we learn more.</li>
<li style="font-weight:400;">Avoid silo decisions: Avoid making decisions in isolation; instead, centralize the tracking of GPUs and GPU related initiatives in one place.</li>
<li style="font-weight:400;">Time is critical: Scalable tooling is a key to moving fast when making distribution decisions which, in turn, allows more time for innovation and learning from our experiences.</li>
<li style="font-weight:400;">Efficiency feeds innovation: Efficiency paves the way for innovation by encouraging optimal resource utilization, fostering collaboration and resource sharing.</li>
<li style="font-weight:400;">Embrace risk in the pursuit of innovation: Acceptable level of risk tolerance will allow to embrace the idea of ‘failing fast’ and maintain an environment conducive to Research and Development.</li>
<li style="font-weight:400;">Transparency and confidentiality: We encourage transparency around the GPU allocation methodology through education and updates on the wiki’s while applying confidentiality around sensitive information on R&amp;D and ROI shareable with only limited stakeholders. We celebrate wins and share lessons learned broadly.</li>
<li style="font-weight:400;">GPUs previously given to fleets may be recalled if other initiatives show more value. Having a GPU doesn’t mean you’ll get to keep it.</li>
</ol>
</li>
<li style="font-weight:400;">To manage all of this they built project greenland.  Its described as a centralized GPU orchestration platform to share GPU capacity across teams and maximize utilization. </li>
<li style="font-weight:400;">It can track GPU usage per initiative, share idle servers and implement clawbacks to reallocate chips to more urgent projects. </li>
<li style="font-weight:400;">The system also simplifies networking setup and security updates, while alerting employees and leaders to projects with low GPU usage. </li>
</ul>
<h2>Closing</h2>
<p>And that is the week in the cloud! Visit our website, the home of the Cloud Pod where you can join our newsletter, slack team, send feedback or ask questions at theCloud Pod.net or tweet at us with hashtag #theCloudPod</p>]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2027382/c1e-dd5dfm78dmt3np46-5zxr7105u1o6-hysjlb.mp3" length="111215296"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 302 of The Cloud Pod – where the forecast is always cloudy! This week Justin and Ryan are on hand to bring you all the latest in Cloud (and AI news.) We’ve got hotpatching, Project Greenland, and a rollback of GPT-4.o, which sort of makes us sad – and our egos are definitely less stroked. Plus Saas, containers, and outposts – all of this and more. Thanks for joining us in the cloud! 
Titles we almost went with this week:

The Cloud Pod was never accused of being sycophantic
2nd Gen outposts!?! I didn’t even know anyone was using Gen 1
AWS Outposts 2nd Gen… not with AI (GASP)
If you’re doing SaaS wrong, Google & AWS have your back this week with new Features 
Patching, so hot right now
Larger container sizes for Azure….  You don’t say
AWS Green reporting detects hotspots… surprisingly close to Maryland…..
Visual pipeline for Opensearch… I want to like this… but I just can’t

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our Slack channel for more info. 
General News 
01:37 Sharing new DORA research for gen AI in software development

The DORA team at Google has released a new report, “Impact of Generative AI In Software Development.” The report is based on data and developer interviews, and the report aims to move beyond hype to offer a proper perspective on AI’s impact on individuals, teams and organizations. 
Click on the link in our show notes to access the full report. However, Google has highlighted a few key points in the blog post.
AI is Real – A staggering 89% of organizations are prioritizing the integration of AI into their applications, and 76% of technologists are already using AI in some part of their daily work. 
Productivity gains confirmed: Developers using Gen AI report significant increases in flow, productivity, and job satisfaction.  For instance, a 25% increase in AI adoption is associated with a 2.1% increase in individual productivity.
Organization benefits are tangible: Beyond individual gains, Dora found strong correlations between AI adoption and improvements in crucial organizational metrics. A 25% increase in AI adoption is associated with increases in document quality, code quality, code review speeds and approval speeds. 
If you are looking to utilize AI in your development organization, they provide five practical approaches for both leaders and practitioners.

Have transparent communications
Empower developers with learning and experimentation
Establish clear policies
Rethink performance metrics
Embrace fast feedback loops



045:06  Ryan – “Those are really good approaches, but really difficult to implement in practice. You know, in my day job, watching the company struggle to get a handle on AI from all the different angles you need to, from data protection, legal liability – just operationally – it’s very hard. So I think having a mature program where you’re rolling that out with intent and being very specific with your AI tasks I think will go a long way with a lot of companies.”  
AI Is Going Great – Or How ML Makes Its Money 
08...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2027382/c1a-k5d5-wwxn160jtx12-wy8dqo.jpg"></itunes:image>
                                                                            <itunes:duration>01:32:41</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2027382/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[301: The Cloud Pod PartyRocks in the House Tonight]]>
                </title>
                <pubDate>Fri, 02 May 2025 04:37:47 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2023544</guid>
                                    <link>https://tcpfm.castos.com/episodes/301-the-cloud-pod-partyrocks-in-the-house-tonight-1</link>
                                <description>
                                            <![CDATA[
<h3>Chapters</h3>
<ul><li>(00:00:00) - The Cloud Pod</li><li>(00:01:07) - How to Write a 300-Episode Recap With AI</li><li>(00:03:05) - We're Turning 300 Episodes Down</li><li>(00:07:39) - Reinventing: The Future of the Podcast</li><li>(00:13:52) - Google Wins Antitrust Case vs. DOJ</li><li>(00:21:18) - Google's Proposal for the Antitrust Case</li><li>(00:24:29) - OpenAI Launches OpenAI03 and O4 Mini</li><li>(00:30:27) - GPT 4.1 and 4.0 Mini</li><li>(00:34:25) - GitHub Cloud and Copilot Announcements</li><li>(00:35:37) - Copilot for Business vs. Personal: Should You Buy Pro+</li><li>(00:38:50) - Amazon VPC Route Server</li><li>(00:42:32) - AWS Security Reference Architecture Code Examples for Generative AI</li><li>(00:45:08) - Amazon Nova Sonic: New Gen AI Model for Voice-enabled Applications</li><li>(00:46:55) - Thank You or No Thank You?</li><li>(00:47:40) - Novasonic's Nova Real 1.1 security video</li><li>(00:51:03) - Amazon AWS S3 Express 1 Zone Price Cut</li><li>(00:53:07) -  AWS STS now automatically serves all requests to the global endpoint in</li><li>(00:56:13) - Gemini Cloud Assist: Spring Cleaning with FinOps Hub</li><li>(00:58:23) - Google's New VM Store for Valkey</li><li>(00:59:36) - Microsoft releases new capabilities to Azure AI</li><li>(01:01:15) - Azure Storage Driver Update & New Capabilities for AI</li><li>(01:02:06) - Llama 4 models now available in Azure AI</li><li>(01:03:31) - Microsoft Azure OpenAI: GPT 4.1, 4.</li><li>(01:04:43) - Copilot in Azure Announces General Availability</li><li>(01:06:41) - Azure Cloud's Hybrid Connection Manager in Public Preview</li><li>(01:07:38) - One-Bit AI Models Won't Need Supercomputers</li><li>(01:09:46) - Microsoft's SQL Server Migration to hyperscale</li><li>(01:12:42) - Oracle: My Public Cloud Was Hacked</li><li>(01:14:49) - Oracle's PR for the Hacking</li><li>(01:18:20) - A Week in the Cloud</li><li>(01:18:57) - Week in Cloud: Cloud Apps</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[301: The Cloud Pod PartyRocks in the House Tonight]]>
                </itunes:title>
                                    <itunes:episode>301</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2023544/c1e-dd5dfm7j79f3np46-ndnr4pxmfg5g-grkbsw.mp3" length="95405056"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2023544/c1a-k5d5-xxok8zd9um5g-zosqrq.jpg"></itunes:image>
                                                                            <itunes:duration>01:19:31</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2023544/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[300: The Next Chapter: How Google’s Next-Level Next Event Nexted All Our Next Expectations – and What’s Next Now That Next Is Past]]>
                </title>
                <pubDate>Thu, 17 Apr 2025 17:34:34 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2015419</guid>
                                    <link>https://tcpfm.castos.com/episodes/300-google-next-results</link>
                                <description>
                                            <![CDATA[<p><span style="font-weight:400;">Welcome to episode 300 of The Cloud Pod – where the forecast is always cloudy! According to the title, this week’s show is taking place inside of a Dr. Suess book, but don’t despair – we’re not going to make you eat green eggs and ham, but we WILL give you the low down on all things Vegas. Well, Google’s Next event which recently took place in Vegas anyway. Did you make any Next predictions? </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">☁️This is the CLOUDPOD Episode 300</span></li>
<li><span style="font-weight:400;">️Tonight we dine in the Cloud</span></li>
<li><span style="font-weight:400;">The Next Chapter</span></li>
<li><span style="font-weight:400;">Now in Preview: Episode 300</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><b>GCP</b></h2>
<p><b>Pre-Next</b></p>
<p><b>02:35 </b><a href="https://arstechnica.com/gadgets/2025/04/google-shakes-up-gemini-leadership-google-labs-head-taking-the-reins/" target="_blank" rel="noreferrer noopener"><b>Google shakes up Gemini leadership, Google Labs head taking the reins</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">There was a lot of Gemini news at Next – but we’ll get to all that. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">In this particular case, there’s an employee shakeup. </span><span style="font-weight:400;">Sissie Hsiao is stepping down from leading the Google team, and is being replaced by Josh Woodward, who is currently leading the Google Labs. </span></li>
</ul>
<p><b>04:35 </b><a href="https://cloud.google.com/blog/products/storage-data-transfer/filestore-instance-replication-now-available/" target="_blank" rel="noreferrer noopener"><b>Filestore instance replication now available</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">GCP says customers have been asking for help in meeting business and regulatory goals, and so they are releasing Filestore instance replication.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This new feature offers an efficient replication point objective (RPO) that can reach 30 minutes for data change rates of 100 MB/sec.</span></li>
</ul>
<p><b>05:16 </b><a href="https://cloud.google.com/blog/products/containers-kubernetes/multi-cluster-orchestrator-for-cross-region-kubernetes-workloads/" target="_blank" rel="noreferrer noopener"><b>Multi-Cluster Orchestrator for cross-region Kubernetes workloads</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">The public preview of Multi-Cluster Orchestrator was recently announced.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This </span><span style="font-weight:400;">lets platform and application teams optimize resource utilization, enhance application resilience, and accelerate innovation in complex, multi-cluster environments.</span><span style="font-weight:400;"> </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The need for effective multi-cluster management has become essential as organizations increasingly use Kubernetes to deploy and manage their applications; Challenges such as resource scarcity, ensuring high availability, and managing deployments across diverse environments create significant operational overhead.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Multi-Cluster Orchestrator addresses these challenges by providing a centralized orchestration layer that abstracts away the complexities of underlying Kubernetes infrastructure matching workloads with capacity across regions.</span>...</li></ul>
<h3>Chapters</h3>
<ul><li>(00:00:00) - The Cloud Pod: Episode 300</li><li>(00:00:38) - Ryan's Absence at CES 2017</li><li>(00:01:53) - Episode 300</li><li>(00:02:30) - Google Shuffles Up Their Gemini Team</li><li>(00:05:08) - GKE: Multi Cluster Orchestrator for Kubernetes</li><li>(00:09:37) - Google I/O 2019: The Conference Schedule</li><li>(00:12:22) - The Wizard of Oz Event at Google's Sphere</li><li>(00:15:01) - The Wizard of Oz Movie Made With AI</li><li>(00:18:56) - The Wizard of Oz: The Sphere</li><li>(00:20:24) - Day 1, keynote</li><li>(00:20:49) - Google Cloud Next: The First Google TPU for Inference &</li><li>(00:25:33) - Google Agent Spaces: Unified Enterprise Search and Intelligence</li><li>(00:31:38) - Google's Video, Speech and Music, Generative AI</li><li>(00:35:42) - Inferring with AWS' GKE</li><li>(00:38:33) - Python's AI Agent Development Kit</li><li>(00:43:13) - Agent to Agent</li><li>(00:47:52) - Google Cloud Keynote</li><li>(00:51:18) - A Day in the Life</li><li>(00:51:38) - Gemini Cloud Conference 2018: Small Announcements</li><li>(00:56:52) - Google Cloud Network: Cross-Cloud Interconnect</li><li>(01:02:14) - Google Cloud Storage Pools: More Storage, More Intelligence</li><li>(01:03:01) - Migration from SQL Server to PostgreSQL using DMS</li><li>(01:06:42) - Google Next: Predicting The Winners</li><li>(01:09:08) - Microsoft's Ignite Conference Recap & More</li><li>(01:13:04) - AI Conference Prediction</li><li>(01:16:06) - Google Next: A Year 2 in Vegas</li><li>(01:18:14) - Black Mirror: The First Season Review</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 300 of The Cloud Pod – where the forecast is always cloudy! According to the title, this week’s show is taking place inside of a Dr. Suess book, but don’t despair – we’re not going to make you eat green eggs and ham, but we WILL give you the low down on all things Vegas. Well, Google’s Next event which recently took place in Vegas anyway. Did you make any Next predictions? 
Titles we almost went with this week:

☁️This is the CLOUDPOD Episode 300
️Tonight we dine in the Cloud
The Next Chapter
Now in Preview: Episode 300

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
GCP
Pre-Next
02:35 Google shakes up Gemini leadership, Google Labs head taking the reins 

There was a lot of Gemini news at Next – but we’ll get to all that. 
In this particular case, there’s an employee shakeup. Sissie Hsiao is stepping down from leading the Google team, and is being replaced by Josh Woodward, who is currently leading the Google Labs. 

04:35 Filestore instance replication now available

GCP says customers have been asking for help in meeting business and regulatory goals, and so they are releasing Filestore instance replication.
This new feature offers an efficient replication point objective (RPO) that can reach 30 minutes for data change rates of 100 MB/sec.

05:16 Multi-Cluster Orchestrator for cross-region Kubernetes workloads

The public preview of Multi-Cluster Orchestrator was recently announced.
This lets platform and application teams optimize resource utilization, enhance application resilience, and accelerate innovation in complex, multi-cluster environments. 
The need for effective multi-cluster management has become essential as organizations increasingly use Kubernetes to deploy and manage their applications; Challenges such as resource scarcity, ensuring high availability, and managing deployments across diverse environments create significant operational overhead.
Multi-Cluster Orchestrator addresses these challenges by providing a centralized orchestration layer that abstracts away the complexities of underlying Kubernetes infrastructure matching workloads with capacity across regions....]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[300: The Next Chapter: How Google’s Next-Level Next Event Nexted All Our Next Expectations – and What’s Next Now That Next Is Past]]>
                </itunes:title>
                                    <itunes:episode>300</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p><span style="font-weight:400;">Welcome to episode 300 of The Cloud Pod – where the forecast is always cloudy! According to the title, this week’s show is taking place inside of a Dr. Suess book, but don’t despair – we’re not going to make you eat green eggs and ham, but we WILL give you the low down on all things Vegas. Well, Google’s Next event which recently took place in Vegas anyway. Did you make any Next predictions? </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">☁️This is the CLOUDPOD Episode 300</span></li>
<li><span style="font-weight:400;">️Tonight we dine in the Cloud</span></li>
<li><span style="font-weight:400;">The Next Chapter</span></li>
<li><span style="font-weight:400;">Now in Preview: Episode 300</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><b>GCP</b></h2>
<p><b>Pre-Next</b></p>
<p><b>02:35 </b><a href="https://arstechnica.com/gadgets/2025/04/google-shakes-up-gemini-leadership-google-labs-head-taking-the-reins/" target="_blank" rel="noreferrer noopener"><b>Google shakes up Gemini leadership, Google Labs head taking the reins</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">There was a lot of Gemini news at Next – but we’ll get to all that. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">In this particular case, there’s an employee shakeup. </span><span style="font-weight:400;">Sissie Hsiao is stepping down from leading the Google team, and is being replaced by Josh Woodward, who is currently leading the Google Labs. </span></li>
</ul>
<p><b>04:35 </b><a href="https://cloud.google.com/blog/products/storage-data-transfer/filestore-instance-replication-now-available/" target="_blank" rel="noreferrer noopener"><b>Filestore instance replication now available</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">GCP says customers have been asking for help in meeting business and regulatory goals, and so they are releasing Filestore instance replication.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This new feature offers an efficient replication point objective (RPO) that can reach 30 minutes for data change rates of 100 MB/sec.</span></li>
</ul>
<p><b>05:16 </b><a href="https://cloud.google.com/blog/products/containers-kubernetes/multi-cluster-orchestrator-for-cross-region-kubernetes-workloads/" target="_blank" rel="noreferrer noopener"><b>Multi-Cluster Orchestrator for cross-region Kubernetes workloads</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">The public preview of Multi-Cluster Orchestrator was recently announced.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This </span><span style="font-weight:400;">lets platform and application teams optimize resource utilization, enhance application resilience, and accelerate innovation in complex, multi-cluster environments.</span><span style="font-weight:400;"> </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The need for effective multi-cluster management has become essential as organizations increasingly use Kubernetes to deploy and manage their applications; Challenges such as resource scarcity, ensuring high availability, and managing deployments across diverse environments create significant operational overhead.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Multi-Cluster Orchestrator addresses these challenges by providing a centralized orchestration layer that abstracts away the complexities of underlying Kubernetes infrastructure matching workloads with capacity across regions.</span></li>
</ul>
<p><b>06:26</b> <a href="https://cloud.google.com/blog/products/containers-kubernetes/benchmarking-a-65000-node-gke-cluster-with-ai-workloads/" target="_blank" rel="noreferrer noopener"><b>GKE at 65,000 nodes: Evaluating performance for simulated mixed AI workloads</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Recently GKE announced it can now support up to 65,000 nodes (up from 15,000.) </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Saint Carrie be with your CFO. </span></li>
</ul>
<p><b>09:15</b> <a href="https://blog.google/products/gemini/how-we-built-gemini-robotics/" target="_blank" rel="noreferrer noopener"><b>How we built the new family of Gemini Robotics models</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Worried about Skynet taking over? Now is the time to check out these articles so you can learn about our robot overlord’s weaknesses. </span></li>
</ul>
<p><b>09:58 Tuesday Night</b></p>
<p><span style="font-weight:400;">Was anyone else weirded out by the scheduling? Did any listeners actually stay until the end on Friday? If you, we’d love to hear from you. </span></p>
<p><b>13:30</b> <a href="https://blog.google/products/google-cloud/sphere-wizard-of-oz/" target="_blank" rel="noreferrer noopener"><b>The AI magic behind Sphere’s upcoming ‘The Wizard of Oz’ experience</b></a><b> </b></p>
<p><a href="https://www.youtube.com/watch?v=f01dsTigSmw" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">https://www.youtube.com/watch?v=f01dsTigSmw</span></a><span style="font-weight:400;"> </span></p>
<p><span style="font-weight:400;">*Show note writer Heather is a curator at a museum that showcases Hollywood’s early history – and is VERY interested to see how the film world feels about this AI rebuilding of such a beloved classic. Some interesting discussions are definitely coming! </span></p>
<p><b>21:11</b> <b>Next Day 1 Keynote</b></p>
<p><a href="https://blog.google/products/google-cloud/ironwood-tpu-age-of-inference/" target="_blank" rel="noreferrer noopener"><b>Ironwood: The first Google TPU for the age of inference</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">A seventh-generation Tensor Processing Unit (TPU) — our most performant and scalable custom AI accelerator to date, and the first designed specifically for inference.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">It is one of several new components of </span><a href="https://cloud.google.com/blog/products/compute/whats-new-with-ai-hypercomputer" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Google Cloud AI Hypercomputer</span></a><span style="font-weight:400;"> architecture, which optimizes hardware and software together for the most demanding AI workloads. With Ironwood, developers can also leverage Google’s own </span><a href="https://blog.google/technology/ai/introducing-pathways-next-generation-ai-architecture/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Pathways</span></a><span style="font-weight:400;"> software stack to reliably and easily harness the combined computing power of tens of thousands of Ironwood TPUs.</span></li>
</ul>
<p><i><span style="font-weight:400;">24:30  Ryan – “</span></i><span style="font-weight:400;">So I was sort of surprised because they did spend a lot of time talking about inference and this chip handling inference concerns. I thought that was real. I mean, it’s just not the way that we’ve been talking about these custom AI chips in the past, right? It’s definitely been all about model training and building all these things. And the inference is more about running these very large models. And so there did seem to be a huge focus on performance and end user experience with AI development all the way through the conference.”</span></p>
<p><a href="https://blog.google/products/workspace/cloud-next-2025-workspace-gemini/" target="_blank" rel="noreferrer noopener"><b>Google Workspace adds new AI tools to Docs, Sheets, Chat and more.</b></a></p>
<p><b>26:04 </b> <a href="https://cloud.google.com/blog/products/ai-machine-learning/google-agentspace-enables-the-agent-driven-enterprise?e=48754805" target="_blank" rel="noreferrer noopener"><b>Google Agentspace enables the agent-driven enterprise</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google says that in order to scale, businesses need AI-ready information ecosystems, and that’s why they’re launching Google Agentspace. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This product puts the latest Google foundation models, powerful agents, and actionable enterprise knowledge in the hands of employees. With Agentspace, employees and agents can find information from across their organization, synthesize and understand it with Gemini’s multimodal intelligence, and act on it with AI agents. </span></li>
</ul>
<p><i><span style="font-weight:400;">27:24  Ryan – “Well, so it SEEMS really cool, until you get through the hard edges…a lot of it really relies on your utilization of Chrome Enterprise Premium, and so that’s a whole workspace ecosystem that if you’re not bought into you’ve got a whole lot of heavy lifting to make that work.” </span></i></p>
<p><b>32:18 </b><a href="https://blog.google/products/google-cloud/cloud-next-gen-ai-vertex-ai-updates/" target="_blank" rel="noreferrer noopener"><b>New video, image, speech and music generative AI tools are coming to </b></a><a href="https://blog.google/products/google-cloud/cloud-next-gen-ai-vertex-ai-updates/" target="_blank" rel="noreferrer noopener"><b>Vertex AI.</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google’s new text to music AI offering, Lyria, is now in private preview. </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">This means customers can generate complete, production-ready assets starting with a text prompt.</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Veo 2 has new editing and camera controls features available in preview with allowlist that help enterprise customers refine and repurpose video content with precision.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Chirp 3 now includes Instant Custom Voice, a new way to create custom voices with just 10 seconds of audio input.</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Hopefully soon we’ll be ready to present our new and improved Jonathan with the help of Veo 2. </span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Imagen 3 has improved image generation and inpainting capabilities for reconstructing missing or damaged portions of an image and is making object removal edits even higher quality.</span></li>
</ul>
<p><a href="https://blog.google/products/google-cloud/cloud-next-ai-hypercomputer-updates/" target="_blank" rel="noreferrer noopener"><b>AI Hypercomputer updates from Google Cloud Next 25</b></a></p>
<p><b>36:46</b> <a href="https://google.github.io/adk-docs/" target="_blank" rel="noreferrer noopener"><b>Agent Development Kit</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">This open-source framework simplifies the process of building sophisticated multi-agent systems while maintaining precise control over agent behavior. Agent Development Kit supports the </span><a href="https://modelcontextprotocol.io/introduction" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Model Context Protocol </span></a><span style="font-weight:400;">(MCP) which provides a unified way for AI models to access and interact with various data sources and tools, rather than requiring custom integrations for each. </span></li>
</ul>
<p><b>43:30</b> <a href="https://developers.googleblog.com/en/a2a-a-new-era-of-agent-interoperability/" target="_blank" rel="noreferrer noopener"><b>Agent 2 Agent (A2A)</b></a> <span style="font-weight:400;">   </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">We’re proud to be the first hyperscaler to create an open </span><a href="https://developers.googleblog.com/en/a2a-a-new-era-of-agent-interoperability/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Agent2Agent protocol</span></a><span style="font-weight:400;"> to help enterprises support multi-agent ecosystems, so agents can communicate with each other, regardless of the underlying framework or model. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">More than 50 partners, including Accenture, Box, Deloitte, Salesforce, SAP, ServiceNow, and TCS are actively contributing to defining this protocol, representing a shared vision of multi-agent systems. </span></li>
</ul>
<p><b>Google Unified Security</b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">This solution brings together our visibility, threat detection, AI powered security operations, continuous virtual red-teaming, the most trusted enterprise browser, and Mandiant expertise — in one converged security solution running on a planet-scale data fabric. </span></li>
</ul>
<p><a href="https://cloud.google.com/blog/products/networking/connect-globally-with-cloud-wan-for-the-ai-era?e=48754805" target="_blank" rel="noreferrer noopener"><b>Cloud WAN</b></a></p>
<p><span style="font-weight:400;">Meta Llama 4</span></p>
<p><b>Other Day 1 items:</b></p>
<p><a href="https://blog.google/technology/research/geospatial-reasoning/" target="_blank" rel="noreferrer noopener"><b>We’re introducing a new way to analyze geospatial data.</b></a></p>
<p><b>48:20 Next Day 2 Keynote</b></p>
<p><b>52:16 Vertex AI Agent Engine</b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Vertex AI Agent Engine (formerly known as LangChain on Vertex AI or Vertex AI Reasoning Engine) is a fully managed Google Cloud service enabling developers to deploy, manage, and scale AI agents in production. Agent Engine handles the infrastructure to scale agents in production so you can focus on creating intelligent and impactful applications. Vertex AI Agent Engine offers:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Fully managed: Deploy and scale agents with a managed runtime that provides robust security features including VPC-SC compliance and comprehensive end-to-end management capabilities. Gain CRUD access to multi-agent applications that use Google Cloud Trace (supporting OpenTelemetry) for performance monitoring and </span><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/agent-engine/manage/tracing" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">tracing</span></a><span style="font-weight:400;">. To learn more, see </span><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/agent-engine/deploy" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">deploy an agent</span></a><span style="font-weight:400;">.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Quality and evaluation: Ensure agent quality with the integrated </span><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/agent-engine/evaluate" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Gen AI Evaluation service</span></a><span style="font-weight:400;">.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Simplified development: Vertex AI Agent Engine abstracts away low-level tasks such as application server development and configuration of authentication and IAM, allowing you to focus on the unique capabilities of your agent, such as its behavior, tools, and model parameters. Furthermore, your agents can use any of the models and tools, such as </span><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/multimodal/function-calling" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">function calling</span></a><span style="font-weight:400;">, in Vertex AI.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Framework agnostic: Enjoy flexibility when deploying agents that you build using different python frameworks including </span><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/agent-engine/develop/adk" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Agent Development Kit</span></a><span style="font-weight:400;">, </span><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/agent-engine/develop/langgraph" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">LangGraph</span></a><span style="font-weight:400;">, </span><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/agent-engine/develop/langchain" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Langchain</span></a><span style="font-weight:400;">, </span><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/agent-engine/develop/ag2" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AG2</span></a><span style="font-weight:400;">, and </span><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/agent-engine/develop/llama-index/query-pipeline" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">LlamaIndex</span></a><span style="font-weight:400;">. If you already have an existing agent, you can adapt it to run on Vertex AI Agent Engine using the </span><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/agent-engine/develop/custom" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">custom template</span></a><span style="font-weight:400;"> in our SDK. Otherwise, you can develop an agent from scratch using one of the </span><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/agent-engine/develop/overview" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">framework-specific templates</span></a><span style="font-weight:400;"> we provide.</span></li>
</ul>
</li>
</ul>
<p><a href="https://cloud.google.com/blog/products/data-analytics/data-analytics-innovations-at-next25" target="_blank" rel="noreferrer noopener"><b>Data analytics innovations at Next’25 | Google Cloud Blog</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google introduced advancements in data analytics, emphasizing the integration of AI and intelligent agents to enhance data accessibility and decision-making. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Key innovations include specialized agents for various roles, AI-assisted data science workflows, and an autonomous data foundation in BigQuery that supports unstructured data and open formats like Iceberg.</span></li>
</ul>
<p><a href="https://developers.google.com/gemini-code-assist/docs/write-code-gemini" target="_blank" rel="noreferrer noopener"><b>Gemini Code Assist in IDEs</b></a></p>
<p><span style="font-weight:400;">Software Engineering Agents – now in preview! </span></p>
<p><a href="https://cloud.google.com/blog/topics/google-cloud-next/google-cloud-next-2025-wrap-up/" target="_blank" rel="noreferrer noopener"><b>Google Cloud Next 2025 Wrap Up</b></a><span style="font-weight:400;">:</span></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/blog/products/application-development/an-application-centric-ai-powered-cloud" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">An application-centric, AI-powered cloud | Google Cloud Blog</span></a></li>
<li style="font-weight:400;"><a href="https://cloud.google.com/blog/products/chrome-enterprise/chrome-expands-ai-powered-enterprise-search-and-enterprise-browser-protections/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Chrome Expands AI-Powered Enterprise Search and Enterprise Browser Protections | Google Cloud Blog</span></a></li>
<li style="font-weight:400;"><a href="https://cloud.google.com/blog/products/databases/accelerate-your-analytics-with-new-bigtable-sql-capabilities/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Accelerate your analytics with new Bigtable SQL capabilities | Google Cloud Blog</span></a></li>
<li style="font-weight:400;"><a href="https://cloud.google.com/blog/products/databases/migrating-sql-server-databases-is-now-easier-with-gemini/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Migrating SQL server databases is now easier with Gemini. Here’s how | Google Cloud Blog</span></a></li>
<li style="font-weight:400;"><a href="https://cloud.google.com/blog/products/data-analytics/announcing-intelligent-unified-governance-in-bigquery/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Announcing intelligent unified governance in BigQuery | Google Cloud Blog</span></a></li>
<li style="font-weight:400;"><a href="https://cloud.google.com/blog/products/databases/alloydb-ai-drives-innovation-from-the-database/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AlloyDB AI drives innovation for application developers</span></a><span style="font-weight:400;">      </span></li>
</ul>
<p><b>1:07:22  Google Next Predictions</b></p>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Ryan – No Points. Sad. </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Responsible AI, in Console/Service/SDK to enable and/or visualize your responsible AI creation or usage</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Endpoint Security Tools (Crowdstrike, Patch Management/Vulnerability)</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Won’t be announcing anything new service announcements just enhancements for AI/Gemini/Etc.</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Justin – 1 Point </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AI Agents specialized for Devops, K8, Devops capability</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Next Generation of TPU GPU’s optimized Optimized Multi-modal</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Unification or Major Enhancement of Anthos &amp; GKE Enterprise </span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Matt – All the points</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Green AI</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">3 not-AI specific keynotes</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Cloud WAN</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Hyperdisk Exapools</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Next Gen Customer Engagement Suite</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">AI security thing that is not Endpoint. More Guardrails. </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google Unified Security</span></li>
</ul>
</li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Honorable Mentions – nada </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Industry verticalization for AI LLM Models. Fine Tuning Marketplace or special model for specific industry/use case</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Personal Assistant for Workspace productivity </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Multicloud tooling</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Number of times AI or ML said on stage</span></li>
</ul>
</li>
</ul>
<ul>
<li><b>Opening Keynote 38 times</b></li>
</ul>
<ul>
<li><b>Developer Keynote: 63 Times</b></li>
</ul>
<ul>
<li><b>Total: 101</b></li>
</ul>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Matt: 52 </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Justin: 97</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Ryan: 1</span></li>
</ul>
</li>
</ul>
<h2><span style="font-weight:400;">Closing</span></h2>
<p><span style="font-weight:400;">And that is the week in the cloud! Visit our website, the home of the Cloud Pod where you can join our newsletter, slack team, send feedback or ask questions at theCloud Pod.net or tweet at us with hashtag #theCloudPod</span></p>
]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2015419/c1e-o838u2w36dtjk80r-9jrrjndobp17-y2pidj.mp3" length="96361696"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 300 of The Cloud Pod – where the forecast is always cloudy! According to the title, this week’s show is taking place inside of a Dr. Suess book, but don’t despair – we’re not going to make you eat green eggs and ham, but we WILL give you the low down on all things Vegas. Well, Google’s Next event which recently took place in Vegas anyway. Did you make any Next predictions? 
Titles we almost went with this week:

☁️This is the CLOUDPOD Episode 300
️Tonight we dine in the Cloud
The Next Chapter
Now in Preview: Episode 300

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
GCP
Pre-Next
02:35 Google shakes up Gemini leadership, Google Labs head taking the reins 

There was a lot of Gemini news at Next – but we’ll get to all that. 
In this particular case, there’s an employee shakeup. Sissie Hsiao is stepping down from leading the Google team, and is being replaced by Josh Woodward, who is currently leading the Google Labs. 

04:35 Filestore instance replication now available

GCP says customers have been asking for help in meeting business and regulatory goals, and so they are releasing Filestore instance replication.
This new feature offers an efficient replication point objective (RPO) that can reach 30 minutes for data change rates of 100 MB/sec.

05:16 Multi-Cluster Orchestrator for cross-region Kubernetes workloads

The public preview of Multi-Cluster Orchestrator was recently announced.
This lets platform and application teams optimize resource utilization, enhance application resilience, and accelerate innovation in complex, multi-cluster environments. 
The need for effective multi-cluster management has become essential as organizations increasingly use Kubernetes to deploy and manage their applications; Challenges such as resource scarcity, ensuring high availability, and managing deployments across diverse environments create significant operational overhead.
Multi-Cluster Orchestrator addresses these challenges by providing a centralized orchestration layer that abstracts away the complexities of underlying Kubernetes infrastructure matching workloads with capacity across regions....]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2015419/c1a-k5d5-dmzj5kqpujrr-bao0em.jpg"></itunes:image>
                                                                            <itunes:duration>01:20:18</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2015419/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[299: We Predict Next, for Next Week’s, Next-Level Google Next Event. What’s Next?]]>
                </title>
                <pubDate>Sun, 06 Apr 2025 15:07:50 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2007523</guid>
                                    <link>https://tcpfm.castos.com/episodes/299-google-next-predictions</link>
                                <description>
                                            <![CDATA[<p><span style="font-weight:400;">Welcome to episode 299 of The Cloud Pod – where the forecast is always cloudy! Google Next is quickly approaching, and you know what that means – it’s time for predictions! Who will win this year’s Crystal Ball award? Only time and the main stage will tell. Join Matthew, Justin, and Ryan as they break down their thoughts on what groundbreaking (and less groundbreaking) announcements are in store for us. </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">OpenAI and Anthropic join forces? </span></li>
<li><span style="font-weight:400;">Its 2025, and AWS is still trying to make Jumbo packets happen</span></li>
<li><span style="font-weight:400;">Beanstalk and Ruby’s Updates!! They’re Alive!!!</span></li>
<li><span style="font-weight:400;">Google Colossus or how to expect a colossal cloud outage someday.</span></li>
<li><span style="font-weight:400;">‍The Cloud Pod gives an ode to Peter</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><b>AI Is Going Great – Or How ML Makes All Its Money  </b></h2>
<p><b>02:27 </b><a href="https://techcrunch.com/2025/03/26/openai-adopts-rival-anthropics-standard-for-connecting-ai-models-to-data/?guccounter=1&amp;guce_referrer=aHR0cHM6Ly9zdGF0aWNzLnRlYW1zLmNkbi5vZmZpY2UubmV0Lw&amp;guce_referrer_sig=AQAAAMULJizr7n-_gBSKfdsDiTPE9qtep1v_PiQpQvUhBgo5b9on9Hd_b7q7I2ueppMHmPwfQYrAYuiExWsdsFYWbMRqkt_WKTbw-TsJz_TK1I4APKe2dgj34qUHVuWypV_IpWsynuM_9z4awSMAfZ7EPNXmDNKOznv7-XDHYYK-b3yH" target="_blank" rel="noreferrer noopener"><b>OpenAI adopts rival Anthropic’s standard for connecting AI models to data</b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://openai.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">OpenAI</span></a><span style="font-weight:400;"> is embracing Anthropic’s standard for connecting AI assistants to the systems where the data resides.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">By adapting </span><a href="https://techcrunch.com/2024/11/25/anthropic-proposes-a-way-to-connect-data-to-ai-chatbots/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Anthropic’s Model Context Protocol</span></a><span style="font-weight:400;"> or MCP across its products, including the desktop app for </span><a href="https://chat.openai.com/chat" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">ChatGPT</span></a><span style="font-weight:400;">.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">MCP is an open source standard that helps AI models produce better, more relevant responses to certain queries. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Sam Altman says that people love MCP and they are excited to add support across their products and that it is available today in the Agents SDK and support for the ChatGPT desktop and Response API is coming soon.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">MCP lets models draw data from sources like business tools and software to complete tasks, as well as from content repositories and app development environments. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">We found two helpful articles that may help demystify this whole concept. </span></li>
</ul>
<p><a href="https://addyo.substack.com/p/mcp-what-it-is-and-why-it-matters" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">MCP: What It Is and Why It Matters – by Addy Osmani</span></a></p>
<p><a href="https://medium.com/google-cloud/meet-mcp-your-llms-super-helpful-assistant-83bd85f04150"></a></p>
<h3>Chapters</h3>
<ul><li>(00:00:00) - Prediction: Google Next</li><li>(00:02:39) - OpenAI Arches MCP Standard for AI</li><li>(00:07:42) - Databricks announces support for Anthropic Cloud 3.7</li><li>(00:11:08) - AWS WAF for Amplify Hosted Sites</li><li>(00:17:16) - Amazon EC2: Jumbo Frames and Full AWS Connection</li><li>(00:20:02) - Ruby 3.4 Support on AWS Lambda</li><li>(00:23:36) - Amazon API Gateway now supports dual-stack IPv4 & IPv6</li><li>(00:26:37) - Amazon EKS Community Add Ons Catalog</li><li>(00:30:02) - Beanstalk: Not Dead, but</li><li>(00:31:59) - Amazon Launches Amazon Nova at New Website</li><li>(00:35:17) - Google Next: Attendance Prediction & More</li><li>(00:38:12) - The AI and Machine Learning Contest</li><li>(00:40:12) - Google's 'Responsive AI'</li><li>(00:42:19) - On The Future of AI</li><li>(00:43:00) - Predictions: Microsoft Will Announce 5 New Features During the 2020 Conference</li><li>(00:46:29) - GK Enterprise: Unification or Non-AI?</li><li>(00:47:27) - AI Tech Announcement at Hudo</li><li>(00:48:48) - Google IO 2018: Industry Verticalization, Personal Assistant</li><li>(00:50:29) - Google's Cloud Announcement</li><li>(00:50:56) - How many times can I say AI or ML on stage?</li><li>(00:51:36) - Google Cloud Backup and Security: Two Things</li><li>(00:53:37) - Google and Mlogical to Accelerate Mainframe Application Modernization</li><li>(00:55:58) - Google's Colossus: The Cloud Storage System</li><li>(01:02:39) - AI assisted BigQuery Data Preparation now generally available</li><li>(01:04:04) - Microsoft Azure Backup Storage Billing Change</li><li>(01:06:19) - Microsoft's 'Fabric' for Business Intelligence</li><li>(01:07:26) - Microsoft Purview: How to Keep Up with DLP Alerts</li><li>(01:10:43) - Oracle Cloud: How Much Does 131,000 Nvidia GV300</li><li>(01:13:45) - OCI Bare Metal and Flex VM Instances Now Available</li><li>(01:15:21) - Oracle's bare metal server pricing vs. Windows: How many regions</li><li>(01:17:17) - Oracle Cloud Breach: How Can They Pass Responsibility?</li><li>(01:18:23) - Cloud Pod</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 299 of The Cloud Pod – where the forecast is always cloudy! Google Next is quickly approaching, and you know what that means – it’s time for predictions! Who will win this year’s Crystal Ball award? Only time and the main stage will tell. Join Matthew, Justin, and Ryan as they break down their thoughts on what groundbreaking (and less groundbreaking) announcements are in store for us. 
Titles we almost went with this week:

OpenAI and Anthropic join forces? 
Its 2025, and AWS is still trying to make Jumbo packets happen
Beanstalk and Ruby’s Updates!! They’re Alive!!!
Google Colossus or how to expect a colossal cloud outage someday.
‍The Cloud Pod gives an ode to Peter

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
AI Is Going Great – Or How ML Makes All Its Money  
02:27 OpenAI adopts rival Anthropic’s standard for connecting AI models to data

OpenAI is embracing Anthropic’s standard for connecting AI assistants to the systems where the data resides.  
By adapting Anthropic’s Model Context Protocol or MCP across its products, including the desktop app for ChatGPT.  
MCP is an open source standard that helps AI models produce better, more relevant responses to certain queries. 
Sam Altman says that people love MCP and they are excited to add support across their products and that it is available today in the Agents SDK and support for the ChatGPT desktop and Response API is coming soon.
MCP lets models draw data from sources like business tools and software to complete tasks, as well as from content repositories and app development environments. 
We found two helpful articles that may help demystify this whole concept. 

MCP: What It Is and Why It Matters – by Addy Osmani
]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[299: We Predict Next, for Next Week’s, Next-Level Google Next Event. What’s Next?]]>
                </itunes:title>
                                    <itunes:episode>299</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p><span style="font-weight:400;">Welcome to episode 299 of The Cloud Pod – where the forecast is always cloudy! Google Next is quickly approaching, and you know what that means – it’s time for predictions! Who will win this year’s Crystal Ball award? Only time and the main stage will tell. Join Matthew, Justin, and Ryan as they break down their thoughts on what groundbreaking (and less groundbreaking) announcements are in store for us. </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">OpenAI and Anthropic join forces? </span></li>
<li><span style="font-weight:400;">Its 2025, and AWS is still trying to make Jumbo packets happen</span></li>
<li><span style="font-weight:400;">Beanstalk and Ruby’s Updates!! They’re Alive!!!</span></li>
<li><span style="font-weight:400;">Google Colossus or how to expect a colossal cloud outage someday.</span></li>
<li><span style="font-weight:400;">‍The Cloud Pod gives an ode to Peter</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><b>AI Is Going Great – Or How ML Makes All Its Money  </b></h2>
<p><b>02:27 </b><a href="https://techcrunch.com/2025/03/26/openai-adopts-rival-anthropics-standard-for-connecting-ai-models-to-data/?guccounter=1&amp;guce_referrer=aHR0cHM6Ly9zdGF0aWNzLnRlYW1zLmNkbi5vZmZpY2UubmV0Lw&amp;guce_referrer_sig=AQAAAMULJizr7n-_gBSKfdsDiTPE9qtep1v_PiQpQvUhBgo5b9on9Hd_b7q7I2ueppMHmPwfQYrAYuiExWsdsFYWbMRqkt_WKTbw-TsJz_TK1I4APKe2dgj34qUHVuWypV_IpWsynuM_9z4awSMAfZ7EPNXmDNKOznv7-XDHYYK-b3yH" target="_blank" rel="noreferrer noopener"><b>OpenAI adopts rival Anthropic’s standard for connecting AI models to data</b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://openai.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">OpenAI</span></a><span style="font-weight:400;"> is embracing Anthropic’s standard for connecting AI assistants to the systems where the data resides.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">By adapting </span><a href="https://techcrunch.com/2024/11/25/anthropic-proposes-a-way-to-connect-data-to-ai-chatbots/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Anthropic’s Model Context Protocol</span></a><span style="font-weight:400;"> or MCP across its products, including the desktop app for </span><a href="https://chat.openai.com/chat" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">ChatGPT</span></a><span style="font-weight:400;">.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">MCP is an open source standard that helps AI models produce better, more relevant responses to certain queries. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Sam Altman says that people love MCP and they are excited to add support across their products and that it is available today in the Agents SDK and support for the ChatGPT desktop and Response API is coming soon.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">MCP lets models draw data from sources like business tools and software to complete tasks, as well as from content repositories and app development environments. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">We found two helpful articles that may help demystify this whole concept. </span></li>
</ul>
<p><a href="https://addyo.substack.com/p/mcp-what-it-is-and-why-it-matters" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">MCP: What It Is and Why It Matters – by Addy Osmani</span></a></p>
<p><a href="https://medium.com/google-cloud/meet-mcp-your-llms-super-helpful-assistant-83bd85f04150" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Meet MCP: Your LLM’s Super-Helpful Assistant!</span></a></p>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Justin particularly loves Addy Osmani’s blog, as they start out with a simple ELI5 on understanding MCP. We’re going to quote verbatim: </span></li>
</ul>
</li>
</ul>
<ul>
<li style="font-weight:400;"><i><span style="font-weight:400;">“Imagine you have a single universal plug that fits all your devices – that’s essentially what the Model Context Protocol (MCP) is for AI. MCP is an </span></i><a href="https://www.anthropic.com/news/model-context-protocol" target="_blank" rel="noreferrer noopener"><i><span style="font-weight:400;">open standard</span></i></a><i><span style="font-weight:400;"> (think “USB-C for AI integrations”) that allows AI models to connect to many different apps and data sources in a consistent way. In simple terms, MCP lets an AI assistant talk to various software tools using a common language, instead of each tool requiring a different adapter or custom code.”</span></i></li>
</ul>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">So, what does this mean in practice? If you’re using an AI coding assistant like Cursor or Windsurf, MCP is the shared protocol that lets that assistant use external tools on your behalf. For example, with MCP an AI model could fetch information from a database, edit a design in Figma, or control a music app – all by sending natural-language instructions through a standardized interface. You (or the AI) no longer need to manually switch contexts or learn each tool’s API; the MCP “translator” bridges the gap between human language and software commands.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">In a nutshell, MCP is like giving your AI assistant a universal remote control to operate all your digital devices and services. Instead of being stuck in its own world, your AI can now reach out and press the buttons of other applications safely and intelligently. This common protocol means one AI can integrate with thousands of tools as long as those tools have an MCP interface – eliminating the need for custom integrations for each new app. The result: your AI helper becomes far more capable, able to not just chat about things but take actions in the real software you use.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The problem your’re solving:</span><span style="font-weight:400;"><br />
</span><span style="font-weight:400;">Without MCP, integrating an AI assistant with external tools is a bit like having a bunch of appliances each with a different plug and no universal outlet. Developers were dealing with fragmented integrations everywhere. For example, your AI IDE might use one method to get code from GitHub, another to fetch data from a database, and yet another to automate a design tool – each integration needing a custom adapter. Not only is this labor-intensive, it’s brittle and doesn’t scale. As Anthropic put it:</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">“even the most sophisticated models are constrained by their isolation from data – trapped behind information silos…Every new data source requires its own custom implementation, making truly connected systems difficult to scale.”</span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">04:45  Justin – “Basically, I consider this to be SQL for AI.”</span></i></p>
<p><b>07:43</b> <a href="https://www.databricks.com/blog/anthropic-claude-37-sonnet-now-natively-available-databricks" target="_blank" rel="noreferrer noopener"><b>Announcing Anthropic Claude 3.7 Sonnet is natively available in Databricks</b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.databricks.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Databricks</span></a><span style="font-weight:400;"> is coming in late to the party with support for </span><a href="https://www.anthropic.com/claude/sonnet" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Claude 3.7 Sonnet</span></a></li>
<li style="font-weight:400;"><span style="font-weight:400;">Databricks is excited to announce that Anthropic Claude 3.7 Sonnet is now natively available in Databricks across AWS, Azure and GCP.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">For the first time, you can securely access Claude’s advanced reasoning, planning and agentic capabilities directly within Databricks. </span></li>
</ul>
<p><b>08:53</b> <a href="https://www.theinformation.com/articles/openai-goes-ghibli-techs-secret-chats?rc=3t8xtd" target="_blank" rel="noreferrer noopener"><b>OpenAI Goes Ghibli, Tech’s Secret Chats</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">We talked last week about </span><a href="https://www.bing.com/ck/a?!&amp;&amp;p=b350505b7dea5640f4ea32980f46152879609c7d01126dc8c6bbe2660fad9868JmltdHM9MTc0MzYzODQwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=chatgpt&amp;u=a1aHR0cHM6Ly9vcGVuYWkuY29tL2NoYXRncHQvb3ZlcnZpZXcv&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">ChatGPT</span></a><span style="font-weight:400;">’s new image capabilities but everyone is not as pleased with the results. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">ChatGPT can make a pretty realistic version of Studio Ghibli’s unique cartoon/anime style which will probably get OpenAI sued over copyright infringement.</span></li>
</ul>
<h2><b>AWS</b></h2>
<p><b>11:17</b> <a href="https://aws.amazon.com/blogs/aws/firewall-support-for-aws-amplify-hosted-sites/" target="_blank" rel="noreferrer noopener"><b>Firewall support for AWS Amplify hosted sites</b></a> <span style="font-weight:400;">  ​</span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">You can now integrate </span><a href="https://aws.amazon.com/waf" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS WAF</span></a><span style="font-weight:400;"> with </span><a href="https://aws.amazon.com/amplify/hosting/?trk=4b29643c-e00f-4ab6-ab9c-b1fb47aa1708&amp;sc_channel=el" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Amplify Hosting</span></a><span style="font-weight:400;">.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Web application owners are constantly working to protect their applications from a variety of threats. Previously, if you wanted to implement a robust security posture for your Amplify Hosted applications, you needed to create architectures using </span><a href="https://aws.amazon.com/cloudfront/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Cloudfront</span></a><span style="font-weight:400;"> Distributions with AWS WAF protection, which required additional configuration steps, expertise and management overhead. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">With the GA of AWS WAF for Amplify hosting, you can now directly attach a web app firewall to your AWS Amplify apps through a one-click integration in the </span><a href="https://console.aws.amazon.com/amplify/?trk=4b29643c-e00f-4ab6-ab9c-b1fb47aa1708&amp;sc_channel=el" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amplify Console</span></a><span style="font-weight:400;"> or using IaC.  This integration gives you access to the full range of AWS WAF capabilities including managed rules, which provide protection against common web exploits and vulnerabilities like SQL injection and cross-site scripting (XSS). You can also create your own custom rules based on your specific application needs. </span></li>
</ul>
<p><i><span style="font-weight:400;">12:19  Ryan – “</span></i><i><span style="font-weight:400;">This is one of those rough edges that you find the wrong way. So I’m glad they fixed this. If you’re using Amplify, I’m sure you don’t want to get down in the dirty in-network routing and how to implement the WAF. So you’re looking for something to apply the managed rules and protect yourself from bots and that kind of traffic. I imagine this is a great integration for those people that are using Amplify.”</span></i></p>
<p><b>17:35</b> <a href="https://aws.amazon.com/about-aws/whats-new/2025/03/amazon-ec2-bandwidth-jumbo-frames/" target="_blank" rel="noreferrer noopener"><b>Amazon EC2 now supports more bandwidth and jumbo frames to select </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2025/03/amazon-ec2-bandwidth-jumbo-frames/" target="_blank" rel="noreferrer noopener"><b>destinations</b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/ec2/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon EC2</span></a><span style="font-weight:400;"> now supports up to the full EC2 instance bandwidth for inter-region VPC peering traffic and to AWS Direct Connect.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Additionally, EC2 supports jumbo frames up to 8500 Bytes for cross-region VPC peering. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Before today, the egress bandwidth for EC2 instances was limited to 50% of the aggregate bandwidth limit for the cases with 32 or more vCPUs and 5 Gbps for more minor instances.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Cross-region peering supported up to 1500 bytes. Now, customers can send bandwidth from the EC2 region or towards AWS direct connect at the full instance baseline specification or 5Gbps, whichever is greater. Customers can use jumbo frames across regions for peered VPCs. </span></li>
</ul>
<p><i><span style="font-weight:400;">18:17  Justin – “I can see some benefits, as much as I made fun of it, but it’s one of those things that you run into in weird outage scenarios…so it’s nice, especially for going between availability zones and cross region peering. ” </span></i></p>
<p><b>20:20</b> <a href="https://aws.amazon.com/about-aws/whats-new/2025/03/aws-lambda-support-ruby-3-4/" target="_blank" rel="noreferrer noopener"><b>AWS Lambda adds support for Ruby 3.4</b></a><b>  </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">RUBYS NOT DEAD!</span></li>
<li style="font-weight:400;"><a href="https://www.bing.com/ck/a?!&amp;&amp;p=d4339fd09da71d68e679abe3e2a97803d22560ab8a8858a8d60b4e937de4f9b7JmltdHM9MTc0MzYzODQwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=aws+lambda&amp;u=a1aHR0cHM6Ly9hd3MuYW1hem9uLmNvbS9sYW1iZGEv&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Lambda</span></a><span style="font-weight:400;"> now supports creating serverless apps using </span><a href="https://www.bing.com/ck/a?!&amp;&amp;p=4ad32f6af3f05f442c01013ac4d07e6ca4f28364605c274773d7310bcd18d620JmltdHM9MTc0MzYzODQwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=ruby+3.4&amp;u=a1aHR0cHM6Ly93d3cucnVieS1sYW5nLm9yZy9lbi9uZXdzLzIwMjQvMTIvMjUvcnVieS0zLTQtMC1yZWxlYXNlZC8&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Ruby 3.4</span></a><span style="font-weight:400;"> (released in February 2025).  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Developers can use Ruby 3.4 as both a managed runtime and a container base image, and AWS will automatically apply updates to the managed runtime and base image as they become available.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Ruby 3.4 is the latest LTS version of Ruby with support expected until March 2028. </span></li>
</ul>
<p><i><span style="font-weight:400;">20:56  Ryan – “I am astonished. I did not think that Ruby was a thing that was still supported.”</span></i></p>
<p><b>23:55</b> <a href="https://aws.amazon.com/blogs/aws/amazon-api-gateway-now-supports-dual-stack-ipv4-and-ipv6-endpoints/" target="_blank" rel="noreferrer noopener"><b>Amazon API Gateway now supports dual-stack (IPv4 and IPv6) endpoints</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon is finally launching </span><a href="https://docs.aws.amazon.com/whitepapers/latest/ipv6-on-aws/IPv6-on-AWS.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">IPv6</span></a><span style="font-weight:400;"> support for </span><a href="https://aws.amazon.com/api-gateway/?trk=d21a4eb6-d91f-4286-843a-d35b2a06a274&amp;sc_channel=el" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon API Gateway</span></a><span style="font-weight:400;"> across all endpoint types, custom domains, and management APIs, in all commercial and </span><a href="https://www.bing.com/ck/a?!&amp;&amp;p=86c38116eeb104378efe0ff9a1294fff649f58dd7d0a5fe92322a45905cfa43bJmltdHM9MTc0MzYzODQwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=AWS+gov+cloud&amp;u=a1aHR0cHM6Ly9hd3MuYW1hem9uLmNvbS9nb3ZjbG91ZC11cy8&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS GovCloud (US) regions</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">You can configure Rest, HTTP and </span><a href="https://www.bing.com/ck/a?!&amp;&amp;p=1cac69f789ae675968c7a8fc2cca4f5bd9859d5d449f52ffe8f83f9ad5eaff97JmltdHM9MTc0MzYzODQwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=websocket+apis&amp;u=a1aHR0cHM6Ly9kZXZlbG9wZXIubW96aWxsYS5vcmcvZW4tVVMvZG9jcy9XZWIvQVBJL1dlYlNvY2tldHNfQVBJ&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">WebSocket APIs</span></a><span style="font-weight:400;"> and custom domains to accept calls from IPv6 clients alongside the existing IPv4 support. You can also call the management API’s via IPv6 or IPv4 clients. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Remember that AWS is still charging you for the IPv4 and there is no way to remove the Ipv4 addresses. </span></li>
</ul>
<p><i><span style="font-weight:400;">24:45  Matthew – “It’s pretty required in mobile; that’s really the big area where you need it. Because the mobile networks have all gone IPv6.”</span></i></p>
<p><b>27:17</b> <a href="https://aws.amazon.com/blogs/containers/announcing-amazon-eks-community-add-ons-catalog/" target="_blank" rel="noreferrer noopener"><b>Announcing Amazon EKS community Add-ons catalog | Containers</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/eks/?trk=fccf147c-636d-45bf-bf0a-7ab087d5691a&amp;sc_channel=el" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">EKS</span></a><span style="font-weight:400;"> supports add ons that streamline support operations capabilities for K8 applications. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">These add ons come from AWS&lt; Partners and the OSS community. But discovery of these tools across multiple different avenues has resulted in chaos and security and misconfiguration risks. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">To fix this Amazon is releasing the </span><a href="https://docs.aws.amazon.com/eks/latest/userguide/community-addons.html?trk=fccf147c-636d-45bf-bf0a-7ab087d5691a&amp;sc_channel=el" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">community add-ons catalog</span></a><span style="font-weight:400;">, which provides a way to streamline cluster operations by integration popular community add-ons through native AWS management, broadening the choice of add-ons that users can install to their clusters directly using </span><a href="https://console.aws.amazon.com/eks/home#/clusters?trk=fccf147c-636d-45bf-bf0a-7ab087d5691a&amp;sc_channel=el" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">EKS console</span></a><span style="font-weight:400;">, </span><a href="https://aws.amazon.com/what-is/sdk/?trk=fccf147c-636d-45bf-bf0a-7ab087d5691a&amp;sc_channel=el" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS SDK</span></a><span style="font-weight:400;">, </span><a href="https://aws.amazon.com/cli/?trk=fccf147c-636d-45bf-bf0a-7ab087d5691a&amp;sc_channel=el" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">CLI</span></a><span style="font-weight:400;"> and </span><a href="https://aws.amazon.com/cloudformation/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">CloudFormation</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Some of the critical capabilities you can find in the add-on catalog include essential capabilities such as:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Metrics server</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Kube-state-metrics</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Prometheus-node-exporter</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Cert-manager</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">External-DNS</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">If you make an add-on you want to add, you can create an issue on the EKS roadmap GitHub requesting its inclusion. </span></li>
</ul>
<p><i><span style="font-weight:400;">28:04  Justin – “Those five examples all just seem like they should be a part of EKS. Just my personal opinion.”</span></i></p>
<p><b>29:34</b> <a href="https://aws.amazon.com/about-aws/whats-new/2025/03/amazon-bedrock-custom-model-import-real-time-cost-transparency/" target="_blank" rel="noreferrer noopener"><b>Amazon Bedrock Custom Model Import introduces real-time cost </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2025/03/amazon-bedrock-custom-model-import-real-time-cost-transparency/" target="_blank" rel="noreferrer noopener"><b>transparency</b></a><span style="font-weight:400;">  </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">When importing your customized foundational models on-demand to </span><a href="https://www.bing.com/aclk?ld=e8eZCnadHROog5ZF-2Ty3AnDVUCUyaIFRA6r7l1Gmj3aLrf0NetAKV06CtRbbe_uPs9xOgglK-boNuKMamm-NYZN1C1XikhV4icpBmS5nOWCoqbLJP144HhY7TxVl2Qhm0Z-kg_cSyVOdPicLoCYAUvxY5lwHsKAyaZzI9bXoYY60sFciKu8AW8vrrddEFhC2UolkrJw&amp;u=aHR0cHMlM2ElMmYlMmZhd3MuYW1hem9uLmNvbSUyZmJlZHJvY2slMmYlM2Z0cmslM2QyYzM1YzAwMC1jOGNlLTRjMWEtYWI3Ny03MTBhNTJmYTk5NTklMjZzY19jaGFubmVsJTNkcHMlMjZzX2t3Y2lkJTNkQUwhNDQyMiExMCE3MTgxMjExODM0NDk4MiEhISE3MTgxMjY2MzExNjA4MCEhNDg1NDMzOTg2ITExNDg5OTEzOTQzNzcxNDclMjZlZl9pZCUzZDMyZmUxOGNjZWI0ZDFhZjNhNjQ1N2YyMzgwOTUzNDljJTNhRyUzYXMlMjZtc2Nsa2lkJTNkMzJmZTE4Y2NlYjRkMWFmM2E2NDU3ZjIzODA5NTM0OWM&amp;rlid=32fe18cceb4d1af3a6457f238095349c&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Bedrock</span></a><span style="font-weight:400;">, you now get full transparency in the compute resources being used and calculate inference costs real-time. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This launch provides you with the minimum compute resources, custom model units, required to run the workload model prior to model invocation in the Bedrock console and through Bedrock APIs. As the models scale to handle more traffic, CloudWatch metrics provide real-time visibility into the inference costs by showing the total number of CMUs used. </span></li>
</ul>
<p><i><span style="font-weight:400;">30:05  Ryan – “The only common metric is money.”</span></i></p>
<p><b>30:33</b> <a href="https://aws.amazon.com/about-aws/whats-new/2025/03/aws-elastic-beanstalk-retrieving-secrets-configuration-secrets-manager-systems-manager/" target="_blank" rel="noreferrer noopener"><b>AWS Elastic Beanstalk now supports retrieving secrets and configuration </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2025/03/aws-elastic-beanstalk-retrieving-secrets-configuration-secrets-manager-systems-manager/" target="_blank" rel="noreferrer noopener"><b>from AWS Secrets Manager and AWS Systems Manager</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">See Matt – Beanstalk isn’t dead! </span></li>
<li style="font-weight:400;"><a href="https://duckduckgo.com/y.js?ad_domain=amazon.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=mY0nTSRZEn1qLU-eqqUVlwU4wCEp_f6trOqsASCH6MXNfV_L8gXKmTqZntzMJMQJ7r-QQnWxoD7wJ8w9hQ3M4XBlrM9KYAOMHpXs5zS4Wpsd5HOpaYru38P6NGzAa8RR.lTfWXtVFPwX7nLU07MmIFw&amp;eddgt=AzqnAzQiMwBP7vyCgZlhaQ%3D%3D&amp;rut=f778bd35b7b28c2ad5b03458e0c5112b226ddeda19c732140fa0792a1504460d&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8BFuRdU5d1HlXrVFCm5fSzTVUCUzWAozshfrHaThqP__r4D_pICQdZ8hHrTWXp5v_3-GDZCidXKnGu0KtPopFc2J2uMKby-13zdejwtAE6uG-qAsvLoFhIGIPCVhl5KovcN1FBKUvBFB0I80TmTVWggm1a6qmAluXbCvcX883K6iLgT76gtYWSLmsjeR1xp2DeaZuhZaWh_rfo19wLAWFqzzbQG4l6PO4y-s0q6PfRZLV_N9vEAMScxZpiZ6nvrKLttS-x397LubpWXMpwFIE18xgIX46e2dmZsp2ffzpo-nHMBXl4UxiYdzHvYRLNQofIaBhsovfjGEQUDM86a5qnJ12YmNn8b87zHA2_txMaj287nHHnBhtgIaNmsc_uMRJz0zHyhc_VFO6aCwPJl-rtgt_-MW-SHZPMpkNnk-ZFGsFWs6q2ulnDIme03jau2u5p2Sd3z9oavj9WGECTUXA7mkBNeH0AUBFWkMtd9tUcUlbQ88yWm9Yj9GQeROhtHUr1YzsopyYoG4MDwxO-bsFNQt04NFG9MDiViaIAX88R3PlKCljLWRlJ0KjM0a-dmTqM6r7kMK4mhcF4lc5UDby6jvahUElnmgaNMzH6J-rXMnNGLFe4QHYGcY-YDvp6MRxIXw53RzqYEzHSAI8mRVNWVDfHVoik1DywLf5GNCJxbB2iOABbk_OHATYIbaFohGiT_mI_BbqSEnIXXRI_PIT3DMZL1NNABgfTgZY2wA8pT0NkSk22G25SB4LXIbgOQuG9ntbKw%26u%3DaHR0cHMlM2ElMmYlMmZwaXhlbC5ldmVyZXN0dGVjaC5uZXQlMmY0NDIyJTJmY3ElM2Zldl9zaWQlM2QxMCUyNmV2X2xuJTNkd2hhdCUyNTIwaXMlMjUyMGVsYXN0aWMlMjUyMGJlYW5zdGFsayUyNmV2X2x0eCUzZCUyNmV2X2x4JTNka3dkLTcyMDg3NDk4NzQ2NjM3JTNhbG9jLTcxMjI4JTI2ZXZfY3J4JTNkNzIwODY5NjY3NTM5MTElMjZldl9tdCUzZHAlMjZldl9kdmMlM2RjJTI2ZXZfcGh5JTNkNzk0OTclMjZldl9sb2MlM2QlMjZldl9jeCUzZDQ4MjUyNDI4OCUyNmV2X2F4JTNkMTE1MzM4ODY5OTU2ODExMSUyNmV2X2V4JTNkJTI2ZXZfZWZpZCUzZDJkNzg4NmJjN2I4YjExMWI4ZjIzZTE1MzE3YTYzNTMyJTNhRyUzYXMlMjZ1cmwlM2RodHRwcyUyNTNBJTI1MkYlMjUyRmF3cy5hbWF6b24uY29tJTI1MkZlbGFzdGljYmVhbnN0YWxrJTI1MkYlMjUzRnRyayUyNTNENDQ0ZjQ4ZWMtNTE4ZC00MDI2LTk2ZTktYTA0ZDU3NzE0NzkyJTI1MjZzY19jaGFubmVsJTI1M0RwcyUyNTI2c19rd2NpZCUyNTNEQUwhNDQyMiExMCE3MjA4Njk2Njc1MzkxMSEhISE3MjA4NzQ5ODc0NjYzNyEhNDgyNTI0Mjg4ITExNTMzODg2OTk1NjgxMTElMjUyNmVmX2lkJTI1M0QyZDc4ODZiYzdiOGIxMTFiOGYyM2UxNTMxN2E2MzUzMiUyNTNBRyUyNTNBcyUyNm1zY2xraWQlM2QyZDc4ODZiYzdiOGIxMTFiOGYyM2UxNTMxN2E2MzUzMg%26rlid%3D2d7886bc7b8b111b8f23e15317a63532&amp;vqd=4-238411453195140526333280859916614082157&amp;iurl=%7B1%7DIG%3DADCB1C47DE4A4BAD8A831E039C6E7F33%26CID%3D02698E0B6B3B638D3F4C9BCF6A236262%26ID%3DDevEx%2C5048.1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Elastic Beanstalk</span></a><span style="font-weight:400;"> now enables customers to reference AWS Systems Manager Parameter Store Parameters and AWS Secrets Manager secrets in environmental variables. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This new integration provides developers with a native method for accessing data from these services in their applications. </span></li>
</ul>
<p><i><span style="font-weight:400;">31:04  Ryan – “It’s a crazy new feature for services that’s been around for a very long time.” </span></i></p>
<p><b>32:33</b> <a href="https://www.aboutamazon.com/news/innovation-at-amazon/amazon-nova-website-sdk?utm_source=ecsocial&amp;utm_medium=linkedin&amp;utm_term=36" target="_blank" rel="noreferrer noopener"><b>Amazon makes it easier for developers and tech enthusiasts to explore </b></a><a href="https://www.aboutamazon.com/news/innovation-at-amazon/amazon-nova-website-sdk?utm_source=ecsocial&amp;utm_medium=linkedin&amp;utm_term=36" target="_blank" rel="noreferrer noopener"><b>Amazon Nova, its advanced Gen AI models</b></a><b>  </b></p>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Check out </span><a href="https://nova.amazon.com" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">https://nova.amazon.com</span></a><span style="font-weight:400;"> can we kill Partyrock now? </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon has realized that while they’ve created numerous Generative AI applications including </span><a href="https://www.aboutamazon.com/news/devices/new-alexa-generative-artificial-intelligence" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Alexa+</span></a><span style="font-weight:400;">, </span><a href="https://www.aboutamazon.com/news/aws/amazon-q-generative-ai-assistant-aws" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Q</span></a><span style="font-weight:400;"> and </span><a href="https://www.aboutamazon.com/news/retail/amazon-rufus" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Rufus</span></a><span style="font-weight:400;">, as well as tools like Bedrock.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Using their cutting edge </span><a href="https://www.aboutamazon.com/news/aws/amazon-nova-artificial-intelligence-bedrock-aws" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Nova</span></a><span style="font-weight:400;"> engine, they are now rolling nova.amazon.com a new website for easy exploration of their foundational models. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">As well as they are introducing Amazon Nova Act, a new AI model trained to perform actions within a web browser. They’re releasing a research preview of the Amazon Nova Act SDK, which will allow developers to experiment with an early version of the new model. </span></li>
</ul>
</li>
<li style="font-weight:400;"><i><span style="font-weight:400;">“Nova.amazon.com puts the power of Amazon’s frontier intelligence into the hands of every developer and tech enthusiast, making it easier than ever to explore the capabilities of Amazon Nova,” </span></i><b>said Rohit Prasad, SVP of Amazon Artificial General Intelligence</b><b><i>.</i></b><i><span style="font-weight:400;"> “We’ve created this experience to inspire builders, so that they can quickly test their ideas with Nova models, and then implement them at scale in Amazon Bedrock. It is an exciting step forward for rapid exploration with AI, including bleeding-edge capabilities such as the Nova Act SDK for building agents that take actions on the web. We’re excited to see what they build and to hear their useful feedback.”</span></i></li>
</ul>
<h2><b>GCP</b></h2>
<p><b>36:04</b> <a href="https://cloud.withgoogle.com/next/25" target="_blank" rel="noreferrer noopener"><b>Google Next</b></a><b> is coming up VERY SOON!</b></p>
<p><span style="font-weight:400;">BRK2-024 – Workload-optimized data protection for mission-critical enterprise </span><span style="font-weight:400;">apps</span></p>
<p><span style="font-weight:400;">BRK1-028 – Unlock value for your workloads: Microsoft, Oracle, OpenShift and </span><span style="font-weight:400;">more</span></p>
<p><b>Google Next Predictions</b></p>
<ul>
<li><b>Ryan</b></li>
</ul>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Responsible AI, in Console/Service/SDK to enable and/or visualize your responsible AI creation or usage</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Endpoint Security Tools (Crowdstrike, Patch Management/Vulnerability)</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Won’t be announcing anything new service announcements just enhancements for AI/Gemini/Etc.</span></li>
</ul>
</li>
</ul>
</li>
</ul>
<ul>
<li><b>Justin</b></li>
</ul>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AI Agents specialized for Devops, K8, Devops capability</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Next Generation of TPU GPU’s optimized Optimized Multi-modal</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Unification or Major Enhancement of Anthos &amp; GKE Enterprise </span></li>
</ul>
</li>
</ul>
</li>
</ul>
<ul>
<li><b>Matt</b></li>
</ul>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Green AI</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">3 not-AI specific keynotes</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">AI security thing that is not Endpoint. More Guardrails. </span></li>
</ul>
</li>
</ul>
</li>
</ul>
<ul>
<li><b>Honorable Mentions</b></li>
</ul>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Industry verticalization for AI LLM Models. Fine Tuning Marketplace or special model for specific industry/use case</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Personal Assistant for Workspace productivity </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Multi Cloud tooling</span></li>
</ul>
</li>
</ul>
</li>
</ul>
<ul>
<li><b>Number of times AI or ML said on stage</b></li>
</ul>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Matt: 52 </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Justin: 97</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Ryan: 1</span></li>
</ul>
</li>
</ul>
<p><b>52:08</b> <a href="https://cloud.google.com/blog/products/identity-security/secure-backups-with-threat-detection-and-remediation/" target="_blank" rel="noreferrer noopener"><b>Secure backups with threat detection and remediation | Google Cloud Blog</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google is really nibbling on the edges of backups and disaster recovery, which I think is a sign that ransomware is still a big problem and concern for customers. </span></li>
<li style="font-weight:400;"><a href="https://cloud.google.com/blog/products/storage-data-transfer/backup-and-dr-service-adds-immutable-indelible-backups" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Backup vault</span></a><span style="font-weight:400;"> was announced last year as a powerful storage feature available as part of </span><a href="https://www.bing.com/ck/a?!&amp;&amp;p=872d48cd08b87f498568dbb1dd26553b09f019ac716b7da65235870d8222f4dcJmltdHM9MTc0MzYzODQwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=Google+Cloud+Backup&amp;u=a1aHR0cHM6Ly9jbG91ZC5nb29nbGUuY29tL2JhY2t1cC1kaXNhc3Rlci1yZWNvdmVyeS9kb2NzL2NvbmNlcHRzL2JhY2t1cC1kcg&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Google Cloud Backup and DR services</span></a><span style="font-weight:400;">.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The point is to secure backups against tampering and unauthorized deletion, and integrates with </span><a href="https://cloud.google.com/security/products/security-command-center?e=48754805" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Security Command Center</span></a><span style="font-weight:400;"> for real-time alerts on high risk actions. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">To further support security needs, they are deepening the integration between Google Backup and DR and security command center enterprise. This includes new detections including threats to the backup vault itself, and end to end workflows to help customers protect backup data. </span></li>
</ul>
<p><i><span style="font-weight:400;">33:53  Ryan – “</span></i><i><span style="font-weight:400;">I think not only is ransomware still a big issue, but also it’s hit the compliance round; it’s a question that comes up all the time in any kind of security audit or attestation – or even a customer walkthrough. It’s definitely an issue that’s in the front of people’s minds and something that’s annoying to fix in reality. So this is great.”</span></i></p>
<p><b>54:12</b> <a href="https://cloud.google.com/blog/products/infrastructure-modernization/mlogica-and-google-cloud-partner-on-mainframe-modernization/" target="_blank" rel="noreferrer noopener"><b>mLogica and Google Cloud partner on mainframe modernization</b></a><b>  </b><span style="font-weight:400;"> </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">The mainframe is still kicking, and Google and </span><a href="https://www.bing.com/ck/a?!&amp;&amp;p=fa0765e651e6f12d191a0f1f36f1dd551a6b44a6750ff1aa5f4f111179fb3972JmltdHM9MTc0MzYzODQwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=mlogica&amp;u=a1aHR0cHM6Ly93d3cubWxvZ2ljYS5jb20v&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">mLogica</span></a><span style="font-weight:400;"> have announced an expanded partnership focused on accelerating and de-risking mainframe application modernization, combining mLogica’s </span><a href="https://www.bing.com/ck/a?!&amp;&amp;p=7489807951b5e2f3ecb7d7b3543c94f88e8c6222ffbcbca9ec908166379b5389JmltdHM9MTc0MzYzODQwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=liber*m&amp;u=a1aHR0cHM6Ly93d3cubWxvZ2ljYS5jb20vcHJvZHVjdHMvbGliZXItbS1tYWluZnJhbWUtbW9kZXJuaXphdGlvbi8&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">LIBER*M</span></a><span style="font-weight:400;"> automated code refactoring suite (available via marketplace) with </span><a href="https://www.bing.com/ck/a?!&amp;&amp;p=3f5f88378000324fb5ad38c62e2fcdc2d3030d3ad93c8108ade02a706e856bc2JmltdHM9MTc0MzYzODQwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=google+dual+run&amp;u=a1aHR0cHM6Ly9jbG91ZC5nb29nbGUuY29tL21haW5mcmFtZS1kdWFsLXJ1bi9kb2NzL2R1YWwtcnVuLW92ZXJ2aWV3&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Google Cloud Dual Run</span></a><span style="font-weight:400;"> for validation and de-risking offering a validated modernization path to their joint customers. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">LIBER*M provides automated assessment, code analysis, dependency mapping, and code transformation capabilities, and it supports multiple target languages and platforms, providing a crucial foundation for refactoring projects. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Google Dual Run (I didn’t know this existed) enables the simultaneous operation of mainframe and cloud applications in parallel, letting you compare and validate refactored applications before cutting over. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This, along with powerful testing capabilities, enables a controlled phase transition, minimizes business disruption and substantially reduces the risks inherent in large-scale mainframe modernization projects. </span></li>
</ul>
<p><b>56:349 </b><a href="https://cloud.google.com/blog/products/storage-data-transfer/how-colossus-optimizes-data-placement-for-performance/" target="_blank" rel="noreferrer noopener"><b>How Colossus optimizes data placement for performance</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google has a great article about its foundational distributed storage system, Colossus storage platform.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Google’s universal storage platform </span><a href="https://cloud.google.com/blog/products/bigquery/bigquery-under-the-hood" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Colossus</span></a><span style="font-weight:400;"> achieves throughput that rivals or exceeds the best parallel file systems, has the management and scale of an object storage system, and has an easy-to-use programming model that’s used by all Google teams. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Moreover, it does all this while serving the needs of products with incredibly diverse requirements, be it scale, affordability, throughput or latency. </span></li>
</ul>
<table>
<tbody>
<tr>
<td><b>Example application</b></td>
<td><b>I/O sizes</b></td>
<td><b>Expected performance</b></td>
</tr>
<tr>
<td><span style="font-weight:400;">BigQuery scans</span></td>
<td><span style="font-weight:400;">hundreds of KBs to tens of MBs</span></td>
<td><span style="font-weight:400;">TB/s</span></td>
</tr>
<tr>
<td><span style="font-weight:400;">Cloud Storage – standard</span></td>
<td><span style="font-weight:400;">KBs to tens of MBs</span></td>
<td><span style="font-weight:400;">100s of milliseconds</span></td>
</tr>
<tr>
<td><span style="font-weight:400;">Gmail messages</span></td>
<td><span style="font-weight:400;">less than hundreds of KBs</span></td>
<td><span style="font-weight:400;">10s of milliseconds</span></td>
</tr>
<tr>
<td><span style="font-weight:400;">Gmail attachments</span></td>
<td><span style="font-weight:400;">KBs to MBs</span></td>
<td><span style="font-weight:400;">seconds</span></td>
</tr>
<tr>
<td><span style="font-weight:400;">Hyperdisk reads</span></td>
<td><span style="font-weight:400;">KBs to hundreds of KBs</span></td>
<td><span style="font-weight:400;">&lt;1 ms</span></td>
</tr>
<tr>
<td><span style="font-weight:400;">YouTube video storage</span></td>
<td><span style="font-weight:400;">MBs</span></td>
<td><span style="font-weight:400;">seconds</span></td>
</tr>
</tbody>
</table>
<p> </p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">This flexibility shows up in publicly available google products. Things from </span><a href="https://cloud.google.com/kubernetes-engine/docs/how-to/persistent-volumes/hyperdisk-ml" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Hyper Disk ML</span></a><span style="font-weight:400;"> to tiered storage for Spanner.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Colossus was the evolution of </span><a href="https://static.googleusercontent.com/media/research.google.com/en//archive/gfs-sosp2003.pdf" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">GFS (Google File System)</span></a><span style="font-weight:400;">, the traditional colossus file system contained in a single datacenter.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Colossus simplified the GFS programming model to an append only storage system that combines file system familiar programming interface with the scalability of object storage. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The colossus metadata service is made up of “curators” that deal with interactive control operations like file creation and deletion, and “custodians,” which maintain the durability and availability of data as well as disk-space balancing. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Colossus clients interact with the curators for metadata and then directly store data on “D servers” which host its SSD or HDDs. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">It’s also good to understand that Colossus is a zonal product, they build a single colossus filesystem per cluster, an internal building block of a Google Cloud Zone. Most data centers have one cluster and thus one colossus filesystem, regardless of how many workloads run inside the cluster.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Many Colossus file systems have multiple exabytes of storage, including two different filesystems that have in excess of 10 exabytes of storage each.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Demanding applications also need large amounts of IOPS and throughput. In fact, some of Google’s largest file systems regularly exceed read throughputs of 50 TB/s and write throughputs of 25 TB/s.  This is enough throughput to send more than 100 full-length 8k movies every second! </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Their single busiest cluster does over 600M IOPS, combined between read and write operations. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Previously when they talked about colossus they talked about how they place the hottest data on SSDs and balance the remaining data across all of the devices in the cluster.  This is more pertinent today, as over the years the SSDs have gotten more affordable, but still pose a substantial cost premium over blended fleets of SSD and HDD. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">To make it easier for their developers they have a L4 distributed SSD caching layer which dynamically picks the data that is most suitable for SSD. </span></li>
</ul>
<p><i><span style="font-weight:400;">33:53  Justin – “</span></i><i><span style="font-weight:400;">This is more pertinent today as over the years, the SSDs have gotten more affordable but still pose a substantial cost premium over blended fleets of SSD and HDD drives. To make it easier for developers, they have an L4 distributed SSD caching layer with dynamic PIX data that is most suitable for SSDs, so the developers don’t even have to think about the tiering. Take that, Amazon!”</span></i></p>
<p><b>1:03:26  </b><a href="https://cloud.google.com/blog/products/data-analytics/ai-assisted-bigquery-data-preparation-now-ga/" target="_blank" rel="noreferrer noopener"><b>AI-assisted BigQuery data preparation now GA</b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/blog/products/data-analytics/introducing-ai-driven-bigquery-data-preparation" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">BigQuery data preparation</span></a><span style="font-weight:400;"> is now generally available. It also now integrates with </span><a href="https://cloud.google.com/bigquery/docs/workflows-introduction" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">BigQuery pipelines</span></a><span style="font-weight:400;">, letting you connect data ingestion and transformative tasks so you can create end-to-end data pipelines with incremental processing, all in a unified environment.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Features include:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Comprehensive transformation capabilities</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Data standardization</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Automated schema mapping</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">AI-suggested join keys for data enrichment</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Visual Data pipelines</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Data quality enforcement with error tables</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Streamlined deployment with github integrations</span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">1:03:59  Ryan – “Automated schema mapping is probably my biggest life work improvement.”</span></i></p>
<h2><b>Azure</b></h2>
<p><b>1:04:52 </b><a href="https://blog.fabric.microsoft.com/en-GB/blog/announcing-backup-storage-billing-for-sql-database-in-microsoft-fabric-what-you-need-to-know/" target="_blank" rel="noreferrer noopener"><b>Announcing backup storage billing for SQL database in Microsoft Fabric: </b></a><a href="https://blog.fabric.microsoft.com/en-GB/blog/announcing-backup-storage-billing-for-sql-database-in-microsoft-fabric-what-you-need-to-know/" target="_blank" rel="noreferrer noopener"><b>what you need to know</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Azure is back to charge you money for SQL backups.  Previously, your fabric capacity-based billing model included compute and data storage. By default, the system provides a full weekly backup, differential backup every 12 hours and transaction log backups every 10 minutes.  After April 1st, 2025, backup storage will also be billed, that exceeds the allocated DB size. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Listen. We get charging for this, but where we’re unclear is if this is configurable for the duration and period we want to store. So if it’s not configurable, this feels like a bit of a cost increase you can’t escape. </span></li>
</ul>
<p><i><span style="font-weight:400;">1:05:46  Matthew – “That’s probably what happened – they realized how much more storage this is actually using.” </span></i></p>
<p><b>1:08:12 </b><a href="https://techcommunity.microsoft.com/blog/microsoft-security-blog/announcing-alert-triage-agents-in-microsoft-purview-powered-by-security-copilot/4396041" target="_blank" rel="noreferrer noopener"><b>Announcing Alert Triage Agents in Microsoft Purview, powered by </b></a><a href="https://techcommunity.microsoft.com/blog/microsoft-security-blog/announcing-alert-triage-agents-in-microsoft-purview-powered-by-security-copilot/4396041" target="_blank" rel="noreferrer noopener"><b>Security Copilot</b></a><b>  </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft says that per their research that organizations face up to 66 alerts per day when it comes to Purview (DLP) alerts, up from 52 in 2023 with teams only really able to review about 63% of the alerts. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Given the sheer volume of data security alerts, it’s no surprise – per Microsoft – it’s hard to keep up. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">To help customers increase the efficacy of their data security programs, address key alerts and focus on the most critical data risks, Microsoft is thrilled to announce </span><a href="https://www.bing.com/ck/a?!&amp;&amp;p=1f49c474e0a99fd394a3ab58d9d81a0a72d900c7e65530d3affecbc3941542e2JmltdHM9MTc0MzYzODQwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=alert+triage+agents&amp;u=a1aHR0cHM6Ly90ZWNoY29tbXVuaXR5Lm1pY3Jvc29mdC5jb20vYmxvZy9taWNyb3NvZnQtc2VjdXJpdHktYmxvZy9hbm5vdW5jaW5nLWFsZXJ0LXRyaWFnZS1hZ2VudHMtaW4tbWljcm9zb2Z0LXB1cnZpZXctcG93ZXJlZC1ieS1zZWN1cml0eS1jb3BpbG90LzQzOTYwNDE&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Alert Triage Agents</span></a><span style="font-weight:400;"> in </span><a href="https://www.bing.com/ck/a?!&amp;&amp;p=95b71c22afa38762053a4e9b6a8f8f321a579b369c8ea8e4aa548de4b158c812JmltdHM9MTc0MzYzODQwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=microsoft+purview+data+loss+prevention&amp;u=a1aHR0cHM6Ly93d3cubWljcm9zb2Z0LmNvbS9lbi11cy9zZWN1cml0eS9idXNpbmVzcy9pbmZvcm1hdGlvbi1wcm90ZWN0aW9uL21pY3Jvc29mdC1wdXJ2aWV3LWRhdGEtbG9zcy1wcmV2ZW50aW9uP21zb2NraWQ9MGE4MTNhNjAxZDQ0NjQ5MDI0YTIyZmEyMWNhYTY1MDI&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Microsoft Purview Data Loss Prevention</span></a><span style="font-weight:400;"> (DLP) and Insider Risk Management (IRM).</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">These </span><a href="https://aka.ms/SecurityCopilotagents" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">autonomous security copilot capabilities</span></a><span style="font-weight:400;"> integrated directly into Microsoft Purview offer an agent-managed alert queue that identifies the DLP and IRM alerts that pose the greatest risk. </span></li>
</ul>
<p><i><span style="font-weight:400;">1:10:09  Ryan – “Doing something with DLP is really tricky, because you don’t want to all up in user’s data – but you want to make sure you are protected from data loss. So each one of these investigations for each one of these alerts is time consuming.” </span></i></p>
<h2><b>Oracle</b></h2>
<p><b>1:11:37 </b><a href="https://blogs.oracle.com/cloud-infrastructure/post/supercluster-nvidia-blackwell-dedicated-alloy" target="_blank" rel="noreferrer noopener"><b>Announcing New AI Infrastructure Capabilities with NVIDIA Blackwell for </b></a><a href="https://blogs.oracle.com/cloud-infrastructure/post/supercluster-nvidia-blackwell-dedicated-alloy" target="_blank" rel="noreferrer noopener"><b>Public, On-Premises, and Service Provider Clouds</b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.oracle.com/cloud/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">OCI</span></a><span style="font-weight:400;"> is making available the latest and greatest NVIDIA GB300 NVL72 and NVIDIA HGX B300 NVL16 with </span><a href="https://www.nvidia.com/en-us/data-center/technologies/blackwell-architecture/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Blackwell Ultra GPUs</span></a><span style="font-weight:400;">, providing early access to the AI acceleration. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">You can get the GB300, B300, in bare metal, or you can use super clusters with up to 131,072 NVIDIA GB300 Grace Blackwell Ultra Superchips as part of rack-scale NVIDIA GB300 NVL72 solutions.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Justin was trying to figure out what a supercluster would cost, but it wasn’t an option in the pricing calculator. However, he was able to pick 1 BM.GPU.GB200.4 with 4 GPUs and 756GB of memory running autonomous linux for $857,088 in Monthly on-demand cost. A bargain! </span></li>
</ul>
<p><i><span style="font-weight:400;">1:14:03  Justin – “I want to run Windows on it so I can open up task manager and see all the CPUs just scaling off </span></i><span style="font-weight:400;">.”</span></p>
<p><b>1:14:41 </b><a href="https://blogs.oracle.com/cloud-infrastructure/post/oci-launches-e6-standard-compute-powered-by-amd" target="_blank" rel="noreferrer noopener"><b>Oracle Launches OCI Compute E6 Standard Instances: 2X the </b></a><a href="https://blogs.oracle.com/cloud-infrastructure/post/oci-launches-e6-standard-compute-powered-by-amd" target="_blank" rel="noreferrer noopener"><b>Performance, Same Price</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">In more reasonably priced instances, the E6 Standard bare metal and flex virtual machine instances are now available, powered by the 5th-gen AMD EPYC processors. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">OCI is among the first cloud providers to offer them. (</span><i><span style="font-weight:400;">Among</span></i><span style="font-weight:400;"> is doing some heavy lifting here. Google was the *actual* first. Neither AWS or Azure have announced yet.)</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Oracle is promising a performance of 2x that of the E5 at the same price. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They feature 2.7GHz base frequency with max boost up to 4.1GHz based on the zen-5  architecture. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">There are configurations from 1-126 OCPU and up to 3072 GB for bare metal and 1454 for virtual machines. </span></li>
</ul>
<p><i><span style="font-weight:400;">1:17:37  Justin – “</span></i><i><span style="font-weight:400;">$10,285 for a bare metal running autonomous Linux. So that’s actually not that bad. It does jump up to $27,000 if you go for Windows. Yeah, so not bad. I only added 100 gigs of disk space, because who needs more than that? Capacity reservation didn’t change the price.”</span></i></p>
<p><b>1:18:25 </b><a href="https://techcrunch.com/2025/03/31/oracle-under-fire-for-its-handling-of-separate-security-incidents/" target="_blank" rel="noreferrer noopener"><b>Oracle under fire for its handling of separate security incidents</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Oracle is under fire for potential security breaches.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The first one is related to Oracle Health; the breach impacts patient data. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Oracle blamed the Cerner breach on an old legacy server not yet migrated to Oracle Cloud. Sure, Jan. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The other breach may be on Oracle Cloud, and Oracle is being cagey. A hacker going by rose87168 posted on a cybercrime forum offering the data of 6 million oracle cloud customers, including authenticated data and encrypted passwords.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Several Oracle customers have confirmed that the data appears genuine, but Oracle has stated that there has been no breach, and the published credentials are not from the Oracle Cloud. Ok, so where did it come from? </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Cybersecurity Expert Kevin Beaumont writes: “</span><i><span style="font-weight:400;">This is a serious cybersecurity incident which impacts customers, in a platform managed by oracle. Oracle are attempting to wordsmith statements around Oracle Cloud and use very specific words to avoid responsibility. This is not ok.</span></i><span style="font-weight:400;">” </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Can’t be unbreakable if it’s breakable. </span></li>
</ul>
<h2><span style="font-weight:400;">Closing</span></h2>
<p><span style="font-weight:400;">And that is the week in the cloud! Visit our website, the home of the Cloud Pod where you can join our newsletter, slack team, send feedback or ask questions at theCloud Pod.net or tweet at us with hashtag #theCloudPod</span></p>
]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2007523/c1e-mq1qaqknqoax9wk7-gp321869sw19-zxfmqf.mp3" length="96019456"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 299 of The Cloud Pod – where the forecast is always cloudy! Google Next is quickly approaching, and you know what that means – it’s time for predictions! Who will win this year’s Crystal Ball award? Only time and the main stage will tell. Join Matthew, Justin, and Ryan as they break down their thoughts on what groundbreaking (and less groundbreaking) announcements are in store for us. 
Titles we almost went with this week:

OpenAI and Anthropic join forces? 
Its 2025, and AWS is still trying to make Jumbo packets happen
Beanstalk and Ruby’s Updates!! They’re Alive!!!
Google Colossus or how to expect a colossal cloud outage someday.
‍The Cloud Pod gives an ode to Peter

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
AI Is Going Great – Or How ML Makes All Its Money  
02:27 OpenAI adopts rival Anthropic’s standard for connecting AI models to data

OpenAI is embracing Anthropic’s standard for connecting AI assistants to the systems where the data resides.  
By adapting Anthropic’s Model Context Protocol or MCP across its products, including the desktop app for ChatGPT.  
MCP is an open source standard that helps AI models produce better, more relevant responses to certain queries. 
Sam Altman says that people love MCP and they are excited to add support across their products and that it is available today in the Agents SDK and support for the ChatGPT desktop and Response API is coming soon.
MCP lets models draw data from sources like business tools and software to complete tasks, as well as from content repositories and app development environments. 
We found two helpful articles that may help demystify this whole concept. 

MCP: What It Is and Why It Matters – by Addy Osmani
]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2007523/c1a-k5d5-34dkvpjpt25j-9aydpw.jpg"></itunes:image>
                                                                            <itunes:duration>01:20:01</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2007523/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[298: BigQuery Gits it With Devops]]>
                </title>
                <pubDate>Wed, 02 Apr 2025 16:37:43 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2005858</guid>
                                    <link>https://tcpfm.castos.com/episodes/298-bigquery-gits-it-with-devops</link>
                                <description>
                                            <![CDATA[<p><span style="font-weight:400;">Welcome to episode 298 of The Cloud Pod – where the forecast is always cloudy! Justin, Matthew and Ryan are in the house (and still very much missing Jonathan) to bring you a  jam packed show this week, with news from Beijing to Virginia! Did you know Virginia was in the US? Amazon definitely wants you to know that. </span></p>
<p><span style="font-weight:400;">We’ve got updates from BigQuery Git Support and their new collab tools, plus all the AI updates you were hoping you’d miss. Tune in now! </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">The Cloud Pod now Recorded from Planet Earth</span></li>
<li><span style="font-weight:400;">☕Wait Java still exists?</span></li>
<li><span style="font-weight:400;">When will java just be coffee and not software</span></li>
<li><span style="font-weight:400;">Cloudflare Makes AI beat Mazes</span></li>
<li><span style="font-weight:400;">Replacing native mobile things with mobile web apps won’t fix your problems AWS</span></li>
<li><span style="font-weight:400;">Turn your security over to the bots</span></li>
<li><span style="font-weight:400;">The Cloud Pod is lost in the AI labyrinth </span></li>
<li><span style="font-weight:400;">AI security agents to secure the AI… wait recursion</span></li>
<li><span style="font-weight:400;">Durable + Stateless.. I don’t know if you know what those words means</span></li>
<li><span style="font-weight:400;">Click ops expands to our phones yay!</span></li>
<li><span style="font-weight:400;">The Cloud Pod is now a data analyst </span></li>
<li><span style="font-weight:400;">⁉️Gitops come to bigquery</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><b>AI Is Going Great – Or How ML Makes All Its Money  </b></h2>
<p><b>00:46 </b><a href="https://www.maginative.com/article/manus-a-new-ai-agent-from-china-is-going-viral-and-raising-big-questions/?utm_source=tldrnewsletter" target="_blank" rel="noreferrer noopener"><b>Manus, a New AI Agent From China is Going Viral—And Raising Big </b></a><a href="https://www.maginative.com/article/manus-a-new-ai-agent-from-china-is-going-viral-and-raising-big-questions/?utm_source=tldrnewsletter" target="_blank" rel="noreferrer noopener"><b>Questions</b></a><b>  </b></p>
<ul>
<li style="font-weight:400;"><a href="https://manus.im/?index=1&amp;ref=maginative.com" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Manus</span></a><span style="font-weight:400;"> is being described as “the first true autonomous AI agent” from China, capable of completing weeks of professional work in hours.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Developed by a team called Butterfly Effect with offices in Beijing and Wuhan, Manus functions as a truly autonomous agent that independently analyzes, plans, and executes complex tasks. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The system uses a multi-agent architecture powered by several distinct AI models, including </span><a href="https://www.bing.com/ck/a?!&amp;&amp;p=ca1d83c3edbc5a8955873507b05bb480d45fd61bf70b98e893fc7b62f56f8142JmltdHM9MTc0MzQ2NTYwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=anthropic&amp;u=a1aHR0cHM6Ly93d3cuYW50aHJvcGljLmNvbS8&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Anthropic’s</span></a> <a href="https://www.bing.com/ck/a?!&amp;&amp;p=e6755b764bcf768179bd18024d260b3d2ba8261e79ee9fde33e3259f0db21560JmltdHM9MTc0MzQ2NTYwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=claude+3.5&amp;u=a1aHR0cHM6..."></a></li></ul>
<h3>Chapters</h3>
<ul><li>(00:00:00) - Cloud Pod: Week 298</li><li>(00:00:56) - China's First Autonomous AI Agent Is Going Viral</li><li>(00:04:12) - Cloudflare's 'Artificial Labyrinth' to Stop Bots</li><li>(00:06:54) - OpenAI's ChatGPT 4.0 Image Generation</li><li>(00:10:46) - Bay Bridge vs Golden Gate</li><li>(00:11:26) - OpenAI's Speech Text and Text Speech Audio Transcription</li><li>(00:12:28) - Redis vs Valky: The Cloud-Tools Fork</li><li>(00:17:25) - Amazon AWS: More Geography on Regions and Availability Zones</li><li>(00:22:05) - Amazon Q & Quicksight: New Scales capability</li><li>(00:24:39) - Amazon OpenSearch OC2 and OM2 Instances Announce</li><li>(00:26:11) - OpenJDK24</li><li>(00:28:26) - AWS Mobile App: More Services, Less Adoption</li><li>(00:33:17) -  AWS Network Firewall: New Flow Management Features</li><li>(00:34:43) - Google Next</li><li>(00:36:43) - Google Cloud Backup: Data Protection Summary and AI Protection</li><li>(00:38:59) - Google's AI Toolbox for Databases</li><li>(00:41:17) - BigQuery Repos: Git Integration in BigQuery Studio</li><li>(00:45:31) - Google's Gemini 2.5 Pro Takes the Top L on the</li><li>(00:48:46) - Azure Functions: Public Preview</li><li>(00:52:13) - Nvidia Serverless GPUs: What You Need to Know</li><li>(00:53:59) - Nvidia's Nim Microservices for Azure AI</li><li>(00:57:16) - Microsoft Launches 6 AI Agents in Security Copilot</li><li>(01:02:02) - Oracle Introduces AI Agent Studio</li><li>(01:03:40) - Week in the Cloud: Google Cloud Next</li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 298 of The Cloud Pod – where the forecast is always cloudy! Justin, Matthew and Ryan are in the house (and still very much missing Jonathan) to bring you a  jam packed show this week, with news from Beijing to Virginia! Did you know Virginia was in the US? Amazon definitely wants you to know that. 
We’ve got updates from BigQuery Git Support and their new collab tools, plus all the AI updates you were hoping you’d miss. Tune in now! 
Titles we almost went with this week:

The Cloud Pod now Recorded from Planet Earth
☕Wait Java still exists?
When will java just be coffee and not software
Cloudflare Makes AI beat Mazes
Replacing native mobile things with mobile web apps won’t fix your problems AWS
Turn your security over to the bots
The Cloud Pod is lost in the AI labyrinth 
AI security agents to secure the AI… wait recursion
Durable + Stateless.. I don’t know if you know what those words means
Click ops expands to our phones yay!
The Cloud Pod is now a data analyst 
⁉️Gitops come to bigquery

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
AI Is Going Great – Or How ML Makes All Its Money  
00:46 Manus, a New AI Agent From China is Going Viral—And Raising Big Questions  

Manus is being described as “the first true autonomous AI agent” from China, capable of completing weeks of professional work in hours.
Developed by a team called Butterfly Effect with offices in Beijing and Wuhan, Manus functions as a truly autonomous agent that independently analyzes, plans, and executes complex tasks. 
The system uses a multi-agent architecture powered by several distinct AI models, including Anthropic’s ]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[298: BigQuery Gits it With Devops]]>
                </itunes:title>
                                    <itunes:episode>298</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p><span style="font-weight:400;">Welcome to episode 298 of The Cloud Pod – where the forecast is always cloudy! Justin, Matthew and Ryan are in the house (and still very much missing Jonathan) to bring you a  jam packed show this week, with news from Beijing to Virginia! Did you know Virginia was in the US? Amazon definitely wants you to know that. </span></p>
<p><span style="font-weight:400;">We’ve got updates from BigQuery Git Support and their new collab tools, plus all the AI updates you were hoping you’d miss. Tune in now! </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">The Cloud Pod now Recorded from Planet Earth</span></li>
<li><span style="font-weight:400;">☕Wait Java still exists?</span></li>
<li><span style="font-weight:400;">When will java just be coffee and not software</span></li>
<li><span style="font-weight:400;">Cloudflare Makes AI beat Mazes</span></li>
<li><span style="font-weight:400;">Replacing native mobile things with mobile web apps won’t fix your problems AWS</span></li>
<li><span style="font-weight:400;">Turn your security over to the bots</span></li>
<li><span style="font-weight:400;">The Cloud Pod is lost in the AI labyrinth </span></li>
<li><span style="font-weight:400;">AI security agents to secure the AI… wait recursion</span></li>
<li><span style="font-weight:400;">Durable + Stateless.. I don’t know if you know what those words means</span></li>
<li><span style="font-weight:400;">Click ops expands to our phones yay!</span></li>
<li><span style="font-weight:400;">The Cloud Pod is now a data analyst </span></li>
<li><span style="font-weight:400;">⁉️Gitops come to bigquery</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><b>AI Is Going Great – Or How ML Makes All Its Money  </b></h2>
<p><b>00:46 </b><a href="https://www.maginative.com/article/manus-a-new-ai-agent-from-china-is-going-viral-and-raising-big-questions/?utm_source=tldrnewsletter" target="_blank" rel="noreferrer noopener"><b>Manus, a New AI Agent From China is Going Viral—And Raising Big </b></a><a href="https://www.maginative.com/article/manus-a-new-ai-agent-from-china-is-going-viral-and-raising-big-questions/?utm_source=tldrnewsletter" target="_blank" rel="noreferrer noopener"><b>Questions</b></a><b>  </b></p>
<ul>
<li style="font-weight:400;"><a href="https://manus.im/?index=1&amp;ref=maginative.com" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Manus</span></a><span style="font-weight:400;"> is being described as “the first true autonomous AI agent” from China, capable of completing weeks of professional work in hours.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Developed by a team called Butterfly Effect with offices in Beijing and Wuhan, Manus functions as a truly autonomous agent that independently analyzes, plans, and executes complex tasks. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The system uses a multi-agent architecture powered by several distinct AI models, including </span><a href="https://www.bing.com/ck/a?!&amp;&amp;p=ca1d83c3edbc5a8955873507b05bb480d45fd61bf70b98e893fc7b62f56f8142JmltdHM9MTc0MzQ2NTYwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=anthropic&amp;u=a1aHR0cHM6Ly93d3cuYW50aHJvcGljLmNvbS8&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Anthropic’s</span></a> <a href="https://www.bing.com/ck/a?!&amp;&amp;p=e6755b764bcf768179bd18024d260b3d2ba8261e79ee9fde33e3259f0db21560JmltdHM9MTc0MzQ2NTYwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=claude+3.5&amp;u=a1aHR0cHM6Ly93d3cuYW50aHJvcGljLmNvbS9jbGF1ZGUvc29ubmV0&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Claude 3.5 Sonnet</span></a><span style="font-weight:400;"> and fine-tuned versions of </span><a href="https://www.bing.com/aclk?ld=e8n5A5-cOgsyq-lYeBlS3PZTVUCUz-ZKvbA5qqjZlbAuDPT4vsT_rxA8z4NIYOnb_pGOrmHX6xBa3YdpdLlPI7eVPXvsqE55tnlKwjixAyJ7kslHJsFnHYdzfD210VZGXDOz0rXcQEt49mWa3Y2F4OYnqOw6U2K3T_yQr6Yz56mUgU4Be6YLKd1KarmsOqBLSu5D0mTg&amp;u=aHR0cCUzYSUyZiUyZnd3dy5BbGliYWJhLmNvbSUzZnNyYyUzZHNlbV9iaW5nJTI2ZmllbGQlM2RVRyUyNmZyb20lM2RzZW1fYmluZyUyNmNtcGduJTNkNDg0OTYwNDcwJTI2YWRncnAlM2QxMjk1MjI1ODM3NzY3NTM2JTI2dGd0JTNka3dkLTgwOTUxODcwMTQyODY5JTNhbG9jLTE5MCUyNkt3ZElEJTNkODA5NTE4NzAxNDI4NjklMjZtdGNodHlwJTNkZSUyNmJkbXRjaHR5cCslM2RiZSUyNm50d3JrJTNkbyUyNmRldmljZSUzZGMlMjZjcmVhdGl2ZSUzZDgwOTUxNjcwMDU0MDQ2JTI2cDElM2RkZWZhdWx0JTI2cDIlM2RkZWZhdWx0JTI2cDMlM2RkZWZhdWx0JTI2UXVlcnklM2RhbGliYWJhJTI2bXNjbGtpZCUzZDhmODQ5NGZkNmQ2MTE0ZWYyNTY2ZjMxMWNhYzlhOGIw&amp;rlid=8f8494fd6d6114ef2566f311cac9a8b0&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Alibaba’s</span></a> <a href="https://www.bing.com/ck/a?!&amp;&amp;p=6efa478e44eda2003a0e9de2279f2ddc0c3a096af213bc812685b52eb39caa1fJmltdHM9MTc0MzQ2NTYwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=qwen&amp;u=a1aHR0cHM6Ly9jaGF0LnF3ZW4uYWkv&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Qwen</span></a><span style="font-weight:400;">.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Unlike traditional chatbots, Manus can work on different tasks without needing frequent, step-by-step instructions, continuing to work in the background even when users close their computers</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">A unique feature is the “Manus’s Computer” window, which allows users to observe what the agent is doing and intervene at any point.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The company claims Manus outperforms </span><a href="https://www.bing.com/ck/a?!&amp;&amp;p=521698f51ea56e445ff2c1dd4f9154bbe0ce61af122d4224c44941c3c18c1235JmltdHM9MTc0MzQ2NTYwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=open+ai&amp;u=a1aHR0cHM6Ly9vcGVuYWkuY29tLw&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">OpenAI’s</span></a> <a href="https://www.bing.com/ck/a?!&amp;&amp;p=8dff5ecd071e309aab05dff715484b5a9ab1f583c0576dc6c692464802585718JmltdHM9MTc0MzQ2NTYwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=open+ai+deep+research&amp;u=a1aHR0cHM6Ly9vcGVuYWkuY29tL2luZGV4L2ludHJvZHVjaW5nLWRlZXAtcmVzZWFyY2gv&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Deep Research</span></a><span style="font-weight:400;"> tool on the </span><a href="https://www.bing.com/ck/a?!&amp;&amp;p=e741a917b5a4b4b30a25f90489505d84b474306e9c10577683a30dafd44161f1JmltdHM9MTc0MzQ2NTYwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=GAIA+benchmark&amp;u=a1aHR0cHM6Ly9odWdnaW5nZmFjZS5jby9nYWlhLWJlbmNobWFyaw&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">GAIA benchmark</span></a><span style="font-weight:400;">, a third-party measure of general AI assistants</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Early testing has shown mixed results – while some reviewers were impressed, others encountered bugs, error messages, and failures on practical tasks like ordering food or booking flights.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The system remains difficult to access due to limited server capacity, creating a scramble for invitation codes which are reportedly selling for thousands of dollars on Chinese reseller apps.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Manus has announced a strategic partnership with Alibaba’s Qwen team to help deal with the surge in traffic and expand its user base</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The emergence of Manus is raising questions about the global AI landscape, with some comparing it to January’s “DeepSeek moment” and questioning whether China has leapfrogged the US in AI development.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Privacy experts have raised concerns about data protection, noting uncertainty about server locations and potential data transfers to China.</span></li>
</ul>
<p><i><span style="font-weight:400;">02:16  Matthew – “It’s no different than giving all your personal information to ChatGPT. Sure, I don’t want to give it to China. But I also don’t like giving it to OpenAI either. </span></i></p>
<p><b>04:14</b> <a href="https://arstechnica.com/ai/2025/03/cloudflare-turns-ai-against-itself-with-endless-maze-of-irrelevant-facts/" target="_blank" rel="noreferrer noopener"><b>Cloudflare turns AI against itself with endless maze of irrelevant facts – Ars </b></a><a href="https://arstechnica.com/ai/2025/03/cloudflare-turns-ai-against-itself-with-endless-maze-of-irrelevant-facts/" target="_blank" rel="noreferrer noopener"><b>Technica</b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.bing.com/ck/a?!&amp;&amp;p=99ecc817d5dd0e54eb7b2016eef6faacbe17b81a30035108cb02b6fed072d162JmltdHM9MTc0MzQ2NTYwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=cloudflare&amp;u=a1aHR0cHM6Ly93d3cuY2xvdWRmbGFyZS5jb20v&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Cloudflare</span></a><span style="font-weight:400;"> has announced a new feature called “</span><a href="https://blog.cloudflare.com/ai-labyrinth/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AI Labyrinth</span></a><span style="font-weight:400;">” that aims to combat unauthorized AI data scraping by serving fake AI-generated content to bots. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The tool will attempt to thwart AI companies that crawl websites without permission to collect training data for LLM that power AI assistants like </span><a href="https://arstechnica.com/information-technology/2023/11/chatgpt-was-the-spark-that-lit-the-fire-under-generative-ai-one-year-ago-today/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">ChatGPT</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Instead of simply blocking the bots, Cloudflare’s new system lures them into a maze of realistic looking but irrelevant pages, wasting the crawlers computing resources. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The approach is a notable shift from the standard block-and-defend strategy used by most website protection services. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Cloudflare says blocking bots sometimes backfires because it alerts the crawlers operators that they’ve been detected. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">When Cloudflare detects unauthorized crawling, rather than blocking the request, it will link the bot to a series of AI-generated pages that are convincing enough to entice a crawler to traverse them. But while real looking, the content is not actually the content of the site they are protecting, so the crawler wastes time and resources.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The data is automatically generated by its </span><a href="https://developers.cloudflare.com/workers-ai/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Workers AI</span></a><span style="font-weight:400;"> service, a commercial platform that runs AI tasks.      </span></li>
</ul>
<p><i><span style="font-weight:400;">05:40  Ryan – “Yeah, is the hallucination in the model? Or is it the bad data that it’s being fed?”</span></i></p>
<p><b>07:05</b> <a href="https://openai.com/index/introducing-4o-image-generation/" target="_blank" rel="noreferrer noopener"><b>Introducing 4o Image Generation | OpenAI</b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://openai.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">OpenAI</span></a><span style="font-weight:400;"> has long believed image generation should be a primary capability of their language models. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">That’s why they have built “the most advanced image generator yet” into GPT-4o, the result image generation that is beautiful, but also useful. For example: </span></li>
</ul>
<p><b>11:39</b> <a href="https://openai.com/index/introducing-our-next-generation-audio-models/" target="_blank" rel="noreferrer noopener"><b>Introducing next-generation audio models in the API</b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://openai.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Open AI</span></a><span style="font-weight:400;"> is launching a new speech to text and text to speech audio model in the API making it possible to build more powerful, customizable, and intelligent voice agents that offer real value. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The latest speech to text models set a new state of the art benchmark, outperforming existing solutions in accuracy and reliability — especially when dealing with accents, noisy environments and varying speech speeds. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">These enhancements are in the gpt-4o-transcribe and gpt-4o-mini-transcribe models with improvements to word error rate and better language recognition and accuracy, compared to the original whisper models. </span></li>
</ul>
<p><i><span style="font-weight:400;">Show note editor aside: As a historian (who specialized in Byzantine and early Medieval studies) tech jargon can sometimes be difficult for me to interpret just by ear. I can sometimes tell when the transcript is off, but sometimes I can’t, and more efficient transcripts would be awesome. </span></i></p>
<h2><b>Cloud Tools </b></h2>
<p><b>12:44</b> <a href="https://thenewstack.io/valkey-8-1s-performance-gains-disrupt-in-memory-databases/" target="_blank" rel="noreferrer noopener"><b>Valkey 8.1’s Performance Gains Disrupt In-Memory Databases</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">This article on </span><a href="https://www.bing.com/ck/a?!&amp;&amp;p=ad7b394032773f5ee0ea065f21f89c829e7a5aef48467376f4bae8e059996dedJmltdHM9MTc0MzQ2NTYwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=valkey&amp;u=a1aHR0cHM6Ly92YWxrZXkuaW8v&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Valkey</span></a><span style="font-weight:400;"> caught Justin’s eye, as it’s been a year since </span><a href="https://www.bing.com/aclk?ld=e8qX0a77oGunz1cK6-Fbd1TTVUCUxgxCqKeU8w2ucEzs6tSBPb5v2E2FGTdF5xwupK1ZnIR1e9eq68PSlvXNBbiVjELIMnYUnAFND79yhkR5Re7y_Zya2ULMHdTcWfqkFA-1qxRoxDGWyGdfnSHqgVZvXR6Z-ijHNHYlFxK580zEDRKNtjjxnGHWgtO_aPKZohNjsDTA&amp;u=aHR0cHMlM2ElMmYlMmZyZWRpcy5pbyUyZmNsb3VkJTNmdXRtX2NhbXBhaWduJTNkYmdfc19jb3JlX2FtZXJ0MV9lbl9icmFuZF9hY3Ffc3RhdGljXzU4MDMyODA2NSUyNnV0bV9zb3VyY2UlM2RiaW5nJTI2dXRtX21lZGl1bSUzZGNwYyUyNnV0bV9jb250ZW50JTNkcmVkaXNfZXhhY3QlMjZ1dG1fdGVybSUzZCUyNm1zY2xraWQlM2RlOTJjNmQyMTg5YWYxNjFhNDlmMjBhY2QyNGVhOTA0ZQ&amp;rlid=e92c6d2189af161a49f20acd24ea904e&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Redis</span></a><span style="font-weight:400;"> announced they were dumping the </span><a href="https://opensource.org/license/bsd-3-clause" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">BSD 3-clause licenses</span></a><span style="font-weight:400;"> and adopting the </span><a href="https://redis.com/legal/rsalv2-agreement/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">RSALv2</span></a><span style="font-weight:400;"> and </span><a href="https://redis.com/legal/rsalv2-agreement/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">SSPLv1</span></a><span style="font-weight:400;"> licenses. This is the event that birthed the </span><a href="https://thenewstack.io/linux-foundation-forks-the-open-source-redis-as-valkey/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Valkey fork</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Apparently the Valkey fork is turning out to be highly successful, per a Percona research paper, 75% of surveyed Redis users are considering migration due to recent licensing changes, and of those considering migration 75% are already testing, considering or adopted valkey.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Third party Redis Developer companies like </span><a href="https://redisson.org/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Redisson</span></a><span style="font-weight:400;"> are supporting both Redis and Valkey.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">It’s not just the licensing that’s driving, but at the </span><a href="https://events.linuxfoundation.org/lf-member-summit/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Linux Foundation Member Summit</span></a><span style="font-weight:400;">, said that Valkey is far faster thanks to incorporating enhanced multi-threading and scalability features. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">That wasn’t the </span><a href="https://thenewstack.io/valkey-will-not-just-be-a-redis-retread/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">original plan</span></a><span style="font-weight:400;">, as they wanted to keep the open source spirit, but also wanted the value to be more than just a fork.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Initially at the first contributor summit in Seattle where we got together developers and users to try to figure out what this new project would look like. At the time it was expected to focus on caching, but users said they wanted more, with Valkey being a high performance database of all sorts of distributed workloads, and although that would cause a lot of complexity, the new core team took that on. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They were successful with Valkey 8 redesigning Redis’s single threaded event loop threading model with a more sophisticated multithreaded approach to I/O Operations which resulted in a 3x improvement in performance as well as 20% reduction in the size of separate cache tables.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Beyond that they have been improving the core by adding rust to add memory safety. As well as changing internal algorithms to improve reliability and failover times.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">As well as they have rebuilt the key-value store from scratch to take better advantage of modern hardware based on work done at Google.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">A ton of this will come out as part of </span><a href="https://github.com/valkey-io/valkey/releases" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Valkey 8.1</span></a><span style="font-weight:400;">.  </span></li>
</ul>
<p><i><span style="font-weight:400;">16:18  Matthew – “The performance improvements here are massive…it’s pretty amazing what they’re able to do now.” If they keep improving, Redis is just going to slowly die off due to their own causes.” </span></i></p>
<h2><b>AWS</b></h2>
<p><b>17:49</b> <a href="https://aws.amazon.com/blogs/aws/now-available-geography-information-for-all-aws-regions-and-availability-zones/" target="_blank" rel="noreferrer noopener"><b>Detailed geographic information for all AWS Regions and Availability Zones </b></a><a href="https://aws.amazon.com/blogs/aws/now-available-geography-information-for-all-aws-regions-and-availability-zones/" target="_blank" rel="noreferrer noopener"><b>is now available | AWS News Blog</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Starting today, you can get more “granular” visibility of geography. Amazon says that due to data sovereignty, the need for more details is super important.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They have added Geography to the </span><a href="https://aws.amazon.com/about-aws/global-infrastructure/regions_az/?p=ngi&amp;loc=2" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Region and Availability Zones</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Virginia is in the United States of America, in case you didn’t know.</span></li>
</ul>
<p><i><span style="font-weight:400;">21:22  Matthew – “So maybe FanDuel didn’t know that US East-1 is in Virginia, and in Virginia they can’t do gambling? So they got a fine there, but they can do it in Ohio, so now they know US East-2 is in Ohio.”</span></i></p>
<p><i><span style="font-weight:400;">Listener note: Is this update important to you? We’d love to hear more about that! Slack, X, Bluesky…you know where to find us. </span></i></p>
<p><b>22:33</b> <a href="https://press.aboutamazon.com/2025/3/new-capability-of-amazon-q-in-quicksight-makes-every-employee-their-own-data-analyst" target="_blank" rel="noreferrer noopener"><b>New Capability of Amazon Q in QuickSight Makes Every Employee Their </b></a><a href="https://press.aboutamazon.com/2025/3/new-capability-of-amazon-q-in-quicksight-makes-every-employee-their-own-data-analyst" target="_blank" rel="noreferrer noopener"><b>Own Data Analyst</b></a></p>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS has announced that </span><a href="https://www.bing.com/aclk?ld=e8FWc-HDej23oK4fyeqnakUDVUCUzCNv8nEbjADSPWyYmWaUJdKVuSik1VZEBxU9pEKBM0DWD7KV8Kz7zZ7gZl1CvZ6er7BVIIW_ulir_i0WJUuC1CQCtw_7BeTWEUwGXwsVXPtB3OwD7Pmg24EU5PfTaHey-dZh3wgT_ovFmKqqi6aaLf4DtHGHByxh88eYVqV1lslQ&amp;u=aHR0cHMlM2ElMmYlMmZhd3MuYW1hem9uLmNvbSUyZnElMmYlM2Z0cmslM2QzMTgwYzViZi0zMWE0LTRiMTQtYjhkOC1iMzAyNjE1NWE3N2YlMjZzY19jaGFubmVsJTNkcHMlMjZzX2t3Y2lkJTNkQUwhNDQyMiExMCE3MjAxODI3Njc0Nzk4MSEhISE3MjAxODgyMjI5MTUzMyEhNDg1NDMzOTc0ITExNTIyODk5Mjg5MjAwMzAlMjZlZl9pZCUzZGU5YzRmYzMwNzFiMzE4N2E3YmFmOWJkY2MwZGUxODg3JTNhRyUzYXMlMjZtc2Nsa2lkJTNkZTljNGZjMzA3MWIzMTg3YTdiYWY5YmRjYzBkZTE4ODc&amp;rlid=e9c4fc3071b3187a7baf9bdcc0de1887&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Q</span></a><span style="font-weight:400;"> in </span><a href="https://www.bing.com/aclk?ld=e8FmMdCTtL-0m_gvZazh66aDVUCUw3S_In2p0DT6VfDVeaB2YwqT56BTMnCoTAIZI5dj1dO-FZoeT2SWFaNQRiij89xkTr8yibLRL4zbEzFO-l0acYzCG0D2LkxZk9mcXSuacn4hKGuRRQtLRET0U1mvG3xj897akpna4BiM9ymlfMPnugDJTETNb4f5UAMkJwTWCEnw&amp;u=aHR0cHMlM2ElMmYlMmZhd3MuYW1hem9uLmNvbSUyZnBtJTJmcXVpY2tzaWdodCUyZiUzZnRyayUzZDEzMGUxNDQxLWM5MjQtNDViOS04OTlmLTBiNWU5YTQxZjdlYyUyNnNjX2NoYW5uZWwlM2RwcyUyNnNfa3djaWQlM2RBTCE0NDIyITEwITcxNjc0NjUwMTI5MjA3ISEhITcxNjc1MTgwMDg5OTEyISE0ODI1MjQyOTEhMTE0Njc5MTYyOTU2NjA3MyUyNmVmX2lkJTNkNjc5ZDgxYWVjYjIzMTIyNWZjMzhlNjQyN2E1YWZmMjclM2FHJTNhcyUyNm1zY2xraWQlM2Q2NzlkODFhZWNiMjMxMjI1ZmMzOGU2NDI3YTVhZmYyNw&amp;rlid=679d81aecb231225fc38e6427a5aff27&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">QuickSight</span></a><span style="font-weight:400;"> unlocks the ability for any employee to perform expert-level data analysis using natural language, without the need for specialized skills or expertise. </span></li>
</ul>
</li>
<li style="font-weight:400;"><i><span style="font-weight:400;">“We are at the beginning of a workplace transformation driven by agents, and Amazon QuickSight is pioneering how this technology can break down the technical barriers between employees and their data,” </span></i><b>said Dilip Kumar, vice president of Amazon Q Business, AWS.</b><i><span style="font-weight:400;"> “With the new scenarios capability, everyone becomes their data analyst who can dive deep into their company data, helping them unlock insights, make better decisions, and explore countless possibilities faster than ever.”</span></i></li>
</ul>
<p><b>25:07</b> <a href="https://aws.amazon.com/about-aws/whats-new/2025/03/aws-or2-om2-instances-opensearch-service/" target="_blank" rel="noreferrer noopener"><b>AWS announces OR2 and OM2 instances for Amazon OpenSearch Service</b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.bing.com/ck/a?!&amp;&amp;p=181acaade1bae6501ac09e18761992012aa972ce897a8d319da40e5c4eb128d3JmltdHM9MTc0MzQ2NTYwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=amazon+opensearch&amp;u=a1aHR0cHM6Ly9hd3MuYW1hem9uLmNvbS93aGF0LWlzL29wZW5zZWFyY2gv&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Opensearch</span></a><span style="font-weight:400;"> service introduces new instances of OR2 and OM2, expanding the opensearch optimized instance family. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The OR2 delivers up to 26% higher indexing throughput than previous OR1 instances and 70% over R7g instances.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The new OM2 instances provide up to 15% higher indexing throughput compared to OR1 instances and 66% over m7g instances in internal benchmarks.</span></li>
</ul>
<p><i><span style="font-weight:400;">25:27  Ryan – “It’s funny to see these announcements, years after running a giant Elasticsearch project for awhile. These are all the struggles, and they’re getting addressed through OpenSearch and Amazon running a giant farm of these things.” </span></i></p>
<p><b>26:42</b> <a href="https://aws.amazon.com/about-aws/whats-new/2025/03/amazon-corretto-24-available/" target="_blank" rel="noreferrer noopener"><b>Amazon Corretto 24 is now generally available</b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.bing.com/ck/a?!&amp;&amp;p=e14c38a51bf61c298bfd80dea4bcca3ac445ee867f3fd47f1ca7609a56d17ea5JmltdHM9MTc0MzQ2NTYwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=corretto+24&amp;u=a1aHR0cHM6Ly9kb2NzLmF3cy5hbWF6b24uY29tL2NvcnJldHRvL2xhdGVzdC9jb3JyZXR0by0yNC11Zy9kb3dubG9hZHMtbGlzdC5odG1s&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Correto 24</span></a><span style="font-weight:400;"> has been released, which is the </span><a href="https://openjdk.org/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">OpenJDK</span></a><span style="font-weight:400;"> 24 feature release. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The next LTS version will be Java SE 25, which comes out in September. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The current LTS is 21. Considering everyone (including Justin) is still on Java 8, it might be time to upgrade. </span></li>
</ul>
<p><b>28:59</b> <a href="https://aws.amazon.com/about-aws/whats-new/2025/03/aws-expanded-support-console-mobile-app/" target="_blank" rel="noreferrer noopener"><b>AWS announces expanded service support in the AWS Console Mobile App</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">If you are eternally disappointed in the AWS Mobile app and its limited coverage, the latest update might make you much happier:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">24 additional services are now available including Service Quotas, Cloudfront, SES, Cloud 9, and AWS Batch via an integrated mobile web browser experience in the Console mobile app.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Justin appreciates the effort – but mostly we’re just hoping they’re not abandoning mobile native completely for the mobile app. </span></li>
</ul>
</li>
</ul>
<p><b>33:32</b> <a href="https://aws.amazon.com/about-aws/whats-new/2025/03/aws-network-firewall-flow-management-feature/" target="_blank" rel="noreferrer noopener"><b>AWS Network Firewall introduces new flow management feature</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS is giving you a new flow management feature for </span><a href="https://docs.aws.amazon.com/network-firewall/latest/developerguide/what-is-aws-network-firewall.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Network Firewall</span></a><span style="font-weight:400;"> that enables customers to identify and control active network flows.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This feature introduces two key functions:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Flow capture – which allows point in time snapshots of active flows</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Flow Flush, which enables selective termination of specific connections.  </span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">33:53  Justin – “</span></i><i><span style="font-weight:400;">So flow capture is just the networking team is sick of providing packet captures, I imagine. So now it’s self-service. makes perfect sense.”</span></i></p>
<h2><b>GCP</b></h2>
<p><b>33:04</b> <span style="font-weight:400;">Google Next is coming up in a few short weeks. Want to see Justin in person? And maybe even get some stickers? Check out these critical sessions: </span></p>
<p><span style="font-weight:400;">–BRK2-024 – Workload-optimized data protection for mission-critical enterprise apps</span></p>
<p><span style="font-weight:400;">–BRK1-028 – Unlock value for your workloads: Microsoft, Oracle, OpenShift and more</span></p>
<p><b>37:04</b> <a href="https://cloud.google.com/blog/products/storage-data-transfer/google-cloud-backup-and-dr-protection-summary/" target="_blank" rel="noreferrer noopener"><b>Introducing protection summary, a new Google Cloud Backup and DR </b></a><a href="https://cloud.google.com/blog/products/storage-data-transfer/google-cloud-backup-and-dr-protection-summary/" target="_blank" rel="noreferrer noopener"><b>feature</b></a><span style="font-weight:400;"> </span></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/blog/products/storage-data-transfer/console-gains-data-protection-interface-for-backup-and-dr?e=48754805" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Data protection</span></a><span style="font-weight:400;"> is critical to your cloud strategy, and that includes </span><a href="https://cloud.google.com/backup-disaster-recovery" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">backups and DR</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Making sure your backups are set up correctly and aligned with your RPO/RTO requirements is critical. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">However, collecting the data in your complex cloud environment can be tricky. So Google is giving you a preview of the Protection Summary and the data protection tab, a new feature in </span><a href="https://www.bing.com/ck/a?!&amp;&amp;p=299caf9e8e93a27816afee0a900608a3ce09f3598c8d316f9bdbe00098c82288JmltdHM9MTc0MzQ2NTYwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=google+cloud+backup&amp;u=a1aHR0cHM6Ly9jbG91ZC5nb29nbGUuY29tL2JhY2t1cC1kaXNhc3Rlci1yZWNvdmVyeS9kb2NzL2NvbmNlcHRzL2JhY2t1cC1kcg&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Google Cloud Backup and DR</span></a><span style="font-weight:400;"> that provides a centralized view of your backup configurations, helps you identify gaps in your data protection, and empowers you to take action to improve your resiliency. </span></li>
<li style="font-weight:400;"><a href="https://www.bing.com/ck/a?!&amp;&amp;p=cd998d3c60d9a2f699a5f3453dff2e40f37a5ecf6ac331cf6626f6b6723009feJmltdHM9MTc0MzQ2NTYwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=google+cloud+protection+summary&amp;u=a1aHR0cHM6Ly9jbG91ZC5nb29nbGUuY29tL2JhY2t1cC1kaXNhc3Rlci1yZWNvdmVyeS9kb2NzL2JhY2t1cC1hZG1pbi9wcm90ZWN0aW9uLXN1bW1hcnk&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Protection summary</span></a><span style="font-weight:400;"> will quickly help you identify resources with no backup configuration. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Quickly configure backups for resources and then assess the backup configurations and vulnerability to ransomware.</span></li>
</ul>
<p><i><span style="font-weight:400;">38:25  Ryan – “</span></i><i><span style="font-weight:400;">That was the first thing I was thinking about when I read through this was the the terrible-ness that I did 12 years ago to plug in some sort of backup errors to a slack channel so that we could pass an audit for notifications. It was ridiculous.”</span></i></p>
<p><b>39:23</b> <a href="https://cloud.google.com/blog/topics/partners/expanding-gen-ai-toolbox-for-databases-with-hypermode/" target="_blank" rel="noreferrer noopener"><b>Expanding Gen AI Toolbox for Databases with Hypermode</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google recently announced the </span><a href="https://cloud.google.com/blog/products/ai-machine-learning/announcing-gen-ai-toolbox-for-databases-get-started-today?e=48754805" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">public beta of AI Toolbox for Databases</span></a><span style="font-weight:400;">, and today they are excited to expand its capabilities through a new partnership with </span><a href="https://hypermode.com/blog/dgraph-google-gen-ai-toolbox-databases" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Hypermode</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><a href="https://github.com/googleapis/genai-toolbox" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Gen AI Toolbox for Databases</span></a><span style="font-weight:400;"> is an open source server that empowers application developers to connect production-grade, agent-based generative AI (gen AI) applications to databases. Toolbox streamlines the creation, deployment and management of sophisticated gen AI tools capable of querying databases with secure access, robust observability, scalability and comprehensive management. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Currently, the toolbox supports </span><a href="https://cloud.google.com/products/alloydb?e=48754805&amp;hl=en" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AlloyDB</span></a><span style="font-weight:400;">, </span><a href="https://cloud.google.com/spanner?e=48754805&amp;hl=en" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Spanner</span></a><span style="font-weight:400;">, </span><a href="https://cloud.google.com/sql?e=48754805&amp;hl=en" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Cloud SQL for PostgreSQL, MySQL, and SQL Server</span></a><span style="font-weight:400;">, as well as self-managed MySQL and PostgreSQL. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Justin doesn’t know what Hypermode is, so this announcement isn’t for him. But if you do know what Hypermode is, then today is a good day! </span></li>
</ul>
<p><b>41:42</b> <a href="https://cloud.google.com/blog/products/data-analytics/bigquery-repositories-integrates-with-git/" target="_blank" rel="noreferrer noopener"><b>Announcing BigQuery repositories: Git-based collaboration in BigQuery </b></a><a href="https://cloud.google.com/blog/products/data-analytics/bigquery-repositories-integrates-with-git/" target="_blank" rel="noreferrer noopener"><b>Studio</b></a><b>  </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Modern data teams use </span><a href="https://www.bing.com/ck/a?!&amp;&amp;p=4a89ef173ba1ac65548c9d4cd7d2db513dac7976dbd52fa2b11b48aa85c473f3JmltdHM9MTc0MzQ2NTYwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=Git&amp;u=a1aHR0cHM6Ly9naXQtc2NtLmNvbS9kb3dubG9hZHM&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Git</span></a><span style="font-weight:400;"> to collaborate effectively and adopt software engineering best practices for managing their data pipeline and analytics code.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">But most tools don’t offer integration with Git version control systems, making Git workflow feel out of reach.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This forces users to copy and paste code between UIs, which is not only time-consuming but also error prone. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">To help, they’re releasing in preview “</span><a href="https://www.bing.com/ck/a?!&amp;&amp;p=d1fa10c521f534d1b50819f87175dd5eca2795ec198ac0498a18f6139e94f1f4JmltdHM9MTc0MzQ2NTYwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=BigQuery+Repositories&amp;u=a1aHR0cHM6Ly9jbG91ZC5nb29nbGUuY29tL2Jsb2cvcHJvZHVjdHMvZGF0YS1hbmFseXRpY3MvYmlncXVlcnktcmVwb3NpdG9yaWVzLWludGVncmF0ZXMtd2l0aC1naXQ&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">BigQuery Repositories</span></a><span style="font-weight:400;">” a new experience in bigquery studio that helps data teams collaborate on code stored in git repositories. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">BigQuery repos provide a comprehensive set of features to integrate Git workflows directly into your BigQuery environment:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Setup new repos in BigQuery Studio where you can develop SQL queries, Notebooks, data preparation, data canvases, or text files with any file extension.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Connect your repositories to remote git hosts like GitHub, GitLab, and other popular Git platforms.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Edit the code in your repositories within a dedicated workspace, on your own copy of the code, before publishing changes to branches</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Perform most Git operations with a user-friendly interface that lets you inspect differences, commit changes, push updates, and create pull requests all within BigQuery Studio. </span></li>
</ul>
</li>
</ul>
<p><b>46:06</b> <a href="https://blog.google/technology/google-deepmind/gemini-model-thinking-updates-march-2025/" target="_blank" rel="noreferrer noopener"><b>Gemini 2.5: Our most intelligent AI model</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google has introduced </span><a href="https://deepmind.google/technologies/gemini" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Gemini 2.5</span></a><span style="font-weight:400;">, their most intelligent AI model. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The first 2.5 release is an experimental version of 2.5 Pro, which is state-of-the-art on a wide range of benchmarks and debuts at #1 on </span><a href="https://lmarena.ai/?leaderboard" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">LMArena</span></a><span style="font-weight:400;"> by a significant margin.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Gemini 2.5 models are thinking models, capable of reasoning through their thoughts before responding, resulting in enhanced performance and improved accuracy. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">With Gemini 2.5, Google has achieved a new level of performance by combining a significantly enhanced base model with improved post-training. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Going forward, they will build thinking capabilities directly into all models, so they can handle more complex problems and support even more capable, context-aware agents. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Google is very proud that 2.5 Pro takes the top of LMArena leaderboard</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Gemini 2.5 without test time techniques, like Majority voting, 2.5 leads in math and science benchmarks. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">It also scores a state-of-the-art 18.8% across models without tools used on Humanity’s last exam, a dataset designed by hundreds of SME to capture the human frontier of knowledge and reasoning. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">2.5 will have a big leap over 2.0 on coding performance, as well as it excels at creating visually compelling web apps and agentic code applications, along with code transformation and editing. </span></li>
</ul>
<p><i><span style="font-weight:400;">47:27  Ryan – “Well, 2.o was a big fix over 1.5, so I’m hoping that it’s as big of an impact.”</span></i></p>
<h2><b>Azure</b></h2>
<p><b>49:23 </b><a href="https://techcommunity.microsoft.com/blog/appsonazureblog/announcing-the-public-preview-launch-of-azure-functions-durable-task-scheduler/4389670" target="_blank" rel="noreferrer noopener"><b>Announcing the public preview launch of Azure Functions durable task </b></a></p>
<p><a href="https://techcommunity.microsoft.com/blog/appsonazureblog/announcing-the-public-preview-launch-of-azure-functions-durable-task-scheduler/4389670" target="_blank" rel="noreferrer noopener"><b>scheduler</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft is announcing the public preview of Azure Functions Durable Task Scheduler. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This new Azure-managed backend is designed to provide high performance, improve reliability, reduce operational overhead, and simplify monitoring your stateful orchestrations. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Durable functions provide you a simplified way to develop complex, stateful and long-running apps in the serverless environment.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This allows developers to orchestrate multiple function calls without having to handle fault tolerance. It’s great for scenarios like orchestrating multiple agents, distributed transactions, big data processing, batch processing like ETL (Extract, Transform and Load), Async APis, and essentially any scenario that requires chaining function calls with state persistence. </span></li>
</ul>
<p><i><span style="font-weight:400;">47:27  Matthew – “I</span></i><i><span style="font-weight:400;">t’s step functions with a CloudWatch event that triggers it…It’s going to do everything that step functions can do.”</span></i></p>
<p><b>52:29</b> <a href="https://techcommunity.microsoft.com/blog/appsonazureblog/announcing-ga-for-azure-container-apps-serverless-gpus/4394302" target="_blank" rel="noreferrer noopener"><b>Announcing GA for Azure Container Apps Serverless GPUs | Microsoft </b></a><a href="https://techcommunity.microsoft.com/blog/appsonazureblog/announcing-ga-for-azure-container-apps-serverless-gpus/4394302" target="_blank" rel="noreferrer noopener"><b>Community Hub</b></a><span style="font-weight:400;"> </span></p>
<ul>
<li style="font-weight:400;"><a href="https://aka.ms/aca/serverless-gpus" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Azure Container Apps Serverless GPU’s</span></a><span style="font-weight:400;"> are now GA.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This allows you to seamlessly run your AI workloads on-demand with automatic scaling, optimized cold strat, per-second billing, and reduced operational overhead. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Nvidia powers the serverless GPU’s which allows you to seamlessly run billing with scale down to zero when not in use. Thus, reducing operational overhead to support easy real-time custom model inference and other GPU-accelerated workloads. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">In addition this supports </span><a href="https://developer.nvidia.com/nim" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">NVIDIA NIM microservices</span></a><span style="font-weight:400;">, which are part of the </span><a href="https://www.bing.com/ck/a?!&amp;&amp;p=03b2d3fa1d17db813536804f7661d88a4e9c0f33127e03ca0a22d0ad1c2eab45JmltdHM9MTc0MzQ2NTYwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=nvidia+ai+enterprise&amp;u=a1aHR0cHM6Ly93d3cubnZpZGlhLmNvbS9lbi11cy9kYXRhLWNlbnRlci9wcm9kdWN0cy9haS1lbnRlcnByaXNlLw&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Nvidia AI Enterprise</span></a><span style="font-weight:400;">, its a set of easy to use microservices designed for secure, reliable deployment of high-performance AI model inference at scale. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Key Benefits for Serverless GPU’s</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Scale-to zero GPUs: Support for serverless scaling of NVIDIA A100 and T4 GPUs.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Per-second billing: Pay only for the GPU compute you use.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Built-in data governance: Your data never leaves the container boundary.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Flexible compute options: Choose between NVIDIA A100 and T4 GPUs.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Middle-layer for AI development: Bring your own model on a managed, serverless compute platform and easily run your AI applications alongside your existing apps.</span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">47:27  Ryan – “I want to make fun of this, but I love the fact that it scales to zero. If I were making some sort of application, I’d go bankrupt without something like this in place, so I think it’s kind of neat.” </span></i></p>
<p><b>54:53</b> <a href="https://azure.microsoft.com/en-us/blog/microsoft-and-nvidia-accelerate-ai-development-and-performance/" target="_blank" rel="noreferrer noopener"><b>Microsoft and NVIDIA accelerate AI development and performance </b></a></p>
<p><a href="https://azure.microsoft.com/en-us/blog/accelerating-agentic-workflows-with-azure-ai-foundry-nvidia-nim-and-nvidia-agentiq/" target="_blank" rel="noreferrer noopener"><b>Accelerating agentic workflows with Azure AI Foundry, NVIDIA NIM, and NVIDIA AgentIQ</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft and NVIDIA have several enhancements to help shape the future of AI. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This includes integrating the newest </span><a href="https://www.bing.com/ck/a?!&amp;&amp;p=a3335be08daa3853ae6b150973d91d7c5740ee6a68624f8100260b65d2a202fdJmltdHM9MTc0MzQ2NTYwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=blackwelll+platform&amp;u=a1aHR0cHM6Ly93d3cubnZpZGlhLmNvbS9lbi11cy9kYXRhLWNlbnRlci90ZWNobm9sb2dpZXMvYmxhY2t3ZWxsLWFyY2hpdGVjdHVyZS8&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Blackwell</span></a><span style="font-weight:400;"> platform on Azure AI, incorporating </span><a href="https://developer.nvidia.com/nim?sortBy=developer_learning_library%2Fsort%2Ffeatured_in.nim%3Adesc%2Ctitle%3Aasc" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">NVIDIA NIM microservices</span></a><span style="font-weight:400;"> into </span><a href="https://azure.microsoft.com/products/ai-foundry/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Azure AI Foundry</span></a><span style="font-weight:400;">, and empowering developers to accelerate their innovations and solve challenging problems. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">NIM provides optimized containers for more than two dozen popular foundation models, allowing developers to deploy generative AI applications and agents quickly. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">These new integrations can accelerate inference workloads for models available on Azure, providing significant performance improvements, greatly supporting the growing use of AI agents. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Key features include optimized model throughput for NVIDIA accelerated computing platforms, prebuilt microservices deployable anywhere and enhanced accuracy for specific use cases. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">General availability of GB200 V6 virtual machine series accelerated by NVIDIA GB200 NVL72 and NVIDIA Quantum Infiniband networking. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Once you have NVIDIA NIM deployed, Nvidia AgentIQ takes center stage with its open source toolkit designed to seamlessly connect, profile and optimize teams of AI agents, enabling your systems to run at peak performance. AgentIQ delivers:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Profiling and optimization</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Dynamic inference enhancements</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Integration with Semantic Kernel</span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">55:50 Justin – “</span></i><i><span style="font-weight:400;">It gives you the PyTorch type tools, all the different capabilities you might want to use to use your GPUs effectively, to do training or inference – all prebuilt into the NIM containers that are prebuilt for you. That’s what it is. They made it sound like it was special, but it’s not.”</span></i></p>
<p><b>58:08</b> <a href="https://www.microsoft.com/en-us/security/blog/2025/03/24/microsoft-unveils-microsoft-security-copilot-agents-and-new-protections-for-ai/" target="_blank" rel="noreferrer noopener"><b>Microsoft unveils Microsoft Security Copilot agents and new protections </b></a><a href="https://www.microsoft.com/en-us/security/blog/2025/03/24/microsoft-unveils-microsoft-security-copilot-agents-and-new-protections-for-ai/" target="_blank" rel="noreferrer noopener"><b>for AI</b></a><span style="font-weight:400;"> </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Last year Microsoft launched </span><a href="https://www.microsoft.com/security/business/solutions/generative-ai-cybersecurity" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Security Copilot</span></a><span style="font-weight:400;"> to empower defenders to detect, investigate and respond to security incidents swiftly and accurately.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Now they are announcing Security Copilot with AI agents designed to autonomously assist with critical areas such as phishing, data security and identity management. The relentless pace and complexity of cyberattacks have surpassed human capacity and establishing AI agents is a necessity for modern security. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft’s </span><a href="https://www.bing.com/ck/a?!&amp;&amp;p=6fc0db22b9069df85d0f9ef038d91d7b7bc85506097995ad954ac727e76b40caJmltdHM9MTc0MzQ2NTYwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=0a813a60-1d44-6490-24a2-2fa21caa6502&amp;psq=microsoft+threat+intelligence&amp;u=a1aHR0cHM6Ly93d3cubWljcm9zb2Z0LmNvbS9lbi11cy9zZWN1cml0eS9ibG9nL3RvcGljL3RocmVhdC1pbnRlbGxpZ2VuY2UvP21zb2NraWQ9MGE4MTNhNjAxZDQ0NjQ5MDI0YTIyZmEyMWNhYTY1MDI&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Threat Intelligence</span></a><span style="font-weight:400;"> now processes 84 trillion signals per day, revealing the exponential growth in cyberattacks. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Today, they are launching 6 </span><a href="https://aka.ms/SecurityCopilotagents" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Security Copilot agents</span></a><span style="font-weight:400;"> built by Microsoft and 5 built by their partners available in preview in April. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The five agents from Microsoft:</span>
<ul>
<li style="font-weight:400;"><b>The Phishing Triage Agent in </b><a href="https://www.microsoft.com/security/business/microsoft-defender" target="_blank" rel="noreferrer noopener"><b>Microsoft Defender</b></a><span style="font-weight:400;"> triages phishing alerts accurately to identify real cyber threats and false alarms. It provides easy-to-understand explanations for its decisions and improves detection based on admin feedback.</span></li>
<li style="font-weight:400;"><b>Alert Triage Agents in </b><a href="http://aka.ms/DSIblog" target="_blank" rel="noreferrer noopener"><b>Microsoft Purview</b></a><span style="font-weight:400;"> triage data loss prevention and insider risk alerts, prioritize critical incidents, and continuously improve accuracy based on admin feedback.</span></li>
<li style="font-weight:400;"><b>Conditional Access Optimization Agent</b><span style="font-weight:400;"> in </span><a href="https://aka.ms/Secure2025/MicrosoftEntraNews" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Microsoft Entra</span></a><span style="font-weight:400;"> monitors for new users or apps not covered by existing policies, identifies necessary updates to close security gaps, and recommends quick fixes for identity teams to apply with a single click.</span></li>
<li style="font-weight:400;"><b>Vulnerability Remediation Agent in Microsoft Intune</b><span style="font-weight:400;"> monitors and prioritizes vulnerabilities and remediation tasks to address app and policy configuration issues and expedites Windows OS patches with admin approval.</span></li>
<li style="font-weight:400;"><b>Threat Intelligence Briefing Agent in Security Copilot</b><span style="font-weight:400;"> automatically curates relevant and timely threat intelligence based on an organization’s unique attributes and cyber threat exposure.</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">The five agentic solutions from partners include:</span>
<ul>
<li style="font-weight:400;"><b>Privacy Breach Response Agent by OneTrust</b><span style="font-weight:400;"> analyzes data breaches to generate guidance for the privacy team on how to meet regulatory requirements.</span></li>
<li style="font-weight:400;"><b>Network Supervisor Agent by Aviatrix</b><span style="font-weight:400;"> performs root cause analysis and summarizes issues related to VPN, gateway, or Site2Cloud connection outages and failures.</span></li>
<li style="font-weight:400;"><b>SecOps Tooling Agent by BlueVoyant</b><span style="font-weight:400;"> assesses a security operations center (SOC) and state of controls to make recommendations that help optimize security operations and improve controls, efficacy, and compliance.</span></li>
<li style="font-weight:400;"><b>Alert Triage Agent by Tanium</b><span style="font-weight:400;"> provides analysts with the necessary context to quickly and confidently make decisions on each alert.</span></li>
<li style="font-weight:400;"><b>Task Optimizer Agent by Fletch </b><span style="font-weight:400;">helps organizations forecast and prioritize the most critical cyberthreat alerts to reduce alert fatigue and improve security.</span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">59:42  Ryan – “</span></i><i><span style="font-weight:400;">So as the new security guy who’s learning all these tools and going through all the things that are in Microsoft Defender, I am very skeptical that this is going to actually solve any issues. But sweet Jesus, if it’s an improvement on what Microsoft Defender already does, it’d be welcome. The patterns and stuff that are detected natively in those tools just by default is not good enough, and so you have to spend a ton of time trolling through too much data to make these things work for anything other than forensic investigation after the fact.”</span></i></p>
<h2><b>Oracle</b></h2>
<p><b>1:03:02 </b><a href="https://www.oracle.com/news/announcement/oracle-introduces-ai-agent-studio-2025-03-20/" target="_blank" rel="noreferrer noopener"><b>Oracle Introduces AI Agent Studio</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Oracle has announced </span><a href="https://www.oracle.com/applications/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Oracle AI Agent studio for Fusion Applications</span></a><span style="font-weight:400;">, a comprehensive platform for creating, extending, deploying and managing AI agents and agent teams across your enterprise. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This is part of the Oracle Fusion Cloud Application Suite, the new AI Agent Studio provides easy-to-use tools for customers and partners to create customized AI agents that address complex business needs and can help drive new levels of productivity. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Oracle AI agent Studio includes:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Agent Template Libraries</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Agent Team Orchestration</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Agent Extensibility</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Choice of LLMs</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Native Fusion Integration</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Third-party system integration</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Trust and Security framework</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Validation and testing tools</span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">1:03:41  Matthew – “Oracle showed up to the AI Agent party.”</span></i></p>
<h2><span style="font-weight:400;">Closing</span></h2>
<p><span style="font-weight:400;">And that is the week in the cloud! Visit our website, the home of the Cloud Pod where you can join our newsletter, slack team, send feedback or ask questions at theCloud Pod.net or tweet at us with hashtag #theCloudPod</span></p>
]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2005858/c1e-4919c17njxbmzon6-mkx8rxz4annk-rgurob.mp3" length="78025581"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 298 of The Cloud Pod – where the forecast is always cloudy! Justin, Matthew and Ryan are in the house (and still very much missing Jonathan) to bring you a  jam packed show this week, with news from Beijing to Virginia! Did you know Virginia was in the US? Amazon definitely wants you to know that. 
We’ve got updates from BigQuery Git Support and their new collab tools, plus all the AI updates you were hoping you’d miss. Tune in now! 
Titles we almost went with this week:

The Cloud Pod now Recorded from Planet Earth
☕Wait Java still exists?
When will java just be coffee and not software
Cloudflare Makes AI beat Mazes
Replacing native mobile things with mobile web apps won’t fix your problems AWS
Turn your security over to the bots
The Cloud Pod is lost in the AI labyrinth 
AI security agents to secure the AI… wait recursion
Durable + Stateless.. I don’t know if you know what those words means
Click ops expands to our phones yay!
The Cloud Pod is now a data analyst 
⁉️Gitops come to bigquery

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
AI Is Going Great – Or How ML Makes All Its Money  
00:46 Manus, a New AI Agent From China is Going Viral—And Raising Big Questions  

Manus is being described as “the first true autonomous AI agent” from China, capable of completing weeks of professional work in hours.
Developed by a team called Butterfly Effect with offices in Beijing and Wuhan, Manus functions as a truly autonomous agent that independently analyzes, plans, and executes complex tasks. 
The system uses a multi-agent architecture powered by several distinct AI models, including Anthropic’s ]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2005858/c1a-k5d5-z3d5n28wf994-w1ocqc.webp"></itunes:image>
                                                                            <itunes:duration>01:05:02</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                                    <podcast:chapters url="https://media-assets.castos.com/chapters/2005858/chapter-data.json"
                        type="application/json" />
                            </item>
                    <item>
                <title>
                    <![CDATA[297: Save the Date So You Can Get Some Skills – In AI!]]>
                </title>
                <pubDate>Wed, 26 Mar 2025 20:33:58 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/2000740</guid>
                                    <link>https://tcpfm.castos.com/episodes/297-save-the-date-ai-skills</link>
                                <description>
                                            <![CDATA[<p><span style="font-weight:400;">Welcome to episode 297 of The Cloud Pod – where the forecast is always cloudy! Justin, Ryan, and Matthew have beaten the black lung and are in the studio – ready to bring you all the latest and greatest in cloud and AI news! We’ve got Wiz buyouts (that security, it’s so hot right now!) Gemma 3, Glue 5 (but not 3 or 4) and Gemini Robots – plus looking forward to AI Skills Fest and Google Next, all this week on The Cloud Pod. </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">Google! Yer a WIZ—Ard</span></li>
<li><span style="font-weight:400;">Google Announces Network Security Integration… and that must include WIZ</span></li>
<li><span style="font-weight:400;">Gemini Robots…. What could go wrong </span></li>
<li><span style="font-weight:400;">️AI Data Studios … So Hot Right Now</span></li>
<li><span style="font-weight:400;">I want 32 Billion dollars</span></li>
<li><span style="font-weight:400;">Azure Follow AWS in bad life choices – mk</span></li>
<li><span style="font-weight:400;">Wait Glue is more than v2</span></li>
<li><span style="font-weight:400;">What happened to Glue 3 and 4?</span></li>
<li><span style="font-weight:400;">5th Try and AWS Glue still sucks</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><b>Follow Up </b></h2>
<p><b>01:05 </b><a href="https://www.nature.com/articles/d41586-025-00829-2" target="_blank" rel="noreferrer noopener"><b>Microsoft quantum computing claim still lacks evidence: physicists are </b></a><a href="https://www.nature.com/articles/d41586-025-00829-2" target="_blank" rel="noreferrer noopener"><b>dubious</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">A MS researcher presented results behind the company’s </span><a href="https://www.nature.com/articles/d41586-025-00527-z" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">controversial claim</span></a><span style="font-weight:400;"> to have created the first </span><a href="https://quantum.microsoft.com/en-us/insights/education/concepts/topological-qubits" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">topological qubits</span></a><span style="font-weight:400;"> – a long-sought goal of quantum computing. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Theorists said it’s a hard problem, and that it was a beautiful talk but the claims come without evidence, and people think they have gone overboard. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The Head of Quantum at Amazon was also highly skeptical:</span>
<ul>
<li style="font-weight:400;"><a href="https://www.businessinsider.com/amazon-exec-casts-doubt-microsoft-quantum-claims-2025-3" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">https://www.businessinsider.com/amazon-exec-casts-doubt-microsoft-quantum-claims-2025-3</span></a></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">02:09  Justin – “</span></i><i><span style="font-weight:400;">No one’s really buying Microsoft actually created a new topological qubit. There’s some doubt… basically they said that what they showed, which is a microscopic H-shaped aluminum wire on top of indium arsenide – a superconductor at ultra-cold temperatures, and the devices are designed to harness majoranas, previously undiscovered quasi-particles that are essential for topological qubits to work, and the goals for majoranas to appear at the four tips of the H-shaped wire emerging from reflective-behavior electrons, and these majorans in theory could be used to perform quantum computing that are resistant to informat...</span></i></p>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 297 of The Cloud Pod – where the forecast is always cloudy! Justin, Ryan, and Matthew have beaten the black lung and are in the studio – ready to bring you all the latest and greatest in cloud and AI news! We’ve got Wiz buyouts (that security, it’s so hot right now!) Gemma 3, Glue 5 (but not 3 or 4) and Gemini Robots – plus looking forward to AI Skills Fest and Google Next, all this week on The Cloud Pod. 
Titles we almost went with this week:

Google! Yer a WIZ—Ard
Google Announces Network Security Integration… and that must include WIZ
Gemini Robots…. What could go wrong 
️AI Data Studios … So Hot Right Now
I want 32 Billion dollars
Azure Follow AWS in bad life choices – mk
Wait Glue is more than v2
What happened to Glue 3 and 4?
5th Try and AWS Glue still sucks

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
Follow Up 
01:05 Microsoft quantum computing claim still lacks evidence: physicists are dubious

A MS researcher presented results behind the company’s controversial claim to have created the first topological qubits – a long-sought goal of quantum computing. 
Theorists said it’s a hard problem, and that it was a beautiful talk but the claims come without evidence, and people think they have gone overboard. 
The Head of Quantum at Amazon was also highly skeptical:

https://www.businessinsider.com/amazon-exec-casts-doubt-microsoft-quantum-claims-2025-3



02:09  Justin – “No one’s really buying Microsoft actually created a new topological qubit. There’s some doubt… basically they said that what they showed, which is a microscopic H-shaped aluminum wire on top of indium arsenide – a superconductor at ultra-cold temperatures, and the devices are designed to harness majoranas, previously undiscovered quasi-particles that are essential for topological qubits to work, and the goals for majoranas to appear at the four tips of the H-shaped wire emerging from reflective-behavior electrons, and these majorans in theory could be used to perform quantum computing that are resistant to informat...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[297: Save the Date So You Can Get Some Skills – In AI!]]>
                </itunes:title>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p><span style="font-weight:400;">Welcome to episode 297 of The Cloud Pod – where the forecast is always cloudy! Justin, Ryan, and Matthew have beaten the black lung and are in the studio – ready to bring you all the latest and greatest in cloud and AI news! We’ve got Wiz buyouts (that security, it’s so hot right now!) Gemma 3, Glue 5 (but not 3 or 4) and Gemini Robots – plus looking forward to AI Skills Fest and Google Next, all this week on The Cloud Pod. </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">Google! Yer a WIZ—Ard</span></li>
<li><span style="font-weight:400;">Google Announces Network Security Integration… and that must include WIZ</span></li>
<li><span style="font-weight:400;">Gemini Robots…. What could go wrong </span></li>
<li><span style="font-weight:400;">️AI Data Studios … So Hot Right Now</span></li>
<li><span style="font-weight:400;">I want 32 Billion dollars</span></li>
<li><span style="font-weight:400;">Azure Follow AWS in bad life choices – mk</span></li>
<li><span style="font-weight:400;">Wait Glue is more than v2</span></li>
<li><span style="font-weight:400;">What happened to Glue 3 and 4?</span></li>
<li><span style="font-weight:400;">5th Try and AWS Glue still sucks</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><b>Follow Up </b></h2>
<p><b>01:05 </b><a href="https://www.nature.com/articles/d41586-025-00829-2" target="_blank" rel="noreferrer noopener"><b>Microsoft quantum computing claim still lacks evidence: physicists are </b></a><a href="https://www.nature.com/articles/d41586-025-00829-2" target="_blank" rel="noreferrer noopener"><b>dubious</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">A MS researcher presented results behind the company’s </span><a href="https://www.nature.com/articles/d41586-025-00527-z" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">controversial claim</span></a><span style="font-weight:400;"> to have created the first </span><a href="https://quantum.microsoft.com/en-us/insights/education/concepts/topological-qubits" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">topological qubits</span></a><span style="font-weight:400;"> – a long-sought goal of quantum computing. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Theorists said it’s a hard problem, and that it was a beautiful talk but the claims come without evidence, and people think they have gone overboard. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The Head of Quantum at Amazon was also highly skeptical:</span>
<ul>
<li style="font-weight:400;"><a href="https://www.businessinsider.com/amazon-exec-casts-doubt-microsoft-quantum-claims-2025-3" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">https://www.businessinsider.com/amazon-exec-casts-doubt-microsoft-quantum-claims-2025-3</span></a></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">02:09  Justin – “</span></i><i><span style="font-weight:400;">No one’s really buying Microsoft actually created a new topological qubit. There’s some doubt… basically they said that what they showed, which is a microscopic H-shaped aluminum wire on top of indium arsenide – a superconductor at ultra-cold temperatures, and the devices are designed to harness majoranas, previously undiscovered quasi-particles that are essential for topological qubits to work, and the goals for majoranas to appear at the four tips of the H-shaped wire emerging from reflective-behavior electrons, and these majorans in theory could be used to perform quantum computing that are resistant to information loss, but no proof, no evidence, and they think Microsoft’s full of it.”</span></i></p>
<h2><b>General News </b></h2>
<p><b>04:12</b> <a href="https://cloud.google.com/blog/products/identity-security/google-announces-agreement-acquire-Wiz" target="_blank" rel="noreferrer noopener"><b>Google + Wiz: Strengthening Multicloud Security</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google has announced the signing of a definitive agreement to acquire Wiz. This will allow them to better provide business and governments with more choice in how they protect themselves. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Google answers why now… and that they have seen their </span><a href="https://www.mandiant.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Mandiant</span></a><span style="font-weight:400;"> consultants witness the accelerating number and severity of breaches. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Most organizations are going digital, and most deployments are multi-cloud or hybrid. Both of which introduce complex management changes. This is occurring while software and AI platforms are becoming deeply embedded across products and operations. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Traditional approaches to cybersecurity struggles to keep up with this evolving landscape. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Google points out that they have </span><a href="https://www.mandiant.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Threat Intelligence</span></a><span style="font-weight:400;">, </span><a href="https://cloud.google.com/blog/products/identity-security/introducing-google-security-operations-intel-driven-ai-powered-secops-at-rsa?e=48754805" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Security Operations</span></a><span style="font-weight:400;">, and </span><a href="https://cloud.google.com/security/consulting/mandiant-services?e=48754805&amp;hl=en" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Consulting</span></a><span style="font-weight:400;">, but Wiz provides them with a seamless cloud security platform that connects all major clouds and code environments to help prevent incidents from happening in the first place.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Wiz’s solution scans your environment, constructing a comprehensive graph of code, cloud resources, services and applications — along with the connections between them. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">It identifies potential attack paths, prioritizes the most critical risks based on their impact, and empowers enterprise developers to secure applications before deployments. </span></li>
</ul>
<p><i><span style="font-weight:400;">06:24  Ryan – “</span></i><i><span style="font-weight:400;">‘m very surprised by this announcement just because they’ve been really touting the Mandiant and both Chronicle into the existing Security Center tools. And then a lot of these reasons why they’re saying WIS is better is specifically stuff that’s been added, like Security Center Enterprise.I Wonder what they had to have from Wiz. With all the security tools that are out there, you buy the market leader for as much money as that.”</span></i></p>
<p><b>09:07</b> <a href="https://apnews.com/article/google-alphabet-wiz-32-billion-e50fb41b9a84a1056a116f963e6efed0" target="_blank" rel="noreferrer noopener"><b>Google to buy cybersecurity firm Wiz for $32 billion, the largest deal in </b></a><a href="https://apnews.com/article/google-alphabet-wiz-32-billion-e50fb41b9a84a1056a116f963e6efed0" target="_blank" rel="noreferrer noopener"><b>company history</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google will buy cybersecurity firm Wiz for $32 billion for the tech giant’s in-house cloud computing amid burgeoning artificial intelligence growth. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The all-cash acquisition </span><a href="https://blog.google/inside-google/company-announcements/google-agreement-acquire-wiz/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">announced Tuesday</span></a><span style="font-weight:400;"> would be Google’s biggest in its 26-year history, and is the biggest deal of 2025 so far. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">“</span><i><span style="font-weight:400;">Wiz and Google Cloud are both fueled by the belief that cloud security needs to be easier, more accessible, more intelligent, and democratized, so more organizations can adopt and use cloud and AI securely</span></i><span style="font-weight:400;">,” Wiz CEO Assaf Rappaport said in a blog post.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Last summer Wiz rejected a $23B dollar bid from Google.  </span></li>
</ul>
<h2><b>AI Is Going Great – Or How ML Makes All Its Money </b></h2>
<p><b>10:23</b> <a href="https://openai.com/global-affairs/openai-proposals-for-the-us-ai-action-plan/" target="_blank" rel="noreferrer noopener"><b>OpenAI’s proposals for the U.S. AI Action Plan</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">OpenAI shared their recommendations with the White House Office of Science and T(OSTP) for the upcoming </span><a href="https://www.whitehouse.gov/briefings-statements/2025/02/public-comment-invited-on-artificial-intelligence-action-plan/#:~:text=The%20AI%20Action%20Plan%20will,from%20hindering%20private%20sector%20innovation." target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">US AI Action Plan</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">As Sam Altman, CEO, has </span><a href="https://ia.samaltman.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">written</span></a><span style="font-weight:400;"> they are on the cusp of what he considers the next leap in prosperity: the intelligence age. But to do that, they must ensure that people have freedom of intelligence, by which they mean the freedom to access and benefit from AI as it advances, protected from both autocratic powers that would take people’s freedoms away, and layers of laws and bureaucracy that would prevent the realization of them. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">So what exactly does Open AI Propose:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">A regulatory strategy that ensures the freedom to innovate</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">An export control strategy that exports democratic AI</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">A copyright strategy that promotes the freedom to learn</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">A strategy to seize the infrastructure opportunity to drive growth</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">An ambitious government adoption strategy </span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">You can read more about </span><a href="https://cdn.openai.com/global-affairs/openai-us-economicblueprint-feb-2025-edu-update.pdf" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Open AI’s Economic Blueprint</span></a><span style="font-weight:400;"> and see the official submission </span><a href="https://cdn.openai.com/global-affairs/ostp-rfi/ec680b75-d539-4653-b297-8bcf6e5f7686/openai-response-ostp-nsf-rfi-notice-request-for-information-on-the-development-of-an-artificial-intelligence-ai-action-plan.pdf" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">here</span></a><span style="font-weight:400;">.  </span></li>
</ul>
<p><i><span style="font-weight:400;">11:14  Justin – “I love when the company that’s going to benefit the most makes all the laws…”</span></i></p>
<h2><b>AWS</b></h2>
<p><b>12:42</b> <a href="https://aws.amazon.com/blogs/aws/aws-pi-day-data-foundation-for-analytics-and-ai/" target="_blank" rel="noreferrer noopener"><b>AWS Pi Day 2025: Data foundation for analytics and AI</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">3/14 just passed us by and another AWS Pi day occurred, this is the first year the blog post hasn’t been written by Jeff Barr who stepped away from the blog at the end of 2024</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This year’s PI day was a focus on </span><a href="https://pages.awscloud.com/NAMER-field-OE-Pi-Day-2025-interest.html?trk=38374f5c-e876-4893-808d-21708ff61043&amp;sc_channel=el" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">accelerating analytics and AI innovation</span></a><span style="font-weight:400;"> with a unified data foundation on AWS.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Several announcements that we’ll cover here in a few minutes… But it’s Pi Day and we really just wanted to be wowed by crazy metrics.</span></li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/s3/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">S3</span></a><span style="font-weight:400;"> currently holds 400 Trillion objects, exabytes of data, and processes a mind-blowing 150M requests per second. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">A decade ago they didn’t have 100 customers storing more than a Petabyte of data on S3, now they have 1000’s of customers who have surpassed the 1 PB milestone. </span></li>
</ul>
<p><i><span style="font-weight:400;">14:01  Matthew – “150 million requests per second! That’s crazy.”</span></i></p>
<p><b>14:14</b> <a href="https://aws.amazon.com/about-aws/whats-new/2025/03/amazon-s3-reduces-pricing-object-tagging/?trk=4b29643c-e00f-4ab6-ab9c-b1fb47aa1708&amp;sc_channel=el" target="_blank" rel="noreferrer noopener"><b>Amazon S3 reduces pricing for S3 object tagging by 35%</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">S3 is reducing </span><a href="https://aws.amazon.com/s3/pricing/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">pricing</span></a><span style="font-weight:400;"> for S3 Object Tagging by 35% in all AWS regions to $0.0065 per 10,000 tags per month.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Object Tags are key-value pairs applied to S3 objects that can be created, updated or deleted at any time during the lifetime of the object. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">S3 Object tags are used for a lot of use cases, including providing fine-grained IAM access, object lifecycle rules, and replication requirements between regions.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Along with </span><a href="https://aws.amazon.com/s3/features/metadata/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">S3 Metadata</span></a><span style="font-weight:400;">, you can easily capture and query custom metadata that is stored in object tags. </span></li>
</ul>
<p><i><span style="font-weight:400;">14:37  Justin – “</span></i><i><span style="font-weight:400;">And I was thinking to myself, hmm, why would they need this? Most people don’t tag their stuff in S3, but then they released a feature not too long ago called S3 metadata, which allows you to easily capture and query custom metadata from your data and then store that in the object tag. And so I’m going to guess a lot of customers were very surprised about how much their tags were costing them. so Amazon agreed and gave you a discount. So you’re welcome.”</span></i></p>
<p><b>16:14</b> <a href="https://aws.amazon.com/about-aws/whats-new/2025/03/amazon-s3-tables-create-query-table-s3-console/?trk=4b29643c-e00f-4ab6-ab9c-b1fb47aa1708&amp;sc_channel=el" target="_blank" rel="noreferrer noopener"><b>Amazon S3 Tables add create and query table support in the S3 console</b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/AmazonS3/latest/userguide/s3-tables-regions-quotas.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon S3 tables</span></a><span style="font-weight:400;"> are now GA and support create and query table operations directly from the </span><a href="https://aws.amazon.com/s3/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">S3</span></a><span style="font-weight:400;"> console using </span><a href="https://docs.aws.amazon.com/athena/latest/ug/what-is.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Athena</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">With this new feature, you can now create a table, populate it with data, and query it with just a few steps in the S3 console.</span></li>
</ul>
<p><i><span style="font-weight:400;">16:34  Justin – “Anything to make me not go to Athena is a win.”</span></i></p>
<p><b>20:42</b> <a href="https://aws.amazon.com/blogs/aws/collaborate-and-build-faster-with-amazon-sagemaker-unified-studio-now-generally-available/" target="_blank" rel="noreferrer noopener"><b>Collaborate and build faster with Amazon SageMaker Unified Studio, now </b></a><a href="https://aws.amazon.com/blogs/aws/collaborate-and-build-faster-with-amazon-sagemaker-unified-studio-now-generally-available/" target="_blank" rel="noreferrer noopener"><b>generally available</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon is announcing the GA of </span><a href="https://aws.amazon.com/sagemaker/unified-studio/?trk=c4ea046f-18ad-4d23-a1ac-cdd1267f942c&amp;sc_channel=el" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Sagemaker Unified Studio</span></a><span style="font-weight:400;">, a single data and AI development environment where you can find and access all of the data in your organization and act on it using the best tool for the job across virtually any use case.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Announced at Re:Invent last year, the studio is a single data and AI development environment. It brings together a wide range of tools and standalone apps including </span><a href="https://aws.amazon.com/athena/?trk=c4ea046f-18ad-4d23-a1ac-cdd1267f942c&amp;sc_channel=el" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Athena</span></a><span style="font-weight:400;">, </span><a href="https://aws.amazon.com/emr/?trk=c4ea046f-18ad-4d23-a1ac-cdd1267f942c&amp;sc_channel=el" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">EMR</span></a><span style="font-weight:400;">, </span><a href="https://aws.amazon.com/glue/?trk=c4ea046f-18ad-4d23-a1ac-cdd1267f942c&amp;sc_channel=el" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Glue</span></a><span style="font-weight:400;">, </span><a href="https://aws.amazon.com/redshift/?trk=c4ea046f-18ad-4d23-a1ac-cdd1267f942c&amp;sc_channel=el" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Redshift</span></a><span style="font-weight:400;">, </span><a href="https://aws.amazon.com/managed-workflows-for-apache-airflow/?trk=c4ea046f-18ad-4d23-a1ac-cdd1267f942c&amp;sc_channel=el" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Managed Workflows for Apache Airflow</span></a><span style="font-weight:400;"> and the existing </span><a href="https://aws.amazon.com/sagemaker/studio/?trk=c4ea046f-18ad-4d23-a1ac-cdd1267f942c&amp;sc_channel=el" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Sagemaker Studio</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">In addition they have announced several enhancements including:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">New Capabilities for </span><a href="https://aws.amazon.com/blogs/aws/collaborate-and-build-faster-with-amazon-sagemaker-unified-studio-now-generally-available/#feat-bedrock" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Bedrock in the Sagemaker Unified Studio</span></a></li>
<li style="font-weight:400;"><span style="font-weight:400;"> Integration of the foundational models, including </span><a href="https://www.anthropic.com/claude/sonnet" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Claude 3.7 Sonnet</span></a><span style="font-weight:400;"> and </span><a href="https://www.deepseek.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">DeepSeek-R1</span></a><span style="font-weight:400;">, which enables data sourcing from S3 within projects for KB creation, extends guardrail functionality to flows and provides a streamlined user management interface for domain admins to manage model governance across AWS accounts. </span></li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/collaborate-and-build-faster-with-amazon-sagemaker-unified-studio-now-generally-available/#feat-q" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Q Developer is now Generally Available in the Sagemaker Unified Studio</span></a><span style="font-weight:400;">, the most capable generative AI assistant for software development, streamlines development in Sagemaker Unified Studio by providing natural language, conversational interfaces that simplify tasks like writing SQL queries, building ETL jobs, troubleshooting and generating real-time code suggestions</span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">22:28  Ryan – “</span></i><i><span style="font-weight:400;">I’m sure there’s data teams that love this, right? This is a tool that is built for them. It’s built for data spelunking and reporting on those jobs  across large data as well. So I’m sure it makes a lot of sense if you’re in that world every day, but it’s what I’m just trying to do, like whatever my podunk use case is. Like, I just want to graph out how many people log in or use this feature, do this thing. Gets a little complex.”</span></i></p>
<p><b>23:25</b> <a href="https://aws.amazon.com/blogs/aws/amazon-s3-tables-integration-with-amazon-sagemaker-lakehouse-is-now-generally-available/" target="_blank" rel="noreferrer noopener"><b>Amazon S3 Tables integration with Amazon SageMaker Lakehouse is now </b></a><a href="https://aws.amazon.com/blogs/aws/amazon-s3-tables-integration-with-amazon-sagemaker-lakehouse-is-now-generally-available/" target="_blank" rel="noreferrer noopener"><b>generally available</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon S3 tables with Amazon </span><a href="https://aws.amazon.com/blogs/aws/simplify-analytics-and-aiml-with-new-amazon-sagemaker-lakehouse/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">SageMaker Lakehouse</span></a><span style="font-weight:400;"> is now generally available, providing a unified S3 table data access across various analytical engines and tools. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">You can access Sagemaker Lakehouse from </span><a href="https://aws.amazon.com/sagemaker/unified-studio/?trk=769a1a2b-8c19-4976-9c45-b6b1226c7d20&amp;sc_channel=el" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon SageMaker Unified Studio</span></a><span style="font-weight:400;">, a single data and AI development environment that combines functionality and tools from AWS analytics and AI/ML Services. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">All S3 tables data integrated with SageMaker Lakehouse can be queried from SageMaker Unified Studio and engines such as Athena, EMR, Redshift, and Iceberg-compatible engines like Spark and </span><a href="https://iceberg.apache.org/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Iceberg</span></a><span style="font-weight:400;">. </span></li>
</ul>
<p><i><span style="font-weight:400;">23:48  Ryan – “</span></i><i><span style="font-weight:400;">and you’ll need the studio, right? Because you’ll need all those services so you can do nine different ways of doing ETL and try and run a report across all of it. Makes perfect sense.”</span></i></p>
<p><b>24:51</b> <a href="https://aws.amazon.com/about-aws/whats-new/2025/03/aws-glue-data-catalog-views-glue-5-0/" target="_blank" rel="noreferrer noopener"><b>Announcing support of AWS Glue Data Catalog views with AWS Glue 5.0</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS is announcing support for AWS Glue Data Catalog with AWS glue 5 for Apache Spark Jobs.  Seems like a sticky situation. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">AWS glue data catalog views allow customers to create views from Glue 5.0 spark jobs that can be queried from multiple engines without requiring access to referenced tables. </span></li>
</ul>
<p><b>26:24</b> <a href="https://aws.amazon.com/about-aws/whats-new/2025/03/amazon-route-53-traffic-flow-visual-editor-improve-dns-policy-editing/" target="_blank" rel="noreferrer noopener"><b>Amazon Route 53 Traffic Flow introduces a new visual editor to improve </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2025/03/amazon-route-53-traffic-flow-visual-editor-improve-dns-policy-editing/" target="_blank" rel="noreferrer noopener"><b>DNS policy editing</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon Route 53 traffic flow now offers an enhanced user interface for improving DNS traffic policy editing. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Route 53 traffic flow is a network traffic management feature which simplifies the process of creating and maintaining DNS records in large and complex configurations, by providing users with an interactive DNS policy management flow chart in their web browser. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">With this release you can easily understand and change the way traffic is routed between users and endpoints using the new features of the visual editor.  </span></li>
</ul>
<p><i><span style="font-weight:400;">26:40  Matthew – “</span></i><i><span style="font-weight:400;">OK, so about 10 years ago when they updated the Route 53 console, I did it like it then. And every time I go into it today, I get mad at it because I can’t figure out how to put a DNS entry in. Because you have to like, select, be like, type, and do that. I’m so used to Terraform. And this just makes me mad thinking about how bad it’s going to be. All I want to do is just put an A record somewhere.”</span></i></p>
<p><b>30:02</b> <a href="https://aws.amazon.com/about-aws/whats-new/2025/03/aws-backup-logically-air-gapped-vault-amazon-fsx/" target="_blank" rel="noreferrer noopener"><b>AWS Backup adds logically air-gapped vault support for Amazon FSx</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon is announcing the availability of </span><a href="https://console.aws.amazon.com/backup/home" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Backup</span></a><span style="font-weight:400;"> logically air-gapped vault support for Amazon FSx for Lustre, Amazon FSx for Windows File Server, and Amazon FSx for OpenZFS. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Logically air-gapped vault is a type of AWS backup vault that allows secure sharing of backups across accounts and organizations, supporting direct restore to reduce recovery time for a data loss event.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">A logically air-gapped vault stores immutable backup copies that are locked by default, and isolated with encryption using AWS owned keys</span></li>
</ul>
<h2><b>GCP</b></h2>
<p><b>31:31 </b><a href="https://cloud.withgoogle.com/next/25" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Google Nex</span></a><span style="font-weight:400;">t is coming up in a few short weeks!</span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">April 9-11 at Mandalay Bay in Las Vegas. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Two courses you should definitely be aware of (for guaranteed Cloud Pod stickers):</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">BRK2-024 – Workload-optimized data protection for mission-critical enterprise apps</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">BRK1-028 – Unlock value for your workloads: Microsoft, Oracle, OpenShift and more</span></li>
</ul>
</li>
</ul>
<p><b>33:59</b> <a href="https://cloud.google.com/blog/products/networking/introducing-network-security-integration/" target="_blank" rel="noreferrer noopener"><b>Streamlined Security: Introducing Network Security Integration</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Many google cloud customers have deep investments in third party security tools, from appliances to saas applications. They enforce consistent policies across multiple clouds. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The challenge of these solutions is that each cloud application and environment comes with its unique paradigms and challenges. This may lead to network re-architecture, high cost of operations or difficulty meeting compliance requirements. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">To help address this, Google is announcing </span><a href="https://cloud.google.com/network-security-integration/docs" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Network Security Integration</span></a><span style="font-weight:400;"> to address these challenges.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This will allow you to integrate third-party network appliances or service deployments with your Google Cloud Workload while maintaining a consistent policy across hybrid and multi-cloud environments without changing your routing policies or network architecture. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">To do this, it leverages Generic Network virtualization encapsulation aka Geneve tunneling, to securely deliver traffic to third party inspection destinations without modifying the original packets. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">In addition, the integration helps accelerate application deployments and compliance with a producer/consumer model. This allows infrastructure operations teams to provide collector infrastructure as a service to application development teams, enabling dynamic consumption of IaaS. Support for the hierarchical firewall policy management further enforces compliance without delays. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">There are two primary modes for Network Security Integration:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Out-of-band integration (GA): Mirrors desired traffic to a separate destination for offline analysis. Supporting the following use cases:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Implementing advanced network security – use advanced offline analysis to detect known attacks based on predetermined signature patterns, and also identify previously unknown attacks with anomaly-based detection.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Improved application available and performance – diagnose and analyze what’s going on over the wire instead of relying on application logs</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Support regularly and compliance requirements</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">In-band integration (preview): Directs specific traffic to a third-party security stack for inline inspection </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Integrate natively with Cloud Next Generation Firewall (NGFW) and Third-party firewall </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Insert your preferred network security solution into brownfield application environments</span></li>
</ul>
</li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Several partners have comments in this article including Palo Alto, Fortinet, Checkpoint, Trellix, Corelight, cpacket networks, netscout and extrahop</span></li>
</ul>
<p><i><span style="font-weight:400;">36:21  Ryan – “I’m trying to figure out if this is amazing – or a way to burn money.”</span></i></p>
<p><b>39:25</b> <a href="https://blog.google/technology/developers/gemma-3/" target="_blank" rel="noreferrer noopener"><b>Introducing Gemma 3: The most capable model you can run on a single </b></a><a href="https://blog.google/technology/developers/gemma-3/" target="_blank" rel="noreferrer noopener"><b>GPU or TPU</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google is introducing the latest version of Gemma – </span><a href="https://ai.google.dev/gemma" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Gemma </span></a><span style="font-weight:400;">3, a collection of lightweight, state of the art open models built from the same research and technology that powers the Gemini 2.0 models.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">These are the most advanced, portable and responsibly developed open models yet.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They are designed to run fast directly on devices from phones and laptops to workstations, helping developers create AI applications, where people need them. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Gemma 3 comes in a range of sizes from 1B, 4B, 12B and 27B allowing you to choose the best model for the specific hardware and performance needs. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">New Capabilities of Gemma 3:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Built with the world’s best single-accelerator model: Gemma 3 delivers state of the art performance for its size, outperforming Llama3-405B, DeepSeek-V3, and o3-mini in preliminary preference evaluations on LMArena’s leaderboard.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Go Global in 140 Languages, with out of the box support for over 35 languages and pretrained support for over 140 languages.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Create AI with advanced text and visual reading capabilities to analyze images, text and short videos, opening up new possibilities for interactive and intelligent applications. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Handle complex tasks with an expanded context window: Gemma 3 offers a 128k-token context window to let your application process and understand vast amounts of information</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Create AI-driven workflows using function calling, which lets you automate tasks and build agentic experiences</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">High performance is delivered faster with quantized models, reducing the model size and computational requirements while maintaining high accuracy</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Alongside Gemma 3, they are also launching </span><a href="https://developers.googleblog.com/en/safer-and-multimodal-responsible-ai-with-gemma/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">ShieldGemma2</span></a><span style="font-weight:400;">, a powerful 4B image safety checker built on the Gemma 3 foundation. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">ShieldGemma2 provides a ready-made solution for image safety, outputting safety labels across three safety categories: Dangerous content, sexually explicit and violence.  </span></li>
</ul>
<p><i><span style="font-weight:400;">41:31  Ryan – “</span></i><i><span style="font-weight:400;">These smaller models are getting me into AI because my initial forays with the larger models, like, this is not going to work. I don’t really want huge hardware, but I want to have the ability to have a model locally in my own environment. These are great because they’re quick and you can run them on just normal PCs. They work better if you do have GPUs, but they still work even on CPU.”</span></i></p>
<p><b>42:38</b> <a href="https://cloud.google.com/blog/products/ai-machine-learning/announcing-gemma-3-on-vertex-ai/" target="_blank" rel="noreferrer noopener"><b>Announcing Gemma 3 on Vertex AI</b></a><span style="font-weight:400;"> </span></p>
<ul>
<li style="font-weight:400;"><a href="https://developers.googleblog.com/en/introducing-gemma3/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Gemma 3</span></a><span style="font-weight:400;"> is of course available on </span><a href="https://console.cloud.google.com/vertex-ai/publishers/google/model-garden/gemma3" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Vertex AI Model Garden</span></a><span style="font-weight:400;">, giving you immediate access for fine-tuning and deployments. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">You can quickly adapt Gemma 3 to your use case using Vertex AI’s pre-built containers and deployment tools. </span></li>
</ul>
<p><b>42:56</b> <a href="https://deepmind.google/discover/blog/gemini-robotics-brings-ai-into-the-physical-world/?utm_source=keywordsnippet&amp;utm_medium=referral" target="_blank" rel="noreferrer noopener"><b>Gemini Robotics brings AI into the physical world</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google is introducing Gemini Robotics, their Gemini 2.0 based model designed for robotics at </span><a href="https://deepmind.google/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Google DeepMind</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They have been making progress in how their Gemini model solves complex problems through multi-modal reasoning across text, images, audio and video. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Gemini Robotics is an advanced vision-language-action (VLA) model that was built on Gemini 2.0 with the addition of physical actions as a new output modality for the purpose of directly controlling robots. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The second model is Gemini Robotics-ER, a Gemini model with advanced spatial understanding. It enables roboticists to run their own programs using Gemini’s embodied reasoning (ER) abilities. (Is anyone else relieved this is embodied reasoning vs. emergency room?)</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Both of these models enable a variety of robots to perform a wider range of real-world tasks than ever before. As part of our efforts, they are partnering with </span><a href="https://apptronik.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Apptronik</span></a><span style="font-weight:400;"> to build the next generation of humanoid robots with Gemini 2.0.  </span></li>
</ul>
<p><i><span style="font-weight:400;">43:57  Ryan – “I’m not a nice person.One of my favorite things to do is yell at technology. The minute it has any kind of reasoning, this isn’t gonna go well for me.”</span></i></p>
<p><b>45:13</b> <a href="https://blog.google/products/gemini/new-gemini-app-features-march-2025/" target="_blank" rel="noreferrer noopener"><b>New Gemini app features, available to try at no cost</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Last week Ryan and Justin discussed how far behind Gemini seems to be in the market, and this week, Google is bringing new and upgraded features to Gemini Users, including </span><a href="https://openai.com/index/introducing-deep-research/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Deep Research</span></a><span style="font-weight:400;">, </span><a href="https://deepmind.google/technologies/gemini/flash-thinking/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">2.0 flash thinking</span></a><span style="font-weight:400;">, </span><a href="https://blog.google/products/gemini/google-gems-tips/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Gems</span></a><span style="font-weight:400;">, Apps and </span><a href="https://gemini.google.com/personalization" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">personalization</span></a><span style="font-weight:400;">.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The new upgraded version of 2.0 flash thinking gets the ability to upload files as well as longer context windows up to 1 million token context windows. 2.0 Flash thinking is a reasoning capability. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">In December, they pioneered a new Gemini product with Deep Research. </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">The goal was to save you hours of time as your personal AI research assistant, searching and synthesizing information from across the web in just minutes and helps you discover sources from across the web you may not have otherwise found.   </span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Now, they are upgrading Deep Research with Gemini 2.0 flash thinking (experimental.)  This enhances Gemini’s capabilities across all research stages — from planning and searching to reasoning, analyzing and reporting — creating higher-quality, multi-page reports that are more detailed and insightful.  Gemini now shows its thoughts while it browses the web, giving you a real-time look into how it’s going to solve your research task. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The Gemini is getting a new experimental feature called Personalization in the model drop-down. You can then ask food-related questions, and it will look at your recent food-related searches or provide travel advice based on destinations I’ve previously Googled.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Gemini is now starting to be able to access calendars, notes, tasks and photos with the new Flash Thinking 2.0.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This allows Gemini to better tackle complex requests like prompts that involve multiple applications because the new model can better reason over the overall request, break it down into distinct steps and assess its own progress as it goes.  So say in a single prompt you can ask Gemini: Look up easy cookie recipes on YouTube, add the ingredients to my shopping list and find me a grocery store that is open nearby.   Soon in google Photos it’ll be able to look at your photos and create an itinerary based on where you took photos or tell you when your driver’s license expires, assuming you’ve taken a photo of it before. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Gems are now available to everyone, letting you create your own personal AI expert on any topic. They are starting to roll out for everyone. Get started with their premade gems or quickly create your own custom gems, like a translator, meal planner, or math coach. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Just go to Gems Manager on the desktop, write instructions, give it a name and then chat with it whenever you want. </span></li>
</ul>
<p><b>49:59</b> <a href="https://cloud.google.com/blog/products/data-analytics/cloud-composer-3-for-apache-airflow/" target="_blank" rel="noreferrer noopener"><b>Cloud Composer 3: The next generation of data pipeline orchestration</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google is announcing the general availability of their 3rd attempt with Cloud Composer, </span><a href="https://cloud.google.com/composer/docs/composer-3/composer-overview" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Cloud Composer 3</span></a><span style="font-weight:400;"> the latest version of their fully managed Apache Airflow service.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This release represents a significant advancement in data pipeline orchestration, enabling data teams to streamline workflows, reduce operational overhead and accelerate time-to-value. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Cloud Composer has a host of new features:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Simplified networking: easily configure network settings with streamlined options, reducing complexity and management overhead</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Evergreen Versioning: to stay up to date with the latest cloud composer releases</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Hidden Infrastructure: focus on your data pipelines, not infrastructure. Cloud Composer 3 handles the underlying infra, allowing you to concentrate on building and running Dags</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Enhanced performance REliability</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Per Task CPU &amp; Memory Control</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Strengthen your security posture </span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">50:48  Ryan – “</span></i><i><span style="font-weight:400;">When I first looked at Composer 2 trying to answer a research question for work, it was nothing more than a glorified deployment template. You still had to deploy all the Kubernetes, all the Amazon or all the Apache Airflow servers, all the infrastructure, all had to live within your project deployed on your network. If you needed to talk to another network, you had to plumb all the private service connects yourself and do all the things. So I’m really glad that GCP has finally figured out how to create a managed service.”</span></i></p>
<h2><b>Azure</b></h2>
<p><b>53:08</b> <a href="https://azure.microsoft.com/en-us/blog/microsoft-cost-management-updates-march-2025/" target="_blank" rel="noreferrer noopener"><b>Microsoft Cost Management updates—March 2025</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft has their monthly update for finops practitioners this month bringing several improvements;</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Optimizing </span><a href="https://learn.microsoft.com/en-us/azure/aks/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AKS</span></a><span style="font-weight:400;"> with new </span><a href="https://azure.microsoft.com/services/cost-management/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">cost analysis capabilities</span></a><span style="font-weight:400;"> allows you to get granular cost information on your AKS clusters. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The views provide you with visibility into the cost of namespaces and all aggregated costs on all of your resources. You just need to install the cost analysis add-on to your cluster to enable this. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">By deprecating the AWS connector on March 31st 2025, you will lose access to the connector and AWS cost and usage data stored in the cost management service, including historical data. (They won’t delete the CUR files in your S3 bucket though).  </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">They recommend moving to another reporting tool, or if you want the rollup in Azure to use standard </span><a href="https://focus.finops.org/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">FOCUS</span></a><span style="font-weight:400;"> format and analytical solution in the </span><a href="https://www.microsoft.com/en-us/microsoft-fabric/resources/data-101/what-is-fabric?msockid=111326c238f86e943a5e34a8396a6f52" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Microsoft Fabric</span></a><span style="font-weight:400;"> solution to analyze and report from various sources. </span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">You can now exchange Azure OpenAI service provisioned reservations and you can also still request refunds as well</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">If you have opinions about the future of cost reporting, and I’m sure some of you do, you can take the cost optimization survey to share that feedback. The link is in the blog post.</span></li>
</ul>
<p><b>56:44</b> <a href="https://techcommunity.microsoft.com/blog/microsoftlearnblog/announcing-the-microsoft-ai-skills-fest-save-the-date/4292269" target="_blank" rel="noreferrer noopener"><b>Announcing the Microsoft AI Skills Fest: Save the date!</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft AI Skills Fest is a global event this April and May designed to bring learners across the globe together to build their AI skills, from beginner explorers to the technology-gifted. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Registration opens March 24th, with the kickoff on April 8th. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">For tech professionals you’ll learn how to build AI-powered solutions using Microsoft AI apps and services quickly. Gain skills and experience working with agents, AI security, Azure AI Foundry, Github Copilot, Microsoft Fabric and more</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Kickoff is at 9:00 AM on April 8th in Australia and will be a full 24-hour globe-spanning event. They are even trying to break a Guinness world record for most users to take an online multi level artificial intelligence lesson in 24 hours. </span></li>
</ul>
<p><b>59:08</b> <a href="https://techcommunity.microsoft.com/blog/adformysql/azure-database-for-mysql-triggers-for-azure-functions-public-preview/4374399" target="_blank" rel="noreferrer noopener"><b>Azure Database for MySQL triggers for Azure Functions (Public Preview)</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Azure is announcing that you can now invoke an Azure Function based on changes to an Azure Database for MySQL table. This new capability is made possible through the Azure Database for MySQL trigger for Azure Functions now available in public preview. </span><span style="font-weight:400;"> </span></li>
</ul>
<p><i><span style="font-weight:400;">59:24  Justin – “PSA – If you’re using triggers in databases to do *anything* you should really rethink your architecture.”</span></i></p>
<h2><span style="font-weight:400;">Closing</span></h2>
<p><span style="font-weight:400;">And that is the week in the cloud! Visit our website, the home of the Cloud Pod where you can join our newsletter, slack team, send feedback or ask questions at theCloud Pod.net or tweet at us with hashtag #theCloudPod</span></p>
]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/2000740/c1e-33g3ckrmpjfm1k3z-z3dj835nc794-krtxuh.mp3" length="81160275"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 297 of The Cloud Pod – where the forecast is always cloudy! Justin, Ryan, and Matthew have beaten the black lung and are in the studio – ready to bring you all the latest and greatest in cloud and AI news! We’ve got Wiz buyouts (that security, it’s so hot right now!) Gemma 3, Glue 5 (but not 3 or 4) and Gemini Robots – plus looking forward to AI Skills Fest and Google Next, all this week on The Cloud Pod. 
Titles we almost went with this week:

Google! Yer a WIZ—Ard
Google Announces Network Security Integration… and that must include WIZ
Gemini Robots…. What could go wrong 
️AI Data Studios … So Hot Right Now
I want 32 Billion dollars
Azure Follow AWS in bad life choices – mk
Wait Glue is more than v2
What happened to Glue 3 and 4?
5th Try and AWS Glue still sucks

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
Follow Up 
01:05 Microsoft quantum computing claim still lacks evidence: physicists are dubious

A MS researcher presented results behind the company’s controversial claim to have created the first topological qubits – a long-sought goal of quantum computing. 
Theorists said it’s a hard problem, and that it was a beautiful talk but the claims come without evidence, and people think they have gone overboard. 
The Head of Quantum at Amazon was also highly skeptical:

https://www.businessinsider.com/amazon-exec-casts-doubt-microsoft-quantum-claims-2025-3



02:09  Justin – “No one’s really buying Microsoft actually created a new topological qubit. There’s some doubt… basically they said that what they showed, which is a microscopic H-shaped aluminum wire on top of indium arsenide – a superconductor at ultra-cold temperatures, and the devices are designed to harness majoranas, previously undiscovered quasi-particles that are essential for topological qubits to work, and the goals for majoranas to appear at the four tips of the H-shaped wire emerging from reflective-behavior electrons, and these majorans in theory could be used to perform quantum computing that are resistant to informat...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/2000740/c1a-k5d5-6z15mn81fw56-7fmsoa.jpg"></itunes:image>
                                                                            <itunes:duration>01:07:38</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                            </item>
                    <item>
                <title>
                    <![CDATA[296: Google Forces AI Protection]]>
                </title>
                <pubDate>Fri, 21 Mar 2025 14:37:25 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/1997617</guid>
                                    <link>https://tcpfm.castos.com/episodes/google-forces-ai-protection</link>
                                <description>
                                            <![CDATA[<p><span style="font-weight:400;">Welcome to episode 296 of The Cloud Pod – where the forecast is always cloudy! Today is a twofer – Justin and Ryan are in the house to make sure you don’t miss out on any of today’s important cloud and AI news. From AI Protection, to Google Next, to Amazon Q Developer, we’ve got it all, this week on TCP! </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;"> Amazon Step Functions, walks step by step into my IDE</span></li>
<li><span style="font-weight:400;"> Deepseek seeks the truth of “is it serverless or servers”? </span></li>
<li><span style="font-weight:400;">️ Well Architected Reviews by AI… What will my solutions architects do now? </span></li>
<li><span style="font-weight:400;">⌨️ The cloud pod hosts steps over the Azure EU Data Boundary</span></li>
<li><span style="font-weight:400;">️ BYOIP to ALBs… only years too late for everyone.</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><b>General News </b></h2>
<p><b>01:02 </b><a href="https://www.hashicorp.com/en/blog/hashicorp-and-red-hat-better-together" target="_blank" rel="noreferrer noopener"><b>HashiCorp and Red Hat, better together</b></a><span style="font-weight:400;"> </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Hashicorp has more details on its future, with the </span><a href="https://www.hashicorp.com/blog/hashicorp-officially-joins-the-ibm-family" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">recent IBM acquisition</span></a><span style="font-weight:400;"> in this blog post. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They talk about the wide range of Day 2 operations, including things like drift detection, image management and patching, rightsizing, and configuration management.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">As </span><a href="https://www.redhat.com/en/technologies/management/ansible" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Red Hat Ansible</span></a><span style="font-weight:400;"> is a purpose built operational management platform, it makes it easier to properly configure resources after the initial creation, but also to evolve the configuration after setup, and then execute ad-hoc playbooks to keep things running reliably and more securely at scale. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Some additional things they’re exploring, now that the acquisition has closed:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Red Hat Ansible Inventory generated dynamically by </span><a href="https://www.terraform.io/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Terraform</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Official Terraform modules for Redhat Ansible, making it easier to trigger terraform from Ansible Playbooks.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Redhat and Hashicorp officially support the Red Hat Ansible Provider for Terraform, making it easier to trigger Ansible from Terraform.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Evolving Terraform provisioners to support a more comprehensive set of lifecycle integrations.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Improved mechanisms to invoke Ansible Playbooks outside of the resource provisioning lifecycle</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Customers – not surprisingly – regularly inte...</span></li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 296 of The Cloud Pod – where the forecast is always cloudy! Today is a twofer – Justin and Ryan are in the house to make sure you don’t miss out on any of today’s important cloud and AI news. From AI Protection, to Google Next, to Amazon Q Developer, we’ve got it all, this week on TCP! 
Titles we almost went with this week:

 Amazon Step Functions, walks step by step into my IDE
 Deepseek seeks the truth of “is it serverless or servers”? 
️ Well Architected Reviews by AI… What will my solutions architects do now? 
⌨️ The cloud pod hosts steps over the Azure EU Data Boundary
️ BYOIP to ALBs… only years too late for everyone.

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
General News 
01:02 HashiCorp and Red Hat, better together 

Hashicorp has more details on its future, with the recent IBM acquisition in this blog post. 
They talk about the wide range of Day 2 operations, including things like drift detection, image management and patching, rightsizing, and configuration management.  
As Red Hat Ansible is a purpose built operational management platform, it makes it easier to properly configure resources after the initial creation, but also to evolve the configuration after setup, and then execute ad-hoc playbooks to keep things running reliably and more securely at scale. 
Some additional things they’re exploring, now that the acquisition has closed:

Red Hat Ansible Inventory generated dynamically by Terraform. 
Official Terraform modules for Redhat Ansible, making it easier to trigger terraform from Ansible Playbooks.
Redhat and Hashicorp officially support the Red Hat Ansible Provider for Terraform, making it easier to trigger Ansible from Terraform.
Evolving Terraform provisioners to support a more comprehensive set of lifecycle integrations.
Improved mechanisms to invoke Ansible Playbooks outside of the resource provisioning lifecycle


Customers – not surprisingly – regularly inte...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[296: Google Forces AI Protection]]>
                </itunes:title>
                                    <itunes:episode>296</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p><span style="font-weight:400;">Welcome to episode 296 of The Cloud Pod – where the forecast is always cloudy! Today is a twofer – Justin and Ryan are in the house to make sure you don’t miss out on any of today’s important cloud and AI news. From AI Protection, to Google Next, to Amazon Q Developer, we’ve got it all, this week on TCP! </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;"> Amazon Step Functions, walks step by step into my IDE</span></li>
<li><span style="font-weight:400;"> Deepseek seeks the truth of “is it serverless or servers”? </span></li>
<li><span style="font-weight:400;">️ Well Architected Reviews by AI… What will my solutions architects do now? </span></li>
<li><span style="font-weight:400;">⌨️ The cloud pod hosts steps over the Azure EU Data Boundary</span></li>
<li><span style="font-weight:400;">️ BYOIP to ALBs… only years too late for everyone.</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><b>General News </b></h2>
<p><b>01:02 </b><a href="https://www.hashicorp.com/en/blog/hashicorp-and-red-hat-better-together" target="_blank" rel="noreferrer noopener"><b>HashiCorp and Red Hat, better together</b></a><span style="font-weight:400;"> </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Hashicorp has more details on its future, with the </span><a href="https://www.hashicorp.com/blog/hashicorp-officially-joins-the-ibm-family" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">recent IBM acquisition</span></a><span style="font-weight:400;"> in this blog post. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They talk about the wide range of Day 2 operations, including things like drift detection, image management and patching, rightsizing, and configuration management.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">As </span><a href="https://www.redhat.com/en/technologies/management/ansible" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Red Hat Ansible</span></a><span style="font-weight:400;"> is a purpose built operational management platform, it makes it easier to properly configure resources after the initial creation, but also to evolve the configuration after setup, and then execute ad-hoc playbooks to keep things running reliably and more securely at scale. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Some additional things they’re exploring, now that the acquisition has closed:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Red Hat Ansible Inventory generated dynamically by </span><a href="https://www.terraform.io/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Terraform</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Official Terraform modules for Redhat Ansible, making it easier to trigger terraform from Ansible Playbooks.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Redhat and Hashicorp officially support the Red Hat Ansible Provider for Terraform, making it easier to trigger Ansible from Terraform.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Evolving Terraform provisioners to support a more comprehensive set of lifecycle integrations.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Improved mechanisms to invoke Ansible Playbooks outside of the resource provisioning lifecycle</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Customers – not surprisingly – regularly integrate Vault and Openshift, and they have identified dozens of connection points that can add value, including:</span>
<ul>
<li style="font-weight:400;"><a href="https://developer.hashicorp.com/vault/docs/platform/k8s/vso" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Vault Secrets Operator</span></a><span style="font-weight:400;"> for OpenShift</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Etcd data encryption </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Argo CI/CD</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Istio Certificate issuance</span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">01:48  Justin – “That’s a lot of promise for Ansible there, that I’m not sure it completely lives up to…”</span></i></p>
<p><b>07:09</b> <a href="https://www.theinformation.com/briefings/justice-department-reiterates-demand-to-break-up-google?rc=3t8xtd" target="_blank" rel="noreferrer noopener"><b>Justice Department Reiterates Demand to Break Up Google</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">New Administration means new head of the DOJ – and we’re sure Google was hoping for a break in the Antitrust area.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Unfortunately for them, the Justice Department reiterated last week that many aspects of its proposed final judgement, including the prohibition of payments to Apple and other companies for a share of search revenue or preferential treatment, still stand, as does the demand that they sell their Chrome web browser.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They did, however, drop their request that Google be prohibited from making investments in AI companies like Anthropic. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This is a sign that the Justice Department may continue their aggressive antitrust stance started by the Biden administration. </span></li>
</ul>
<p><i><span style="font-weight:400;">08:12  Ryan – “</span></i><i><span style="font-weight:400;">The Chrome browser, if they have to sell it off, it’s going to be just a nightmare for them. They’ve put a lot into Chrome that’s not just browser-based. A lot of their zero trust for BeyondCorp has moved into that, into the Chrome enterprise and a whole bunch of sort…that’s gonna sting. But I mean, that’s, it also speaks to the you know, what the DOG is trying to accomplish, which is those things are very tied together and you have to use them.”</span></i></p>
<h2><b>AI Is Going Great, Or How ML Makes Money </b></h2>
<p><b>09:07</b> <a href="https://www.theinformation.com/articles/google-is-still-behind-in-ai-why?rc=3t8xtd" target="_blank" rel="noreferrer noopener"><b>Google Is Still Behind in AI. Why?</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AI isn’t going so well for everyone, from Apple (who has now delayed several exciting IOS features another year) to </span><a href="https://deepmind.google/technologies/gemini/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Google Gemini</span></a><span style="font-weight:400;">, who is falling further and further behind </span><a href="https://openai.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Open AI</span></a><span style="font-weight:400;"> and even Grok.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The Information points at the increasing disparity and the struggles of AI. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">So In general… Where do we feel AI is between the vendors? </span></li>
</ul>
<p><i><span style="font-weight:400;">11:18  Justin – “</span></i><i><span style="font-weight:400;">I think it’s good. Copilot, I feel is behind in some other areas, but like for code completion and scaffolding, I think it’s still doing a pretty good job. But, you know, there were an area, it’s still pretty weak as an agentic coding exercise, like being able to give it a prompt and have it write, you know, code pieces. That’s why people are, you know, doing a lot with cursor these days and they’re doing a lot with Claude CLI and you these things where they can do a lot more interesting things. so I suspect that that’s going to have to change this year for GitHub.”</span></i></p>
<p><b>13:35</b> <a href="https://www.theinformation.com/briefings/googles-ai-unit-reorganizes-product-work-announces-changes-to-gemini-app-team?rc=3t8xtd" target="_blank" rel="noreferrer noopener"><b>Google’s AI Unit Reorganizes Product Work, Announces Changes to </b></a><a href="https://www.theinformation.com/briefings/googles-ai-unit-reorganizes-product-work-announces-changes-to-gemini-app-team?rc=3t8xtd" target="_blank" rel="noreferrer noopener"><b>Gemini App Team</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google has disbanded its product impact unit, whose goal was to incorporate </span><a href="https://deepmind.google/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">DeepMind</span></a><span style="font-weight:400;"> research into Google products, as it attempts to streamline the process of creating AI products. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">DeepMind Leader Demis Hassabis wrote in an email to employees that the move was designed to optimize and simplify their product work, model development work, and product area engagements. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They also announced changes to the Gemini team, which has struggled to compete with Open AI.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Google has hired former Meta VP of Product, Chris Strahar, to lead product on Gemini, and is adding product teams from Google’s more experimental multimodal assistant product </span><a href="https://deepmind.google/technologies/project-astra/%5C" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Astra</span></a><span style="font-weight:400;">, into Gemini. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They will also be moving Gemini to use models developed by DeepMind’s main post training teams rather than a chatbot specific team per the memo. </span></li>
</ul>
<p><b>14:58</b> <a href="https://openai.com/index/new-tools-for-building-agents/" target="_blank" rel="noreferrer noopener"><b>New tools for building agents</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">OpenAI is releasing the first set of tools to help developers and enterprises build useful and reliable agents.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Over the last year, they have introduced new model capabilities including reasoning, multimodal interactions, and new safety techniques, but customers have complained that turning these features into production ready agents was challenging, requiring extensive prompt iteration and custom orchestration logic without sufficient visibility or built-in support.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">To address these challenges, Open AI is launching a new set of APIs and tools to help build agentic applications:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">New </span><a href="https://platform.openai.com/docs/quickstart?api-mode=responses" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Response API</span></a><span style="font-weight:400;">, combining the simplicity of chat completion API with the tool use capabilities of the Assistant API for building Agents.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Built in tools including</span><a href="https://platform.openai.com/docs/guides/tools-web-search" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;"> web search</span></a><span style="font-weight:400;">, </span><a href="https://platform.openai.com/docs/guides/tools-file-search" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">file search</span></a><span style="font-weight:400;"> and </span><a href="https://platform.openai.com/docs/guides/tools-computer-use" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">computer use</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The new </span><a href="https://platform.openai.com/docs/guides/agents" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Agents SDK</span></a><span style="font-weight:400;"> to orchestrate single-agent and multi-agent workflows.</span></li>
<li style="font-weight:400;"><a href="https://platform.openai.com/docs/guides/agents#orchestration" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Integrated observability</span></a><span style="font-weight:400;"> tools to trace and inspect agent workflow execution.</span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">16:57  Justin – “Y</span></i><i><span style="font-weight:400;">ou know those Pinterest fails – you know, those those memes, I feel like I’ve done that with Agentic AIs left and right, like where I’m like, I have this cool idea, you know, like where I’ll read a watch a YouTube video and like how to automate this daily task. And then by the time I get through it, I’ve got this three quarters of the way created monstrosity of things shrug together with string and it’s never going to run reliably or repeatedly.”</span></i></p>
<p><b>18:11</b> <a href="https://gizmodo.com/microsofts-relationship-with-openai-is-not-looking-good-2000573293?utm_source=tldrnewsletter" target="_blank" rel="noreferrer noopener"><b>Microsoft’s Relationship With OpenAI Is Not Looking Good</b></a><span style="font-weight:400;"> </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Things may not be going great with Microsoft and Open AI, with the latest report that Microsoft is developing its own in-house reasoning models to compete with OpenAI. The Information also says Microsoft has been testing models from Elon Musk’s xAI, Meta, and DeepSeek to replace ChatGPT in Copilot, its AI bot for the workplace. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft Copilot has </span><a href="https://www.emarketer.com/content/microsoft-copilot-fails-impress-businesses-due-high-costs-limited-results" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">received poor reception</span></a><span style="font-weight:400;"> in enterprises due to the high costs and limited results. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft even let OpenAI out of a contract that required it to use Azure for all of its hosting needs.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">It may make sense in the long run if both companies continue to see themselves as competitors vs partners. </span></li>
</ul>
<p><i><span style="font-weight:400;">19:37  Justin – “</span></i><i><span style="font-weight:400;">Microsoft needs an office assistant. Those are different needs and potentially different models. And so I think that’s maybe where you’re seeing the divergence of interest, because of, they want to make, AGI at open AI and, know, really, that’s not what Microsoft wants. They would like to sell more office licenses at higher prices and that helps them with revenue. So they have different goals, perhaps, between the two of them.”</span></i></p>
<h2><b>Cloud Tools</b></h2>
<p><b>20:55</b> <a href="https://www.hashicorp.com/en/blog/vault-enterprise-1-19-reduces-risk-encryption-updates-automated-root-rotation" target="_blank" rel="noreferrer noopener"><b>Vault Enterprise 1.19 reduces risk with encryption updates and automated </b></a><a href="https://www.hashicorp.com/en/blog/vault-enterprise-1-19-reduces-risk-encryption-updates-automated-root-rotation" target="_blank" rel="noreferrer noopener"><b>root rotation</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Hashicorp Vault 1.19 is now GA, with enhanced security workflows, post-quantum computing features and long-term support. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Notable features in Vault Enterprise 1.19 include:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Module-Lattice-Based Digital Signature Standard (ML-DSA) Post Quantum Cryptography (PQC) support: </span><a href="https://developer.hashicorp.com/vault/docs/secrets/transit" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Transit secrets engine</span></a><span style="font-weight:400;"> adds support for ML-DSA PQC sign and verify functionality for experimental purposes.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Vault transit engine support for </span><a href="https://developer.hashicorp.com/vault/docs/secrets/transit#ed" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">ED25519</span></a><span style="font-weight:400;"> with pre-hashing: The vault transit engine now supports </span><a href="https://developer.hashicorp.com/vault/api-docs/secret/transit" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">ED25519PH</span></a><span style="font-weight:400;"> signing, which is commonly used in remote and embedded devices.</span></li>
<li style="font-weight:400;"><a href="https://developer.hashicorp.com/vault/tutorials/pki/pki-engine" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Constrained certificate authorities</span></a><span style="font-weight:400;"> (CA): Constrained CA’s reduce risk by providing isolation for PKI workloads.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Extended automated root rotation: Vault 1.19 extends its centralized rotation manager, which now provides a mechanism to automate rotation of root credentials for AWS, Azure, and Google Cloud auth methods and secret engines, along with LDAP and database plugins.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Additional UI support for Workload Identity Federation (WIF): Vault 1.19 now provides UI support for WIF on Google Cloud and Azure.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Long-term support (LTS): While Vault 1.16 enters one year of extended support, 1.19 represents Vault Enterprise’s second LTS release.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Seal-wrap </span><a href="https://developer.hashicorp.com/vault/docs/auth/approle" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AppRole</span></a><span style="font-weight:400;"> data for Federal Information Processing Standards (FIPS): FIPS-compliant Hardware Security Module (HSM) deployments.</span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">21:24  Justin – “So not quite production ready yet, but they’re getting ready for quantum as well.”</span></i></p>
<p><b>23:24</b> <a href="https://www.hashicorp.com/en/blog/terraform-migrate-now-generally-available" target="_blank" rel="noreferrer noopener"><b>Terraform migrate now generally available</b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.hashicorp.com/en/blog/enabling-fast-safe-migration-to-hcp-terraform-with-terraform-migrate-tf-migrate" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Terraform Migrate</span></a><span style="font-weight:400;">, which we previously talked about, is now generally available – making it easy to move from Terraform Community Edition to HCP Terraform and Terraform Enterprise. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Designed to reduce manual effort and improve accuracy, it streamlines the migration process, helping teams Adopt HCP Terraform and Terraform Enterprise with confidence. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Key features include:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Automating state transfer</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">State refactoring</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Validation and Verification</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">In addition, they’ve expanded features such as Variable management and migration</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Gitlab integration</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Security and validation for Git Personal Access tokens</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Refined directory skipping</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Dry run mode</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Improved target branch naming</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">And optimizations for error handling, logging and debugging </span></li>
</ul>
<h2><b>AWS</b></h2>
<p><b>25:06</b> <a href="https://aws.amazon.com/about-aws/whats-new/2025/03/application-load-balancer-integration-vpc-ipam/" target="_blank" rel="noreferrer noopener"><b>Application Load Balancer announces integration with Amazon VPC IPAM</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">ALB allows you to provide a pool of Public IPV4 addresses for IP address assignment to load balancer nodes.  You can configure these via IPAM, and this can consist of BYOIP or contiguous IPv4 address blocks provided by Amazon.</span></li>
</ul>
<p><i><span style="font-weight:400;">26:01  Ryan – “</span></i><i><span style="font-weight:400;">That’s cool. didn’t quite catch on that this was a contiguous Amazon blocks…. You can provide a smaller range without actually having to go through and you know, sacrifice your first born and sell your liver for IP space. like, that’s pretty rad.”</span></i></p>
<p><b>28:00</b> <a href="https://aws.amazon.com/about-aws/whats-new/2025/03/aws-step-functions-workflow-studio-vs-code-ide/" target="_blank" rel="noreferrer noopener"><b>Announcing AWS Step Functions Workflow Studio for the VS Code IDE</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS Step Functions Workflow Studio is now available in </span><a href="https://aws.amazon.com/visualstudiocode/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Toolkit for Visual Studio Code</span></a><span style="font-weight:400;">, enabling you to visually create, edit and debug state machine workflows directly in your IDE. </span></li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/step-functions/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Step Functions</span></a><span style="font-weight:400;"> are a visual workflow service capable of orchestrating over 14,000+ API actions from over 220 AWS services to build distributed applications and data processing workloads.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Workflow studio is a visual builder that allows you to compose workflows on canvas, while generating workflow definitions in the background. </span></li>
</ul>
<p><i><span style="font-weight:400;">28:33  Ryan – “</span></i><i><span style="font-weight:400;">I think it was two or three years ago I was an old man yelling at cloud. ‘You can just switch over.’ But now I am so addicted to everything being my ID. This is great. I won’t use studio to create a whole bunch of step functions, but debugging them? Oh yeah. Like it’s, it’s super helpful there. That’s pretty cool. I like it.”</span></i></p>
<p><b>29:12</b> <a href="https://aws.amazon.com/about-aws/whats-new/2025/03/aws-lambda-cloudwatch-logs-live-tail-vs-code-ide/?trk=e61dee65-4ce8-4738-84db-75305c9cd4fe&amp;sc_channel=el" target="_blank" rel="noreferrer noopener"><b>AWS Lambda adds support for Amazon CloudWatch Logs Live Tail in VS </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2025/03/aws-lambda-cloudwatch-logs-live-tail-vs-code-ide/?trk=e61dee65-4ce8-4738-84db-75305c9cd4fe&amp;sc_channel=el" target="_blank" rel="noreferrer noopener"><b>Code IDE</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS Lambda now supports </span><a href="https://docs.aws.amazon.com/lambda/latest/dg/monitoring-cloudwatchlogs-view.html#monitoring-live-tail" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Cloudwatch Logs Live Tail</span></a><span style="font-weight:400;"> in VS Code IDE through the AWS toolkit for visual studio code. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Live tail is an interactive log streaming and analytics capability which provides real-time visibility into logs, making it easier to develop and troubleshoot lambda functions.</span></li>
</ul>
<p><b>30:26</b> <a href="https://aws.amazon.com/about-aws/whats-new/2025/03/amazon-q-developer-cli-agent-command-line/" target="_blank" rel="noreferrer noopener"><b>Amazon Q Developer announces a new CLI agent within the command line</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon Q Developer announced an enhanced CLI agent within the Amazon Q command line interface (CLI) that allows you to have more dynamic conversations.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">With this update, Amazon Q developer can now use the information in your CLI environment to help you read and write files locally, query AWS resources or create code. </span></li>
</ul>
<p><i><span style="font-weight:400;">31:10  Ryan – “</span></i><i><span style="font-weight:400;">Well, I mean, it would be nice to be able to natural language query your ginormous AWS infrastructure and have it just figure it out. Right. Like that would be fantastic if they can get there, but I don’t know if it’s there yet.”</span></i></p>
<p><b>31:56</b> <a href="https://aws.amazon.com/blogs/aws/deepseek-r1-now-available-as-a-fully-managed-serverless-model-in-amazon-bedrock/" target="_blank" rel="noreferrer noopener"><b>DeepSeek-R1 now available as a fully managed serverless model in </b></a><a href="https://aws.amazon.com/blogs/aws/deepseek-r1-now-available-as-a-fully-managed-serverless-model-in-amazon-bedrock/" target="_blank" rel="noreferrer noopener"><b>Amazon Bedrock</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">In January you could access </span><a href="https://www.deepseek.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">DeepSeek-R1</span></a><span style="font-weight:400;"> models that became available in </span><a href="https://aws.amazon.com/blogs/aws/deepseek-r1-models-now-available-on-aws/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Bedrock</span></a><span style="font-weight:400;">, through the </span><a href="https://aws.amazon.com/bedrock/marketplace/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">marketplace</span></a><span style="font-weight:400;"> or custom model import.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Now they’re making it easier to use </span><a href="https://aws.amazon.com/bedrock/deepseek" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">DeepSeek in Amazon Bedrock</span></a><span style="font-weight:400;"> through an expanded range of options, including a new serverless solution.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The fully managed DeepSeek-R1 model is now GA in Bedrock. </span></li>
</ul>
<p><i><span style="font-weight:400;">32:30  Justin – “</span></i><i><span style="font-weight:400;">You’ll be able to then tune these and do all kinds of other things as you go in the future and use RAG, et cetera, with DeepSeq. So if you’re okay with the ramifications, they may have stolen all their data from OpenAI. You can use DeepSeq in your product. Good luck to you.”</span></i></p>
<p><b>33:18</b> <a href="https://aws.amazon.com/blogs/machine-learning/accelerate-aws-well-architected-reviews-with-generative-ai/?trk=e61dee65-4ce8-4738-84db-75305c9cd4fe&amp;sc_channel=el" target="_blank" rel="noreferrer noopener"><b>Accelerate AWS Well-Architected reviews with Generative AI</b></a><span style="font-weight:400;">  </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Building cloud infrastructure baked on proven best practices promoting security, reliability, and cost efficiency.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">To achieve these goals, the </span><a href="https://aws.amazon.com/architecture/well-architected/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Well Architected Framework</span></a><span style="font-weight:400;"> provides comprehensive guidance for building and improving cloud architectures. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">As your system scales, conducting well architected framework reviews becomes more crucial, offering deeper insights and strategic value to help organizations optimize their growing cloud environments. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">To address these challenges, they have built a </span><a href="https://github.com/aws-samples/sample-well-architected-acceleration-with-generative-ai" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">WAFR Accelerator solution</span></a><span style="font-weight:400;"> that uses generative AI to help streamline and expedite the WAFR process. By automating the initial assessment and documentation process, the solution significantly reduces time spent on evaluations while providing consistent architecture assessments against AWS Well-Architected principles.  This allows teams to focus more on implementing improvements and optimizing AWS infrastructure. The solution incorporates the following features:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">RAG to create context aware detailed assessments</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">An interactive Chat interface</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Integrated with AWS well-architected tool which prepopulates workload information and initial assessment responses. </span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">34:51  Ryan – “</span></i><i><span style="font-weight:400;">This has the potential of being really amazing. I have very mixed feelings about the well-architected framework process. I’ve done both the self-serve many times and even the walkthrough from technical account support. And I always just feel like it lacks the ability to find any real problems. Once you get past the like, you know, regional distribution and being able to rehydrate data sort of problems, it sort of falls down very quickly and, and doesn’t help solve, complex issues that may arrive due to conditions. And so I’m sort of hoping that, you know, introducing AI into this mix might give it that ability to sort of have a lot more context into your deployment as it’s asking you questions.”</span></i></p>
<p><b>39:21</b> <a href="https://aws.amazon.com/about-aws/whats-new/2025/03/amazon-bedrock-multi-agent-collaboration/" target="_blank" rel="noreferrer noopener"><b>Amazon Bedrock now supports multi-agent collaboration</b></a><span style="font-weight:400;">  </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS Announces the GA of multi-agent collaboration for </span><a href="https://aws.amazon.com/bedrock/agents/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Bedrock</span></a><span style="font-weight:400;">, allowing developers to create networks of specialized agents that communicate and coordinate under the guidance of a supervisor Agent. This new capability allows you to tackle more intricate, multi-step workflows and scale your AI-driven applications more effectively. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Bedrock multi-agent collaboration GA introduces key enhancements designed to improve scalability, flexibility and operational efficiency.  Inline agents allow you to dynamically adjust agent roles and behaviors at runtime, making workflows more adaptable as your business needs evolve. </span></li>
</ul>
<p><i><span style="font-weight:400;">39:38  Ryan – “</span></i><i><span style="font-weight:400;">Do you think that supervisor agent just stands around, doesn’t really do anything and then takes credit for all the other agents work?”</span></i></p>
<h2><b>GCP</b></h2>
<p><b>40:51 </b><a href="https://cloud.withgoogle.com/next/25" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Google Nex</span></a><span style="font-weight:400;">t is coming up in a few short weeks!</span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">April 9-11 at Mandalay Bay in Las Vegas. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Two courses you should definitely be aware of (for guaranteed Cloud Pod stickers)</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">BRK2-024 – Workload-optimized data protection for mission-critical enterprise apps</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">BRK1-028 – Unlock value for your workloads: Microsoft, Oracle, OpenShift and more</span></li>
</ul>
</li>
</ul>
<p><b>43:08</b> <a href="https://cloud.google.com/blog/products/containers-kubernetes/kubernetes-history-inspector-visualizes-cluster-logs/" target="_blank" rel="noreferrer noopener"><b>Meet Kubernetes History Inspector, a log visualization tool for Kubernetes </b></a><a href="https://cloud.google.com/blog/products/containers-kubernetes/kubernetes-history-inspector-visualizes-cluster-logs/" target="_blank" rel="noreferrer noopener"><b>clusters</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google has been directly confronting K8 troubleshooting challenges for years as they support large-scale, complex deployments. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Google cloud support teams have developed deep expertise in diagnosing issues with K8 environments through routinely analyzing a vast number of customer support tickets, diving into user environments, and leveraging our collective knowledge to pinpoint the root cause of problems.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">To address this, they released </span><a href="https://github.com/GoogleCloudPlatform/khi" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Kubernetes History Inspector (KHI)</span></a><span style="font-weight:400;"> as open source to the community. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Effective K8 troubleshooting requires collecting, correlating, and analyzing these disparate log streams. Manually configuring logging for each of these components can be a significant burden, requiring careful attention to detail and a thorough understanding of the K8 ecosystem. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Collecting logs is the easy part, the real challenge lies in analyzing the logs. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Many issues in K8 are not revealed by a single obvious error message. Instead they’ll manifest as a chain of events, requiring a deep understanding of the causal relationships between numerous log entries across multiple components. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">KHI is a powerful tool that analyzes logs collected by cloud logging, extracts state information for each component, and visualizes it in a chronological timeline.  Furthermore, KHI links this timeline back to the raw log data, allowing you to track how each element evolved over time. </span></li>
</ul>
<p><i><span style="font-weight:400;">46:19  Justin – “</span></i><i><span style="font-weight:400;">Because like even in ECS, I’ve had this problem before where I’ve had like multiple containers that talk to each other and then like, my God, why do we this error? And it’s like, if I could see the state, I would have known that the other container crashed, which is why this error occurred in my container as a dependency on it. So like there’s definitely value in this visualization, but it’s not exactly how I would have visualized it. So like when I was reading through the article, I was very excited and then I saw the screenshots and I was like, huh, it’s not bad, but it’s definitely not how I thought it was going to look when I saw it.”</span></i></p>
<p><b>47:16</b> <a href="https://cloud.google.com/blog/products/infrastructure/google-cloud-launches-42nd-cloud-region-in-sweden/" target="_blank" rel="noreferrer noopener"><b>Hej Sverige! Google Cloud launches new region in Sweden</b></a></p>
<p><span style="font-weight:400;">(hey-j sver-ee-geh)</span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google’s new cloud region in Sweden is now open, it represents an investment by Google into Sweden’s future and Google’s ongoing commitment to empowering businesses and individuals with the power of the cloud.  This new region, the 42nd globally for Google, and 13th in europe, opens doors to opportunities for innovation, sustainability, and growth within sweden and across the globe.</span></li>
</ul>
<p><b>49:04</b> <a href="https://cloud.google.com/blog/products/identity-security/introducing-ai-protection-security-for-the-ai-era/" target="_blank" rel="noreferrer noopener"><b>Announcing AI Protection: Security for the AI era</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">As AI use increases, security remains a top concern, and they often hear that organizations are worried about risks that can come with rapid adoption. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Google Cloud is committed to helping our customers confidently build and deploy AI in a secure, compliant and private manner. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Google is making it easier to mitigate risk throughout the AI lifecycle. With their new </span><a href="http://cloud.google.com/security/securing-ai" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AI protection</span></a><span style="font-weight:400;">, a set of capabilities designed to safeguard AI workloads and data across clouds and models — irrespective of the platforms you choose to use. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">AI protection helps teams comprehensively manage AI risk by:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Discovering AI inventory in your environment and assessing it for potential vulnerabilities</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Securing AI assets with controls, policies and guardrails</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Managing threats against AI systems with detection, investigation, and response capabilities. </span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">AI protection is integrated with </span><a href="https://cloud.google.com/security/products/security-command-center" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">SCC</span></a><span style="font-weight:400;">, our Multi-cloud risk-management platform, so that security teams can get a centralized view of their AI posture and manage AI risks holistically in context with their other cloud risks. </span></li>
</ul>
<p><i><span style="font-weight:400;">50:28  Justin – “</span></i><i><span style="font-weight:400;">It pulls in a model armor, STP discovery, AI related toxic combinations, posture management for AI threat detection for AI, the notebook security scanner and the data security posture management. all into sec for this. Yeah. It’s pretty full featured out of the box, which I’m pretty impressed with for a Google product.”</span></i></p>
<p><b>50:54</b> <a href="https://cloud.google.com/blog/products/databases/introducing-tiered-storage-for-spanner/" target="_blank" rel="noreferrer noopener"><b>Introducing tiered storage for Spanner</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google is announcing full managed tiered storage for Spanner, a new capability that lets you use larger datasets with Spanner by striking the right balance between cost and performance, while minimizing operational overhead through a simple, easy-to-use, interface. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Tiered storage with spanner addresses the challenge of hot and cold data, and allows you to tier based on hard disks that are 80% cheaper.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">In addition to the cost savings you get ease of management, you get unified and consistent experience and flexibility and control. </span></li>
</ul>
<p><i><span style="font-weight:400;">51:31  Ryan – “</span></i><i><span style="font-weight:400;">This looks great. You know, the ability to have data stored cold and pay a lower price for it.”</span></i></p>
<h2><b>Azure</b></h2>
<p><b>51:57</b> <a href="https://azure.microsoft.com/en-us/blog/whats-new-in-azure-elastic-san/" target="_blank" rel="noreferrer noopener"><b>What’s new in Azure Elastic SAN</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">The least cloudiest service gets more features this week, released last year Azure Elastic San has new capabilities</span></li>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/storage/elastic-san/elastic-san-expand?tabs=azure-powershell-basesize%2Cazure-powershell-autoscale%2Cazure-powershell#autoscale-preview" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Autoscale for capacity</span></a><span style="font-weight:400;"> in public preview.</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Helps save you time by simplifying the management of the Elastic San, as you can set a policy for auto scaling your capacity when you are running out of storage rather than needing to actively track whether your storage is reaching its limits. </span></li>
</ul>
</li>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/storage/elastic-san/elastic-san-snapshots?tabs=azure-portal%22%20%5Co%20%22https://learn.microsoft.com/en-us/azure/storage/elastic-san/elastic-san-snapshots?tabs=azure-portal%22%20%5Ct%20%22_blank" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Snapshot support</span></a><span style="font-weight:400;"> is now GA. </span></li>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/storage/elastic-san/elastic-san-networking-concepts#data-integrity" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">CRC Protection</span></a><span style="font-weight:400;"> to maintain the integrity of your data</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Fully Validated and Optimized for costs with </span><a href="https://learn.microsoft.com/en-us/azure/azure-sql/virtual-machines/windows/failover-cluster-instance-azure-elastic-san-manually-configure?view=azuresql" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">SQL FCI</span></a><span style="font-weight:400;"> workloads</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Reduced TCO for Azure VMware on Elastic San</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Full AKS support</span></li>
</ul>
<p><i><span style="font-weight:400;">52:55  Ryan – “</span></i><i><span style="font-weight:400;">So if you’re using a storage shared model, running your database on in the container. Yeah, I don’t know. I mean, you know, these types of things are what I want. If I’m going to have to manage infrastructure at this level, I want it to be auto-scaling and fairly automatic.”</span></i></p>
<p><b>53:30</b> <a href="https://blogs.microsoft.com/on-the-issues/2025/02/26/microsoft-completes-landmark-eu-data-boundary-offering-enhanced-data-residency-and-transparency/" target="_blank" rel="noreferrer noopener"><b>Microsoft completes landmark EU Data Boundary, offering enhanced data </b></a><a href="https://blogs.microsoft.com/on-the-issues/2025/02/26/microsoft-completes-landmark-eu-data-boundary-offering-enhanced-data-residency-and-transparency/" target="_blank" rel="noreferrer noopener"><b>residency and transparency</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft has completed the </span><a href="https://www.microsoft.com/en-us/trust-center/privacy/european-data-boundary-eudb?msockid=3cf3cb184ba5674a3504de074a88665c" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">EU Data Boundary for the Microsoft Cloud</span></a><span style="font-weight:400;">, an industry leading solution that stores and processes public sector and commercial customer data in the EU and European Free Trade Association (EFTA.)</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">With the completion of the boundary, the European commercial and public sector customers are now able to store and process their customer data and pseudonymized personal data for Microsoft core cloud services including MS365, Dynamics 365, Power Platform and most Azure services within the EU and EFTA. </span></li>
</ul>
<p><i><span style="font-weight:400;">54:46  Ryan – “Hopefully it’s not just all duct tape and baling wire in the backend.”</span></i></p>
<p><b>55:04</b> <a href="https://techcommunity.microsoft.com/blog/appsonazureblog/azure-load-testing-celebrates-two-years-with-two-exciting-announcements/4389751" target="_blank" rel="noreferrer noopener"><b>Azure Load Testing Celebrates Two Years with Two Exciting </b></a><a href="https://techcommunity.microsoft.com/blog/appsonazureblog/azure-load-testing-celebrates-two-years-with-two-exciting-announcements/4389751" target="_blank" rel="noreferrer noopener"><b>Announcements!</b></a><span style="font-weight:400;"> </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Azure Load Testing is celebrating its 2 year anniversary with a few announcements. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Starting March 1st, you’ll benefit from significant pricing changes including no monthly resource fee, eliminating the $10 monthly resource fee to help you save on overall costs. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">20% price reduction the cost per Virtual User Hour for &gt; 10,000 VUH is reduced from 7.5 cents to 6 cents, as well as the </span><a href="https://techcommunity.microsoft.com/blog/appsonazureblog/azure-load-testing-price-drop-and-usage-limits-to-supercharge-your-testing/4388534" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">consumption limit per resource</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They also are excited to announce </span><a href="https://techcommunity.microsoft.com/blog/appsonazureblog/run-locust-based-tests-in-azure-load-testing/4389373" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Locust-based tests</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This addition allows you to leverage the power, flexibility, and developer friendly nature of the Python-based Locust load testing framework, in addition to the already supported Apache Jmeter load testing framework.</span></li>
</ul>
<p><b>57:04</b> <a href="https://azure.microsoft.com/en-us/blog/announcing-the-responses-api-and-computer-using-agent-in-azure-ai-foundry/" target="_blank" rel="noreferrer noopener"><b>Announcing the Responses API and Computer-Using Agent in Azure AI </b></a><a href="https://azure.microsoft.com/en-us/blog/announcing-the-responses-api-and-computer-using-agent-in-azure-ai-foundry/" target="_blank" rel="noreferrer noopener"><b>Foundry</b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://azure.microsoft.com/en-us/products/ai-foundry" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Azure Foundry</span></a><span style="font-weight:400;"> has added two new capabilities: responses API and the Computer Using Agent. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Covered in previous shows when OpenAI announced them… but don’t let Azure fool you into not thinking they’re innovating. </span></li>
</ul>
<h2><b>Oracle</b></h2>
<p><b>57:40</b> <a href="https://www.oracle.com/news/announcement/q3fy25-earnings-release-2025-03-10/" target="_blank" rel="noreferrer noopener"><b>Oracle Announces Fiscal 2025 Third Quarter Financial Results</b></a><b> </b></p>
<p><a href="https://www.marketwatch.com/story/oracle-won-some-big-cloud-contracts-heres-why-its-stock-is-falling-ec29af53" target="_blank" rel="noreferrer noopener"><b>Oracle won some big cloud contracts. Here’s why its stock is falling</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Oracle stock was a bit of a mixed bag to the analysts.  Fiscal third quarter earnings missed wall street expectations. Oracle shares surged last year amid the artificial-intelligence boom but are down 14% in 2025. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Oracle’s guidance for the fiscal fourth quarter was also below Wall Street’s expectations, implying fiscal 2025 revenue growth of 7.5% to 8% versus prior commentary of double digit growth, BNP Paribas analyst Stefan Slowinski pointed out in a note to clients. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Free cash flow is a bit of a challenge due to the large investments in AI which could lead to slower growth in the short term while they regain free cash flow.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Good luck, Azure.</span></li>
</ul>
<h2><span style="font-weight:400;">Closing</span></h2>
<p><span style="font-weight:400;">And that is the week in the cloud! Visit our website, the home of the Cloud Pod where you can join our newsletter, slack team, send feedback or ask questions at theCloud Pod.net or tweet at us with hashtag #theCloudPod</span></p>
]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/1997617/c1e-zo9ob7wxg2cqmo3g-47dz4w7qtjq0-vrkxqd.mp3" length="71317859"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 296 of The Cloud Pod – where the forecast is always cloudy! Today is a twofer – Justin and Ryan are in the house to make sure you don’t miss out on any of today’s important cloud and AI news. From AI Protection, to Google Next, to Amazon Q Developer, we’ve got it all, this week on TCP! 
Titles we almost went with this week:

 Amazon Step Functions, walks step by step into my IDE
 Deepseek seeks the truth of “is it serverless or servers”? 
️ Well Architected Reviews by AI… What will my solutions architects do now? 
⌨️ The cloud pod hosts steps over the Azure EU Data Boundary
️ BYOIP to ALBs… only years too late for everyone.

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
General News 
01:02 HashiCorp and Red Hat, better together 

Hashicorp has more details on its future, with the recent IBM acquisition in this blog post. 
They talk about the wide range of Day 2 operations, including things like drift detection, image management and patching, rightsizing, and configuration management.  
As Red Hat Ansible is a purpose built operational management platform, it makes it easier to properly configure resources after the initial creation, but also to evolve the configuration after setup, and then execute ad-hoc playbooks to keep things running reliably and more securely at scale. 
Some additional things they’re exploring, now that the acquisition has closed:

Red Hat Ansible Inventory generated dynamically by Terraform. 
Official Terraform modules for Redhat Ansible, making it easier to trigger terraform from Ansible Playbooks.
Redhat and Hashicorp officially support the Red Hat Ansible Provider for Terraform, making it easier to trigger Ansible from Terraform.
Evolving Terraform provisioners to support a more comprehensive set of lifecycle integrations.
Improved mechanisms to invoke Ansible Playbooks outside of the resource provisioning lifecycle


Customers – not surprisingly – regularly inte...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/1997617/c1a-k5d5-ww6d43xotx42-pvhmbp.jpg"></itunes:image>
                                                                            <itunes:duration>00:59:26</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                            </item>
                    <item>
                <title>
                    <![CDATA[295: Skype follows Chime to the Grave]]>
                </title>
                <pubDate>Thu, 13 Mar 2025 14:12:30 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/1992506</guid>
                                    <link>https://tcpfm.castos.com/episodes/295-skype-follows-chime-to-the-grave</link>
                                <description>
                                            <![CDATA[<p style="text-align:center;"><span style="font-weight:400;">Welcome to episode 295 of The Cloud Pod – where the forecast is always cloudy! </span></p>
<p style="text-align:center;"><span style="font-weight:400;">Welp, it’s sayonara to Skype – and time to finally make the move to Teams. Hashi has officially moved to IBM, GPT 4.5 is out and people have…thoughts. Plus, Google has the career coach you need to make all your dreams come true.*</span></p>
<p style="text-align:center;"><span style="font-weight:400;">*Assuming those dreams are reasonable in a volatile economy. </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">Someday we’ll find it, the rainbow connection, the lovers, the cloud dreamers, and </span><span style="font-weight:400;">Me </span></li>
<li><span style="font-weight:400;">Dreamer, you know you are a dreamer</span></li>
<li><span style="font-weight:400;">☁️You may say I’m a cloud dreamer, but I’m not the only one</span></li>
<li><span style="font-weight:400;">May the skype shut down</span></li>
<li><span style="font-weight:400;">Q can tell me that my python skills are bad</span></li>
<li><span style="font-weight:400;">How many free code assistance does Ryan need to be a good developer: ALL OF </span><span style="font-weight:400;">THEM</span></li>
<li><span style="font-weight:400;">Oops honey I spent 1M dollars on oracle</span></li>
<li><span style="font-weight:400;">Latest Cloud Pod Reviews: “It’s a Lemon”</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><b>General News </b></h2>
<p><b>01:04 </b><a href="https://arstechnica.com/gadgets/2025/02/on-may-5-microsofts-skype-will-shut-down-for-good/" target="_blank" rel="noreferrer noopener"><b>On May 5, Microsoft’s Skype will shut down for good</b></a><span style="font-weight:400;"> </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">In what we swear is the 9th death for Skype, Microsoft has announced that after 21 years (with 13 of those years under MS Control,) Skype will be no more. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">For real this time. Really. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">May 5th is the official last day of Skype, and they’ve indicated you can continue your calls and chats in Teams. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Starting now, you should be able to use your Skype login to get into Teams. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">For those of you who do this, you’ll see all your existing contacts and chats in Teams. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Alternatively, you can export your Skype data, specifically contacts, call history and chats. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Current subscribers to Skype Premium services will remain active until the end, but you will not be able to sign up for Skype at this time. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Skype dial pad credits will remain active in the web interface and inside Teams after May 5th so you can finish using those credits. </span></li>
</ul>
<p><i><span style="font-weight:400;">03:37  Matthew  – “</span></i><i><span style="font-weight:400;">I think there’s a lot of people and, you know, at least people I know in other countries to still use Skype, like pretty heavily for like cross country communications, things along those lines. So I think a lot of that is that there probably is still a good amount of people using it. And this is just, Hey, they’re trying to make it nicely...</span></i></p>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 295 of The Cloud Pod – where the forecast is always cloudy! 
Welp, it’s sayonara to Skype – and time to finally make the move to Teams. Hashi has officially moved to IBM, GPT 4.5 is out and people have…thoughts. Plus, Google has the career coach you need to make all your dreams come true.*
*Assuming those dreams are reasonable in a volatile economy. 
Titles we almost went with this week:

Someday we’ll find it, the rainbow connection, the lovers, the cloud dreamers, and Me 
Dreamer, you know you are a dreamer
☁️You may say I’m a cloud dreamer, but I’m not the only one
May the skype shut down
Q can tell me that my python skills are bad
How many free code assistance does Ryan need to be a good developer: ALL OF THEM
Oops honey I spent 1M dollars on oracle
Latest Cloud Pod Reviews: “It’s a Lemon”

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
General News 
01:04 On May 5, Microsoft’s Skype will shut down for good 

In what we swear is the 9th death for Skype, Microsoft has announced that after 21 years (with 13 of those years under MS Control,) Skype will be no more. 
For real this time. Really. 
May 5th is the official last day of Skype, and they’ve indicated you can continue your calls and chats in Teams. 
Starting now, you should be able to use your Skype login to get into Teams. 
For those of you who do this, you’ll see all your existing contacts and chats in Teams. 
Alternatively, you can export your Skype data, specifically contacts, call history and chats. 
Current subscribers to Skype Premium services will remain active until the end, but you will not be able to sign up for Skype at this time. 
Skype dial pad credits will remain active in the web interface and inside Teams after May 5th so you can finish using those credits. 

03:37  Matthew  – “I think there’s a lot of people and, you know, at least people I know in other countries to still use Skype, like pretty heavily for like cross country communications, things along those lines. So I think a lot of that is that there probably is still a good amount of people using it. And this is just, Hey, they’re trying to make it nicely...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[295: Skype follows Chime to the Grave]]>
                </itunes:title>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p style="text-align:center;"><span style="font-weight:400;">Welcome to episode 295 of The Cloud Pod – where the forecast is always cloudy! </span></p>
<p style="text-align:center;"><span style="font-weight:400;">Welp, it’s sayonara to Skype – and time to finally make the move to Teams. Hashi has officially moved to IBM, GPT 4.5 is out and people have…thoughts. Plus, Google has the career coach you need to make all your dreams come true.*</span></p>
<p style="text-align:center;"><span style="font-weight:400;">*Assuming those dreams are reasonable in a volatile economy. </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">Someday we’ll find it, the rainbow connection, the lovers, the cloud dreamers, and </span><span style="font-weight:400;">Me </span></li>
<li><span style="font-weight:400;">Dreamer, you know you are a dreamer</span></li>
<li><span style="font-weight:400;">☁️You may say I’m a cloud dreamer, but I’m not the only one</span></li>
<li><span style="font-weight:400;">May the skype shut down</span></li>
<li><span style="font-weight:400;">Q can tell me that my python skills are bad</span></li>
<li><span style="font-weight:400;">How many free code assistance does Ryan need to be a good developer: ALL OF </span><span style="font-weight:400;">THEM</span></li>
<li><span style="font-weight:400;">Oops honey I spent 1M dollars on oracle</span></li>
<li><span style="font-weight:400;">Latest Cloud Pod Reviews: “It’s a Lemon”</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><b>General News </b></h2>
<p><b>01:04 </b><a href="https://arstechnica.com/gadgets/2025/02/on-may-5-microsofts-skype-will-shut-down-for-good/" target="_blank" rel="noreferrer noopener"><b>On May 5, Microsoft’s Skype will shut down for good</b></a><span style="font-weight:400;"> </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">In what we swear is the 9th death for Skype, Microsoft has announced that after 21 years (with 13 of those years under MS Control,) Skype will be no more. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">For real this time. Really. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">May 5th is the official last day of Skype, and they’ve indicated you can continue your calls and chats in Teams. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Starting now, you should be able to use your Skype login to get into Teams. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">For those of you who do this, you’ll see all your existing contacts and chats in Teams. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Alternatively, you can export your Skype data, specifically contacts, call history and chats. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Current subscribers to Skype Premium services will remain active until the end, but you will not be able to sign up for Skype at this time. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Skype dial pad credits will remain active in the web interface and inside Teams after May 5th so you can finish using those credits. </span></li>
</ul>
<p><i><span style="font-weight:400;">03:37  Matthew  – “</span></i><i><span style="font-weight:400;">I think there’s a lot of people and, you know, at least people I know in other countries to still use Skype, like pretty heavily for like cross country communications, things along those lines. So I think a lot of that is that there probably is still a good amount of people using it. And this is just, Hey, they’re trying to make it nicely. So how, you know, nice and clean cut over for people versus, you know, the Apple method of it just doesn’t work anymore. Good luck.”</span></i></p>
<p><b>04:41</b> <a href="https://www.hashicorp.com/en/blog/hashicorp-officially-joins-the-ibm-family" target="_blank" rel="noreferrer noopener"><b>HashiCorp officially joins the IBM family</b></a><span style="font-weight:400;"> </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">IBM has finished the acquisition of </span><a href="https://www.hashicorp.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">HashiCorp</span></a><span style="font-weight:400;">, which they had announced last year.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Armon Dadgar wrote a blog post reflecting on the journey that Hashicorp has been on; he talks about the future and that his goal is to have Hashicorp in every datacenter. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">He says while they have made strides towards that goal, he feels incredibly optimistic with IBM, since they gain access to their global scale and increased R&amp;D resources. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">There are also integration opportunities of IBM and the </span><a href="https://www.redhat.com/en/technologies/all-products" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">RedHat Portfolio</span></a><span style="font-weight:400;">. Integrating </span><a href="https://developer.hashicorp.com/terraform/language/resources/provisioners/syntax" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Terraform for provisionin</span></a><span style="font-weight:400;">g with </span><a href="https://docs.ansible.com/ansible/latest/index.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Ansible</span></a><span style="font-weight:400;"> for configuration management will enable an end to end approach to infrastructure automation as code, while integrating terraform with cloudability will provide native </span><a href="https://www.finops.org/introduction/what-is-finops/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Finops</span></a><span style="font-weight:400;"> capabilities to manage and optimize costs at scale.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Vault integration with </span><a href="https://www.bing.com/aclk?ld=e8C6XOkQiYi6YTBvAPR8mlDTVUCUxWqjOJRMlJ8A8v0bw4XJGj6MF7RlDzEL19SOmucWpIKXzz2Cnu4GS2P_abix_zXb5Vk1xx-3OAOgnZshfuZHjsFmr89SWn8ZLFeg8Wt1SuczqQe72wJb2bQPjeoMqBnIH4O8JootUbZijfgCNU1rDkbE12-gfYd-uLT1p_NsVkyg&amp;u=aHR0cHMlM2ElMmYlMmZ3d3cuY2R3LmNvbSUyZmNvbnRlbnQlMmZjZHclMmZlbiUyZmJyYW5kJTJmcmVkaGF0Lmh0bWwlM2ZjbV92ZW4lM2RhY3F1aXJneSUyNmNtX2NhdCUzZGJpbmclMjZjbV9wbGElM2RTMyUyYlJlZCUyYkhhdCUyNmNtX2l0ZSUzZFJlZCUyYkhhdCUyYk9wZW5TaGlmdCUyYkUlMjZzX2t3Y2lkJTNkQUwhNDIyMyExMCE3MzY2NzU1MDYwMzcwMiEhISE3MzY2NzQ4MDk1NDAxMCEhMzYxNDY1MjM0ITExNzg2NzczNDc5ODY2NzglMjZlZl9pZCUzZGEzMTVlMDNjOTBlMzE1NmQzN2MyODdlYzFmZjI2NjEzJTNhRyUzYXMlMjZtc2Nsa2lkJTNkYTMxNWUwM2M5MGUzMTU2ZDM3YzI4N2VjMWZmMjY2MTMlMjNvcGVuc2hpZnQ&amp;rlid=a315e03c90e3156d37c287ec1ff26613" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">OpenShift</span></a><span style="font-weight:400;">, Ansible and </span><a href="https://www.bing.com/aclk?ld=e8xb8qoQM-0_jA8uAWXrU2sTVUCUz03N5zuVDuL3vqf68xl8JxQEVeYjc8Q_HrAEOcCUmdBmoSSV3GvTTA3cUHNybyks_R7dovSTQiInXmRHk3vwZ1nHgjZ7BpSalZmQDv6ynWfFxyAE8irtsHsLDBxVxUqPQ9VXMblDbEwOBuwoX6rVy_KgxcoePF139Az2wuVXJbCg&amp;u=aHR0cHMlM2ElMmYlMmZhZC5kb3VibGVjbGljay5uZXQlMmZzZWFyY2hhZHMlMmZsaW5rJTJmY2xpY2slM2ZsaWQlM2Q0MzcwMDA4MTIwMDE0OTc1MSUyNmRzX3Nfa3dnaWQlM2Q1ODcwMDAwODgyMDYxNTU4OCUyNmRzX2FfY2lkJTNkNDA1NTA3ODI5JTI2ZHNfYV9jYWlkJTNkMjE5OTc1NjQ5NDUlMjZkc19hX2FnaWQlM2QxNzIyODk0NzM1OTIlMjZkc19hX2xpZCUzZGt3ZC0zNjE0NzA1OTYlMjYlMjZkc19lX2FkaWQlM2Q4MTM2Mzk4OTUwMzI5MCUyNmRzX2VfdGFyZ2V0X2lkJTNka3dkLTgxMzY0MjAwMzY4MzE1JTNhbG9jLTE5MCUyNiUyNmRzX2VfbmV0d29yayUzZG8lMjZkc191cmxfdiUzZDIlMjZkc19kZXN0X3VybCUzZGh0dHBzJTNhJTJmJTJmd3d3LmlibS5jb20lMmZndWFyZGl1bSUzZnV0bV9jb250ZW50JTNkU1JDV1clMjZwMSUzZFNlYXJjaCUyNnA0JTNkNDM3MDAwODEyMDAxNDk3NTElMjZwNSUzZGUlMjZwOSUzZDU4NzAwMDA4ODIwNjE1NTg4JTI2Z2NsaWQlM2QxZDczM2I5ODNmMGUxYWMwYjE1MTVjNDhiMWI4NTRlMSUyNmdjbHNyYyUzZDNwLmRzJTI2JTI2bXNjbGtpZCUzZDFkNzMzYjk4M2YwZTFhYzBiMTUxNWM0OGIxYjg1NGUx&amp;rlid=1d733b983f0e1ac0b1515c48b1b854e1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Guardium</span></a><span style="font-weight:400;"> will bring world-class secrets management to those platforms and reduce the integration burden on end users. </span></li>
</ul>
<p><i><span style="font-weight:400;">05:44  Justin – “</span></i><i><span style="font-weight:400;">BM is gonna make a bunch of money if they force me to use Vault and Terraform Enterprise for all those capabilities. you know, HashiCorp was never shy to charge you at least $400,000. That was the starting price for pretty much everything.”</span></i></p>
<h2><b>AI Is Going Great, Or How ML Makes Money </b></h2>
<p><b>06:34</b> <a href="https://openai.com/index/introducing-gpt-4-5/" target="_blank" rel="noreferrer noopener"><b>Introducing GPT-4.5</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><a href="https://openai.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">OpenAI</span></a><span style="font-weight:400;"> has launched </span><a href="https://openai.com/index/introducing-gpt-4-5/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">GPT 4.5</span></a><span style="font-weight:400;">, their largest and best model for chat yet. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">GPT 4.5 is a step forward in scaling up pre-training and post-training. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Early testing shows that GTP 4.5 feels more natural. With a broader knowledge base, improved ability to follow user intent and greater “EQ” make it useful for tasks like improving writing, programming and solving practical problems. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They expect it to hallucinate less. </span></li>
</ul>
<p><span style="font-weight:400;">And on that note….</span></p>
<p><b>08:08</b> <a href="https://garymarcus.substack.com/p/hot-take-gpt-45-is-a-nothing-burger" target="_blank" rel="noreferrer noopener"><b>Hot take: GPT 4.5 is a nothing burger</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Gary Marcus, author of rebooting AI, and founder and CEO of geometric intelligence (</span><a href="https://techcrunch.com/2016/12/05/uber-acquires-geometric-intelligence-to-create-an-ai-lab/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">acquired by Uber</span></a><span style="font-weight:400;">) called Chat GPT 4.5 a </span><a href="https://en.wikipedia.org/wiki/Nothingburger" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">nothing burger</span></a><span style="font-weight:400;">.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">He previously predicted that GPT 4.5 wouldn’t be that impressive, and that the pure scaling of LLMs (adding more data and compute) has hit the wall. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">He claims he was right. Hallucinations didn’t disappear, and nor did stupid errors. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">He points out both </span><a href="https://x.ai/news/grok-3" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Grok 3</span></a><span style="font-weight:400;"> and </span><a href="https://openai.com/index/introducing-gpt-4-5/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">GPT4.5</span></a><span style="font-weight:400;"> didn’t fundamentally change anything, and both are barely better than </span><a href="https://www.anthropic.com/news/claude-3-5-sonnet" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Claude 3.5</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">He quotes other AI forecasters who moved projections for AGI to later, and even pointed to Sam Atman’s rather tepid tweet regarding GPT 4.5. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Sam Altman also says they didn’t drop plus and pro at the same time because it’s a giant, expensive model and they need tens of thousands of GPU’s to roll it out to plus tier.  He also says its not a reasoning model and it won’t crush benchmarks</span></li>
</ul>
<p><i><span style="font-weight:400;">09:13  Ryan – “</span></i><i><span style="font-weight:400;">It’s interesting because it’s in the consumer space, like you got to have flashy changes that dramatically change the user experience, right? So it’s like you always want to do incremental improvements. But if you’re announcing large bottle stuff, you know, it’s going to have a huge effect on your stock value. If the new stuff is just more expensive and more of the same. So it’ll be fun to see as they navigate this because it’s a new business model and uncharted territory.”</span></i></p>
<p><b>09:15</b> <a href="https://arstechnica.com/ai/2025/02/its-a-lemon-openais-largest-ai-model-ever-arrives-to-mixed-reviews/" target="_blank" rel="noreferrer noopener"><b>“It’s a lemon”—OpenAI’s largest AI model ever arrives to mixed reviews</b></a><span style="font-weight:400;"> </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">The bad reviews for 4.5 weren’t just from Gary Marcus.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Ars Technica reported that it’s a “lemon”.  Ouch. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Big, expensive and slow, providing only marginally better performance than </span><a href="https://arstechnica.com/information-technology/2024/05/chatgpt-4o-lets-you-have-real-time-audio-video-conversations-with-emotional-chatbot/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">GPT 4o</span></a><span style="font-weight:400;"> at 30x the cost for input and 15x the cost for outputs. </span></li>
</ul>
<p><b>10:16</b> <a href="https://www.reuters.com/technology/microsoft-will-urge-trump-overhaul-curbs-ai-chip-exports-wsj-reports-2025-02-27/" target="_blank" rel="noreferrer noopener"><b>Microsoft urges Trump to overhaul Biden’s last AI-chip export curbs</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">MSFT is urging the Trump Administration to ease export restrictions imposed on AI chips.  Microsoft says the rules disadvantage allies, including India, Switzerland and Israel, and limit the ability for US tech companies to build and expand AI data centers in those countries. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Tighter US restrictions on the exports of advanced AI chips to Beijing are keeping American chipmakers and big tech from serving one of the largest markets for semiconductors, accelerating a global race for AI infrastructure dominance.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft says this will force some allies to turn to the Chinese market in the absence of sufficient supply of US tech.  Left unchanged, the rule will give China strategic advantage in spreading over time its own AI technology, echoing its rapid ascent in 5G telecommunications a decade ago. </span></li>
</ul>
<p><i><span style="font-weight:400;">12:21  Ryan – “</span></i><i><span style="font-weight:400;">Which is basically what we saw with DeepSeek. They basically said, well, we can’t get these chips, so we’re going to figure out a cheaper way to build a model and then cause everyone to have pain. But the other reality is that I’m sure China is getting access to all these chips through some other country who doesn’t have quite the same restriction controls. They buy all the chips from the US, then they sell them on the dark market to China, I’m sure, if they really wanted them.”</span></i></p>
<h2><b>AWS</b></h2>
<p><b>13:16</b> <a href="https://aws.amazon.com/about-aws/whats-new/2025/02/aws-chatbot-named-amazon-q-developer/" target="_blank" rel="noreferrer noopener"><b>AWS Chatbot is now named Amazon Q Developer</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS Chatbot is now called </span><a href="https://console.aws.amazon.com/chatbot" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Q Developer</span></a><span style="font-weight:400;">.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The new name recognizes the integration of Amazon Q Developer, the most capable generative AI powered assistant for software development, in Microsoft Teams and Slack to manage and optimize AWS resources.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">With Q Developer, customers can monitor, operate and troubleshoot AWS resources in chat channels faster. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Customers can quickly retrieve telemetry and ask questions to understand the state of their resources. </span></li>
</ul>
<p><i><span style="font-weight:400;">14:03  Justin – “</span></i><i><span style="font-weight:400;">So AWS Chatbot is a very simple, I’m going to make a request and I have to use a certain syntax in the AWS chatbot to Slack. And then it calls the API and it returns data from the API that Amazon provides that I’ve synchronized and I have authorized. And it provides accurate data back to me. Amazon Q does not provide reliable data ever. It provides hallucinations. So if I ask it like how many Graviton based computers am I running in this region? And it comes back and says 32. Can I trust that there’s 32 boxes running or do I have to go double check it now because you’re using an LLM in the middle of this thing that doesn’t know what the hell it’s doing.”</span></i></p>
<p><b>21:06</b> <a href="https://aws.amazon.com/about-aws/whats-new/2025/02/amazon-ecs-additional-iam-condition-keys/" target="_blank" rel="noreferrer noopener"><b>Amazon ECS adds support for additional IAM condition keys</b></a><span style="font-weight:400;"> </span></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/ecs" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">ECS</span></a><span style="font-weight:400;"> has launched 8 new service-specific condition keys for IAM. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">These new condition keys let you create IAM policies and SCPs to better enforce your organizational policies in containerized environments. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">IAM condition keys allow you to author policies that enforce access control based on API request context. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">With today’s release ECS has added condition keys that allow you to enforce policies related to resource configuration (ecs:task-cpu, ecs:task:memory and ecs:compute-compatibility), container privileges (ecs:privileged), network configuration (ecs:auto-assign-public-ip and ecs:subnet) and tag propagation (ecs:propagate tags and ecs:enable-ecs-managed-tags) for your applications deployed on ECS. </span></li>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/AmazonECS/latest/APIReference/API_CreateService.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">https://docs.aws.amazon.com/AmazonECS/latest/APIReference/API_CreateService.html</span></a><span style="font-weight:400;"> </span></li>
</ul>
<p><i><span style="font-weight:400;">23:44  Matthew – “</span></i><i><span style="font-weight:400;">It’s a subset of the create service, which has grant permission to run and maintain the desired number of tasks from a specified task definition via service. So I think I might be right with the CPU task in there, where you could say you can’t create a CPU of a certain thing.”</span></i></p>
<p><b>26:55</b></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2025/02/kubernetes-versions-amazon-eks-anywhere/" target="_blank" rel="noreferrer noopener"><b>Announcing extended support for Kubernetes versions for Amazon EKS </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2025/02/kubernetes-versions-amazon-eks-anywhere/" target="_blank" rel="noreferrer noopener"><b>Anywhere</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS is announcing extended support for </span><a href="https://kubernetes.io/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">K8</span></a><span style="font-weight:400;"> versions of </span><a href="https://anywhere.eks.amazonaws.com/docs/concepts/support-versions/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">EKS Anywhere</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">With extended support for K8 versions for EKS Anywhere, you continue to receive security patches for clusters on any K8 version for up to 26 months after the version is released in EKS anywhere. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Extended support for the K8 version for EKS anywhere is available for K8 1.28 and above. </span></li>
</ul>
<p><i><span style="font-weight:400;">27:20  Justin – “So, if you’re worried about </span></i><i><span style="font-weight:400;">the long-term supportability of Kubernetes and you don’t want to upgrade it every month, as you probably should, you can now get 26 months of support.”</span></i></p>
<p><b>27:55</b> <a href="https://aws.amazon.com/blogs/aws/get-insights-from-multimodal-content-with-amazon-bedrock-data-automation-now-generally-available/" target="_blank" rel="noreferrer noopener"><b>Get insights from multimodal content with Amazon Bedrock Data </b></a><a href="https://aws.amazon.com/blogs/aws/get-insights-from-multimodal-content-with-amazon-bedrock-data-automation-now-generally-available/" target="_blank" rel="noreferrer noopener"><b>Automation, now generally available</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Announced at Re:Invent, </span><a href="https://aws.amazon.com/bedrock/bda/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Bedrock Automation</span></a><span style="font-weight:400;"> is a feature to streamline the generation of valuable insights from unstructured, multi-modal content such as docs, images, audio and video. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The takeaway here is reducing the development time and effort to build intelligent document processing, media analysis, and other multimodal data-centric automation solutions.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Now, this capability is generally available with support for cross region inference endpoints to be available in more regions and seamlessly use compute across different locations. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Based on feedback during the previous, they have also improved accuracy and added support for logo recognition from images and videos.  </span></li>
</ul>
<h2><b>GCP</b></h2>
<p><b>29:24</b> <a href="https://blog.google/technology/developers/gemini-code-assist-free/" target="_blank" rel="noreferrer noopener"><b>Get coding help from Gemini Code Assist — now for free</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google is giving you </span><a href="https://codeassist.google/products/individual" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Gemini Code Assist for individuals</span></a><span style="font-weight:400;"> for free.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">If you can’t sell it – giving it to engineers and then going after them for licensing violations is always a great move. </span></li>
</ul>
<p><b>31:47</b> <a href="https://cloud.google.com/blog/topics/training-certifications/career-dreamer-tool-shows-google-cloud-careers-and-credentials/" target="_blank" rel="noreferrer noopener"><b>Discover Google Cloud careers and credentials in our new Career Dreamer</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google says if you have never worked in the cloud, it can be hard to know where to start. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Even if you’re a seasoned cloud architect, how do you pivot to your next big thing? And once you find it, once you’ve pinpointed the career of your dreams, the biggest hurdle of all is knowing the skills and training that will help you get there. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">If you are dreaming of a new direction in their careers, or a new one entirely, Google is here to help with </span><a href="https://grow.google/career-dreamer/home/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Career Dreamer</span></a><span style="font-weight:400;">.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Google gives you an AI powered career solution, where you can go and determine the skills and things you need to learn for your next dream role – all personalized to you. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The first step is going through the questionnaire, and then creating a custom prompt for you to use in Gemini to act as your career coach. (Copywriter note: Just don’t let it coach you into copywriting.)</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">It will even point you to the training sources you need, like </span><a href="https://www.cloudskillsboost.google/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Google Cloud Skills Boost</span></a><span style="font-weight:400;"> and </span><a href="https://grow.google/enroll-certificates" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Google Career Certificates</span></a><span style="font-weight:400;">. Betcha can’t wait to put those on your LinkedIn profile! </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Interested in learning more? Sure you are. </span><a href="https://cloud.google.com/blog/topics/training-certifications/certification-research-roi-value-of-training-ipsos-report-new-certs/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Get Google Cloud certified in 2025—and see why the latest research says it matters</span></a><span style="font-weight:400;">. </span></li>
</ul>
<p><i><span style="font-weight:400;">32:27  Ryan – “</span></i><i><span style="font-weight:400;">This is way better than my usual method, which is complaining about something until they just give you that responsibility to make it your job to fix it, which is how I’ve advanced through my career.”</span></i></p>
<p><b>34:52</b> <a href="https://cloud.google.com/blog/products/databases/enhancing-alloydb-vector-search-with-inline-filtering-and-enterprise-observability/" target="_blank" rel="noreferrer noopener"><b>Enhancing AlloyDB vector search with inline filtering and enterprise </b></a><a href="https://cloud.google.com/blog/products/databases/enhancing-alloydb-vector-search-with-inline-filtering-and-enterprise-observability/" target="_blank" rel="noreferrer noopener"><b>observability</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google is introducing a new enhancement to help you get even more out of vector search in </span><a href="https://cloud.google.com/products/alloydb" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AlloyDB</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">First, we are launching inline filtering, a major performance enhancement to filter vector search in AlloyDB.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Being able to perform vector search directly in the database, instead of post-processing on the application side, inline filtering helps ensure that searches are fast, accurate and efficient, automatically combining the best of </span><a href="https://cloud.google.com/alloydb/docs/ai/store-index-query-vectors?resource=scann" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">vector indexes</span></a><span style="font-weight:400;"> and traditional indexes on metadata columns to achieve better query performance. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Second, we are launching enterprise grade observability and management tooling for vector indexes to help ensure stable performance and the highest quality search results. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This includes a new recall evaluator, or built in tooling for evaluating </span><a href="https://en.wikipedia.org/wiki/Precision_and_recall" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">recall</span></a><span style="font-weight:400;">, a key metric for vector search quality. You no longer have to build your own measurement pipeline and process for your apps to deliver good results. </span></li>
</ul>
<p><b>38:30</b> <a href="https://cloud.google.com/blog/products/databases/new-terraform-provider-for-oracle-database-at-google-cloud/" target="_blank" rel="noreferrer noopener"><b>Announcing Terraform providers for Oracle Database@Google Cloud</b></a><span style="font-weight:400;"> </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google is sharing the GA of Terraform Providers for Oracle Database@Google CLoud. You can now deploy and manage Oracle Autonomous Database and Oracle Exadata Database Service resources using the Google Terraform provider.  The release compliments the existing gcloud and google cloud console capabilities. </span></li>
</ul>
<p><i><span style="font-weight:400;">38:44  Justin – “</span></i><i><span style="font-weight:400;">I’ve always dreamed of being able to bankrupt a company with Terraform apply for my Oracle Exadata use cases. So thank you for that, Google. I really appreciate it.”</span></i></p>
<h2><b>Azure</b></h2>
<p><b>41:10 </b><a href="https://azure.microsoft.com/en-us/blog/announcing-new-models-customization-tools-and-enterprise-agent-upgrades-in-azure-ai-foundry/" target="_blank" rel="noreferrer noopener"><b>Announcing new models, customization tools, and enterprise agent </b></a><a href="https://azure.microsoft.com/en-us/blog/announcing-new-models-customization-tools-and-enterprise-agent-upgrades-in-azure-ai-foundry/" target="_blank" rel="noreferrer noopener"><b>upgrades in Azure AI Foundry</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><a href="https://azure.microsoft.com/en-us/products/ai-foundry" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Azure AI Foundry</span></a><span style="font-weight:400;"> is getting support for </span><a href="https://openai.com/index/introducing-gpt-4-5/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Open AI’s GPT 4.5</span></a><span style="font-weight:400;"> in preview on </span><a href="https://www.bing.com/aclk?ld=e8QEyCATce1dHXQ8EEcETm3DVUCUyrARI1Hs9MSIXdO1mwqUj3rgFltvnU2C6wMwccOxYDibdwjLg8j-ZCGTBp9fQfd8nIO76AETgEuX_J1rmAjuR5Rfw458KbZ7YT4IWVpxqPzVgCKAVxNTC7viKtGOPOKqCk_ItJdXDBDEQv8wepUbEDcWMtD5cTRI3HWKXQexSWYA&amp;u=aHR0cHMlM2ElMmYlMmZhenVyZS5taWNyb3NvZnQuY29tJTJmZW4tdXMlMmZwcmljaW5nJTJmcHVyY2hhc2Utb3B0aW9ucyUyZmF6dXJlLWFjY291bnQlMmZzZWFyY2glM2ZlZl9pZCUzZF9rX2Q3OTU2OTNjMzE4NDE1ZmU2ZTFmMGVlZDI1OGU3MTI4X2tfJTI2T0NJRCUzZEFJRGNtbTVlZHN3ZHV1X1NFTV9fa19kNzk1NjkzYzMxODQxNWZlNmUxZjBlZWQyNThlNzEyOF9rXyUyNm1zY2xraWQlM2RkNzk1NjkzYzMxODQxNWZlNmUxZjBlZWQyNThlNzEyOA&amp;rlid=d795693c318415fe6e1f0eed258e7128" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Azure</span></a><span style="font-weight:400;"> Open AI. The research preview demonstrates improvements from scaling pre and post-training a step forward in unsupervised learning techniques. </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Natural integrations with broader knowledge, higher “EQ” can help to improve coding, writing and problem-solving tasks</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Accuracy and hallucinations: with lower hallucination rates (37.1% vs 61.8%) and higher accuracy 62.5% vs 3.8% compared to GPT-4o</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Stronger human alignment improves the ability to follow instructions, understand nuance and engage in natural languages. </span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">The latest wave of AI models from </span><a href="https://azure.microsoft.com/en-us/blog/empowering-innovation-the-next-generation-of-the-phi-family/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Microsoft Phi</span></a><span style="font-weight:400;"> continue to push boundaries of what’s possible with smaller and more efficient architectures:</span>
<ul>
<li style="font-weight:400;"><a href="https://ai.azure.com/explore/models/Phi-4-multimodal-instruct/version/1/registry/azureml?tid=72f988bf-86f1-41af-91ab-2d7cd011db47" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Phi-4-multimodal</span></a><span style="font-weight:400;"> unifies text, speech, and vision for context aware interactions. Retail kiosks can now diagnose product issues via camera and voice inputs, eliminating the need for complex manual descriptions. </span></li>
<li style="font-weight:400;"><a href="https://ai.azure.com/explore/models/Phi-4-mini-instruct/version/1/registry/azureml" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Phi-4-mini</span></a><span style="font-weight:400;"> packs impressive performance in just 3.8 billion parameters with a 128k context window. Outperforming larger models on math and coding, and increased inference speed by 30%</span></li>
<li style="font-weight:400;"><a href="https://azure.microsoft.com/en-us/blog/empowering-innovation-the-next-generation-of-the-phi-family/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Empowering innovation: The next generation of the Phi family</span></a></li>
</ul>
</li>
<li style="font-weight:400;"><a href="https://techcommunity.microsoft.com/blog/machinelearningblog/introducing-stability-ai-generative-visual-models-to-azure-ai-foundry/4377271" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Stability AI</span></a><span style="font-weight:400;"> Models with advanced generating techniques:</span>
<ul>
<li style="font-weight:400;"><a href="https://ai.azure.com/explore/models/Stable-Diffusion-3.5-Large/version/1/registry/azureml-stabilityai" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Stable Diffusion 3.5 Large</span></a></li>
<li style="font-weight:400;"><a href="https://ai.azure.com/explore/models/Stable-Image-Ultra/version/1/registry/azureml-stabilityai" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Stable Image Ultra</span></a></li>
<li style="font-weight:400;"><a href="https://ai.azure.com/explore/models/Stable-Image-Core/version/1/registry/azureml-stabilityai" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Stable Image Core</span></a></li>
</ul>
</li>
<li style="font-weight:400;"><a href="https://techcommunity.microsoft.com/blog/machinelearningblog/better-search-smarter-ai-cohere-rerank-v3-5-launches-on-azure-ai-foundry/4386392" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Cohere</span></a><span style="font-weight:400;"> enhanced retrieval capabilities with </span><a href="https://ai.azure.com/explore/models/Cohere-rerank-v3.5/version/1/registry/azureml-cohere" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Cohere ReRank 3.5</span></a></li>
<li style="font-weight:400;"><span style="font-weight:400;">GPT 4.0 family expansion with </span><a href="https://ai.azure.com/explore/models/gpt-4o-audio-preview/version/2024-12-17/registry/azure-openai" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Audio</span></a><span style="font-weight:400;"> and </span><a href="https://ai.azure.com/explore/models/gpt-4o-realtime-preview/version/2024-12-17/registry/azure-openai" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Real Time preview</span></a></li>
<li style="font-weight:400;"><span style="font-weight:400;">Plus you get all new customization tools</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Distillation workflows</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Reinforcement fine-tuning</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Fine Tuning for </span><a href="https://ai.azure.com/explore/models?selectedCollection=mistral" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Mistral</span></a></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">As well as support for bringing </span><a href="https://aka.ms/AgentService_Enterprise" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">your own Vnet</span></a><span style="font-weight:400;"> for AI Agent interaction and </span><a href="https://ai.azure.com/labs/projects/magma" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Magma (Multi-Agent Goal Management architecture)</span></a><span style="font-weight:400;"> via </span><a href="https://ai.azure.com/labs" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Foundry Labs</span></a><span style="font-weight:400;">. </span></li>
</ul>
<p><i><span style="font-weight:400;">43:06  Ryan – “</span></i><i><span style="font-weight:400;">I do like the idea of those mini packs because I think that that’s that I’m more interested in that side versus the GPT 4.5 model. Like, cause I think that, you know, can have these giant mega models with all the information in them. But I mean, maybe it’s just my usage of AI is pretty simplistic too, but you know, their example of, know, being able to sort of take a, you know, different sets of information where it’d be visual text and then come up with a, like a repair program. Like that is, you know, like that’s the use case I’m more interested in versus just giant things. So that’s kind of neat.”</span></i></p>
<p><b>44:20</b> <a href="https://techcommunity.microsoft.com/blog/azure-ai-services-blog/announcing-provisioned-deployment-for-azure-openai-service-fine-tuning/4385752" target="_blank" rel="noreferrer noopener"><b>Announcing Provisioned Deployment for Azure OpenAI Service Fine-tuning</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">After fine tuning your models to make your agents behave and speak the way you like, you’ve scaled up your RAG apps – and now customers want it to be snappier and more responsive.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Luckily, Azure OpenAI service is offering (in preview) provisioned deployments for fine-tuned models, giving your applications predictable performance and predictable cost. </span></li>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/ai-services/openai/concepts/provisioned-throughput" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Provision throughput</span></a><span style="font-weight:400;"> allows you to purchase capacity in terms of performance needs instead of per token.  With fine-tuned deployments, it replaces both the hosting fee and token based billing of standard and global standard with a throughput based capacity unit called PTUs. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">If you’re already using Provisioned Throughput units with base models, they work identically in fine tuned models and are completely </span><a href="https://learn.microsoft.com/en-us/azure/ai-services/openai/concepts/provisioned-migration#model-independent-quota" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">interchangeable</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The two models you can provision deployments for are gpt-4o and gpt-4o-mini in North Central US or Switzerland with more regions coming in the future.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Note: if you want another region, click </span><a href="https://learn.microsoft.com/en-us/troubleshoot/azure/general/region-access-request-process" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">here</span></a><span style="font-weight:400;"> and hit the “submit a request” button to get it considered for GA.  </span></li>
</ul>
<p><i><span style="font-weight:400;">45:40  Matthew – “</span></i><i><span style="font-weight:400;">Well, that’s the problem; when you deploy your new app with a new thing, you’re like, OK, do I do provision? Do I hit my limits? And in Azure, and definitely some of the smaller regions or other regions than the primary ones like North Central, East US to those ones. You can hit those limits pretty easily and all of sudden then you get token limits or other errors that occur. So it’s like, you know, do you provision it or pay upfront, or do you risk a new feature of your app having an issue? Do you want your CFO yelling at you, or your customer?”</span></i></p>
<p><b>48:25</b> <a href="https://blog.fabric.microsoft.com/en-GB/blog/announcing-the-launch-of-microsoft-fabric-quotas/" target="_blank" rel="noreferrer noopener"><b>Announcing the launch of Microsoft Fabric Quotas</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft has launched </span><a href="https://learn.microsoft.com/fabric/fundamentals/microsoft-fabric-overview" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Microsoft Fabric</span></a> <a href="https://learn.microsoft.com/azure/quotas/quotas-overview" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Quotas</span></a><span style="font-weight:400;">, a new feature designed to control resource governance for the acquisition of your Microsoft Fabric Capacities. Fabric Quotas aims to help customers ensure that Fabric resources are used efficiently and help manage the overall performance and reliability of the Azure platform while preventing misuse. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft Fabric is a comprehensive service that offers advanced analytics solutions through multiple workloads, all available in a Single SaaS capacity model. Fabric is available via three skus:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Fabric Free trial: a time-bound per-user trial providing a capacity with a given size to every trial user</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Power BI Premium: office-sold offers available as 12 month subscriptions </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Fabric capacities: </span><a href="https://www.bing.com/aclk?ld=e8Qf0dW_WgqY36yBGBjgVzbDVUCUyHca5B_t_mr5OrlSDfDrbyn48gqkErNdndzD3V8r4cUzOpKEwVK5MbpAUVHeXMdUgq_CuSwRIzcQNe_FTetIpEYxiMmplWQkQxUtiythahW6B7AB_zuoOUOYrcWckYIf_hDsgF2DaTCWSspDLHHGzMp92fH_ag5IbxE4bIhwa3Bg&amp;u=aHR0cHMlM2ElMmYlMmZhenVyZS5taWNyb3NvZnQuY29tJTJmZW4tdXMlMmZwcmljaW5nJTJmcHVyY2hhc2Utb3B0aW9ucyUyZnBheS1hcy15b3UtZ28lMmZzZWFyY2glMmYlM2ZlZl9pZCUzZF9rXzg1NzM5YzdjOTFmMDE3MWMxNGJjZmJhZGI0MDk5NmU3X2tfJTI2T0NJRCUzZEFJRGNtbTVlZHN3ZHV1X1NFTV9fa184NTczOWM3YzkxZjAxNzFjMTRiY2ZiYWRiNDA5OTZlN19rXyUyNm1zY2xraWQlM2Q4NTczOWM3YzkxZjAxNzFjMTRiY2ZiYWRiNDA5OTZlNw&amp;rlid=85739c7c91f0171c14bcfbadb40996e7" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Azure PAYG</span></a><span style="font-weight:400;"> offers available in multiple SKUs</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Fabric Quotas limit the number of capacity units a customer can provision across multiple capacities in a subscription. The quota is calculated based on the subscription plan type and Azure region. </span></li>
</ul>
<p><b>53:31</b> <a href="https://techcommunity.microsoft.com/blog/azuresqlblog/availability-metric-for-azure-sql-db-is-now-generally-available/4379174" target="_blank" rel="noreferrer noopener"><b>Availability metric for Azure SQL DB is now generally availabl</b><span style="font-weight:400;">e</span></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Azure SQL Database the modern cloud-based relational databases service is announcing the GA of Availability metrics for Azure SQL DBA enabling you to monitor SLA-compliant availability. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This azure monitor metric is at a 1-minute frequency </span><a href="https://learn.microsoft.com/en-us/azure/azure-monitor/essentials/data-platform-metrics#retention-of-metrics" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">storing up to 93 days</span></a><span style="font-weight:400;">. Typically, the latency to display availability is less than three minutes. You can visualize the metric in Azure monitor and set up </span><a href="https://learn.microsoft.com/en-us/azure/azure-monitor/alerts/alerts-overview" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">alerts</span></a><span style="font-weight:400;"> too.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Availability is determined based on the database being operational for connections. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">A minute is considered downtime or unavailable for a given database if all continuous attempts by the customer to establish a connection to the database within the minute fail.</span></li>
</ul>
<p><i><span style="font-weight:400;">53:59  Justin – “</span></i><i><span style="font-weight:400;">If my database is down because I can’t connect to it for a minute, all of my app has failed. So I don’t, I don’t know that I need you to tell me that your availability was a miss. Cause I think I know from other reasons personally, but, like some customer somewhere must’ve just been like Microsoft, you have to tell us how available your database is. You promised this SLA and you don’t give us a way to measure it. And that’s BS. And that’s why this feature exists. And that’s the only reason why this feature exists because no one needs this unless you are being super pedantic.”</span></i></p>
<p><b>57:18</b> <a href="https://techcommunity.microsoft.com/blog/azuresqlblog/native-windows-principals-for-azure-sql-managed-instance-are-now-generally-avail/4374871" target="_blank" rel="noreferrer noopener"><b>Native Windows principals for Azure SQL Managed Instance are now </b></a><a href="https://techcommunity.microsoft.com/blog/azuresqlblog/native-windows-principals-for-azure-sql-managed-instance-are-now-generally-avail/4374871" target="_blank" rel="noreferrer noopener"><b>generally available</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Azure is announcing the GA of Native </span><a href="https://review.learn.microsoft.com/en-us/azure/azure-sql/managed-instance/winauth-azuread-overview?view=azuresql-mi" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Windows Principals in Azure SQL managed Instances</span></a><span style="font-weight:400;">.  This capability allows the migration of Azure SQL Managed instances and unblocks the migration of legacy applications tied to windows login.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This feature is crucial for the SQL Managed instance link. While the managed instance link facilitates near-real time data replication between SQL Server and Azure SQL Managed instances, the read only replica in the cloud restricts the creation of Microsoft Entra principals. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">With this new feature you have 3 authentication modes for SQL managed instances:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft Entra (default) this mode allows authenticating Entra users using Microsoft Entra user metadata. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Paired (SQL server default) the default mode for SQL Server Auth…. SA</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Windows (New Mode): this mode allows authenticating Microsoft Entra users using the windows user metadata within sql managed instance. </span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">59:02  Matthew – “</span></i><i><span style="font-weight:400;">I have feelings about this that I will not share because this podcast would never end.”</span></i></p>
<p><b>1:01:53 </b><a href="https://devblogs.microsoft.com/visualstudio/claude-3-7-now-available-in-github-copilot-for-visual-studio/" target="_blank" rel="noreferrer noopener"><b>February 24th, 2025 Claude 3.7 Now Available in GitHub Copilot for Visual </b></a><a href="https://devblogs.microsoft.com/visualstudio/claude-3-7-now-available-in-github-copilot-for-visual-studio/" target="_blank" rel="noreferrer noopener"><b>Studio</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Last week we talked about </span><a href="https://www.anthropic.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Claude 3.7</span></a><span style="font-weight:400;"> shipping. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Well, it’s **good news**! </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">It’s available in Github Copilot now. </span></li>
</ul>
<h2><span style="font-weight:400;">Closing</span></h2>
<p><span style="font-weight:400;">And that is the week in the cloud! Visit our website, the home of the Cloud Pod where you can join our newsletter, slack team, send feedback or ask questions at theCloud Pod.net or tweet at us with hashtag #theCloudPod</span></p>
]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/1992506/c1e-33g3ckr05xcm1k3z-qdw9v248fngw-lgomnh.mp3" length="76782675"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 295 of The Cloud Pod – where the forecast is always cloudy! 
Welp, it’s sayonara to Skype – and time to finally make the move to Teams. Hashi has officially moved to IBM, GPT 4.5 is out and people have…thoughts. Plus, Google has the career coach you need to make all your dreams come true.*
*Assuming those dreams are reasonable in a volatile economy. 
Titles we almost went with this week:

Someday we’ll find it, the rainbow connection, the lovers, the cloud dreamers, and Me 
Dreamer, you know you are a dreamer
☁️You may say I’m a cloud dreamer, but I’m not the only one
May the skype shut down
Q can tell me that my python skills are bad
How many free code assistance does Ryan need to be a good developer: ALL OF THEM
Oops honey I spent 1M dollars on oracle
Latest Cloud Pod Reviews: “It’s a Lemon”

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
General News 
01:04 On May 5, Microsoft’s Skype will shut down for good 

In what we swear is the 9th death for Skype, Microsoft has announced that after 21 years (with 13 of those years under MS Control,) Skype will be no more. 
For real this time. Really. 
May 5th is the official last day of Skype, and they’ve indicated you can continue your calls and chats in Teams. 
Starting now, you should be able to use your Skype login to get into Teams. 
For those of you who do this, you’ll see all your existing contacts and chats in Teams. 
Alternatively, you can export your Skype data, specifically contacts, call history and chats. 
Current subscribers to Skype Premium services will remain active until the end, but you will not be able to sign up for Skype at this time. 
Skype dial pad credits will remain active in the web interface and inside Teams after May 5th so you can finish using those credits. 

03:37  Matthew  – “I think there’s a lot of people and, you know, at least people I know in other countries to still use Skype, like pretty heavily for like cross country communications, things along those lines. So I think a lot of that is that there probably is still a good amount of people using it. And this is just, Hey, they’re trying to make it nicely...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/1992506/c1a-k5d5-xxw2g50xhqw1-6ruwqj.jpg"></itunes:image>
                                                                            <itunes:duration>01:03:59</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                            </item>
                    <item>
                <title>
                    <![CDATA[294: Ding: Chime is Dead]]>
                </title>
                <pubDate>Fri, 07 Mar 2025 14:34:58 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/1988555</guid>
                                    <link>https://tcpfm.castos.com/episodes/294-ding-chime-is-dead</link>
                                <description>
                                            <![CDATA[<p><span style="font-weight:400;">Welcome to episode 294 of The Cloud Pod – where the forecast is always cloudy!Ilya Boy, do we have a news packed week for you! Sutskever raised $30B without a product, Mira Murati launched her own AI lab, and Claude 3.7 now thinks before it speaks. Meanwhile, Microsoft casually invented new matter for quantum computing, Google built an AI scientist, and AWS killed Chime (RIP). At this rate, AI is either going to save the world or speedrun becoming Ultron. Let’s all find out together – today on The Cloud Pod! </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">☠️Ding – Chime is Dead</span></li>
<li><span style="font-weight:400;">Does your container really need 192 cores</span></li>
<li><span style="font-weight:400;">Quantum is the new AI</span></li>
<li><span style="font-weight:400;">AI is now IN the robots</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><b>AI Is Going Great – Or How ML Makes All It’s Money </b></h2>
<p><b>02:41</b> <a href="https://www.theinformation.com/briefings/ilya-sutskevers-startup-in-talks-to-raise-financing-at-30-billion-valuation?rc=3t8xtd" target="_blank" rel="noreferrer noopener"><b>Ilya Sutskever’s Startup in Talks to Raise Financing at $30 Billion Valuation</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">It’s been a minute since we talked about former OpenAI executives and what they’re up to. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Let’s start with Ilya Sutskever and Mira Murati, post Open AI career</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The Information reports that Ilya Suskevers’ startup “</span><a href="https://ssi.inc/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Safe Superintelligence</span></a><span style="font-weight:400;">” is in talks to </span><a href="https://www.reuters.com/technology/artificial-intelligence/openai-co-founder-sutskevers-new-safety-focused-ai-startup-ssi-raises-1-billion-2024-09-04/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">raise $1Billion</span></a><span style="font-weight:400;"> in a round that would value the startup at $30 Billion.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The company has yet to release a product, but based on the name we can guess what they’re working on…</span></li>
</ul>
<p><i><span style="font-weight:400;">03:22  Ryan – “It’s so nuts to me that they can raise that much without – really just an idea. Doesn’t have to have any proof or POC…”</span></i></p>
<p><b>07:07</b> <a href="https://www.theinformation.com/articles/murati-joins-crowded-ai-startup-sector?rc=3t8xtd" target="_blank" rel="noreferrer noopener"><b>Murati Joins Crowded AI Startup Sector</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Mira Murati confirmed one of the worst kept secrets in AI, by revealing her lab </span><a href="https://thinkingmachines.ai/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Thinking Machine Labs</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Murati has lured away two thirds of her team from OpenAI. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">We’ll be waiting to see how the funding goes for this one. </span></li>
</ul>
<p><b>08:02</b> <a href="https://www.anthropic.com/news/claude-3-7-sonnet" target="_blank" rel="noreferrer noopener"><b>Claude 3.7 Sonnet and Claude Code</b></a></p>
<ul>
<li style="font-weight:400;"><a></a></li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 294 of The Cloud Pod – where the forecast is always cloudy!Ilya Boy, do we have a news packed week for you! Sutskever raised $30B without a product, Mira Murati launched her own AI lab, and Claude 3.7 now thinks before it speaks. Meanwhile, Microsoft casually invented new matter for quantum computing, Google built an AI scientist, and AWS killed Chime (RIP). At this rate, AI is either going to save the world or speedrun becoming Ultron. Let’s all find out together – today on The Cloud Pod! 
Titles we almost went with this week:

☠️Ding – Chime is Dead
Does your container really need 192 cores
Quantum is the new AI
AI is now IN the robots

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
AI Is Going Great – Or How ML Makes All It’s Money 
02:41 Ilya Sutskever’s Startup in Talks to Raise Financing at $30 Billion Valuation

It’s been a minute since we talked about former OpenAI executives and what they’re up to. 
Let’s start with Ilya Sutskever and Mira Murati, post Open AI career
The Information reports that Ilya Suskevers’ startup “Safe Superintelligence” is in talks to raise $1Billion in a round that would value the startup at $30 Billion.  
The company has yet to release a product, but based on the name we can guess what they’re working on…

03:22  Ryan – “It’s so nuts to me that they can raise that much without – really just an idea. Doesn’t have to have any proof or POC…”
07:07 Murati Joins Crowded AI Startup Sector

Mira Murati confirmed one of the worst kept secrets in AI, by revealing her lab Thinking Machine Labs. 
Murati has lured away two thirds of her team from OpenAI. 
We’ll be waiting to see how the funding goes for this one. 

08:02 Claude 3.7 Sonnet and Claude Code

]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[294: Ding: Chime is Dead]]>
                </itunes:title>
                                    <itunes:episode>294</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p><span style="font-weight:400;">Welcome to episode 294 of The Cloud Pod – where the forecast is always cloudy!Ilya Boy, do we have a news packed week for you! Sutskever raised $30B without a product, Mira Murati launched her own AI lab, and Claude 3.7 now thinks before it speaks. Meanwhile, Microsoft casually invented new matter for quantum computing, Google built an AI scientist, and AWS killed Chime (RIP). At this rate, AI is either going to save the world or speedrun becoming Ultron. Let’s all find out together – today on The Cloud Pod! </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">☠️Ding – Chime is Dead</span></li>
<li><span style="font-weight:400;">Does your container really need 192 cores</span></li>
<li><span style="font-weight:400;">Quantum is the new AI</span></li>
<li><span style="font-weight:400;">AI is now IN the robots</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><b>AI Is Going Great – Or How ML Makes All It’s Money </b></h2>
<p><b>02:41</b> <a href="https://www.theinformation.com/briefings/ilya-sutskevers-startup-in-talks-to-raise-financing-at-30-billion-valuation?rc=3t8xtd" target="_blank" rel="noreferrer noopener"><b>Ilya Sutskever’s Startup in Talks to Raise Financing at $30 Billion Valuation</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">It’s been a minute since we talked about former OpenAI executives and what they’re up to. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Let’s start with Ilya Sutskever and Mira Murati, post Open AI career</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The Information reports that Ilya Suskevers’ startup “</span><a href="https://ssi.inc/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Safe Superintelligence</span></a><span style="font-weight:400;">” is in talks to </span><a href="https://www.reuters.com/technology/artificial-intelligence/openai-co-founder-sutskevers-new-safety-focused-ai-startup-ssi-raises-1-billion-2024-09-04/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">raise $1Billion</span></a><span style="font-weight:400;"> in a round that would value the startup at $30 Billion.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The company has yet to release a product, but based on the name we can guess what they’re working on…</span></li>
</ul>
<p><i><span style="font-weight:400;">03:22  Ryan – “It’s so nuts to me that they can raise that much without – really just an idea. Doesn’t have to have any proof or POC…”</span></i></p>
<p><b>07:07</b> <a href="https://www.theinformation.com/articles/murati-joins-crowded-ai-startup-sector?rc=3t8xtd" target="_blank" rel="noreferrer noopener"><b>Murati Joins Crowded AI Startup Sector</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Mira Murati confirmed one of the worst kept secrets in AI, by revealing her lab </span><a href="https://thinkingmachines.ai/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Thinking Machine Labs</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Murati has lured away two thirds of her team from OpenAI. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">We’ll be waiting to see how the funding goes for this one. </span></li>
</ul>
<p><b>08:02</b> <a href="https://www.anthropic.com/news/claude-3-7-sonnet" target="_blank" rel="noreferrer noopener"><b>Claude 3.7 Sonnet and Claude Code</b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.anthropic.com/en/docs/about-claude/models" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Anthropic</span></a><span style="font-weight:400;"> is releasing their latest model </span><a href="https://claude.ai/new" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Claude</span></a><span style="font-weight:400;"> 3.7 Sonnet, their most intelligent model to date and the first hybrid reasoning model on the market.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Claude 3.7 sonnet can produce near instant responses or extended, step by step thinning that is made </span><a href="https://youtu.be/t3nnDXa81Hs" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">visible to the user</span></a><span style="font-weight:400;">.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">API users also have fine grained control over how long the model can think for. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Claude 3.7 shows particularly strong improvements in coding and front-end web development.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">In addition to the new model they have introduced a command line tool for Agentic Coding, </span><a href="https://docs.anthropic.com/en/docs/agents-and-tools/claude-code" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Claude Code</span></a><span style="font-weight:400;">.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Claude code is available as a limited research preview and enables developers to delegate substantial engineering tasks directly from the terminal (Justin really wants a native VS code integration… come on Anthropic!)</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The extended thinking model is not available in the free tier, but all other paid plans are covered as well as through our various cloud providers. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Want to join the preview? You can do that </span><a href="https://docs.anthropic.com/en/docs/agents-and-tools/claude-code/overview#install-and-authenticate" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">here</span></a><span style="font-weight:400;">. </span></li>
</ul>
<p><i><span style="font-weight:400;">12:44  Justin – “</span></i><i><span style="font-weight:400;">AI is great. I can see how it makes good coders even better, and bad coders worse, and your ability to be a debugger is gonna be the make or break for you in the AI coding world.”</span></i></p>
<h2><b>AWS</b></h2>
<p><b>14:58</b> <a href="https://aws.amazon.com/blogs/messaging-and-targeting/update-on-support-for-amazon-chime/" target="_blank" rel="noreferrer noopener"><b>Update on Support for Amazon Chime</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon has decided to end support for their </span><a href="https://aws.amazon.com/chime" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Chime</span></a><span style="font-weight:400;"> service, including business calling features, effective February 20th, 2026. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon Chime will no longer accept new customers starting February 19th, 2025. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">You can continue to use it as an existing customer for meetings through February 20, 2026, and you can delete your data prior to that day. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">For those of you using the </span><a href="https://aws.amazon.com/chime/chime-sdk/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Chime SDK</span></a><span style="font-weight:400;">, this service will not change (it powers Slack Huddles.)</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon provides you with a few options to replace Chime including their own AWS Wikr service, or from AWS partners such as Zoom, Webex and Slack. </span></li>
</ul>
<p><i><span style="font-weight:400;">16:38  Matthew – “</span></i><i><span style="font-weight:400;">I was surprised at how short of a timeline this was, because I feel like code command and some of the other ones are multi-year, and maybe that’s just memory. But one year, if you’re fully integrated into the solution, doesn’t feel like a long time to migrate as a business. Or no one actually uses it, so who cares? One of the two.”</span></i></p>
<p><b>19:07</b> <a href="https://aws.amazon.com/about-aws/whats-new/2025/02/amazon-ecs-increases-cpu-ecs-tasks/" target="_blank" rel="noreferrer noopener"><b>Amazon ECS increases the CPU limit for ECS tasks to 192 vCPUs</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">In today’s “Are you sure containers are the right solution,” ECS now supports CPU limits of 192 vCPU’s for ECS tasks deployed on EC2 instances, an increase from the previous 10 vCPU limit.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This enhancement allows customers to more effectively manage resource allocation on larger Amazon EC2 instances. </span></li>
</ul>
<p><b>21:09</b> <a href="https://aws.amazon.com/blogs/aws/anthropics-claude-3-7-sonnet-the-first-hybrid-reasoning-model-is-now-available-in-amazon-bedrock/" target="_blank" rel="noreferrer noopener"><b>Anthropic’s Claude 3.7 Sonnet hybrid reasoning model is now available in Amazon Bedrock</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Sonnet available in Bedrock and Q Developer.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Finally. </span></li>
</ul>
<p><b>22:50</b> <a href="https://aws.amazon.com/es/about-aws/whats-new/2025/02/aws-network-firewall-automated-domain-lists/" target="_blank" rel="noreferrer noopener"><b>AWS Network Firewall introduces automated domain lists and insights</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">One less thing we have to use Athena for… is a victory in our book! </span></li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/network-firewall/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Network Firewall</span></a><span style="font-weight:400;"> now offers automated domain lists and insights, a feature that enhances visibility into network traffic and simplifies rule configurations.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The capability analyzes HTTP and HTTPS traffic from the last 30 days, and provides insights into the frequency of access to domains, allowing a quick rule creation based on observed network traffic patterns. </span></li>
</ul>
<p><i><span style="font-weight:400;">23:10  Ryan – “It’s </span></i><i><span style="font-weight:400;">funny because I, when they rolled out this, this feature or the network firewall together, I’ve become real spoiled. And so like, when it didn’t have this, was like, how am I supposed to use this? I gotta compile all my traffic to figure out what’s going on. Like, boo. And so, yeah, this is, this is great because compiling these data sets and running your queries is a chore. And typically that’s all you want, right? You just want to be able to very quickly sort of say this is what’s coming in and answer a question and move on.”</span></i></p>
<h2><b>GCP</b></h2>
<p><b>26:38</b> <a href="https://cloud.google.com/blog/products/networking/public-ip-health-checks-in-cloud-dns-now-ga/" target="_blank" rel="noreferrer noopener"><b>Introducing Cloud DNS public IP health checks, for more resilient </b></a><a href="https://cloud.google.com/blog/products/networking/public-ip-health-checks-in-cloud-dns-now-ga/" target="_blank" rel="noreferrer noopener"><b>multicloud deployments</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google is announcing the General Availability of </span><a href="https://cloud.google.com/blog/products/networking/dns-routing-policies-for-geo-location--weighted-round-robin" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Cloud DNS routing policies</span></a><span style="font-weight:400;"> with </span><a href="https://cloud.google.com/dns/docs/routing-policies-overview" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">public health IP checking</span></a><span style="font-weight:400;">, which provides the automated, health-aware traffic management that you need to build resilient applications, no matter where your workloads reside. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Running on multiple cloud providers can often lead to fragmented traffic management strategies. </span><a href="https://cloud.google.com/dns" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Cloud DNS</span></a><span style="font-weight:400;"> now lets you intelligently route traffic across multiple cloud providers based on application health from a single Interface.  Cloud DNS supports a variety of routing policies, including weighted round robin, geolocation, and failover, giving you the flexibility to tailor your traffic management strategy to your specific needs. </span></li>
</ul>
<p><i><span style="font-weight:400;">27:03  Ryan – “</span></i><i><span style="font-weight:400;">I mean, so maybe you can take your Kubernetes workload and actually spread it across multiple clouds and serve from all clouds with a solution like this. it’s always that sort of edge case where it’s sort of the rubber meets the road and you run into these weird things trying to serve from multi-cloud. But this is a big step towards that. I’m sure there’s other edge cases that I’m not thinking about. I know there’s a ton of operability concerns, but this is kind of neat.”</span></i></p>
<p><b>30:13</b> <a href="https://cloud.google.com/blog/products/identity-security/announcing-quantum-safe-digital-signatures-in-cloud-kms/" target="_blank" rel="noreferrer noopener"><b>Announcing quantum-safe digital signatures in Cloud KMS</b></a><span style="font-weight:400;"> </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google is releasing quantum-safe digital signatures (</span><a href="https://csrc.nist.gov/pubs/fips/204/final" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">FIPS 204</span></a><span style="font-weight:400;">/</span><a href="https://csrc.nist.gov/pubs/fips/205/final" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">FIPS 205</span></a><span style="font-weight:400;"> compliant) in KMS for software-based keys, available in preview. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They are also sharing their high-level view in their post quantum strategy for google cloud encryption products, including for </span><a href="https://cloud.google.com/security/products/security-key-management" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Cloud KMS</span></a><span style="font-weight:400;"> and </span><a href="https://cloud.google.com/kms/docs/hsm" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Cloud HSM</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Their goal is to ensure that Google Cloud KMS is quantum safe</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Offering software and hardware support for standardized quantum-safe algorithms</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Supporting migration paths for existing keys, protocols and customer workloads to adopt </span><a href="https://csrc.nist.gov/pqc-standardization" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">PQC</span></a><span style="font-weight:400;">.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Quantum-proofing Google’s underlying core infrastructure</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Analyzing the security and performance of PQC algorithms and implementations. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">And contributing technical comments to PQC advocacy efforts in standard bodies and government organizations. </span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">33:04  Justin – “</span></i><i><span style="font-weight:400;">They implemented hybrid plus quantum key exchange that provides traditional and quantum resistant algorithms, and they provided protection against both current threats and potential future quantum computer-based attacks. And then it goes on to talk about the implementation use of Kyber. And I do remember us talking about Kyber because I think we talked, we talked about Kyber crystals at one point. Yeah. So we did talk about this at one point. So yes, you’re good already. So yes, Google is coming in either, maybe behind Azure. I don’t know. I don’t know if they have anything.”</span></i></p>
<p><b>34:47</b> <a href="https://cloud.google.com/blog/products/compute/new-a4x-vms-powered-by-nvidia-gb200-gpus/" target="_blank" rel="noreferrer noopener"><b>Introducing A4X VMs powered by NVIDIA GB200 — now in preview</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google is announcing the preview of A4X VMs, powered by </span><a href="https://cloud.google.com/blog/products/compute/introducing-a4-vms-powered-by-nvidia-b200-gpu-aka-blackwell" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">NVIDIA GB200 NVL72</span></a><span style="font-weight:400;">, a system consisting of 72 NVIDIA Blackwell GPUs and 36 arm-based NVIDIA Grace CPus connected via fifth generation NVIDIA NVLink.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">With this integrated system, A4X VMs directly address the significant compute and memory demands of reasoning models that use chain-of-thought, unlocking new levels of AI performance and accuracy.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Google Cloud is the first and only provider today to offer both the A4 Vm powered by NVIDIA B200 and A4x VMs powered by NVIDIA GB200 NVL72.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Google says to help you choose the best one for your workload:</span>
<ul>
<li style="font-weight:400;"><b>A4X VMs (powered by NVIDIA GB200 NVL72 GPUs): </b><span style="font-weight:400;">Purpose-built for training and serving the most demanding, extra large-scale AI workloads, particularly those involving reasoning models, large language models with long context windows, and scenarios that require massive concurrency. This is enabled by the unified memory across a large GPU domain.</span></li>
<li style="font-weight:400;"><b>A4 VMs (powered by NVIDIA B200 GPUs):</b><span style="font-weight:400;"> A4 provides excellent performance and versatility for diverse AI model architectures and workloads, including training, fine-tuning, and serving. A4 offers easy portability from prior generations of Cloud GPUs and optimized performance benefits for varying scaled training jobs.</span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">36:07  Justin – “</span></i><i><span style="font-weight:400;">So basically, if you need big, expensive hardware, use the A4X. And if you want to do some inference and basic things, you have a model you’re already happy with, the A4 VM is probably the right choice for you.”</span></i></p>
<p><b>37:40</b> <a href="https://blog.google/feed/google-research-ai-co-scientist/" target="_blank" rel="noreferrer noopener"><b>We’re launching a new AI system for scientists</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google has launched an AI co-scientist, a new AI system built on Gemini 2.0 designed to aid scientists in creating novel hypotheses and research plans.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Researchers can specify a research goal for example, to better understand the spread of disease-causing microbes using natural language and the AI co-scientist will propose testable hypotheses, along with a summary of relevant published literature and a possible experimental approach.  </span></li>
</ul>
<p><i><span style="font-weight:400;">38:02  Ryan – “That’s wild – it’s such a specific use case!” </span></i></p>
<p><b>39:59</b> <a href="https://cloud.google.com/blog/products/ai-machine-learning/anthropics-claude-3-7-sonnet-is-available-on-vertex-ai/" target="_blank" rel="noreferrer noopener"><b>Announcing Claude 3.7 Sonnet, Anthropic’s first hybrid reasoning model, </b></a><a href="https://cloud.google.com/blog/products/ai-machine-learning/anthropics-claude-3-7-sonnet-is-available-on-vertex-ai/" target="_blank" rel="noreferrer noopener"><b>is available on Vertex AI</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Claude 3.7 is available on Vertex AI</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Vertex apparently also supports Claude Code.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Shocking. </span></li>
</ul>
<h2><b>Azure</b></h2>
<p><b>41:33</b> <a href="https://www.geekwire.com/2025/microsoft-quantum-breakthrough-promises-to-usher-in-the-next-era-of-computing-in-years-not-decades/?utm_source=GeekWire+Newsletters&amp;utm_campaign=ab4daffe34-daily-digest-email&amp;utm_medium=email&amp;utm_term=0_4e93fc7dfd-ab4daffe34-233353605&amp;mc_cid=ab4daffe34&amp;mc_eid=04fad859c0" target="_blank" rel="noreferrer noopener"><b>Microsoft quantum breakthrough promises to usher in the next era of </b></a><a href="https://www.geekwire.com/2025/microsoft-quantum-breakthrough-promises-to-usher-in-the-next-era-of-computing-in-years-not-decades/?utm_source=GeekWire+Newsletters&amp;utm_campaign=ab4daffe34-daily-digest-email&amp;utm_medium=email&amp;utm_term=0_4e93fc7dfd-ab4daffe34-233353605&amp;mc_cid=ab4daffe34&amp;mc_eid=04fad859c0" target="_blank" rel="noreferrer noopener"><b>computing in ‘years, not decades’</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft has announced they have created a new type of matter, growing up in sciences, you would learn that there are three main types of Matter including Solids, Liquids and Gas. But now Microsoft has turned this on its head. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They have created an entirely new state of matter, unlocked by a new class of materials, topoconductors that enable the fundamental leap in computing.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">All of this powers Majorana 1, the first quantum processing until built on topological core. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Satya believes this breakthrough will allow them to create a truly meaningful quantum computer not in decades but in years. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The qubits created with topoconductors are faster, more reliable and smaller. They are 1/100th of a millimeter, meaning we now have a clear path to a million-qubit processor. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">A chip the size of the palm of your hand yet is capable of solving problems that even all the computers on earth today could not. </span></li>
</ul>
<p><i><span style="font-weight:400;">42:37  Ryan – “W</span></i><i><span style="font-weight:400;">ow. I mean, that last bullet point is what my head explodes. Like I know I don’t understand quantum computers and I, you know, like from any kind of way. Now, you know, they’re introducing new states of matter, like in order to power some of those things, it’s gonna, it just feels like tomorrow world is gonna be completely unrecognizable.”</span></i></p>
<p><b>45:28</b> <a href="https://news.microsoft.com/source/features/ai/microsofts-majorana-1-chip-carves-new-path-for-quantum-computing/" target="_blank" rel="noreferrer noopener"><b>Microsoft’s Majorana 1 chip carves new path for quantum computing</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft says they took a step back, and said OK if we invent the transistor for the quantum age. What properties does it need to have? And that’s apparently how they got there. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Being able to fit a million qubits in the palm of the hand, unlocks the path to meet the threshold for quantum computers to deliver transformative, real-world solutions such as breaking down microplastics into harmless byproducts or inventing self-healing material for construction, manufacturing and healthcare. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">All of the current computer’s operating together can’t do what a one-million-qubit quantum computer will be able to do. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The first topological core powering the </span><a href="https://news.microsoft.com/azure-quantum/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Morjana 1</span></a><span style="font-weight:400;"> is reliable by design, incorporating error resistance at the hardware level making it more stable. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Commercially important applications will require trillions of operations on a million qubits, which would be prohibitive with current approaches that rely on fine-tuned analog control of each qubit.  </span></li>
<li style="font-weight:400;"><a href="https://aka.ms/MSQuantumAQblog" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">The new chip</span></a><span style="font-weight:400;"> allows them to be controlled digitally, redefining and vastly simplifying how quantum computing works.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft is now one of two companies to be invited to the final phase of DARPA’s underexplored systems for utility quantum computing program.  </span></li>
</ul>
<p><b>47:52</b> <a href="https://arstechnica.com/ai/2025/02/microsofts-new-ai-agent-can-control-software-and-robots/" target="_blank" rel="noreferrer noopener"><b>Microsoft’s new AI agent can control software and robots</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft research unveiled </span><a href="https://microsoft.github.io/Magma/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Magma</span></a><span style="font-weight:400;">, an integrated AI foundational model that combines visual and language processing to control software interfaces and robotic systems. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">If the results hold up outside of MS, it could be a meaningful step forward for an all purpose multi-model AI that can operate interactively in both real and digital spaces. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">MS claims its the first multi-modal AI model that can not only process but also actively act on the data from navigating user interfaces to manipulating physical objects.  </span></li>
</ul>
<p><i><span style="font-weight:400;">49:30  Justin – “</span></i><i><span style="font-weight:400;">Well, good, I look forward to our future robot overlords.”</span></i></p>
<p><b>49:36</b> <a href="https://azure.microsoft.com/en-us/blog/introducing-azure-ai-foundry-labs-a-hub-for-the-latest-ai-research-and-experiments-at-microsoft/" target="_blank" rel="noreferrer noopener"><b>Introducing Azure AI Foundry Labs: A hub for the latest AI research and </b></a><a href="https://azure.microsoft.com/en-us/blog/introducing-azure-ai-foundry-labs-a-hub-for-the-latest-ai-research-and-experiments-at-microsoft/" target="_blank" rel="noreferrer noopener"><b>experiments at Microsoft</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Azure is launching Azure AI Foundry Labs, a hub for developers, startups and enterprises to explore groundbreaking innovations from research to Microsoft. (until Visual Studio kills it…. I’m watching you VS studio)</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft’s newest AI breakthrough Muse, is a first of its kind world and human action model (WHAM), available today in Azure AI Foundry, is the latest example of bringing cutting-edge research innovation to their AI platform for customers to use. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">With Azure AI foundry labs they are excited to unveil new assets for their latest research driven projects that empower developers to explore, engage and experiment.  Projects across models and agentic frameworks include:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Aurora – A large-scale atmosphere model providing high-resolution weather forecasts and air pollution predictions, outperforming traditional tools. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">ExACT: an open source project enabling agents to learn from past interactions and improve search efficiency dynamically</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Magnetic-One: a multi-agent system involving complex problems by orchestrating multiple agents, built on the autogen framework.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Mattersim: a deep learning model for atomistic simulations, predicting material properties with high precision</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">OmniParserver v2: a vision-based module converting UI screenshots into structured elements, enhancing agents’ action generation.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">TamGen: a generative AI model for drug decision, suing a GPT-like chemical language model for target-aware molecule generations and refinement</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft points out that the speed of innovation is crucial, and points to the slow adoption of GPS and the decade it took from military applications to consumer use.  But AI innovations are moving much faster than that.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The pace of AI advancement has accelerated dramatically. </span></li>
</ul>
<p><i><span style="font-weight:400;">50:55  Ryan – “</span></i><i><span style="font-weight:400;">I mean, I can’t agree more with the speed of innovations blinding. You know, we started this podcast to keep up with cloud news as the hyperscalers got to a certain scale where they were announcing enough stuff that we couldn’t keep up to date. Now I feel even with this, it’s, I really struggle to, you know, understand half of these use cases and how it’s applied and the whole thing. Like it’s crazy to me how fast things are moving.”</span></i></p>
<p><b>51:46</b> <a href="https://blogs.microsoft.com/blog/2025/02/19/a-new-level-unlocked/" target="_blank" rel="noreferrer noopener"><b>A new level unlocked</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft is also releasing Muse, a first of its kind generative AI model that they are applying to gaming. It’s a huge step forward in gameplay ideation. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Muse, just from observing human gameplay, has developed a deep understanding of the environment, including its dynamics and how it evolves over time in response to actions.  This unlocks the ability to rapidly iterate, remix and create in video games so developers can eventually create immersive environments and unleash their full creativity. </span></li>
</ul>
<h2><b>Aftershow</b></h2>
<p><b>1:03:02 </b><a href="https://www.theinformation.com/briefings/amazon-paid-1-billion-for-control-of-bond-franchise?rc=3t8xtd" target="_blank" rel="noreferrer noopener"><b>Amazon Paid $1 Billion for Control of Bond Franchise</b></a><i><span style="font-weight:400;"> </span></i></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon paid around $1billion to secure creative control of the James Bond franchise, according to a person familiar with the matter. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This deal is a joint venture with Barbara Broccoli and Michael G Wilson, both of whom have been long stewards of the bond franchise.  Amazon bought MGM studios in 2022, it gave Amazon the right to distribute bond films, while Broccoli and Wilson retained creative control. But they have been at odds with Amazon since the tech giant bought MGM studios in 2022, delaying the production of a new bond film. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This new deal allows them to create television and movies based on the bond character. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Rumors are they would like to create an expanded Bond Universe, similar to the MCU.</span></li>
</ul>
<h2><span style="font-weight:400;">Closing</span></h2>
<p><span style="font-weight:400;">And that is the week in the cloud! Visit our website, the home of the Cloud Pod where you can join our newsletter, slack team, send feedback or ask questions at theCloud Pod.net or tweet at us with hashtag #theCloudPod</span></p>
]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/1988555/c1e-5rkrbmw4ovfr80gd-dm400m48avmm-m42ox5.mp3" length="73690299"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 294 of The Cloud Pod – where the forecast is always cloudy!Ilya Boy, do we have a news packed week for you! Sutskever raised $30B without a product, Mira Murati launched her own AI lab, and Claude 3.7 now thinks before it speaks. Meanwhile, Microsoft casually invented new matter for quantum computing, Google built an AI scientist, and AWS killed Chime (RIP). At this rate, AI is either going to save the world or speedrun becoming Ultron. Let’s all find out together – today on The Cloud Pod! 
Titles we almost went with this week:

☠️Ding – Chime is Dead
Does your container really need 192 cores
Quantum is the new AI
AI is now IN the robots

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
AI Is Going Great – Or How ML Makes All It’s Money 
02:41 Ilya Sutskever’s Startup in Talks to Raise Financing at $30 Billion Valuation

It’s been a minute since we talked about former OpenAI executives and what they’re up to. 
Let’s start with Ilya Sutskever and Mira Murati, post Open AI career
The Information reports that Ilya Suskevers’ startup “Safe Superintelligence” is in talks to raise $1Billion in a round that would value the startup at $30 Billion.  
The company has yet to release a product, but based on the name we can guess what they’re working on…

03:22  Ryan – “It’s so nuts to me that they can raise that much without – really just an idea. Doesn’t have to have any proof or POC…”
07:07 Murati Joins Crowded AI Startup Sector

Mira Murati confirmed one of the worst kept secrets in AI, by revealing her lab Thinking Machine Labs. 
Murati has lured away two thirds of her team from OpenAI. 
We’ll be waiting to see how the funding goes for this one. 

08:02 Claude 3.7 Sonnet and Claude Code

]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/1988555/c1a-k5d5-47d007k3tg68-vrdvhi.jpg"></itunes:image>
                                                                            <itunes:duration>01:01:25</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                            </item>
                    <item>
                <title>
                    <![CDATA[293: Terraform Apply – Output Pizza]]>
                </title>
                <pubDate>Wed, 26 Feb 2025 03:47:03 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/1982248</guid>
                                    <link>https://tcpfm.castos.com/episodes/293-terraform-apply-output-pizza</link>
                                <description>
                                            <![CDATA[<p><span style="font-weight:400;">Welcome to episode 293 of The Cloud Pod – where the forecast is always cloudy! This week we’ve got a lot of new and, surprise, a new installment of Cloud Journey AND and aftershow – so make sure to stay tuned for that! We’ve got undersea cables, Go 1.24, Wasm, Anthropic and more. </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">️Lets Go!</span></li>
<li><span style="font-weight:400;">Under Sea cables make AI go BRRRRRR</span></li>
<li><span style="font-weight:400;">The CloudPod says it will grow the listeners by 10x by 2027</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<p><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></p>
<h2><b>General News</b></h2>
<p><b>01:30</b> <a href="https://go.dev/blog/go1.24" target="_blank" rel="noreferrer noopener"><b>Go 1.24 is released!</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Go 1.24 has been released with a bunch of improvements! </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Go now fully supports </span><a href="https://go.dev/issue/46477" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">generic type aliases</span></a><span style="font-weight:400;">.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">It also includes several performance improvements to the runtime that have reduced CPU overhead by 2-3% on average across a suite of representative benchmarks. (Say that 5 times fast.)</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Tool improvements around tool dependencies for a module. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The standard library now includes </span><a href="https://go.dev/doc/security/fips140" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">new mechanisms to facilitate FIPS-140-3 compliance</span></a><span style="font-weight:400;">. And you know we love some good FIPS-140-3 compliance. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Lastly, it includes some improved </span><a href="https://go.dev/doc/go1.24#wasm" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">WebAssembly</span></a><span style="font-weight:400;"> support – which we’ll talk about later. </span></li>
</ul>
<p><b>04:46</b> <a href="https://engineering.fb.com/2025/02/14/connectivity/project-waterworth-ai-subsea-infrastructure/" target="_blank" rel="noreferrer noopener"><b>Unlocking global AI potential with next-generation subsea infrastructure</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Meta announced their most ambitious subsea cable endeavor: Project Waterworth. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Once the cable is completed, the project will reach five major continents and span over 50,000 KM (longer than the earth’s circumference) making it the world’s longest </span><a href="https://engineering.fb.com/2021/09/28/connectivity/2africa-pearls/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">subsea cable</span></a><span style="font-weight:400;"> project using the highest-capacity technology available. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">It will bring connectivity to the US, India, Brazil, South Africa, as well as other key regions. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Waterworth will be a multi-billion dollar, multi-year investment to strengthen the scale and reliability of the world’s digital highways by opening three new oceanic corridors with the abundant, high-speed connectivit...</span></li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 293 of The Cloud Pod – where the forecast is always cloudy! This week we’ve got a lot of new and, surprise, a new installment of Cloud Journey AND and aftershow – so make sure to stay tuned for that! We’ve got undersea cables, Go 1.24, Wasm, Anthropic and more. 
Titles we almost went with this week:

️Lets Go!
Under Sea cables make AI go BRRRRRR
The CloudPod says it will grow the listeners by 10x by 2027

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
General News
01:30 Go 1.24 is released! 

Go 1.24 has been released with a bunch of improvements! 
Go now fully supports generic type aliases.
It also includes several performance improvements to the runtime that have reduced CPU overhead by 2-3% on average across a suite of representative benchmarks. (Say that 5 times fast.)
Tool improvements around tool dependencies for a module. 
The standard library now includes new mechanisms to facilitate FIPS-140-3 compliance. And you know we love some good FIPS-140-3 compliance. 
Lastly, it includes some improved WebAssembly support – which we’ll talk about later. 

04:46 Unlocking global AI potential with next-generation subsea infrastructure

Meta announced their most ambitious subsea cable endeavor: Project Waterworth. 
Once the cable is completed, the project will reach five major continents and span over 50,000 KM (longer than the earth’s circumference) making it the world’s longest subsea cable project using the highest-capacity technology available. 
It will bring connectivity to the US, India, Brazil, South Africa, as well as other key regions. 
Waterworth will be a multi-billion dollar, multi-year investment to strengthen the scale and reliability of the world’s digital highways by opening three new oceanic corridors with the abundant, high-speed connectivit...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[293: Terraform Apply – Output Pizza]]>
                </itunes:title>
                                    <itunes:episode>293</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p><span style="font-weight:400;">Welcome to episode 293 of The Cloud Pod – where the forecast is always cloudy! This week we’ve got a lot of new and, surprise, a new installment of Cloud Journey AND and aftershow – so make sure to stay tuned for that! We’ve got undersea cables, Go 1.24, Wasm, Anthropic and more. </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">️Lets Go!</span></li>
<li><span style="font-weight:400;">Under Sea cables make AI go BRRRRRR</span></li>
<li><span style="font-weight:400;">The CloudPod says it will grow the listeners by 10x by 2027</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<p><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></p>
<h2><b>General News</b></h2>
<p><b>01:30</b> <a href="https://go.dev/blog/go1.24" target="_blank" rel="noreferrer noopener"><b>Go 1.24 is released!</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Go 1.24 has been released with a bunch of improvements! </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Go now fully supports </span><a href="https://go.dev/issue/46477" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">generic type aliases</span></a><span style="font-weight:400;">.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">It also includes several performance improvements to the runtime that have reduced CPU overhead by 2-3% on average across a suite of representative benchmarks. (Say that 5 times fast.)</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Tool improvements around tool dependencies for a module. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The standard library now includes </span><a href="https://go.dev/doc/security/fips140" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">new mechanisms to facilitate FIPS-140-3 compliance</span></a><span style="font-weight:400;">. And you know we love some good FIPS-140-3 compliance. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Lastly, it includes some improved </span><a href="https://go.dev/doc/go1.24#wasm" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">WebAssembly</span></a><span style="font-weight:400;"> support – which we’ll talk about later. </span></li>
</ul>
<p><b>04:46</b> <a href="https://engineering.fb.com/2025/02/14/connectivity/project-waterworth-ai-subsea-infrastructure/" target="_blank" rel="noreferrer noopener"><b>Unlocking global AI potential with next-generation subsea infrastructure</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Meta announced their most ambitious subsea cable endeavor: Project Waterworth. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Once the cable is completed, the project will reach five major continents and span over 50,000 KM (longer than the earth’s circumference) making it the world’s longest </span><a href="https://engineering.fb.com/2021/09/28/connectivity/2africa-pearls/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">subsea cable</span></a><span style="font-weight:400;"> project using the highest-capacity technology available. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">It will bring connectivity to the US, India, Brazil, South Africa, as well as other key regions. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Waterworth will be a multi-billion dollar, multi-year investment to strengthen the scale and reliability of the world’s digital highways by opening three new oceanic corridors with the abundant, high-speed connectivity needed to drive AI innovation around the world.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Meta has apparently developed 20 subsea cables over the last decade, including multiple deployments of industry leading subsea cables of 24 fiber pairs, compared to the typical 8 to 16 pairs of other new systems .</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They are also deploying a first of its kind routing system, maximizing the cable load in deep waters at depths up to 7,000 meters and using enhanced burial techniques in high-risk fault areas, such as shallow waters near the coast, to avoid damage from ship anchors and other hazards. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They wrap up the article by basically saying they’re doing this for AI. Color us surprised. </span></li>
</ul>
<p><i><span style="font-weight:400;">06:25  Ryan – “</span></i><i><span style="font-weight:400;">I was sort of surprised that this is where Meta is investing. I don’t think of them in that space, like I do internet providers and cloud hyperscalers.”</span></i></p>
<h2><b>AI Is Going Great – Or How ML Makes All Its Money  </b></h2>
<p><b>07:50</b> <a href="https://arstechnica.com/ai/2025/02/sam-altman-lays-out-roadmap-for-openais-long-awaited-gpt-5-model/" target="_blank" rel="noreferrer noopener"><b>Sam Altman lays out roadmap for OpenAI’s long-awaited GPT-5 model</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Sam Altman </span><a href="https://x.com/sama/status/1889755723078443244" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">announced</span></a><span style="font-weight:400;"> a roadmap for how Open AI plans to release GPT-5, the long awaited followup to </span><a href="https://arstechnica.com/information-technology/2023/03/openai-announces-gpt-4-its-next-generation-ai-language-model/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">GPT 4</span></a><span style="font-weight:400;">.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Altman said it would be coming in “</span><a href="https://x.com/sama/status/1889757267425370415" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">months</span></a><span style="font-weight:400;">,” suggesting a release later this year.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">He further </span><a href="https://x.com/sama/status/1889755723078443244" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">explained</span></a><span style="font-weight:400;"> on X that they plan to ship GPT 4.5 – previously known as Orion – in “</span><a href="https://x.com/sama/status/1889757267425370415" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">weeks</span></a><span style="font-weight:400;">” as their last non-simulated reasoning model.  Simulated reasoning like </span><a href="https://arstechnica.com/information-technology/2024/12/openai-announces-o3-and-o3-mini-its-next-simulated-reasoning-models/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">o3</span></a><span style="font-weight:400;"> uses a special technique to iteratively process problems posed by users more deeply, but they are slower than conventional LLM like GPT-4o, and not ideal for all tasks. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">After 4.5, GPT 5 will be a system that brings together features from across the current AI Model lineup, including conventional AI models, SR models, and specialized models that do tasks like web search and research. </span></li>
</ul>
<p><i><span style="font-weight:400;">08:54  Justin – “</span></i><i><span style="font-weight:400;">I’m definitely very interested in how, you know, like where does AGI come into their roadmap? Like I know they keep talking about it soon. Like, is that this year’s problem? Is that a problem next year? Is that a next decade problem? Like I don’t really know when AGI is going to be real on what their timeline looks like.”</span></i></p>
<p><b>09:31</b> <a href="https://www.theinformation.com/articles/anthropic-strikes-back?rc=3t8xtd" target="_blank" rel="noreferrer noopener"><b>Anthropic Strikes Back</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Everyone has been waiting for </span><a href="https://www.bing.com/ck/a?!&amp;&amp;p=4a3b5036cf79f56971106532de93d3ec24aa0e5e7430c3125865d872860ae04dJmltdHM9MTc0MDQ0MTYwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=111326c2-38f8-6e94-3a5e-34a8396a6f52&amp;psq=anthropic&amp;u=a1aHR0cHM6Ly93d3cuYW50aHJvcGljLmNvbS8&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Anthropic</span></a><span style="font-weight:400;"> to produce a reasoning model.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">From reporting on The Information, they say Anthropic is taking a different approach to reasoning. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">It developed a hybrid AI model that includes reasoning capabilities, which basically means the model uses more computation resources to calculate answers to hard questions, but the model can also handle simpler tasks quickly, without the extra work by acting like a normal LLM.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The company reportedly plans to release it in the next few weeks. </span></li>
</ul>
<p><b>10:31</b> <a href="https://www.theinformation.com/articles/anthropic-projects-soaring-growth-to-34-5-billion-in-2027-revenue?rc=3t8xtd" target="_blank" rel="noreferrer noopener"><b>Anthropic Projects Soaring Growth to $34.5 Billion in 2027 Revenue</b></a><span style="font-weight:400;">  </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">More reporting from The Information also alleges that current revenue for Anthropic is $3.7 Billion, with a projection that revenue could grow to $34.5 billion in 2027. </span></li>
</ul>
<p><i><span style="font-weight:400;">11:08  Ryan – “I don’t recommend anyone take investment advice from The Cloud Pod…”</span></i></p>
<h2><b>Cloud Tools</b></h2>
<p><b>11:37</b> <a href="https://nat-henderson.github.io/terraform-provider-dominos/" target="_blank" rel="noreferrer noopener"><b>The Terraform plugin for the Dominos Pizza provider</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">When you’re been writing a lot of Terraform code, it can sometimes make you hungry for some pizza. This provider can help you out! </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The </span><a href="https://github.com/nat-henderson/terraform-provider-dominos" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Domino Terraform provider</span></a><span style="font-weight:400;"> exists to ensure that while you’re waiting for your cloud infrastructure to spin up, you can get a hot pizza delivered.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This is powered by the expansion of the Terraform resource model into the physical world, inspired by the Google Rest API for interconnects.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The provider configuration is straightforward (although we’re disappointed that the credit card isn’t “sensitive.”</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">We truly are living in advanced times. </span></li>
</ul>
<p><i><span style="font-weight:400;">12:55  Matthew – “</span></i><i><span style="font-weight:400;">There is a feature for hash card vault support for credit card data. And you know, another one which blocks the addition of pineapple as a topping.”</span></i></p>
<p><span style="font-weight:400;">*Listener note: If anyone tries this, let us know how it goes! </span></p>
<h2><b>AWS </b></h2>
<p><b>14:30</b> <a href="https://aws.amazon.com/blogs/aws/aws-cloudtrail-network-activity-events-for-vpc-endpoints-now-generally-available/" target="_blank" rel="noreferrer noopener"><b>AWS CloudTrail network activity events for VPC endpoints now generally </b></a><a href="https://aws.amazon.com/blogs/aws/aws-cloudtrail-network-activity-events-for-vpc-endpoints-now-generally-available/" target="_blank" rel="noreferrer noopener"><b>available</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS is announcing the GA of network activity events for </span><a href="https://aws.amazon.com/vpc/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">VPC</span></a><span style="font-weight:400;"> endpoints in </span><a href="https://aws.amazon.com/cloudtrail/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">CloudTrail</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This feature helps you to record and monitor AWS API activity traversing your VPC endpoints, helping you strengthen your data perimeter and implement better detective controls. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Previously, it was hard to detect potential data exfiltration attempts and unauthorized access to the resources within your network through VPC endpoints.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">While VPC endpoint policies could be configured to prevent access from external accounts, there was no built in mechanism to log a denied action or detect when external credentials were used at a VPC endpoint. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Now you can opt in to log all AWS API activity passing through your VPC endpoints. Cloudtrail records these events as a new event type called network activity events, which capture both control plane and data plane actions passing through a VPC endpoint. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Network activity events in CloudTrail provide several key benefits:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Comprehensive Visibility</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">External credential detection</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Data exfiltration prevention</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Enhanced security monitoring </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Visibility for regulatory compliance</span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">15:21  Ryan – “</span></i><i><span style="font-weight:400;">Yeah, this is a neat feature. As someone who remembers, I guess remembers or dreads, can’t, I’m not sure what’s the right word, trying to troubleshoot connectivity to a private endpoint from a data center connectivity. There really is just no visibility or was until this feature was announced. So this is, I think, a fantastic addition and being able to log that information and act on that information for security purposes.”</span></i></p>
<p><b>20:03</b> <a href="https://aws.amazon.com/blogs/security/introducing-the-aws-trust-center/" target="_blank" rel="noreferrer noopener"><b>Introducing the AWS Trust Center</b></a> <span style="font-weight:400;"> </span></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS</span></a><span style="font-weight:400;"> is working to earn your trust as it is one of their core leadership principles with the launch of </span><a href="https://aws.amazon.com/trust-center/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Trust Center</span></a><span style="font-weight:400;">, a new online resource that shares how AWS approaches securing your assets in the cloud. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The AWS Trust Center is a window into their security practices, compliance programs and data protection controls that demonstrate how they work to earn your trust every day. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">AWS artifact? </span></li>
</ul>
<p><i><span style="font-weight:400;">20:45  Ryan – “</span></i><i><span style="font-weight:400;">I know that the artifacts was seemingly very hard for non-technical auditors to navigate. And I’ve had to spend a lot of time walking people through that. So anything that makes this easier. I haven’t looked at this landing page, but I’m hoping that it’s sort of geared towards that audience of compliance people who are building reports for very specific frameworks. And it sort of lays it all out in an easy to find manner.”</span></i></p>
<p><b>22:57</b> <a href="https://aws.amazon.com/about-aws/whats-new/2025/02/amazon-inspector-security-engine-container-images-scanning/" target="_blank" rel="noreferrer noopener"><b>Amazon Inspector enhances the security engine for container images </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2025/02/amazon-inspector-security-engine-container-images-scanning/" target="_blank" rel="noreferrer noopener"><b>scanning</b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/inspector/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Inspector</span></a><span style="font-weight:400;"> has updated its engine powering container image scanning for ECR. This upgrade will give you a more comprehensive view of the vulnerabilities in third party dependencies used in container images. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This will not disrupt any of your existing workflows. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Our big question: didn’t’ this already exist? </span></li>
</ul>
<p><b>25:12</b> <a href="https://aws.amazon.com/about-aws/whats-new/2025/02/aws-secrets-configuration-provider-pod-identity-eks/" target="_blank" rel="noreferrer noopener"><b>AWS Secrets and Configuration Provider now integrates with Pod Identity </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2025/02/aws-secrets-configuration-provider-pod-identity-eks/" target="_blank" rel="noreferrer noopener"><b>for Amazon EKS</b></a><b>  </b></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/secrets-manager/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Secrets Manager</span></a><span style="font-weight:400;"> Secrets and Configuration Provider now integrates with EKS pod identity. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This integration simplifies IAM authentication for Amazon EKS when retrieving secrets from AWS Secrets Manager or parameters from AWS Systems Manager Parameter Store. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">With the new features, you can manage IAM permissions for K8 apps more efficiently and securely, enabling granular access control through role session tags on secrets. </span></li>
</ul>
<p><i><span style="font-weight:400;">25:29  Ryan – “</span></i><i><span style="font-weight:400;">This has been a, like a clear area where EKS was not the same offering as in Google or, you know, being able to sort of leverage these identities directly from your pod configuration and your secure, your namespace configuration and be able to tie that to sort of a distributed role identity. So this is something that’s pretty great in terms of being able to provide that. It’s at least one step closer to full workload identity.</span></i></p>
<p><b>26:21</b> <a href="https://reinforce.awsevents.com/" target="_blank" rel="noreferrer noopener"><b>AWS Re:inforce Dates announced</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Dates just dropped; it’s going to be June 16-18th in Philadelphia. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Registration opens in March. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Chris Betz CISO will keynote. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">At least it’s not in Houston in July. </span></li>
</ul>
<p><b>28:30</b> <a href="https://aws.amazon.com/blogs/networking-and-content-delivery/exploring-new-subnet-management-capabilities-of-network-load-balancer/" target="_blank" rel="noreferrer noopener"><b>Exploring new subnet management capabilities of Network Load Balancer</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">You can now remove subnets from </span><a href="https://aws.amazon.com/elasticloadbalancing/network-load-balancer/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">NLBs</span></a><span style="font-weight:400;"> without destroying the entire NLB, matching the capabilities of </span><a href="https://aws.amazon.com/elasticloadbalancing/application-load-balancer/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">ALBs.</span></a></li>
<li style="font-weight:400;"><span style="font-weight:400;">It’s one of those things you only find out the hard way, It’s nice to have the flexibility now. </span></li>
</ul>
<h2><b>GCP</b></h2>
<p><b>31:17</b> <a href="https://cloud.google.com/blog/topics/developers-practitioners/attend-the-google-cloud-genai-roadshow/" target="_blank" rel="noreferrer noopener"><b>Deep dive into AI with Google Cloud’s global generative AI roadshow</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google is on the road with their Generative AI roadshow!  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This event provides practical code-level engagement with Google’s most advanced AI technologies. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">These events will show you how to leverage everything from Google Cloud Infrastructure to the latest Gemini 2.0 models. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They started in India and then moved on to Europe and APAC, with the Bay, Seattle and Austin all getting visits in March 2025. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Ryan’s take: It’s worth your time if there’s an event near you. </span></li>
</ul>
<p><b>36:31 </b> <a href="https://cloud.google.com/blog/products/containers-kubernetes/using-multikueue-to-provision-global-gpu-resources/" target="_blank" rel="noreferrer noopener"><b>With MultiKueue, grab GPUs for your GKE cluster, wherever they may be</b></a><span style="font-weight:400;">   </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AI and LLM’s are experiencing explosive growth powering applications like machine translation to artistic creations. These technologies rely on intensive computations that require specialized hardware resources, like GPUs. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">To address scarcity in GPU’s, Google introduced the dynamic workload scheduler, and it transformed how you access and use GPU resources, particularly within a GKE cluster.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">In addition, </span><a href="https://cloud.google.com/blog/products/compute/introducing-dynamic-workload-scheduler?e=4875480" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">DWS</span></a><span style="font-weight:400;"> offered an easy and straightforward </span><a href="https://cloud.google.com/kubernetes-engine/docs/tutorials/kueue-intro" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">integration between GKE and Kueue</span></a><span style="font-weight:400;">, a cloud-native job scheduler making it easier than ever to access GPUs quickly in a given region for a given GKE cluster.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">But what if you can use multiple regions, so you can get it done ASAP. This is what today’s announcement is all about with </span><a href="https://kueue.sigs.k8s.io/docs/concepts/multikueue/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">MultiKueue</span></a><span style="font-weight:400;">, a Kueue feature. With MK GKE and DWS can wait for accelerators in multiple regions.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">DWS automatically provisions resources in the best GKE clusters as soon as they are available. By submitting workloads to the global queue, MK executes them in the region with available GPU resources helping to optimize global resource usage.  </span></li>
</ul>
<p><i><span style="font-weight:400;">25:29  Matthew – “</span></i><i><span style="font-weight:400;">What I found interesting about this is that this is something that Amazon and Microsoft really can’t do because of the way Google is built at a global VNet level or VPC level, where each of the other ones have isolated regions. So this is something that because of the way Google is instructed with that global VPC, you have the ability to more easily burst into other regions, versus on AWS or Microsoft, you have to build a VPC or VNet and then launch your workloads in there and then connect it all back. So it’s actually an interesting win that, you know, win or loss, depending on how you want to view it, that Google has, and that they are able to say, just go use the access capacity here. Don’t really worry about data, you know, laws or anything else that you might have to worry about. But, you know, you have this ability to go grab these things in these other places that could be cheaper or more expensive depending on where your origin of everything is.”</span></i></p>
<p><b>41:27</b> <a href="https://cloud.google.com/blog/products/application-development/go-1-24-expands-support-for-wasm/" target="_blank" rel="noreferrer noopener"><b>Announcing Wasm support in Go 1.24</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">As we talked earlier, Google has released </span><a href="https://go.dev/dl/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Go 1.24</span></a><span style="font-weight:400;">, the latest version of Google’s OS programming language. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">There is a lot to love that we covered earlier, but it also significantly expands its capabilities for </span><a href="https://webassembly.org/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">WebAssembly</span></a><span style="font-weight:400;"> (Wasm) a binary instruction format that provides for the execution of high-performance, low-level code at speeds approaching native performance. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">With a new go:wasmexport compiler directive and the ability to build a reactor for WebAssembly System Interface (WASI), developers now export functions from their Go code to Wasm, including long-running apps. </span></li>
</ul>
<p><i><span style="font-weight:400;">42:01  Justin – “…</span></i><i><span style="font-weight:400;">if you can just natively go into WebAssembly from Go, I think that’s a nice feature. Yeah, one more reason why I should learn more Go. Yeah, I keep working on Python, but I could also learn Go. Maybe I could get some more utility out of Go, I think.”</span></i></p>
<h2><b>Azure</b></h2>
<p><b>43:02 </b><a href="https://www.microsoft.com/en-us/security/blog/2025/02/13/securing-deepseek-and-other-ai-systems-with-microsoft-security/" target="_blank" rel="noreferrer noopener"><b>Securing DeepSeek and other AI systems with Microsoft Security</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">With recent concerns around security and deepseek, Microsoft is capitalizing with this helpful article on securing DeepSeek and others with Microsoft Security</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They highlight several things for security around your AI estate</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Azure AI Foundry’s Azure AI Content Safety, built in content filtering available by default to help detect and block malicious, harmful, or ungrounded content, with opt out options for flexibility.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Security Posture Management with Microsoft Defender for Cloud AI security posture management capabilities</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">See all the data via cyberthreat protection with Microsoft Defender for cloud allowing your SOC to review logs and telemetry to block real time attacks against the AI as well as XDR capabilities to further analyze threats. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Integrations with Purview DLP and Purview Data Security Posture Management. </span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">44:03  Ryan – “…</span></i><i><span style="font-weight:400;">the reaction to DeepSeek I find hilarious more than the tool itself, you know, because it is just sort of like, wait, China, no, we have to secure this stuff. And, you know, everyone knew about the security concerns of sending data to AI and sort of, you know, like, yeah, no, this is a thing to be aware of. then immediately forgot it. But the minute it was being sent to a Chinese company, was a different reaction in the industry. And so I definitely think that Azure is capitalizing on this for sure.”</span></i></p>
<p><b>46:39</b> <a href="https://azure.microsoft.com/en-us/blog/microsoft-cost-management-updates-february-2025/" target="_blank" rel="noreferrer noopener"><b>Microsoft Cost Management updates—February 2025</b></a><span style="font-weight:400;"> </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft is rolling out a bunch of cool things in the world of finops this week. Woo!</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">For those of you with an EA agreement, you can now use the Cost allocation field so you can support cost allocations based on hierarchy based on departments and accounts. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Copilot has been a good way to get your cost queries answered using natural language.  With view in cost analysis functionality you can also directly navigate to cost analysis to a custom view based on your prompts.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Now to that powerful capability they are giving sample prompts (</span><a href="https://azure.microsoft.com/en-us/blog/microsoft-cost-management-updates-february-2025/#_Copilot_nudges" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">nudges</span></a><span style="font-weight:400;">) to the overview page to encourage and guide users to interact with copilot more effectively.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Azure has built out some </span><a href="https://azure.microsoft.com/en-us/blog/microsoft-cost-management-updates-february-2025/_Introducing" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">FOCUS</span></a><span style="font-weight:400;"> introduction lessons for use with Azure to help you apply Finops Focus best practices directly to your environment. </span></li>
</ul>
<p><i><span style="font-weight:400;">47:27  Matthew – “</span></i><i><span style="font-weight:400;">The nudges are kind of useful and they’ve been adding copilot into the console. And then I have fun with it when it’s like, you know, internal server errors, why my instance didn’t scale up properly. And then I just say, copilot, tell me what’s wrong. And it goes, yo, open a support ticket or like try turning it back on and off again.”</span></i></p>
<p><b>49:45</b> <a href="https://azure.microsoft.com/en-us/updates?id=480458" target="_blank" rel="noreferrer noopener"><b>Generally Available: Scheduled Load Tests in Azure Load Testing</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Scheduling tests allows you to run tests at a later time or run at a regular cadence. Azure Load Testing supports adding one schedule to a test. You can add a schedule to a test after creating it.</span></li>
</ul>
<p><b>51:27</b> <a href="https://azure.microsoft.com/en-us/updates?id=478996" target="_blank" rel="noreferrer noopener"><b>GA: 6th Generation Intel-Based VMS – DV6-EV6</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">New 5th Gen Intel® Xeon® Platinum 8537C (Emerald Rapids) processor</span></li>
</ul>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Up to 27% higher vCPU performance and 3x larger L3 cache than the previous generation Intel Dl/D/Ev5 VMs </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Up to 192vCPU and &gt;18GiB of memory </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Azure Boost which enables: </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Up to 400k IOPS and 12 GB/s remote storage throughput </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Up to 200 Gbps VM network bandwidth </span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">46% larger local SSD capacity and &gt;3X read IOPS </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">NVMe interface for local and remote disks </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Enhanced security through Total Memory Encryption (TME) technology.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Woo.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">We still hate their naming conventions. </span></li>
</ul>
<h2><b>Cloud Journey Series</b></h2>
<p><span style="font-weight:400;">Yes – It’s back! </span></p>
<p><b>53:10</b> <a href="https://itnext.io/should-all-developers-learn-infrastructure-as-code-a77e7feefbc8" target="_blank" rel="noreferrer noopener"><b>Should all developers learn Infrastructure as Code?</b></a><span style="font-weight:400;"> </span></p>
<h2><b>Aftershow</b></h2>
<p><span style="font-weight:400;">Yes, This is back too! </span></p>
<p><b>1:03:02 </b><a href="https://arstechnica.com/tech-policy/2025/02/man-offers-to-buy-city-dump-in-last-ditch-effort-to-recover-800m-in-bitcoins/" target="_blank" rel="noreferrer noopener"><b>Man offers to buy city dump in last-ditch effort to recover $800M in </b></a><span style="font-weight:400;"> </span><a href="https://arstechnica.com/tech-policy/2025/02/man-offers-to-buy-city-dump-in-last-ditch-effort-to-recover-800m-in-bitcoins/" target="_blank" rel="noreferrer noopener"><b>bitcoins</b></a><span style="font-weight:400;"> </span></p>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">James Howell is back in the news, the IT Pro who lost 8,000 bitcoins in a landfill more than a decade ago, thinks he has one last chance to dig up his buried treasure before it’s lost forever – by buying the landfill itself. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This has been an ongoing legal battle for a while, with the latest curve being the Newport city council in Wales, has decided to close the landfill.  He has offered to buy it, if approved he would remove every piece of trash — clearing out thousands of tons and potentially sparing the city the cost of cleaning the site. He would use “a scanner with AI-trained detection technology” and a magnetic belt to surface his long lost hard drive containing the only copy of the 51-character private key he needs to get back into his crypto wallet. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">But the Newport council appears unlikely to accept Howells offer. The city has already secured permission to develop a solar farm on a portion of the landfill property. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Howell would rather clean it up and turn it into a park, but the council believes the solar project is a better use. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They have regularly ignored his advances, including his offer to share the money with them. </span></li>
</ul>
</li>
<li style="font-weight:400;"><i><span style="font-weight:400;">“This needle is very, very, very valuable—$800 million,” </span></i><span style="font-weight:400;">Howells told The Times</span><i><span style="font-weight:400;">. “Which means I’m willing to search every piece of hay in order to find the needle.”</span></i></li>
</ul>
<h2><span style="font-weight:400;">Closing</span></h2>
<p><span style="font-weight:400;">And that is the week in the cloud! Visit our website, the home of the Cloud Pod where you can join our newsletter, slack team, send feedback or ask questions at theCloud Pod.net or tweet at us with hashtag #theCloudPod</span></p>
]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/1982248/c1e-dd5df60jjxu3np1o-257wm0g5t4r2-grmpzl.mp3" length="83849320"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 293 of The Cloud Pod – where the forecast is always cloudy! This week we’ve got a lot of new and, surprise, a new installment of Cloud Journey AND and aftershow – so make sure to stay tuned for that! We’ve got undersea cables, Go 1.24, Wasm, Anthropic and more. 
Titles we almost went with this week:

️Lets Go!
Under Sea cables make AI go BRRRRRR
The CloudPod says it will grow the listeners by 10x by 2027

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
General News
01:30 Go 1.24 is released! 

Go 1.24 has been released with a bunch of improvements! 
Go now fully supports generic type aliases.
It also includes several performance improvements to the runtime that have reduced CPU overhead by 2-3% on average across a suite of representative benchmarks. (Say that 5 times fast.)
Tool improvements around tool dependencies for a module. 
The standard library now includes new mechanisms to facilitate FIPS-140-3 compliance. And you know we love some good FIPS-140-3 compliance. 
Lastly, it includes some improved WebAssembly support – which we’ll talk about later. 

04:46 Unlocking global AI potential with next-generation subsea infrastructure

Meta announced their most ambitious subsea cable endeavor: Project Waterworth. 
Once the cable is completed, the project will reach five major continents and span over 50,000 KM (longer than the earth’s circumference) making it the world’s longest subsea cable project using the highest-capacity technology available. 
It will bring connectivity to the US, India, Brazil, South Africa, as well as other key regions. 
Waterworth will be a multi-billion dollar, multi-year investment to strengthen the scale and reliability of the world’s digital highways by opening three new oceanic corridors with the abundant, high-speed connectivit...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/1982248/c1a-k5d5-ww6qp43os43r-q0zt1i.jpg"></itunes:image>
                                                                            <itunes:duration>01:09:53</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                            </item>
                    <item>
                <title>
                    <![CDATA[292: VS Code Friend or Foe… Azure Data Studio Murdered]]>
                </title>
                <pubDate>Sat, 22 Feb 2025 00:48:49 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/1979875</guid>
                                    <link>https://tcpfm.castos.com/episodes/292-vs-code-azure-data-studio</link>
                                <description>
                                            <![CDATA[<p><span style="font-weight:400;">Welcome to episode 292 of The Cloud Pod – where the forecast is always cloudy! This week Justin and Jonathan are a dynamic duo, bringing you all the latest in news – and sound effects – because it’s earnings time! Plus we’ve got new from VS Code, Azure Data Studio, CodeBuild and more. </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">☁️The Cloud Pod Renames Cloud Earnings to ‘The Gulf of Capex’</span></li>
<li><span style="font-weight:400;">Sorry Elon, OpenAI Doesn’t Want Your Pocket Change</span></li>
<li><span style="font-weight:400;">MacOS gets into the Fastlane for Oil Changes</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><b>General News</b></h2>
<p><b>It’s earnings time! </b></p>
<p><b>01:29</b> <a href="https://www.businessinsider.com/alphabet-to-spend-big-ai-capex-q4-earnings-2025-2" target="_blank" rel="noreferrer noopener"><b>Alphabet is planning to spend big on AI again this year, sending shares </b></a><a href="https://www.businessinsider.com/alphabet-to-spend-big-ai-capex-q4-earnings-2025-2" target="_blank" rel="noreferrer noopener"><b>down</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Alphabet earnings were a bit of a let down with cloud revenue missing and their announcement of spending $75 Billion in CapEx (DeepSeek who?)</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Consolidated revenue rose 12% in the period to 96.5 billion. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Capex investments of $75b shocked analysts who expected $57.9 billion.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">EPS was 2.15 vs 2.13.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Revenue of 96.5 billion vs 96.62 expected.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Ad revenue rose to 72.46 billion vs 71.3, Youtube advertising revenue was 10.47 billion vs 10.22 billion. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Google Cloud was 12.0 billion vs expectation of 12.19 billion.</span></li>
</ul>
<p><i><span style="font-weight:400;">02:09  Jonathan – “I’m guessing ad revenue is gonna be down again, </span></i><i><span style="font-weight:400;">Q1, Q2 because I think a lot of ad revenue is driven by the election season. So that’s not looking too good for them.”</span></i></p>
<p><b>03:13</b> <a href="https://seekingalpha.com/news/4400293-microsoft-gaap-eps-of-3_23-beats-by-0_13-revenue-of-69_6b-beats-by-790m?utm_source=businessinsider&amp;utm_medium=referral&amp;feed_item_type=news" target="_blank" rel="noreferrer noopener"><b>Microsoft GAAP EPS of $3.23 beats by $0.13, revenue of $69.6B beats by </b></a><a href="https://seekingalpha.com/news/4400293-microsoft-gaap-eps-of-3_23-beats-by-0_13-revenue-of-69_6b-beats-by-790m?utm_source=businessinsider&amp;utm_medium=referral&amp;feed_item_type=news" target="_blank" rel="noreferrer noopener"><b>$790M</b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.microsoft.com/en-us/investor/earnings/fy-2025-q2/press-release-webcast" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Microsoft</span></a><span style="font-weight:400;"> followed up with also weak growth in its Azure cloud computing unit. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">EPS was 3.23 beating expectations by 0.13</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Revenue of 69.6B beating by 780M</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Intelligent clo...</span></li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 292 of The Cloud Pod – where the forecast is always cloudy! This week Justin and Jonathan are a dynamic duo, bringing you all the latest in news – and sound effects – because it’s earnings time! Plus we’ve got new from VS Code, Azure Data Studio, CodeBuild and more. 
Titles we almost went with this week:

☁️The Cloud Pod Renames Cloud Earnings to ‘The Gulf of Capex’
Sorry Elon, OpenAI Doesn’t Want Your Pocket Change
MacOS gets into the Fastlane for Oil Changes

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
General News
It’s earnings time! 
01:29 Alphabet is planning to spend big on AI again this year, sending shares down

Alphabet earnings were a bit of a let down with cloud revenue missing and their announcement of spending $75 Billion in CapEx (DeepSeek who?)
Consolidated revenue rose 12% in the period to 96.5 billion. 
Capex investments of $75b shocked analysts who expected $57.9 billion.
EPS was 2.15 vs 2.13.
Revenue of 96.5 billion vs 96.62 expected.
Ad revenue rose to 72.46 billion vs 71.3, Youtube advertising revenue was 10.47 billion vs 10.22 billion. 
Google Cloud was 12.0 billion vs expectation of 12.19 billion.

02:09  Jonathan – “I’m guessing ad revenue is gonna be down again, Q1, Q2 because I think a lot of ad revenue is driven by the election season. So that’s not looking too good for them.”
03:13 Microsoft GAAP EPS of $3.23 beats by $0.13, revenue of $69.6B beats by $790M

Microsoft followed up with also weak growth in its Azure cloud computing unit. 
EPS was 3.23 beating expectations by 0.13
Revenue of 69.6B beating by 780M
Intelligent clo...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[292: VS Code Friend or Foe… Azure Data Studio Murdered]]>
                </itunes:title>
                                    <itunes:episode>292</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p><span style="font-weight:400;">Welcome to episode 292 of The Cloud Pod – where the forecast is always cloudy! This week Justin and Jonathan are a dynamic duo, bringing you all the latest in news – and sound effects – because it’s earnings time! Plus we’ve got new from VS Code, Azure Data Studio, CodeBuild and more. </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">☁️The Cloud Pod Renames Cloud Earnings to ‘The Gulf of Capex’</span></li>
<li><span style="font-weight:400;">Sorry Elon, OpenAI Doesn’t Want Your Pocket Change</span></li>
<li><span style="font-weight:400;">MacOS gets into the Fastlane for Oil Changes</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><b>General News</b></h2>
<p><b>It’s earnings time! </b></p>
<p><b>01:29</b> <a href="https://www.businessinsider.com/alphabet-to-spend-big-ai-capex-q4-earnings-2025-2" target="_blank" rel="noreferrer noopener"><b>Alphabet is planning to spend big on AI again this year, sending shares </b></a><a href="https://www.businessinsider.com/alphabet-to-spend-big-ai-capex-q4-earnings-2025-2" target="_blank" rel="noreferrer noopener"><b>down</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Alphabet earnings were a bit of a let down with cloud revenue missing and their announcement of spending $75 Billion in CapEx (DeepSeek who?)</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Consolidated revenue rose 12% in the period to 96.5 billion. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Capex investments of $75b shocked analysts who expected $57.9 billion.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">EPS was 2.15 vs 2.13.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Revenue of 96.5 billion vs 96.62 expected.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Ad revenue rose to 72.46 billion vs 71.3, Youtube advertising revenue was 10.47 billion vs 10.22 billion. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Google Cloud was 12.0 billion vs expectation of 12.19 billion.</span></li>
</ul>
<p><i><span style="font-weight:400;">02:09  Jonathan – “I’m guessing ad revenue is gonna be down again, </span></i><i><span style="font-weight:400;">Q1, Q2 because I think a lot of ad revenue is driven by the election season. So that’s not looking too good for them.”</span></i></p>
<p><b>03:13</b> <a href="https://seekingalpha.com/news/4400293-microsoft-gaap-eps-of-3_23-beats-by-0_13-revenue-of-69_6b-beats-by-790m?utm_source=businessinsider&amp;utm_medium=referral&amp;feed_item_type=news" target="_blank" rel="noreferrer noopener"><b>Microsoft GAAP EPS of $3.23 beats by $0.13, revenue of $69.6B beats by </b></a><a href="https://seekingalpha.com/news/4400293-microsoft-gaap-eps-of-3_23-beats-by-0_13-revenue-of-69_6b-beats-by-790m?utm_source=businessinsider&amp;utm_medium=referral&amp;feed_item_type=news" target="_blank" rel="noreferrer noopener"><b>$790M</b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.microsoft.com/en-us/investor/earnings/fy-2025-q2/press-release-webcast" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Microsoft</span></a><span style="font-weight:400;"> followed up with also weak growth in its Azure cloud computing unit. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">EPS was 3.23 beating expectations by 0.13</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Revenue of 69.6B beating by 780M</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Intelligent cloud revenue was 25.5 billion an increase of 19%</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft indicated they plan to spend 80 Billion in CapEx for AI and data center growth. </span></li>
</ul>
<p><i><span style="font-weight:400;">04:02  Justin- “</span></i><i><span style="font-weight:400;">Also international expansion still, I think a big area too, particularly for Azure and Google and even Amazon. Like they’re all announcing more and more regions, more expansion of data centers, lots of laws that are going to pass for data sovereignty that they have to deal with. there’s, there’s spend everywhere.”</span></i></p>
<p><b>04:23</b> <a href="https://www.businessinsider.com/amazon-earnings-call-report-amzn-stock-live-updates-2025-2" target="_blank" rel="noreferrer noopener"><b>Amazon earnings recap: Stock falls as guidance falls short, CFO indicates </b></a><a href="https://www.businessinsider.com/amazon-earnings-call-report-amzn-stock-live-updates-2025-2" target="_blank" rel="noreferrer noopener"><b>capex of more than $100 billion in 2025</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><a href="https://www.businessinsider.com/amazon" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon</span></a><span style="font-weight:400;"> followed its peers by indicating they will invest $100B in CapEx for Amazon’s AI efforts on AWS</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">CEO Andy Jassy said that AWS could grow faster if they were not hindered by datacenter capacity…which is really interesting. We’re assuming GPU capacity. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon reported sales of 187.79B, beating estimates of 187.32 billion, EPS was 1.86 compared to $1.50 expected. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">AWS was a little light compared to estimates at 28.79B compared to expectations of 28.82 billion, but what’s 300 million between friends? </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon guided lighter than analysts expected at 151b to 155.5 billion, vs expectations 158.64 billion. Also penalized in after hours markets. </span></li>
</ul>
<p><i><span style="font-weight:400;">06:04  Justin- “</span></i><i><span style="font-weight:400;">I would assume inference, you know, becomes the bigger area of investment long-term, but short-term, you know, you need to train. they, I think a lot of their stuff, they’ve like training them and those things were really focused primarily at training first. So inference seems to be where everyone’s spending most of their money these days.”</span></i></p>
<h2><b>AI Is Going Great – Or How ML Makes All Its Money  </b></h2>
<p><b>06:39</b> <a href="https://www.theinformation.com/briefings/openai-ceo-appears-to-reject-elon-musks-97-billion-takeover-bid?rc=3t8xtd" target="_blank" rel="noreferrer noopener"><b>OpenAI CEO Appears to Reject Elon Musk’s $97 Billion Takeover Bid</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Elon recently made an unsolicited bid to buy </span><a href="https://www.bing.com/ck/a?!&amp;&amp;p=096d6fe2921446ee30d769b05371213a1a9e07fc1f6f9ea10dd14b55bd994019JmltdHM9MTczOTgzNjgwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=111326c2-38f8-6e94-3a5e-34a8396a6f52&amp;psq=open+AI&amp;u=a1aHR0cHM6Ly9vcGVuYWkuY29tLw&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Open AI</span></a><span style="font-weight:400;"> for 97.4 Billion.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">On Monday, Sam Altman rejected the offer. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Altman told his staff that Musk’s effort was “embarrassing,” and not in the best interest of the OpenAI mission to develop artificial general intelligence to benefit humanity. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Altman also declared that this is Musk’s attempt to slow down a competitor.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This does cause some complications, as OpenAI continues to plan to shift away from its non-profit roots. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">If the plan is for the non-profit to sell the for profit business, this bid makes it more expensive for the internal sale of the assets. </span></li>
</ul>
<p><i><span style="font-weight:400;">07:42  Jonathan – “</span></i><i><span style="font-weight:400;">It’s interesting that he made a bid. I mean, I don’t think he would have actually filed through on it personally. Now he’s got XAI and Grok 3 coming out soon and those other things. I agree with Sam Altman that it was probably just a distraction to mess with things. But he has drawn a line in the sand at $97.4 billion though.”</span></i></p>
<p><b>08:41</b> <a href="https://openai.com/global-affairs/introducing-the-intelligence-age/" target="_blank" rel="noreferrer noopener"><b>Introducing the intelligence age</b></a><span style="font-weight:400;"> </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Super Bowl Ads were all the rage over the weekend, during the drumming of the KC chiefs by the Philadelphia Eagles 40-22.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Justin was really hoping for cloud commercials to talk about, but they didn’t materialize (and we DO NOT count Google’s android ads). </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">*But* OpenAI debuted their first ever ad. View it </span><a href="https://youtu.be/kIhb5pEo_j0?si=MMIaARTbQAILDNxK" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">here</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">We’re interested in you think! Let us know on social or via our Slack channel what you thought of the ad. </span></li>
</ul>
<p><i><span style="font-weight:400;">10:03  Jonathan – “</span></i><i><span style="font-weight:400;">I actually liked the look of it. The first time I saw it, I was like, this is a bit strange. But I liked the halftone look. reminds me of newspaper print and news unfolding over the years. It was kind of neat. I’m glad I didn’t spend the extra 8 million on another 30 seconds, though, and showing the doom that’s going to come out and the poverty. Yeah, like the desolate wasteland of Earth after nobody’s got a job anymore.”</span></i></p>
<p><b>12:32</b> <a href="https://arstechnica.com/ai/2025/02/openais-secret-weapon-against-nvidia-dependence-takes-shape/" target="_blank" rel="noreferrer noopener"><b>OpenAI’s secret weapon against Nvidia dependence takes shape</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Open AI is in the final stages of designing its long rumored AI processor with the aim of decreasing the company’s dependence on Nvidia hardware, per Reuters. </span></li>
<li style="font-weight:400;"><a href="https://arstechnica.com/information-technology/2023/11/chatgpt-was-the-spark-that-lit-the-fire-under-generative-ai-one-year-ago-today/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">ChatGPT</span></a><span style="font-weight:400;"> plans to leverage TSMC (Taiwan Semiconductor Manufacturing Co.) for fabrication within the next few months, but the chip has not yet formally been announced. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The first chip will use TSMCs’ 3-nanometer process. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The chips will incorporate high-bandwidth memory and networking features similar to those found in NVIDIA processors. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Initially the first chips will focus on running models (inference) rather than training them, with limited deployment across OpenAI.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;"> The goal is for mass production to start in 2026. The hardware will likely end up in </span><a href="https://arstechnica.com/ai/2025/01/trump-announces-500b-stargate-ai-infrastructure-project-with-agi-aims/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Stargate</span></a><span style="font-weight:400;"> and/or Microsoft data centers.</span></li>
</ul>
<p><i><span style="font-weight:400;">13:57  Justin – “</span></i><i><span style="font-weight:400;">I’m actually shocked just this long for them to announce that they were doing their own chip and to, you know, they actually haven’t announced it technically, but you know, rumors come out that they’re doing one. There’s been some scuttlebutt about it, but this is a pretty firm, you know, research paper by the, or news article by the Reuters. So yeah, very interesting.”</span></i></p>
<h2><b>AWS  </b></h2>
<p><b>15:04</b> <a href="https://aws.amazon.com/blogs/aws/codebuild-for-macos-adds-support-for-fastlane/" target="_blank" rel="noreferrer noopener"><b>AWS CodeBuild for macOS adds support for Fastlane</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><a href="https://fastlane.tools/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Fastlane</span></a><span style="font-weight:400;"> for </span><a href="https://aws.amazon.com/codebuild/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS CodeBuild</span></a><span style="font-weight:400;"> has now come to the Mac OS environments. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Fastlane is an open source tool suite designed to automate various aspects of mobile app development.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">It provides mobile app developers with a centralized set of tools to manage tasks such as code signing, screenshot generation, beta distribution and app store submissions. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Fully integrated with popular CI and CD platforms, it supports IOS and Android development workflows. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Previously you could install Fastlane on your codebuild for MacOS installs, but it was undifferentiated heavy lifting and now you get it installed by default. </span></li>
</ul>
<p><b>16:16</b> <a href="https://aws.amazon.com/blogs/compute/introducing-jsonl-support-with-step-functions-distributed-map/" target="_blank" rel="noreferrer noopener"><b>Introducing JSONL support with Step Functions Distributed Map</b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/step-functions/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Step Functions</span></a><span style="font-weight:400;"> is expanding its capabilities of </span><a href="https://docs.aws.amazon.com/step-functions/latest/dg/state-map-distributed.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Distributed Map</span></a><span style="font-weight:400;"> by adding support for </span><a href="https://jsonlines.org/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">JSONL</span></a><span style="font-weight:400;"> (JSON Lines)</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">JSONL, a highly efficient text-based format, stores structured data as individual JSON objects separated by newlines, making it particularly suitable for large datasets. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This allows you to process large collection of items stored in JSONL format directly through distributed map and optionally exports the output of the Distributed Map as JSONL file. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The enhancement also introduces support for additional delimited file formats, including semicolon and tab-delimited files, providing greater flexibility in data source options. </span></li>
</ul>
<p><i><span style="font-weight:400;">16:51  Jonathan – “</span></i><i><span style="font-weight:400;">That’s really cool, actually, because thinking about streaming data, like log data, everyone’s moved to JSON logs, except now we just emit a text event with valid JSON, but it goes into the same file. So JSON lines are very much, I think, designed for log handling, log scanning, looking for patterns there. So this is really nice. It means we don’t have to have a separate Lambda function that reads in a 50 gigabyte file and breaks it into pieces first.”</span></i></p>
<h2><b>GCP</b></h2>
<p><b>19:08</b> <a href="https://cloud.google.com/blog/topics/partners/get-bigquery-datasets-on-google-cloud-marketplace/" target="_blank" rel="noreferrer noopener"><b>BigQuery datasets now available on Google Cloud Marketplace</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google is announcing datasets on the </span><a href="https://cloud.google.com/marketplace%5C" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Google Cloud Marketplace</span></a><span style="font-weight:400;"> through </span><a href="https://cloud.google.com/analytics-hub" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">BIgQuery Analytics Hub</span></a><span style="font-weight:400;">, opening up new avenues for organizations to power innovative analytics use cases and procure data for enterprise business needs. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Using Google Cloud Marketplace offers access as a centralized procurement tool to a wide array of enterprise apps, foundational AI models, LLMs, and now commercial and free datasets from third-party data providers and Google. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Combined with </span><a href="https://cloud.google.com/bigquery" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">BigQuery</span></a><span style="font-weight:400;"> Analytics hub you can enable cross-organizational zero-copy sharing at scale, with governance, security and encryption all built in natively. </span></li>
</ul>
<p><i><span style="font-weight:400;">19:57  Jonathan – “</span></i><i><span style="font-weight:400;">I think they’re slowly putting them back again by court order. yeah, I guess Google has the advantage here though, because they don’t have to copy the data. They make it, they keep one copy and everyone has access to it. Whereas Amazon, I don’t think quite got there yet, did they?”</span></i></p>
<p><b>20:51</b> <a href="https://cloud.google.com/blog/products/ai-machine-learning/announcing-gen-ai-toolbox-for-databases-get-started-today/" target="_blank" rel="noreferrer noopener"><b>Announcing public beta of Gen AI Toolbox for Databases</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google is launching the public beta of </span><a href="https://github.com/googleapis/genai-toolbox" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Gen AI Toolbox for Databases</span></a><span style="font-weight:400;"> in partnership with </span><a href="https://www.langchain.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">LangChain</span></a><span style="font-weight:400;">, the leading orchestration framework for developers building large language models. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Gen AI Toolbox is an open-source server that empowers application developers to connect production-grade, agent based generative AI applications to databases.  Streamlining the creation, deployment and management of sophisticated gen AI tools capable of querying databases with secure access, robust observability, scalability and comprehensive manageability. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">It can currently connect to self managed PostGreSQL, MySQL, as well as managed offerings like AlloyDB, spanner, and CloudSQL for Postgres, Mysql and SQL server. </span></li>
</ul>
<p><b>22:32</b> <a href="https://cloud.google.com/blog/products/databases/memorystore-cluster-autoscaler-now-on-github/" target="_blank" rel="noreferrer noopener"><b>Rightsize your Memorystore for Redis Clusters with open-source </b></a><a href="https://cloud.google.com/blog/products/databases/memorystore-cluster-autoscaler-now-on-github/" target="_blank" rel="noreferrer noopener"><b>Autoscaler</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Last year google gave us </span><a href="https://cloud.google.com/memorystore/docs/cluster" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Memorystore for Redis Clusters</span></a><span style="font-weight:400;"> with the ability to manually trigger scale out and down.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Now, to meet the elastic nature of modern Memorystore workloads, they are excited to announce the </span><a href="https://github.com/GoogleCloudPlatform/memorystore-cluster-autoscaler" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">open-source Memorystore Cluster Autoscaler available on Github</span></a><span style="font-weight:400;">, which builds on the open source </span><a href="https://github.com/cloudspannerecosystem/autoscaler" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">panner autoscaler</span></a><span style="font-weight:400;"> from 2020. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The autoscaler consists of two components the Poller and the Scaler, which monitors via cloud monitoring the health and performance of the memorystore cluster instances. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Justin specifically appreciates this, but it’s a hack, and should be something they build into the service long term. But we remember AWS had this moment too at one point where they would give you automation solutions and then deliver full automation in the service a year or two later. </span></li>
</ul>
<p><i><span style="font-weight:400;">23:05 Justin  – “</span></i><i><span style="font-weight:400;">I’d really like you to just build this into the product. Like why is this an open source thing that I have to run on my own server or infrastructure. But yeah, in fairness to Google, Amazon used to do this too. They would build like these custom solutions that they put on their GitHub thing. And then eventually a lot of people downloaded those things. Those eventually became future products within a couple of years.”</span></i></p>
<p><b>24:08</b> <a href="https://blog.google/technology/google-deepmind/gemini-model-updates-february-2025/" target="_blank" rel="noreferrer noopener"><b>Gemini 2.0 is now available to everyone</b></a><span style="font-weight:400;">       </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google has made </span><a href="https://blog.google/feed/gemini-app-model-update-january-2025" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">2.0 Flash available to all users of Gemini App</span></a><span style="font-weight:400;"> on desktop and mobile, helping everyone discover new ways to create, interact and collaborate with Gemini. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Today, we’re making the updated Gemini 2.0 flash generally available via the Gemini API in </span><a href="https://aistudio.google.com/prompts/new_chat?model=gemini-2.0-flash" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Google AI Studio</span></a><span style="font-weight:400;"> and </span><a href="https://console.cloud.google.com/freetrial?redirectPath=/vertex-ai/studio" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Vertex AI</span></a><span style="font-weight:400;">.  Developers can now build production applications with 2.0 flash. </span></li>
</ul>
<p><i><span style="font-weight:400;">24:32  Jonathan – “</span></i><i><span style="font-weight:400;">It’s quite a stretch to say build production applications. I mean, I guess you can build applications, maybe if you’re lucky. I played with Gemini 2, and I played with their deep research. Gemini’s 1.5 deep research offering a few days ago. I think it’s got a way to go. I don’t think it’s quite there with OpenAI’s version of the same thing just yet.”</span></i></p>
<h2><b>Azure</b></h2>
<p><b>27:24</b> <a href="https://techcommunity.microsoft.com/blog/azuresqlblog/azure-data-studio-retirement/4371009" target="_blank" rel="noreferrer noopener"><b>Azure Data Studio Retirement</b></a><span style="font-weight:400;"> </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Azure is announcing the upcoming retirement of Azure Data Studio (ADS) on February 6th, 2025 as they focus on delivering a modern, streamlined SQL development experience. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">ADS will remain supported until February 28th, 2026, giving developers ample time to transition. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This decision aligns with their commitment to simplifying SQL development by consolidation efforts on VS code with </span><a href="http://aka.ms/vscode-mssql-docs" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">MSSQL extension</span></a><span style="font-weight:400;">, a powerful and versatile tool designed for modern developers</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">But why… Well:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">They want to focus on innovation, and VS code provides a robust platform.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Streamlined Tools eliminates duplication, reduces engineering, maintenance overhead, and accelerates feature delivery, ensuring developers have access to the latest innovations. </span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Transition to VS Code gets you a modern development environment and a comprehensive set of MSSQL Extensions. </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Execute queries faster with filtering, sorting and export options JSON, Excel and CSV.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Manage schemas visually with Table Designer, Object Explorer and support for keys, indexes and constraints. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Connect to SQL Server, Azure SQL (all offerings), and SQL database in Fabric using an improved Connection Dialog</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Streamline development with scripting, object modifications, and a unified SQL experience</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Optimize performance with an enhanced Query Results Pane and execution plans. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Integrate with DevOps and CI/CD pipelines using SQL Database projects. </span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">29:30  Justin – “</span></i><i><span style="font-weight:400;">Visual Studio is an anchor. It’s so big. It’s so complicated. And if you’re trying to get people to do modern.net development with C sharp, you don’t need all that bloat. Like that, they’re still supporting WCF frameworks which are 20 years old at this point. You don’t need that in modern .NET web development. So it makes sense to me that they’re divorcing themselves from Visual Studio.”</span></i></p>
<h2><b>Off Topic </b></h2>
<p><b>35:55 </b><a href="https://blog.google/products/maps/united-states-geographic-name-change-feb-2025/" target="_blank" rel="noreferrer noopener"><b>Gulf of America name change in the U.S. — what you’ll see in Maps</b></a><span style="font-weight:400;"> </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">If anyone knows of a plugin that will put it back for Chrome… we’re all ears.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Google has updated the Gulf of Mexico to Gulf of America for those in the US. If you are in Mexico you’ll still see the Gulf of Mexico, and if you’re in the rest of the world you’ll see the Gulf of Mexico (Gulf of America). </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This is fine. </span></li>
</ul>
<p><b>04:38</b> <a href="https://blog.google/feed/notebooklm-google-one/" target="_blank" rel="noreferrer noopener"><b>NotebookLM Plus is now available in the Google One AI Premium </b></a></p>
<p><a href="https://blog.google/feed/notebooklm-google-one/" target="_blank" rel="noreferrer noopener"><b>subscription</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">NotebookLM is a research and thinking companion designed to help you make the most of your information.  You can upload material, summarize it, ask questions and transform it into something engaging, like a podcast-style audio discussion.  NotebookLM can help you ace a career certification, generate ideas or synthesize data for a project.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">NotebookLM plus to the google one AI premium plan, a version with higher usage limits and premium features for even more customized research. </span></li>
</ul>
<h2><span style="font-weight:400;">Closing</span></h2>
<p><span style="font-weight:400;">And that is the week in the cloud! Visit our website, the home of the Cloud Pod where you can join our newsletter, slack team, send feedback or ask questions at theCloud Pod.net or tweet at us with hashtag #theCloudPod</span></p>
]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/1979875/c1e-8m9mb959o6i4gp59-rkzj457nfkk6-qnycsw.mp3" length="59603342"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 292 of The Cloud Pod – where the forecast is always cloudy! This week Justin and Jonathan are a dynamic duo, bringing you all the latest in news – and sound effects – because it’s earnings time! Plus we’ve got new from VS Code, Azure Data Studio, CodeBuild and more. 
Titles we almost went with this week:

☁️The Cloud Pod Renames Cloud Earnings to ‘The Gulf of Capex’
Sorry Elon, OpenAI Doesn’t Want Your Pocket Change
MacOS gets into the Fastlane for Oil Changes

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
General News
It’s earnings time! 
01:29 Alphabet is planning to spend big on AI again this year, sending shares down

Alphabet earnings were a bit of a let down with cloud revenue missing and their announcement of spending $75 Billion in CapEx (DeepSeek who?)
Consolidated revenue rose 12% in the period to 96.5 billion. 
Capex investments of $75b shocked analysts who expected $57.9 billion.
EPS was 2.15 vs 2.13.
Revenue of 96.5 billion vs 96.62 expected.
Ad revenue rose to 72.46 billion vs 71.3, Youtube advertising revenue was 10.47 billion vs 10.22 billion. 
Google Cloud was 12.0 billion vs expectation of 12.19 billion.

02:09  Jonathan – “I’m guessing ad revenue is gonna be down again, Q1, Q2 because I think a lot of ad revenue is driven by the election season. So that’s not looking too good for them.”
03:13 Microsoft GAAP EPS of $3.23 beats by $0.13, revenue of $69.6B beats by $790M

Microsoft followed up with also weak growth in its Azure cloud computing unit. 
EPS was 3.23 beating expectations by 0.13
Revenue of 69.6B beating by 780M
Intelligent clo...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/1979875/c1a-k5d5-8dwnrnwohk-yxzu5r.jpg"></itunes:image>
                                                                            <itunes:duration>00:41:23</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                            </item>
                    <item>
                <title>
                    <![CDATA[291: AWS, GCP and Azure eat KRO]]>
                </title>
                <pubDate>Thu, 13 Feb 2025 17:28:17 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/1974149</guid>
                                    <link>https://tcpfm.castos.com/episodes/291-aws-gcp-and-azure-eat-kro</link>
                                <description>
                                            <![CDATA[<p><span style="font-weight:400;">Welcome to episode 291 of The Cloud Pod – where the forecast is always cloudy! Justin, Jonathan, and Ryan have battled through the various plagues and have come together to bring you all the latest in cloud news, including Kro, DeepSeek, and CoPilot. </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">In Shocking News China Steals US IP</span></li>
<li><span style="font-weight:400;">️The Cloud Pod is Now Supported in Gov Cloud </span></li>
<li><span style="font-weight:400;">Microsoft Goes Open Source No SQL… and Hell Hasn’t Frozen Over</span></li>
<li><span style="font-weight:400;">Zombie Buckets Receive How Much Traffic?!?</span></li>
<li><span style="font-weight:400;">️AWS, GCP and Azure eat KRO</span></li>
<li><span style="font-weight:400;">‍✈️Github Copilot for Free, so You Can Win at Coding Interviews</span></li>
<li><span style="font-weight:400;">Customized Best Practices… I don’t think you know what best practices are</span></li>
<li><span style="font-weight:400;">☁️TheCloudPod Leverages Deep Understanding to Make a Nuanced Decision on adopting Copilot</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><b>Follow Up</b></h2>
<p><b>01:23</b> <a href="https://venturebeat.com/ai/is-deepseek-really-sending-data-to-china-lets-decode/" target="_blank" rel="noreferrer noopener"><b>Is DeepSeek really sending data to China? Let’s decode</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">One of the early concerns about </span><a href="https://deepseek.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">DeepSeek</span></a><span style="font-weight:400;"> was its privacy implications, starting with their </span><a href="https://platform.deepseek.com/downloads/DeepSeek%20Privacy%20Policy.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">privacy policy</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Allegations are significant but reality is if the open source model is hosted locally or orchestrated via GPUs in the US the data does not go to China.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">But if you’re using the DeepSeek app it clearly states in the privacy policy that the data will be stored in China. Data hosted on Chinese servers can be seized by the Government at any time. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Maybe rethink using the native DeepSeek websites and mobile apps and just host them locally in LM studio. </span></li>
</ul>
<p><i><span style="font-weight:400;">02:21  Jonathan – “They’re collecting some weird data. I get collecting conversational data, because that is the business they’re in, but they’re also doing some weird stuff, like they fingerprint users by looking at the patterns of the way that they type. Not just what they type, but how they type, like the timing between hitting different letters – things like that.”</span></i></p>
<p><b>8:06</b> <a href="https://www.theinformation.com/briefings/openai-believes-deepseek-was-developed-using-openai-models?rc=3t8xtd" target="_blank" rel="noreferrer noopener"><b>OpenAI Believes DeepSeek Was Developed Using OpenAI Models</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Listener Note: paywall article </span></li>
<li style="font-weight:400;"><a href="https://www.bing.com/ck/a?!&amp;&amp;p=fc810c778abb8d35017b18f340bf2d23ffb04ac0039d452e9badff0dfd548dd3JmltdHM9MTczOTIzMjAwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=111326c2-38f8-6e94-3a5e-34a..."></a></li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 291 of The Cloud Pod – where the forecast is always cloudy! Justin, Jonathan, and Ryan have battled through the various plagues and have come together to bring you all the latest in cloud news, including Kro, DeepSeek, and CoPilot. 
Titles we almost went with this week:

In Shocking News China Steals US IP
️The Cloud Pod is Now Supported in Gov Cloud 
Microsoft Goes Open Source No SQL… and Hell Hasn’t Frozen Over
Zombie Buckets Receive How Much Traffic?!?
️AWS, GCP and Azure eat KRO
‍✈️Github Copilot for Free, so You Can Win at Coding Interviews
Customized Best Practices… I don’t think you know what best practices are
☁️TheCloudPod Leverages Deep Understanding to Make a Nuanced Decision on adopting Copilot

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
Follow Up
01:23 Is DeepSeek really sending data to China? Let’s decode 

One of the early concerns about DeepSeek was its privacy implications, starting with their privacy policy. 
Allegations are significant but reality is if the open source model is hosted locally or orchestrated via GPUs in the US the data does not go to China.
But if you’re using the DeepSeek app it clearly states in the privacy policy that the data will be stored in China. Data hosted on Chinese servers can be seized by the Government at any time. 
Maybe rethink using the native DeepSeek websites and mobile apps and just host them locally in LM studio. 

02:21  Jonathan – “They’re collecting some weird data. I get collecting conversational data, because that is the business they’re in, but they’re also doing some weird stuff, like they fingerprint users by looking at the patterns of the way that they type. Not just what they type, but how they type, like the timing between hitting different letters – things like that.”
8:06 OpenAI Believes DeepSeek Was Developed Using OpenAI Models 

Listener Note: paywall article 
]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[291: AWS, GCP and Azure eat KRO]]>
                </itunes:title>
                                    <itunes:episode>291</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p><span style="font-weight:400;">Welcome to episode 291 of The Cloud Pod – where the forecast is always cloudy! Justin, Jonathan, and Ryan have battled through the various plagues and have come together to bring you all the latest in cloud news, including Kro, DeepSeek, and CoPilot. </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">In Shocking News China Steals US IP</span></li>
<li><span style="font-weight:400;">️The Cloud Pod is Now Supported in Gov Cloud </span></li>
<li><span style="font-weight:400;">Microsoft Goes Open Source No SQL… and Hell Hasn’t Frozen Over</span></li>
<li><span style="font-weight:400;">Zombie Buckets Receive How Much Traffic?!?</span></li>
<li><span style="font-weight:400;">️AWS, GCP and Azure eat KRO</span></li>
<li><span style="font-weight:400;">‍✈️Github Copilot for Free, so You Can Win at Coding Interviews</span></li>
<li><span style="font-weight:400;">Customized Best Practices… I don’t think you know what best practices are</span></li>
<li><span style="font-weight:400;">☁️TheCloudPod Leverages Deep Understanding to Make a Nuanced Decision on adopting Copilot</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><b>Follow Up</b></h2>
<p><b>01:23</b> <a href="https://venturebeat.com/ai/is-deepseek-really-sending-data-to-china-lets-decode/" target="_blank" rel="noreferrer noopener"><b>Is DeepSeek really sending data to China? Let’s decode</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">One of the early concerns about </span><a href="https://deepseek.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">DeepSeek</span></a><span style="font-weight:400;"> was its privacy implications, starting with their </span><a href="https://platform.deepseek.com/downloads/DeepSeek%20Privacy%20Policy.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">privacy policy</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Allegations are significant but reality is if the open source model is hosted locally or orchestrated via GPUs in the US the data does not go to China.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">But if you’re using the DeepSeek app it clearly states in the privacy policy that the data will be stored in China. Data hosted on Chinese servers can be seized by the Government at any time. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Maybe rethink using the native DeepSeek websites and mobile apps and just host them locally in LM studio. </span></li>
</ul>
<p><i><span style="font-weight:400;">02:21  Jonathan – “They’re collecting some weird data. I get collecting conversational data, because that is the business they’re in, but they’re also doing some weird stuff, like they fingerprint users by looking at the patterns of the way that they type. Not just what they type, but how they type, like the timing between hitting different letters – things like that.”</span></i></p>
<p><b>8:06</b> <a href="https://www.theinformation.com/briefings/openai-believes-deepseek-was-developed-using-openai-models?rc=3t8xtd" target="_blank" rel="noreferrer noopener"><b>OpenAI Believes DeepSeek Was Developed Using OpenAI Models</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Listener Note: paywall article </span></li>
<li style="font-weight:400;"><a href="https://www.bing.com/ck/a?!&amp;&amp;p=fc810c778abb8d35017b18f340bf2d23ffb04ac0039d452e9badff0dfd548dd3JmltdHM9MTczOTIzMjAwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=111326c2-38f8-6e94-3a5e-34a8396a6f52&amp;psq=openai&amp;u=a1aHR0cHM6Ly9vcGVuYWkuY29tLw&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">OpenAI</span></a><span style="font-weight:400;"> says they have found evidence that the Chinese firm behind DeepSeek developed the AI using information generated by OpenAI’s models. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This is prohibited by the OpenAI terms of service, and is a practice known as AI model distillation.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">With distillation, the developer asks existing AI models lots of questions and uses the answers to develop new models that mimic their performance.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This shortcut results in models that roughly approximate state-of-the-art models but don’t cost a lot to produce</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">OpenAI said last year it would sell access to its models directly to customers based in China, while MS has continued to resell OpenAI models through its Azure cloud service to Chinese customers.</span><span style="font-weight:400;"> </span></li>
</ul>
<p><i><span style="font-weight:400;">09:15  Justin- “Oh, you mean the company that stole all the internet data in the world to create a model is complaining about another company stealing their data?”</span></i></p>
<h2><b>General News </b></h2>
<p><b>11:42</b> <a href="https://www.theregister.com/2025/02/04/abandoned_aws_s3/" target="_blank" rel="noreferrer noopener"><b>Abandoned AWS S3 buckets can be reused in supply-chain attacks that </b></a><a href="https://www.theregister.com/2025/02/04/abandoned_aws_s3/" target="_blank" rel="noreferrer noopener"><b>would make SolarWinds look ‘insignificant’</b></a><b> </b></p>
<p><a href="https://labs.watchtowr.com/8-million-requests-later-we-made-the-solarwinds-supply-chain-attack-look-amateur/" target="_blank" rel="noreferrer noopener"><b>8 Million Requests Later, We Made The SolarWinds Supply Chain Attack </b></a><a href="https://labs.watchtowr.com/8-million-requests-later-we-made-the-solarwinds-supply-chain-attack-look-amateur/" target="_blank" rel="noreferrer noopener"><b>Look Amateur</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><a href="https://labs.watchtowr.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">watchTowr Labs</span></a><span style="font-weight:400;"> security researchers are claiming that Abandoned AWS S3 buckets could be reused to hijack the global software supply chain in an attack that would make “</span><a href="https://www.cisecurity.org/solarwinds" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Solarwinds</span></a><span style="font-weight:400;"> look amateurish and insignificant.”</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The researchers report that they have identified 150 buckets that were long gone, yet applications and websites are still trying to pull software updates and other code from them. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">If someone were to take over those buckets, they could be used to feed malicious software updates into peoples devices. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The buckets were previously owned by governments, fortune 500 firms, technology and cybersecurity firms and major open source projects. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The watchTowr team spent &lt;500 dollars to re-register 150 S3 buckets with the same names and enabled logging to determine what files were still being requested and by what.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Then, they spent 2 months watching the requests. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">During the 2 months, the S3 budget received more than eight million requests for resources including Windows, Linux, and macOS executables, virtual machine images, javascript files, cloud formation templates and SSLVPN server configurations. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Coming from all over includes Nasa and US government networks, along with government organizations in the UK and other countries. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Watchtower CEO Benjamin Harris said that it would be terrifyingly simple to pull off an exploit in this way. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">BTW, Justin super approves of this company as they use a lot of Memes in their article.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">AWS took the S3 buckets off Watchtower’s hands and sinkhole-d them, so these 150 are no longer being used… but how many more exist out there?</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They didn’t really break down how they found them, but it’s probably not very hard to find.</span></li>
</ul>
<p><i><span style="font-weight:400;">13:55  Jonathan – “It’s no different than domain registrations expiring, or getting somebody’s phone number after it’s been advertised…I feel like they’re pointing the finger at Amazon a little more than they should. To say that it’s a supply chain attack is kind of a stretch because these companies don’t exist anymore, that’s why the buckets are gone – so it’s a dead supply chain attack</span></i></p>
<h2><b>AI is Going Great – or How ML Makes All It’s Money </b></h2>
<p><b>20:19</b> <a href="https://openai.com/global-affairs/introducing-chatgpt-gov/" target="_blank" rel="noreferrer noopener"><b>Introducing ChatGPT Gov</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><a href="https://openai.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">OpenAI</span></a><span style="font-weight:400;"> is releasing a version of openAI that is targeted at the public sector.  They believe the US Government’s adoption of AI can boost efficiency and productivity and is crucial for maintaining and enhancing America’s global leadership. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">By making the products available to the US government, they aim to ensure AI serves the national interest and the public good, aligned with democratic values, while empowering policymakers to responsibly integrate capabilities to deliver better services to the American people. (Side note, did anyone else lol at this?)</span></li>
<li style="font-weight:400;"><a href="https://openai.com/global-affairs/introducing-chatgpt-gov/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">ChatGPT Gov</span></a><span style="font-weight:400;">, a new tailored version of ChatGPT designed to provide US government agencies with an additional way to access OpenAI’s frontier models. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Agencies can deploy ChatGPT Gov in their own MS Azure commercial cloud or Azure Government cloud on top of the Microsoft Azure OpenAI service. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Self-hosting ChatGPT Gov enables agencies to more easily manage their own security, privacy and compliance requirements, such as stringent cybersecurity frameworks (IL5, CJIS, ITAR and FEDRAMP) high.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Additionally, they believe the infrastructure will expedite internal authorization of OpenAI’s tools for the handling of non-public sensitive data. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">ChatGPT Gov reflects their commitment to helping the US Government agencies leverage OpenAI’s technology today.  While they continue to work towards FedRAMP moderate and high accreditations for their SaaS product, ChatGPT enterprise. They are also evaluating expanding ChatGPT Gov to Azure’s classified regions. </span></li>
</ul>
<p><i><span style="font-weight:400;">22:13  Justin – “Remember back in the early days of Cloud Pod when we were talking about all the engineers protesting at the companies about the machine learning being used on video content for police forces, and I was thinking about that compared to this…I don’t know if people are going to protest this. They should. They probably should.”</span></i></p>
<p><b>23:23</b> <a href="https://www.theinformation.com/briefings/openai-revenue-surged-from-200-a-month-chatgpt-subscriptions?rc=3t8xtd" target="_blank" rel="noreferrer noopener"><b>OpenAI Revenue Surged From $200-a-Month ChatGPT Subscriptions</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Reportedly the $200 dollar ChatGPT Pro subscriptions have raised OpenAI revenue by $25M a month or at least $300M on an annual basis.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">I guess we don’t know what we are talking about… I’m still unclear what they’re buying with this other than the Vision capability they just launched. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Interested in checking out the pricing models for yourself? You can do that – </span><a href="https://openai.com/chatgpt/pricing" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">here</span></a><span style="font-weight:400;">! </span></li>
</ul>
<p><i><span style="font-weight:400;">25:04  Ryan – “I do love that the rabbit holes that I fall into for internet research have now been outsourced to AI, so I can just have the robot do the rabbit hole.”</span></i></p>
<p><b>27:32</b> <a href="https://openai.com/index/introducing-deep-research/" target="_blank" rel="noreferrer noopener"><b>Introducing deep research</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><a href="https://chatgpt.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">ChatGPT</span></a><span style="font-weight:400;"> has released Deep research in ChatGPT, a new agentic capability that conducts multi-step research on the internet for complex tasks. It accomplishes in tens of minutes what would take a human many hours.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Deep Research, when prompted, will find, analyze and synthesize hundreds of online sources to create a comprehensive report at the level of a research analyst. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Leveraging the </span><a href="https://openai.com/index/openai-o3-mini/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">OpenAI o3</span></a><span style="font-weight:400;"> model that is optimized for web browsing and data analysis, it leverages reasoning to search, interpret and analyze massive amounts of text, images and PDFs on the internet, pivoting as needed in reaction to information it encounters. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Deep research was built for areas like finance, science, policy and engineering and needs thorough, precise, and reliable research. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">To use it, select Deep research in the message composer and enter your query. Tell Chat GPT what you need, and whether it’s a competitive analysis on streaming platforms or a personalized report on the best commuter bike. You can attach files and spreadsheets to add context to your question. Once it starts running, a sidebar appears with a summary of the steps taken and sources used.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Deep research may take anywhere from 5 to 30 minutes to complete its work, taking the time needed to dive deep into the web</span></li>
</ul>
<p><b>30:05</b> <a href="https://www.snowflake.com/en/blog/deepseek-preview-snowflake-cortex-ai/" target="_blank" rel="noreferrer noopener"><b>Announcing DeepSeek-R1 in Preview on Snowflake Cortex AI</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">All the cloud providers are starting to offer DeepSeek, with the first up this week being </span><a href="https://www.snowflake.com/en/data-cloud/cortex/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Snowflake Cortex AI</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The model is available in private preview for serverless inference for batch and interactive.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The model is hosted in the US with no data shared with the model provider.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Once GA, you’ll be able to manage access to the model via role-based access control (RBAC). </span></li>
</ul>
<p><i><span style="font-weight:400;">30:31  Justin – “</span></i><i><span style="font-weight:400;">So if you want to try Deep Seek in a safer environment, Snowflake is your friend.”</span></i></p>
<h2><b>Cloud Tools</b></h2>
<p><b>31:02</b> <a href="https://aws.amazon.com/blogs/opensource/introducing-qontos-prometheus-rds-exporter-an-open-source-solution-to-enhance-monitoring-amazon-rds/" target="_blank" rel="noreferrer noopener"><b>Introducing Qonto’s Prometheus RDS Exporter – An Open Source Solution </b></a><a href="https://aws.amazon.com/blogs/opensource/introducing-qontos-prometheus-rds-exporter-an-open-source-solution-to-enhance-monitoring-amazon-rds/" target="_blank" rel="noreferrer noopener"><b>to Enhance Monitoring Amazon RDS</b></a><b>  </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Databases are a critical part of your infrastructure, and if you’re using AWS RDS, the ability to get metrics like CPU, RAM, IOPS, Storage or service quotas is critical, but challenging when the number of RDS instances increases to the 10s, hundreds or thousands of databases to monitor. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This is why a standardized approach to database monitoring can help administrators save time and help scale their business with lower risk. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Qonto, a leading payment institution that offers a panel of banking services to small businesses with simplicity, has published a unified framework for Amazon RDS monitoring which helps them deploy best practices at scale and monitor hundreds of databases with limited effort. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This automation comes as the </span><a href="https://github.com/qonto/prometheus-rds-exporter" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Prometheus RDS Exporter</span></a><span style="font-weight:400;"> for Amazon RDS monitoring, and they have open sourced it under an MIT license. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Qonto wanted to aggregate key RDS metrics and push them into prometheus for monitoring and alerting purposes. </span></li>
</ul>
<p><i><span style="font-weight:400;">32:01  Ryan – “I do like the sort of standardization that Prometheus has brought. I get a little frustrated sometimes with some of the use cases, because it’s a big, big hammer that can be set up to solve little problems. But something like this, if you’ve got enough scale, where you’re struggling to visualize and see metrics across hundred of Amazon accounts, and then maybe you’ve got other applications that’s using OpenTelemetry – I think this is pretty cool that you can standardize it and put it all in one place.”</span></i></p>
<h2><b>AWS</b></h2>
<p><b>35:38</b> <a href="https://aws.amazon.com/about-aws/whats-new/2025/01/amazon-redshift-default-security-configurations-new-warehouses/" target="_blank" rel="noreferrer noopener"><b>Amazon Redshift announces enhanced default security configurations for </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2025/01/amazon-redshift-default-security-configurations-new-warehouses/" target="_blank" rel="noreferrer noopener"><b>new warehouses</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/pm/redshift/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Redshift</span></a><span style="font-weight:400;"> announces enhanced security defaults to help you adhere to best practices in data security and reduce the risk of potential misconfigurations.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">These changes include disabling </span><a href="https://docs.aws.amazon.com/redshift/latest/mgmt/rs-security-group-public-private.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">public access</span></a><span style="font-weight:400;">, enabling </span><a href="https://docs.aws.amazon.com/redshift/latest/mgmt/working-with-db-encryption.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">database encryption</span></a><span style="font-weight:400;">, and enforcing </span><a href="https://docs.aws.amazon.com/redshift/latest/mgmt/working-with-parameter-groups.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">secure connection</span></a><span style="font-weight:400;"> by default when creating a new data warehouse.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">AMEN.</span></li>
</ul>
<p><b>39:18</b> <a href="https://aws.amazon.com/blogs/aws/deepseek-r1-models-now-available-on-aws/" target="_blank" rel="noreferrer noopener"><b>DeepSeek-R1 models now available on AWS</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon is also providing you access to DeepSeek R1 models in </span><a href="https://aws.amazon.com/blogs/aws/category/artificial-intelligence/amazon-machine-learning/amazon-bedrock/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Bedrock</span></a><span style="font-weight:400;"> and </span><a href="https://aws.amazon.com/blogs/aws/category/artificial-intelligence/sagemaker/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Sagemaker AI</span></a><span style="font-weight:400;">.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">As this is a publicly available model you only pay for the infrastructure price based on the inference instance hours you select for Bedrock, Sagemaker jumpstart and Ec2. </span></li>
</ul>
<p><b>40:06</b> <a href="https://aws.amazon.com/about-aws/whats-new/2025/02/amazon-ec2-automated-recovery-microsoft-sql-server-vss/" target="_blank" rel="noreferrer noopener"><b>Amazon EC2 now supports automated recovery of Microsoft SQL Server </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2025/02/amazon-ec2-automated-recovery-microsoft-sql-server-vss/" target="_blank" rel="noreferrer noopener"><b>with VSS</b></a><b>  </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">In horrible ideas, you can now make automated recovery for MSSQL Server databases from VSS-based EBS snapshots.  Customers can use an AWS Systems Manager runbook and specify a restore point to automate recovery without stopping a running MSSQL Database. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">VSS allows application data to be backed up while applications are running. This new feature will enable customers to automate the recovery from VSS-based EBS snapshots and ensure rapid recovery of large databases within minutes. </span></li>
</ul>
<p><i><span style="font-weight:400;">40:38  Justin – “Just use SQL backup natively please.”</span></i></p>
<h2><b>GCP</b></h2>
<p><b>04:38 </b><a href="https://cloud.google.com/blog/products/compute/introducing-workload-manager-custom-rules/" target="_blank" rel="noreferrer noopener"><b>Introducing custom rules in Workload Manager: Evaluate workloads </b></a><a href="https://cloud.google.com/blog/products/compute/introducing-workload-manager-custom-rules/" target="_blank" rel="noreferrer noopener"><b>against customized best practices</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/workload-manager/docs/overview" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Workload Manager</span></a><span style="font-weight:400;"> provides a rule-based validation service for evaluating your workloads on Google cloud. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Workload Manager scans your workloads, including SAP and MSSQL to detect deviations from standards, rules and best practices to improve system quality, reliability and performance.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Now you can extend workload manager with custom rules (GA), a detective-based service that helps ensure your validations are not blocking any deployments, but that allows you to easily detect compliance issues across different architectural intents. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This can be used against projects, folders and orgs against best practices and custom standards. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">To get started you codify best practices in </span><a href="https://www.openpolicyagent.org/docs/latest/policy-language/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Rego</span></a><span style="font-weight:400;">, a declarative policy language that’s used to define rules and express policies over complex data structures, and run or schedule evaluation scans across your deployments. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Than you export the findings to bigquery dataset and visualize them using looker</span></li>
</ul>
<p><i><span style="font-weight:400;">43:44  Ryan – “I mean, I do like these types of workflows, and the reason I like them is so you can practice security without everything being in force mode. And if you’re allowing direct access to clouds, then you are allowing the users in the company to not have to through a centralized team, or an infrastructure team…and you’re going to end up with insecure configurations, because random people are clicking through defaults.”</span></i></p>
<p><b>45:22</b> <a href="https://cloud.google.com/blog/products/compute/introducing-a4-vms-powered-by-nvidia-b200-gpu-aka-blackwell/" target="_blank" rel="noreferrer noopener"><b>Blackwell is here — new A4 VMs powered by NVIDIA B200 now in preview</b></a></p>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google is bringin the NVIDIA Blackwell GPU to google cloud with the preview of the A4 VMs, powered by NVIDIA HGX B200. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The A4 VM features eight of the Blackwell GPU’s interconnected by fifth-generation NVIDIA NVLink, and offers a significant performance boost over the previous generation of A3 High VM.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Each GPU delivers 2.25 times the peak compute and 2.25 times the HBM capacity, making A4 VMs a versatile option for training and fine-tuning for a wide range of model architectures, while increasing the compute and HBM capacity.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The A4 VM integrates Google’s infrastructure with Blackwell GPUs to bring the best cloud experience for Google Cloud customers, from scale and performance, to ease-of-use and cost optimizations</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Enhanced Networking with the </span><a href="https://cloud.google.com/blog/products/compute/trillium-sixth-generation-tpu-is-in-preview?e=48754805" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Titanium ML network adapte</span></a><span style="font-weight:400;">r, optimized to deliver a secure, high-performance cloud experience for AI workloads, building on NVIDIA connectX-7 NICs.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Google K8 Engine with support of up to 65k nodes per cluster. A4 VMs are natively integrated into GKE.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Vertex AI will support the A4</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Pytorch and Cuda, work closely with NVIDIA to optimize JAX and XLA</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Hypercompute Cluster with tight GKE and SLURM integration</span></li>
</ul>
</li>
</ul>
</li>
<li style="font-weight:400;"><i><span style="font-weight:400;">“We’re excited to leverage A4, powered by NVIDIA’s Blackwell B200 GPUs. Running our workload on cutting edge AI Infrastructure is essential for enabling low-latency trading decisions and enhancing our models across markets. We’re looking forward to leveraging the innovations in Hypercompute Cluster to accelerate deployment of training our latest models that deliver quant-based algorithmic trading.” –</span></i><span style="font-weight:400;"> Gerard Bernabeu Altayo, Compute Lead, Hudson River Trading</span></li>
</ul>
<p><i><span style="font-weight:400;">47:37  Jonathan – “</span></i><i><span style="font-weight:400;">Yeah, the NVLink is really quite the performance booster here because consumer cards use PCIe very low bandwidth, relatively speaking. So I think that the real advantage in using these clusters that they put together is just because of the massive bandwidth between nodes in the cluster. And the real bottleneck in clustering GPUs is communication between nodes, which is why DeepSeek did some cool stuff with what they were doing in building their model. What they did is they, instead of using CUDA, they used low-level language, PTX, and they reassigned some of the cores to compress data and to work on optimizing network traffic between nodes, and that’s probably one of reasons they were able to do what they did with such kind of strange resources.”</span></i></p>
<p><b>49:55</b> <a href="https://cloud.google.com/blog/products/containers-kubernetes/introducing-kube-resource-orchestrator/" target="_blank" rel="noreferrer noopener"><b>Simplify the developer experience on Kubernetes with KRO</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Hell has NOT frozen over. (As far as we know.) </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Google, AWs and Azure have been collaborating on </span><a href="https://github.com/kro-run" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Kube Resource Orchestrator</span></a><span style="font-weight:400;"> (Kro).  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Kro introduces a K8 native, cloud agnostic way to define groupings of K8 resources. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">With Kro, you can group your applications and their dependencies as a single resource that can be easily consumed by end users. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Before Kro you had to invest in custom solutions such as building custom K8 controllers or using packaging tools like Helm, which can’t leverage the benefits of K8 CRDs.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">These approaches are costly to create, maintain, and troubleshoot and complex for non-k8 experts to consume. This is a problem many K8 users face. Rather than developing vendor-specific solutions, they have partnered with Amazon and Microsoft to make K8 APis simpler for all k8 users.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Platform and devops teams want to define standards for how application teams deploy their workloads, and they want to use K8 as the platform for creating and enforcing these standards.  Each service needs to handle everything from resource creation to security configurations, monitoring setup, defining the end-user interface and more. There are client-side templating tools that can help like Helm or Kustomize, but K8 lacked a native way for platform teams to create custom groupings of resources for consumption by end users</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Kro is a k8 native framework that lets you create a reusable API to deploy multiple resources as single units.  This can be used to encapsulate K8 deployments and dependencies into a single API that your application teams can use, even if they aren’t familiar with K8. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;"> You can use Kro to create custom end-user interfaces that expose only the parameters an end-user should see, hiding the complexity of K8 and cloud-provider APIs. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">See the article for some example use cases.</span></li>
</ul>
<p><i><span style="font-weight:400;">52:59  Ryan – “</span></i><i><span style="font-weight:400;">I can see this being easier to support within a business. But it still has all the problems that I don’t like about operators and custom resources, trying to make this the one the API for everything – on a very complex system.”</span></i></p>
<p><b>54:20</b> <a href="https://cloud.google.com/blog/products/databases/spanner-graph-is-now-ga/" target="_blank" rel="noreferrer noopener"><b>Announcing the general availability of Spanner Graph</b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/products/spanner/graph" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Spanner Graph</span></a><span style="font-weight:400;"> is now Generally Available. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Graph analysis helps reveal hidden connections in data and when combined with techniques like full-text search and vector search, enables you to deliver a new class of AI-enabled application experiences.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The traditional approaches based on niche tools resulted in data silos, operational overhead and scalability challenges. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">It really is the tool looking for a solution. </span></li>
</ul>
<p><b>55:58</b> <a href="https://cloud.google.com/release-notes#February_03_2025" target="_blank" rel="noreferrer noopener"><b>AlloyDB Omni K8 Operator 1.3 GA</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">This new </span><a href="https://cloud.google.com/alloydb/omni/docs/deploy-kubernetes" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">operator</span></a><span style="font-weight:400;"> has several nice features:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">K8 1.30 supports connection pooling</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">You can put</span><a href="https://cloud.google.com/alloydb/omni/docs/kubernetes-maintenance-mode" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;"> databases in maintenance mode</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">You can </span><a href="https://cloud.google.com/alloydb/omni/docs/create-replication-slots" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">create replication slots and users</span></a><span style="font-weight:400;"> for logical replication via the operator AP. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Release of K8 operator adds support for </span><a href="https://github.com/kubernetes/kube-state-metrics" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">kube-state-metrics</span></a><span style="font-weight:400;"> so that you can use Prometheus or a prometheus-compatible scraper to consume and display custom metrics</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">You can create a new database cluster, this version of the K8 operator creates RO and RW load balancers concurrently, which reduces the time that it takes for the database cluster to be ready</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Configurable log rotation has a default retention of seven days, and each archived file is individually compressed using Gzip.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Various bug fixes and performance improvements. </span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">56:54  Justin – “This is nice, if you’re using Omni, and you want to do Kubernetes things.”</span></i></p>
<h2><b>Azure </b></h2>
<p><b>58:15</b> <a href="https://opensource.microsoft.com/blog/2025/01/23/documentdb-open-source-announcement/" target="_blank" rel="noreferrer noopener"><b>DocumentDB: Open-Source Announcement</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft is announcing the official release of </span><a href="https://github.com/microsoft/documentdb" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">DocumentDB</span></a><span style="font-weight:400;"> — an open-source document database platform and the engine powering the vCore-based Azure Cosmos DB for MongoDB, built on PostgreSQL</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The project uses the permissive MIT Licenses.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">There are two components to the project:</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Pg_document_db_core – A custom PostgreSQL extension optimizing for BSON data type support in Postgres</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Pg_documentdb_api- the data plane for implementing CRUD operations, query functionality, and index management. </span></li>
</ul>
<p><i><span style="font-weight:400;">58:50  Jonathan – “Why would they call it the same name as Amazon’s DB?”</span></i></p>
<p><b>59:50</b> <a href="https://devblogs.microsoft.com/visualstudio/announcing-a-free-github-copilot-for-visual-studio/" target="_blank" rel="noreferrer noopener"><b>Announcing a free GitHub Copilot for Visual Studio</b></a> <span style="font-weight:400;">  </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft has released a free plan for </span><a href="https://github.com/features/copilot" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Github Copilot</span></a><span style="font-weight:400;">, available for everyone using </span><a href="https://visualstudio.microsoft.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Visual Studio</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">With the free version you get:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">2000 code completions per month</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">50 chat messages per month</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Access to the latest AI models with Anthropic Claude 3.5 Sonnet and Open AI’s GPT-4o.</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Thanks for not charging us twice, we guess? </span></li>
</ul>
<p><b>1:02:15 </b><a href="https://azure.microsoft.com/en-us/blog/announcing-the-availability-of-the-o3-mini-reasoning-model-in-microsoft-azure-openai-service/" target="_blank" rel="noreferrer noopener"><b>Announcing the availability of the o3-mini reasoning model in Microsoft </b></a><a href="https://azure.microsoft.com/en-us/blog/announcing-the-availability-of-the-o3-mini-reasoning-model-in-microsoft-azure-openai-service/" target="_blank" rel="noreferrer noopener"><b>Azure OpenAI Service</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">We are pleased to announce that OpenAI o3-mini is now available in </span><a href="https://azure.microsoft.com/products/ai-services/openai-service/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Microsoft Azure OpenAI</span></a><span style="font-weight:400;"> service. O3-mini adds significant cost efficiencies compared to o1-mini with enhanced reasoning, with new features like reasoning effort and tools, while providing comparable or better responsiveness. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">New features of o3-mini</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Reasoning effort parameter</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Structured output </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Function and tools support</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Developer messages</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">System Message compatibility</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Continue Strength on coding, math and scientific reasoning. </span></li>
</ul>
</li>
</ul>
<p><b>1:02:46 </b><a href="https://azure.microsoft.com/en-us/blog/deepseek-r1-is-now-available-on-azure-ai-foundry-and-github/" target="_blank" rel="noreferrer noopener"><b>DeepSeek R1 is now available on Azure AI Foundry and GitHub</b></a><span style="font-weight:400;"> </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Deepseek is also now available in the model catalog on Azure AI foundry and GitHub, joining a diverse portfolio of over 1,800, models including frontier, open-source, industry-specific, and task-based based AI models. </span></li>
</ul>
<p><i><span style="font-weight:400;">1:03:14  Jonathan – “</span></i><i><span style="font-weight:400;">I’m really excited about what DeepSeek’s done. And I think it’s going to have a huge effect on the rest of the AI industry. Like they’ve completely reworked how the transformers work at a fairly fundamental level. And if we don’t see other people adopting the same changes that they’ve made, I’d be really surprised.”</span></i></p>
<h2><b>Oracle</b></h2>
<p><b>1:05:57 </b><a href="https://www.oracle.com/news/announcement/oracle-and-google-cloud-expand-regional-availability-and-add-powerful-new-capabilities-to-oracle-database-at-google-cloud-2025-01-30/" target="_blank" rel="noreferrer noopener"><b>Oracle and Google Cloud Expand Regional Availability and Add Powerful </b></a><a href="https://www.oracle.com/news/announcement/oracle-and-google-cloud-expand-regional-availability-and-add-powerful-new-capabilities-to-oracle-database-at-google-cloud-2025-01-30/" target="_blank" rel="noreferrer noopener"><b>New Capabilities to Oracle Database@Google Cloud</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Oracle and Google Cloud have announced plans to expand Oracle Database@Google Cloud by adding eight new regions over the next 12 months, including locations in the U.S., Canada, Japan, India, Brazil. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">In addition they are releasing new capabilities including:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Cross-Region Disaster Recovery for Oracle Autonomous Database Serverless. Cool!</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Single-Node VM Clusters for Oracle Exadata Database Service on Dedicated Infrastructure. </span></li>
</ul>
</li>
</ul>
<h2><span style="font-weight:400;">Closing</span></h2>
<p><span style="font-weight:400;">And that is the week in the cloud! Visit our website, the home of the Cloud Pod where you can join our newsletter, slack team, send feedback or ask questions at theCloud Pod.net or tweet at us with hashtag #theCloudPod</span></p>
]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/1974149/c1e-4919c4qzk4cmzon6-qdw0jq7nhv33-ucukz2.mp3" length="78877695"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 291 of The Cloud Pod – where the forecast is always cloudy! Justin, Jonathan, and Ryan have battled through the various plagues and have come together to bring you all the latest in cloud news, including Kro, DeepSeek, and CoPilot. 
Titles we almost went with this week:

In Shocking News China Steals US IP
️The Cloud Pod is Now Supported in Gov Cloud 
Microsoft Goes Open Source No SQL… and Hell Hasn’t Frozen Over
Zombie Buckets Receive How Much Traffic?!?
️AWS, GCP and Azure eat KRO
‍✈️Github Copilot for Free, so You Can Win at Coding Interviews
Customized Best Practices… I don’t think you know what best practices are
☁️TheCloudPod Leverages Deep Understanding to Make a Nuanced Decision on adopting Copilot

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
Follow Up
01:23 Is DeepSeek really sending data to China? Let’s decode 

One of the early concerns about DeepSeek was its privacy implications, starting with their privacy policy. 
Allegations are significant but reality is if the open source model is hosted locally or orchestrated via GPUs in the US the data does not go to China.
But if you’re using the DeepSeek app it clearly states in the privacy policy that the data will be stored in China. Data hosted on Chinese servers can be seized by the Government at any time. 
Maybe rethink using the native DeepSeek websites and mobile apps and just host them locally in LM studio. 

02:21  Jonathan – “They’re collecting some weird data. I get collecting conversational data, because that is the business they’re in, but they’re also doing some weird stuff, like they fingerprint users by looking at the patterns of the way that they type. Not just what they type, but how they type, like the timing between hitting different letters – things like that.”
8:06 OpenAI Believes DeepSeek Was Developed Using OpenAI Models 

Listener Note: paywall article 
]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/1974149/c1a-k5d5-rkznw63mao8k-o2qkb2.jpg"></itunes:image>
                                                                            <itunes:duration>01:05:44</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                            </item>
                    <item>
                <title>
                    <![CDATA[290: Open AI to Operator: There is a DeepSeek Outside the Door]]>
                </title>
                <pubDate>Thu, 06 Feb 2025 17:38:59 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/1968352</guid>
                                    <link>https://tcpfm.castos.com/episodes/290-deepseek-open-ai-operator</link>
                                <description>
                                            <![CDATA[<p><span style="font-weight:400;">Welcome to episode 290 of The Cloud Pod – where the forecast is always cloudy! It’s a full house this week – and a good thing too, since there’s a lot of news! Justin, Jonathan, Ryan, and Matthew are all in the house to bring you news on DeepSeek, OpenVox, CloudWatch, and more. </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">☁️The cloud pod wonders if azure is still hung over from new years</span></li>
<li><span style="font-weight:400;">Stratoshark sends the Cloud pod to the stratosphere</span></li>
<li><span style="font-weight:400;">Cutting-Edge Chinese “Reasoning” Model Rivals OpenAI… and it’s FREE?!</span></li>
<li><span style="font-weight:400;">Wireshark turns 27, Cloud Pod Hosts feel old</span></li>
<li><span style="font-weight:400;">☠️Operator: DeepSeek is here to kill OpenAI</span></li>
<li><span style="font-weight:400;">Time for a deepthink on buying all that Nvidia stock</span></li>
<li><span style="font-weight:400;">AWS Token Service finally goes cloud native</span></li>
<li><span style="font-weight:400;">The CloudPod wonders if OpenAI’s Operator can order its own $200 subscription</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><b>AI IS Going Great – Or How ML Makes All Its Money</b></h2>
<p><b>01:29 </b><a href="https://www.digitalocean.com/blog/introducing-generative-ai-platform" target="_blank" rel="noreferrer noopener"><b>Introducing the GenAI Platform: Simplifying AI Development for All</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">If you’re struggling to find that AI GPU capacity, </span><a href="https://www.digitalocean.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Digital Ocean</span></a><span style="font-weight:400;"> is pleased to announce their </span><a href="https://www.digitalocean.com/products/gen-ai" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">DigitalOcean GenAI Platform</span></a><span style="font-weight:400;"> is now available to everyone.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The platform aims to democratize AI development, empowering everyone – from solo developers to large teams – to leverage the transformative potential of generative AI. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">On the Gen AI platform you can:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Build Scalable AI Agents</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Seamlessly integrate with workflows</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Leverage guardrails</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Optimize Efficiency. </span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Some of the use cases they are highlighting are chatbots, e-commerce assistance, support automation, business insights, AI-Driven CRMs, Personalized Learning and interactive tools. </span></li>
</ul>
<p><i><span style="font-weight:400;">02:23  Jonathan – “</span></i><i><span style="font-weight:400;">Inference cost is really the big driver there. So once you once you build something that’s that’s done, but it’s nice to see somebody focusing on delivering it as a service rather than, you know, a $50 an hour compute for training models. This is right where they need to be.”</span></i></p>
<p><b>04:21</b> <a href="https://openai.com/index/introducing-operator/" target="_blank" rel="noreferrer noopener"><b>OpenAI: Introducing Operator</b></a></p>
<ul>
<li style="font-weight:400;"></li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 290 of The Cloud Pod – where the forecast is always cloudy! It’s a full house this week – and a good thing too, since there’s a lot of news! Justin, Jonathan, Ryan, and Matthew are all in the house to bring you news on DeepSeek, OpenVox, CloudWatch, and more. 
Titles we almost went with this week:

☁️The cloud pod wonders if azure is still hung over from new years
Stratoshark sends the Cloud pod to the stratosphere
Cutting-Edge Chinese “Reasoning” Model Rivals OpenAI… and it’s FREE?!
Wireshark turns 27, Cloud Pod Hosts feel old
☠️Operator: DeepSeek is here to kill OpenAI
Time for a deepthink on buying all that Nvidia stock
AWS Token Service finally goes cloud native
The CloudPod wonders if OpenAI’s Operator can order its own $200 subscription

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
AI IS Going Great – Or How ML Makes All Its Money
01:29 Introducing the GenAI Platform: Simplifying AI Development for All 

If you’re struggling to find that AI GPU capacity, Digital Ocean is pleased to announce their DigitalOcean GenAI Platform is now available to everyone.
The platform aims to democratize AI development, empowering everyone – from solo developers to large teams – to leverage the transformative potential of generative AI. 
On the Gen AI platform you can:

Build Scalable AI Agents
Seamlessly integrate with workflows
Leverage guardrails
Optimize Efficiency. 


Some of the use cases they are highlighting are chatbots, e-commerce assistance, support automation, business insights, AI-Driven CRMs, Personalized Learning and interactive tools. 

02:23  Jonathan – “Inference cost is really the big driver there. So once you once you build something that’s that’s done, but it’s nice to see somebody focusing on delivering it as a service rather than, you know, a $50 an hour compute for training models. This is right where they need to be.”
04:21 OpenAI: Introducing Operator

]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[290: Open AI to Operator: There is a DeepSeek Outside the Door]]>
                </itunes:title>
                                    <itunes:episode>290</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p><span style="font-weight:400;">Welcome to episode 290 of The Cloud Pod – where the forecast is always cloudy! It’s a full house this week – and a good thing too, since there’s a lot of news! Justin, Jonathan, Ryan, and Matthew are all in the house to bring you news on DeepSeek, OpenVox, CloudWatch, and more. </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">☁️The cloud pod wonders if azure is still hung over from new years</span></li>
<li><span style="font-weight:400;">Stratoshark sends the Cloud pod to the stratosphere</span></li>
<li><span style="font-weight:400;">Cutting-Edge Chinese “Reasoning” Model Rivals OpenAI… and it’s FREE?!</span></li>
<li><span style="font-weight:400;">Wireshark turns 27, Cloud Pod Hosts feel old</span></li>
<li><span style="font-weight:400;">☠️Operator: DeepSeek is here to kill OpenAI</span></li>
<li><span style="font-weight:400;">Time for a deepthink on buying all that Nvidia stock</span></li>
<li><span style="font-weight:400;">AWS Token Service finally goes cloud native</span></li>
<li><span style="font-weight:400;">The CloudPod wonders if OpenAI’s Operator can order its own $200 subscription</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><b>AI IS Going Great – Or How ML Makes All Its Money</b></h2>
<p><b>01:29 </b><a href="https://www.digitalocean.com/blog/introducing-generative-ai-platform" target="_blank" rel="noreferrer noopener"><b>Introducing the GenAI Platform: Simplifying AI Development for All</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">If you’re struggling to find that AI GPU capacity, </span><a href="https://www.digitalocean.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Digital Ocean</span></a><span style="font-weight:400;"> is pleased to announce their </span><a href="https://www.digitalocean.com/products/gen-ai" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">DigitalOcean GenAI Platform</span></a><span style="font-weight:400;"> is now available to everyone.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The platform aims to democratize AI development, empowering everyone – from solo developers to large teams – to leverage the transformative potential of generative AI. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">On the Gen AI platform you can:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Build Scalable AI Agents</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Seamlessly integrate with workflows</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Leverage guardrails</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Optimize Efficiency. </span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Some of the use cases they are highlighting are chatbots, e-commerce assistance, support automation, business insights, AI-Driven CRMs, Personalized Learning and interactive tools. </span></li>
</ul>
<p><i><span style="font-weight:400;">02:23  Jonathan – “</span></i><i><span style="font-weight:400;">Inference cost is really the big driver there. So once you once you build something that’s that’s done, but it’s nice to see somebody focusing on delivering it as a service rather than, you know, a $50 an hour compute for training models. This is right where they need to be.”</span></i></p>
<p><b>04:21</b> <a href="https://openai.com/index/introducing-operator/" target="_blank" rel="noreferrer noopener"><b>OpenAI: Introducing Operator</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">We have thoughts about the name of this service…</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">OpenAI is releasing the preview version of their agent that can use a web browser to perform tasks for you. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The new version is available to OpenAI pro users. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">OpenAI says it’s currently a research preview, meaning it has limitations and will evolve based on your feedback. </span></li>
<li style="font-weight:400;"><a href="https://operator.chatgpt.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Operator</span></a><span style="font-weight:400;"> can handle various browser tasks such as filling out forms, ordering groceries, and even creating memes.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The ability to use the same interfaces and tools that humans interact with on a daily basis broadens the utility of AI, helping people save time on everyday tasks while opening up a new engagement opportunity for business</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Operator is powered by a new model called </span><a href="https://openai.com/index/computer-using-agent/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Computer-Using Agent (CUA)</span></a><span style="font-weight:400;">. Combining GPT-4o’s vision capabilities with advanced reasoning through reinforcement learning, CUA is trained to interact with a GUI </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Justin was going to try it, but he forgot that the Pro plan is $200 dollars a month – so our listeners have to wait on his review of that one. </span></li>
</ul>
<p><i><span style="font-weight:400;">06:52  Jonathan – “</span></i><i><span style="font-weight:400;">I like Operator. What I really like to see though is I don’t want to have to have it open in the browser. I don’t want to watch it doing its work.”</span></i></p>
<p><b>08:09</b> <a href="https://arstechnica.com/ai/2025/01/china-is-catching-up-with-americas-best-reasoning-ai-models/" target="_blank" rel="noreferrer noopener"><b>Cutting-edge Chinese “reasoning” model rivals OpenAI o1—and it’s free to </b></a><a href="https://arstechnica.com/ai/2025/01/china-is-catching-up-with-americas-best-reasoning-ai-models/" target="_blank" rel="noreferrer noopener"><b>download</b></a><b> </b></p>
<p><a href="https://arstechnica.com/ai/2025/01/deepseek-spooks-american-tech-industry-as-it-tops-the-apple-app-store/" target="_blank" rel="noreferrer noopener"><b>DeepSeek panic triggers tech stock sell-off as Chinese AI tops App Store</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">There’s a lot of jokes here, but we’re going to keep it professional – you’re welcome or we’re sorry, depending on your maturity level. </span></li>
<li style="font-weight:400;"><a href="https://en.wikipedia.org/wiki/DeepSeek" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">DeepSeek</span></a><span style="font-weight:400;"> has turned the AI world upside down over the last week.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Last week, Chinese AI lab DeepSeek released its new </span><a href="https://github.com/deepseek-ai/DeepSeek-R1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">R1 model family</span></a><span style="font-weight:400;"> under an open </span><a href="https://huggingface.co/deepseek-ai/DeepSeek-R1-Zero/blob/main/LICENSE" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">MIT License</span></a><span style="font-weight:400;">, with its largest version containing 671 billion parameters. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The company is claiming that the model performs at the levels comparable to OpenAI’s o1 simulated reasoning model on several math and coding benchmarks.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">In addition to the main deepseek-r1-main and deepseek-r1 models, they released 6 smaller distilled versions ranging from 1.6 billion to 70 billion parameters. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">These distilled models are based on existing open source architectures like </span><a href="https://qwenlm.github.io/blog/qwen/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Qwen</span></a><span style="font-weight:400;"> and </span><a href="https://www.llama.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Llama</span></a><span style="font-weight:400;">, trained using data generated from the full R1 model. The smallest version can run on a laptop, while the full model requires far more substantial computing resources. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This stunned the AI market, as most open-weight models which can often be run and fine-tuned on local hardware, have lagged behind proprietary models like </span><a href="https://openai.com/o1/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">OpenAI o1</span></a><span style="font-weight:400;"> in so called reasoning-benchmarks. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Having these capabilities available in a MIT licensed model that anyone can study, modify or use commercially potentially marks a shift in what’s possible with a public model.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The stock market panicked in response, with companies like </span><a href="http://www.nvidia.com/page/home.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Nvidia</span></a><span style="font-weight:400;"> down 17% percent on Monday this week – based on the fact that DeepSeek jumped to the top of the app store free downloads, and the fact its low-cost and freely available. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The three things that have investors and researchers shocked:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">The Chinese startup that trained the model for only $6 million (reportedly 3% of the cost of training Open AI o1) as a so-called “side-project” while using less powerful NVIDIA H800 AI acceleration ships due to US export restrictions on cutting-edge GPU. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">It appeared just four months after OpenAI announced o1 in September 2024. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Released them under MIT license.</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">This led investors to see that American Tech companies – which have thrived on proprietary and closed models, have “no moat,” which means that any technological lead led by cutting-edge hardware or impressive bankrolls doesn’t protect them from startup challenges. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The question is it really any good, and can they scale to continue to maintain this with limited access to future GPUs.  </span></li>
</ul>
<p><i><span style="font-weight:400;">10:57  Ryan – “</span></i><i><span style="font-weight:400;">The impact the story has had this week has been a roller coaster. Like, and I don’t know if that’s just because I’ve been busy and sort of half paying attention. And, now, it wasn’t really until we were preparing for the show that I really dove in to figure out what, what this was after seeing it. Like, you know, first it was like a Chinese app taking over the phones. I thought it was security concerns and all this stuff, especially with all the Tik Tok stuff that’s going on. And then to find out it was an AI model, I’m like, it’s just, there’s other Chinese AI models, then the impact on Nvidia stock. So it was kind of crazy to see all of this happen. And it really just proves that the AI market right now is just very volatile and very subject to change.”</span></i></p>
<h2><b>Cloud Tools</b></h2>
<p><b>20:19</b> <a href="https://www.hashicorp.com/blog/enabling-fast-safe-migration-to-hcp-terraform-with-terraform-migrate-tf-migrate" target="_blank" rel="noreferrer noopener"><b>Enabling fast, safe migration to HCP Terraform with Terraform migrate </b></a><a href="https://www.hashicorp.com/blog/enabling-fast-safe-migration-to-hcp-terraform-with-terraform-migrate-tf-migrate" target="_blank" rel="noreferrer noopener"><b>(tf-migrate)</b></a><span style="font-weight:400;"> </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Migrating to </span><a href="https://www.bing.com/ck/a?!&amp;&amp;p=af6dabefc2ec73c46d7cce047c2e2fde0e8dc34e4aee3230fe5f091913d40a80JmltdHM9MTczODU0MDgwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=111326c2-38f8-6e94-3a5e-34a8396a6f52&amp;psq=HCP+terraform&amp;u=a1aHR0cHM6Ly9hcHAudGVycmFmb3JtLmlvL2FwcA&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">HCP Terraform</span></a><span style="font-weight:400;"> can be a bit of a pain, especially when it comes to factoring your state file transitions. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">When you need to migrate from CE to HCP Terraform or </span><a href="https://www.bing.com/ck/a?!&amp;&amp;p=0d011879d356a3c06b9ac886c3cdead005f7d520438b261b26e82cf565913fc8JmltdHM9MTczODU0MDgwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=111326c2-38f8-6e94-3a5e-34a8396a6f52&amp;psq=terraform+enterprise&amp;u=a1aHR0cHM6Ly9kZXZlbG9wZXIuaGFzaGljb3JwLmNvbS90ZXJyYWZvcm0vZW50ZXJwcmlzZQ&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Terraform Enterprise</span></a><span style="font-weight:400;">, state file management during that migration is the biggest challenge.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This led </span><a href="https://www.bing.com/ck/a?!&amp;&amp;p=9fb5c4811223bb7a5d4487e7f2139fc7e31b3d2c694868519658110a6c6ef676JmltdHM9MTczODU0MDgwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=111326c2-38f8-6e94-3a5e-34a8396a6f52&amp;psq=Hashicorp&amp;u=a1aHR0cHM6Ly93d3cuaGFzaGljb3JwLmNvbS8&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Hashicorp</span></a><span style="font-weight:400;"> to build </span><a href="https://www.bing.com/ck/a?!&amp;&amp;p=f794a38a969a5db36f65a822cdaedf3645b827f8b3ee03b1f70e581c5787a4d7JmltdHM9MTczODU0MDgwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=111326c2-38f8-6e94-3a5e-34a8396a6f52&amp;psq=TF+Migrate&amp;u=a1aHR0cHM6Ly9kZXZlbG9wZXIuaGFzaGljb3JwLmNvbS90ZXJyYWZvcm0vY2xvdWQtZG9jcy9taWdyYXRlL3RmLW1pZ3JhdGU&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">TF-Migrate</span></a><span style="font-weight:400;">, a utility for automating state migrations to HCP Terraform and Terraform Enterprise.  It can also be used to simplify workspace setup and supports modular refactoring. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">There are future enhancements in the works:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Integration with source code systems like </span><a href="https://www.bing.com/ck/a?!&amp;&amp;p=dc11d06d7920738324d9a52a13ea81e13ca59cd54f8de8c1aa74ddb0ccfa4d9bJmltdHM9MTczODU0MDgwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=111326c2-38f8-6e94-3a5e-34a8396a6f52&amp;psq=github&amp;u=a1aHR0cHM6Ly9naXRodWIuY29tLw&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Github</span></a><span style="font-weight:400;">, to enhance migration workflows by embedding migrations configurations directly into repositories. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Enhancing and extending the migration capabilities to support variables, modules and private registries between multiple terraform deployment options</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Improve handling of sensitive data during migrations, such as secrets or access tokens</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Further integration with Terraform Enterprise and Terraform Cloud to enhance governance by offering centralized control over migration tasks, audit trails, and policy enforcements. </span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">21:44  Ryan – “</span></i><i><span style="font-weight:400;">Anytime you have state conflict due to either data recovery or just try and reconcile manual actions that have happened since or anything like that, it’s always so painful. So I’m really happy to see tools like this exist. And it’s just another example of HashiCorp building in really usable functionality, whether it’s upgrading your code to the newest Terraform version or migrating state files. I like this a whole lot.”</span></i></p>
<p><b>23:53</b> <a href="https://siliconangle.com/2025/01/22/sysdig-extends-wiresharks-legacy-stratoshark-cloud-environments/" target="_blank" rel="noreferrer noopener"><b>Sysdig extends Wireshark’s legacy with Stratoshark for cloud </b></a><a href="https://siliconangle.com/2025/01/22/sysdig-extends-wiresharks-legacy-stratoshark-cloud-environments/" target="_blank" rel="noreferrer noopener"><b>environments</b></a></p>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;"><a href="https://sysdig.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Sysdig Inc</span></a><span style="font-weight:400;">. announced the launch of Stratoshark, a new open source tool that extends </span><a href="https://www.wireshark.org/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Wireshark</span></a><span style="font-weight:400;"> granular network visibility into the cloud and provides users a standardized approach to cloud system analysis.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Wireshark is over 27 years old, with over 5 million daily users and has had over 160 million downloads to help you analyze network traffic and troubleshoot issues. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">However, as companies move to the cloud, analysts have lacked the same visibility as a comparable open source tool. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Stratoshark fills the gap, with features that unlock deep cloud visibility to assist in analyzing and troubleshooting cloud system calls and logs with a level of granularity and workflow familiar to longtime wireshark users. </span></li>
</ul>
</li>
</ul>
<ul>
<li style="font-weight:400;"><i><span style="font-weight:400;">“Wireshark revolutionized network analysis by democratizing packet captures, a concept that Sysdig brought to cloud-native workloads and Falco extended to cloud runtime security,” </span></i><b><i>said Gerald Combs, Stratoshark and Wireshark co-creator and Sysdig director of open-source projects.</i></b><i><span style="font-weight:400;"> “Wireshark users live by the phrase ‘pcap or it didn’t happen,’ but until now cloud packet capture hasn’t been easy or even possible. Stratoshark helps unlock this level of visibility, equipping network professionals with a familiar tool that makes system call and log analysis as accessible and transformative for the cloud as Wireshark did for network packet analysis.”</span></i></li>
<li style="font-weight:400;"><span style="font-weight:400;">Stratoshark leverages </span><a href="https://www.cncf.io/blog/2025/01/22/from-pcap-to-scap-how-falcos-libraries-registries-and-plugins-enable-cloud-native-insights/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Falco libraries</span></a><span style="font-weight:400;">, repositories and plugins to unite deep cloud visibility with familiar wireshark functionality.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Falco is an open-source runtime security tool created by Sysdig that detects and alerts on unexpected behavior in a cloud-native environment , such as K8. </span></li>
</ul>
<p><i><span style="font-weight:400;">29:30  Ryan- “</span></i><i><span style="font-weight:400;">It’s a magic trick. I’ve used Wireshark to sort out issues that people were blaming and all kinds of different things. I remember sorting through a Java heap problem because of Wireshark outputs and timing differences and a whole bunch of things. It really is something I can break out and it looks like the ancient times tool, but it really does help.”</span></i></p>
<p><b>31:02</b> <a href="https://thenewstack.io/openvox-the-community-driven-fork-of-puppet-has-arrived/?utm_source=tldrdevops" target="_blank" rel="noreferrer noopener"><b>OpenVox: The Community-Driven Fork of Puppet Has Arrived</b></a><span style="font-weight:400;"> </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">The OpenSource Puppet community has forked </span><a href="https://puppet.com/?utm_content=inline+mention" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Puppet</span></a><span style="font-weight:400;"> into </span><a href="https://github.com/openvoxproject" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">OpenVox</span></a><span style="font-weight:400;">.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This fork sprang from Puppet’s owner, </span><a href="https://thenewstack.io/puppets-open-source-community-plans-to-fork-the-program/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Perforce, moving Puppet’s binaries and packages</span></a><span style="font-weight:400;"> to private, hardened, and controlled locations. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">In addition, community contributors would have limited access to the program, and usage beyond 25 nodes will require commercial licenses.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">These changes have been resisted by long-time Puppet users and contributors who started this fork. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Initially referred to as the OpenPuppetProject, the community, now known as </span><a href="https://voxpupuli.org/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Vox Pupuli</span></a><span style="font-weight:400;">, has settled on </span><a href="https://blog.netways.de/blog/2025/01/10/community-fork-von-puppet-soll-openvox-heissen/?fbclid=IwY2xjawH9CkRleHRuA2FlbQIxMAABHfwWN7usd9w6eHzlv8KLA0Ct48QnfeJy0ybDRoSh4D8rTiQn1B7jlpp_Rw_aem_6g2Oty_P_hlthiaFtj5wCQ" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">OpenVox as the fork’s name</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They intend to continue Puppet’s work while adhering to the open source principles. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">A github repository has been set up, and discussions are ongoing regarding the project organizational structure and future direction. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The intent is this to be a </span><a href="https://voxpupuli.org/blog/2025/01/21/openvox-release/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">soft fork</span></a><span style="font-weight:400;">, with the desire to maintain downstream compatibility for as long as possible. As well as the puppet standards steering committee will include seats representing the whole community, including perforce, whether they want to join or not. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They don’t fully plan to follow puppet with plans including:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Modernizing the OpenVox codebase and ecosystem, in particular the developers plan to support current OS and Ruby versions rather than relying on fifteen-year-old unmaintained ruby gems</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Recentering and focusing on community requirements. Actual usage patterns will drive development rather than which customers have the deepest pockets</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Democratizing platform support, instead of waiting for Puppet to support the current Unbuntu Linux, community members can contribute to the projects themselves.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Maintaining an active and responsive open-source community. Ie: YES, your pull request will finally get reviewed.  </span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">35:12  Jonathan – “</span></i><i><span style="font-weight:400;">I think with AI, as mature as it is and as mature as it’s getting, it’s not going to be long before you can point a set of AI agents at any product you like and say, build me this thing that does exactly the same thing as this. And by the way, work around these patterns that they have. And we’ll be able to reproduce anything very cheaply, very quickly. I think I wouldn’t want to be in SAS right now or any kind of software, to be honest.”</span></i></p>
<h2><b>AWS</b></h2>
<p><b>36:44</b> <a href="https://aws.amazon.com/about-aws/whats-new/2025/01/cloudwatch-execution-plan-capture-aurora-postgresql/" target="_blank" rel="noreferrer noopener"><b>CloudWatch provides execution plan capture for Aurora PostgreSQL</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Cloudwatch Database insights now collects the query execution plans of top sql queries running on Aurora PostgreSQL instances and stores them over time.  This feature helps you identify if a change in the query execution plan is the cause of the performance degradation or a stalled query. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Execution plans are available exclusively in the advanced mode of cloudwatch database insights. </span></li>
</ul>
<p><b>38:06</b> <a href="https://aws.amazon.com/about-aws/whats-new/2025/01/aws-client-vpn-concurrent-vpn-connections/" target="_blank" rel="noreferrer noopener"><b>AWS Client VPN announces support for concurrent VPN connections</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS is announcing the general availability of concurrent VPN connections for AWS client VPN, making your security people sad – but the people who have to do real work are going to be really happy. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This feature allows you to securely connect to multiple Client VPN connections simultaneously, enabling access to your resources across the different environments.   </span></li>
</ul>
<p><i><span style="font-weight:400;">38:19  Matthew – “</span></i><i><span style="font-weight:400;">And now we have to use Wireshark to figure out where all of our connections are going.”</span></i></p>
<p><b>40:01</b> <a href="https://aws.amazon.com/about-aws/whats-new/2025/01/new-edge-location-kingdom-saudi-arabia/" target="_blank" rel="noreferrer noopener"><b>AWS announces new edge location in the Kingdom of Saudi Arabia</b></a><span style="font-weight:400;"> </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS is expanding the KSA region with Amazon </span><a href="https://aws.amazon.com/cloudfront/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">CloudFront</span></a><span style="font-weight:400;"> edge location in Jeddah. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The new AWS edge location brings the full suite of benefits provided by Amazon Cloudfront, a secure, highly distributed, and scalable CDN.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">When doing research we came across this gem:</span><i><span style="font-weight:400;"> For the Kingdom of Saudi Arabia (KSA) location, you must use location-specific URLs to access the jurisdictional Google Cloud console, as well as some methods and commands in the gcloud CLI, the Cloud Client Libraries, and the Security Command Center API. </span></i><span style="font-weight:400;">WHAT? WHY? </span></li>
</ul>
<p><b>42:23</b> <a href="https://aws.amazon.com/about-aws/whats-new/2025/01/general-availability-aws-managed-notifications/" target="_blank" rel="noreferrer noopener"><b>Announcing general availability of AWS Managed Notifications</b></a><span style="font-weight:400;"> </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS is announcing the GA of AWS Managed notifications, a new feature of AWS user Notifications that enhances how customers receive and manage AWS health notifications.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Justin loves these, and would love everyone to send him some. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This feature allows you to view and modify default AWS health notifications in the console notifications center, alongside your custom notifications such as cloudwatch alarms. </span></li>
</ul>
<p><i><span style="font-weight:400;">43:09  Ryan – “I </span></i><i><span style="font-weight:400;">mean, they’ve been working towards this in a while, you know, for a long while. remember previewing something that was similar to this. The idea is that  instead of blasting the email account that you associate with your AWS account, you can tune it to specific things and, to be specific, you can have multiple targets depending on the alert, right? And that makes a lot more sense. But it still hasn’t really reconciled itself into something usable in a lot of ways. it’s, I don’t know how to get, you know, anyone to read them, you know, their database engine is, you know, two versions out of support and they need to update and, then also have the same list, you know, manage the outages that AWS might experience. so like, it’s, it’s just sort of weird in order to configure this and deal with this and it’s a strange problem that I don’t quite know the right solution to.”</span></i></p>
<p><b>47:42 </b> <a href="https://aws.amazon.com/blogs/security/announcing-upcoming-changes-to-the-aws-security-token-service-global-endpoint/" target="_blank" rel="noreferrer noopener"><b>Announcing upcoming changes to the AWS Security Token Service global </b></a><a href="https://aws.amazon.com/blogs/security/announcing-upcoming-changes-to-the-aws-security-token-service-global-endpoint/" target="_blank" rel="noreferrer noopener"><b>endpoint</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS launched </span><a href="https://docs.aws.amazon.com/STS/latest/APIReference/welcome.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">STS</span></a><span style="font-weight:400;"> in August 2011 with a single global endpoint (</span><a href="https://sts.amazonaws.com" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">https://sts.amazonaws.com</span></a><span style="font-weight:400;">), hosted in the US East Region. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">To reduce dependencies on a single region, STS launched </span><a href="https://docs.aws.amazon.com/sdkref/latest/guide/feature-sts-regionalized-endpoints.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS STS Regional endpoints</span></a><span style="font-weight:400;"> in February 2015.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">These regional endpoints allow you to use STS in the same region as your workloads, improving performance and reliability.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">However, customers and third-party tools continue to call the STS global endpoint, and as a result, these customers don’t get the benefits of the regional endpoints. To help improve resiliency and performance, they are making changes to the STS global endpoint, with no action required for you. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Today all requests to the global endpoint are processed in the US east region. Starting in a few weeks, the STS global endpoint will be automatically served in the same region as your AWS deployed workloads.  For example, if your app calls sts.amazonaws.com from the us-west region, your call will be served locally via the US west region STS service.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This will apply for all regions that are enabled by default, for opt-in regions or if you’re using STS outside of AWS they will still be handled by US_east. </span></li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/cloudtrail/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">CloudTrail</span></a><span style="font-weight:400;"> logs for global STS endpoints will still be sent to the US-East region. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">CloudTrail logs will have additional metadata fields including EndpointType and awsServingRegion to clarify which endpoint and region served the request. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Requests made to STS.amazonaws.com endpoints will have a value of us-east-1 for the requested region condition key, regardless of which region served the request. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Requests handled by the STS endpoint will not share a request quota with the region STS endpoint. </span></li>
</ul>
<p><i><span style="font-weight:400;">52:009  Justin – “</span></i><i><span style="font-weight:400;">I imagine if they retire this, it breaks all of us East one forever.”</span></i></p>
<p><b>53:09</b> <a href="https://aws.amazon.com/about-aws/whats-new/2025/01/amazon-s3-metadata-generally-available/" target="_blank" rel="noreferrer noopener"><b>Amazon S3 Metadata is now generally available</b></a><span style="font-weight:400;">  </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS is announcing the GA of Amazon S3 metadata.  S3 metadata provides automated and easily queried metadata that updates in near real time, simplifying business analytics, real-time inference applications, and more.  S3 metadata supports object metadata, which includes system defined details like size and source of the object, and custom metadata, which allows you to use tags to annotate your objects with information like product SKU, transaction ID or content rating. </span></li>
</ul>
<p><i><span style="font-weight:400;">53:39  Ryan- I’ve needed this for a long time, and I’ve done some crazy work arounds. I’m glad to see they’re rolling it out there, because it is super useful.”</span></i></p>
<h2><b>GCP</b></h2>
<p><b>54:28</b> <a href="https://cloud.google.com/blog/products/data-analytics/introducing-bigquery-metastore-fully-managed-metadata-service/" target="_blank" rel="noreferrer noopener"><b>Introducing BigQuery metastore, a unified metadata service with Apache </b></a><a href="https://cloud.google.com/blog/products/data-analytics/introducing-bigquery-metastore-fully-managed-metadata-service/" target="_blank" rel="noreferrer noopener"><b>Iceberg support</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google is releasing the public preview of </span><a href="https://cloud.google.com/bigquery/docs/about-bqms" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Bigquery Metastore</span></a><span style="font-weight:400;">, a fully managed unified metadata service that provides processing engine interoperability while enabling consistent data governance. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">BigQuery metastore is a highly scalable runtime metadata service that works with multiple engines, for example, BigQuery, </span><a href="https://spark.apache.org/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Apache Spark</span></a><span style="font-weight:400;">, </span><a href="https://hive.apache.org/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Hive</span></a><span style="font-weight:400;"> and </span><a href="https://flink.apache.org/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Flink</span></a><span style="font-weight:400;"> and supports the </span><a href="https://iceberg.apache.org/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Apache Iceberg</span></a><span style="font-weight:400;"> table format. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This allows analytics engines to query one copy of the data with a single schema, whether the data is stored in BigQuery storage tables, BigQuery tables for Apache Iceberg, or BigLake External tables. </span></li>
</ul>
<p><b>54:48</b> <a href="https://cloud.google.com/blog/products/devops-sre/new-cloud-deploy-features-for-automated-deployments/" target="_blank" rel="noreferrer noopener"><b>Safer automated deployments with new Cloud Deploy features</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><a href="http://cloud.google.com/deploy" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Cloud Deploy</span></a><span style="font-weight:400;"> is getting several new features this week, but all of these are in </span><a href="https://cloud.google.com/products?hl=en#product-launch-stages" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">preview</span></a><span style="font-weight:400;">, so don’t rip out your current CD solutions yet.</span></li>
<li style="font-weight:400;"><a href="https://cloud.google.com/deploy/docs/automation-rules#repairrollout_rule" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Repair Rollouts</span></a><span style="font-weight:400;">, lets you retry failed deployments or automatically roll back to a previously successful release when an error occurs. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This can come in any phase of the deployment from a sql migration, a misconfiguration detected when talking to a GKE cluster or as part of a deployment verification step. </span></li>
<li style="font-weight:400;"><a href="https://cloud.google.com/deploy/docs/deploy-app-policy" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Deploy policies</span></a><span style="font-weight:400;"> limit what the automation or users can do. Initially, their launching time-windows policy, which can, for example, inhibit deployments during evenings, weekends, or during important events. While an on-caller with the policy overrider role could “break glass” to get around the policies, automated deployments won’t be able to trigger during the middle of a big demo</span></li>
<li style="font-weight:400;"><a href="https://cloud.google.com/deploy/docs/automation-rules#timedpromote_rule" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Time promotions</span></a><span style="font-weight:400;">, after a release is successfully rolled out, you may want to automatically deploy it to the next environment. Previous auto-promote features let you promote a release after a specified duration, for example moving it into prod 12 hours after it went to staging. But often you want promotions to happen on a schedule, not based on a delay.  </span></li>
</ul>
<p><i><span style="font-weight:400;">56:56  Matthew – “</span></i><i><span style="font-weight:400;">I miss a good code deploy cloud deploy tool. That’s all I have to say here.”</span></i></p>
<p><b>59:53</b> <a href="https://cloud.google.com/blog/products/ai-machine-learning/introducing-agent-evaluation-in-vertex-ai-gen-ai-evaluation-service/" target="_blank" rel="noreferrer noopener"><b>Introducing agent evaluation in Vertex AI Gen AI evaluation service</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google is announcing </span><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/models/evaluation-overview" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Vertex AI Gen AI evaluation service</span></a><span style="font-weight:400;"> in preview. This new feature empowers developers to rigorously assess and understand their AI agents. It includes a powerful set of evaluation metrics specifically designed for agents built with different frameworks, and provides native agent inference capabilities to streamline the evaluation process. </span></li>
</ul>
<p><i><span style="font-weight:400;">1:00:58 Justin – “I don’t know how it works, I just know that’s what they’re doing.”</span></i></p>
<p><b>1:02:18 </b><a href="https://cloud.google.com/blog/products/compute/announcing-smaller-machine-types-for-a3-high-vms/" target="_blank" rel="noreferrer noopener"><b>Announcing smaller machine types for A3 High VMs</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">You can now get A3 High VM powered by Nvidia H100 80gb GPUs in multiple machine types including 1, 2, 4 and 8 GPU options. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">As well as support for Spot market pricing as well as integration into vertex.</span></li>
</ul>
<h2><b>Off Topic, But Interesting,,,</b></h2>
<p><b>1:04:38 </b><a href="https://cloud.google.com/blog/products/chrome-enterprise/new-year-new-os-supporting-your-business-with-chromeos-flex/" target="_blank" rel="noreferrer noopener"><b>New Year, New OS. Supporting your business with ChromeOS Flex</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">If you have some old laptops or computers hanging around, you can now deploy a no-cost, easy to deploy solution to breathe new life into them.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">With just a USB stick, you can install </span><a href="https://chromeos.google/products/chromeos-flex/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">ChromeOS Flex</span></a><span style="font-weight:400;"> and transform aging laptops, kiosks and more into fast, secure and modern devices.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Google says it’s the perfect solution for businesses hoping to refresh devices, improve security, and embrace sustainability. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Going into 2025 they’ve certified over 600 devices to work effortlessly with Chrome Flex.</span></li>
</ul>
<p><i><span style="font-weight:400;">1:06:15  Jonathan- “</span></i><i><span style="font-weight:400;">I like the idea of what they’re doing. I think if it saves a bunch of stuff going in a landfill or something and brings some new life into things for a few more years, that’s great. Especially as Windows 11 is only supporting newer CPUs and TPMv2 and things like that. It’s super annoying that the OS vendor would do that.”</span></i></p>
<h2><span style="font-weight:400;">Closing</span></h2>
<p><span style="font-weight:400;">And that is the week in the cloud! Visit our website, the home of the Cloud Pod where you can join our newsletter, slack team, send feedback or ask questions at theCloud Pod.net or tweet at us with hashtag #theCloudPod</span></p>
]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/1968352/c1e-8m9mb9jzodhx61og-1p4pk1gzcgmz-xinuyb.mp3" length="84230185"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 290 of The Cloud Pod – where the forecast is always cloudy! It’s a full house this week – and a good thing too, since there’s a lot of news! Justin, Jonathan, Ryan, and Matthew are all in the house to bring you news on DeepSeek, OpenVox, CloudWatch, and more. 
Titles we almost went with this week:

☁️The cloud pod wonders if azure is still hung over from new years
Stratoshark sends the Cloud pod to the stratosphere
Cutting-Edge Chinese “Reasoning” Model Rivals OpenAI… and it’s FREE?!
Wireshark turns 27, Cloud Pod Hosts feel old
☠️Operator: DeepSeek is here to kill OpenAI
Time for a deepthink on buying all that Nvidia stock
AWS Token Service finally goes cloud native
The CloudPod wonders if OpenAI’s Operator can order its own $200 subscription

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
AI IS Going Great – Or How ML Makes All Its Money
01:29 Introducing the GenAI Platform: Simplifying AI Development for All 

If you’re struggling to find that AI GPU capacity, Digital Ocean is pleased to announce their DigitalOcean GenAI Platform is now available to everyone.
The platform aims to democratize AI development, empowering everyone – from solo developers to large teams – to leverage the transformative potential of generative AI. 
On the Gen AI platform you can:

Build Scalable AI Agents
Seamlessly integrate with workflows
Leverage guardrails
Optimize Efficiency. 


Some of the use cases they are highlighting are chatbots, e-commerce assistance, support automation, business insights, AI-Driven CRMs, Personalized Learning and interactive tools. 

02:23  Jonathan – “Inference cost is really the big driver there. So once you once you build something that’s that’s done, but it’s nice to see somebody focusing on delivering it as a service rather than, you know, a $50 an hour compute for training models. This is right where they need to be.”
04:21 OpenAI: Introducing Operator

]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/1968352/c1a-k5d5-5zxqk5pnukv-m1bv7f.jpg"></itunes:image>
                                                                            <itunes:duration>01:10:12</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                            </item>
                    <item>
                <title>
                    <![CDATA[289: DORA The Explorer… Of EU Regulations]]>
                </title>
                <pubDate>Fri, 31 Jan 2025 16:07:10 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/1955248</guid>
                                    <link>https://tcpfm.castos.com/episodes/289-dora-the-explorer-of-eu-regulations</link>
                                <description>
                                            <![CDATA[<p><span style="font-weight:400;">Welcome to episode 289 of The Cloud Pod – where the forecast is always cloudy! Justin, Ryan, and Matt are here this week to bring you a riveting podcast on EU regulations! Are you asleep yet? No? Ok great. We promise it will be a good show – despite the title. </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">Stargate: We’re not saying its Aliens, but its $500 Billion</span></li>
<li><span style="font-weight:400;">️AWS: Now with extra sessions</span></li>
<li><span style="font-weight:400;">EC2 Flex: Bigger, Badder and Probably still expensive</span></li>
<li><span style="font-weight:400;">SNS FIFO: So fast, it’ll give you whiplash</span></li>
<li><span style="font-weight:400;">‍⚖️Azure: Now with added Legalese (Thanks, EU)</span></li>
<li><span style="font-weight:400;">OpenAI’s Stargate: From Chatbots to Interdimensional Travel (maybe)</span></li>
<li><span style="font-weight:400;">☢️GCP’s Biochar Initiative: Turning Waste into… Well, Less Waste (hopefully)</span></li>
<li><span style="font-weight:400;">️AWS Console Multiple Sessions: So you can prove you dropped those databases from multiple accounts</span></li>
<li><span style="font-weight:400;">☁️Amazon still adds new features to SNS and the cloud pod is impressed</span></li>
<li><span style="font-weight:400;">☠️AWS tries to kill chrome profiles</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><b>AI IS Going Great – Or How ML Makes All Its Money</b></h2>
<p><b>01:47 </b><a href="https://openai.com/index/announcing-the-stargate-project/" target="_blank" rel="noreferrer noopener"><b>Announcing The Stargate Project</b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://openai.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Open AI</span></a><span style="font-weight:400;"> announced a joint investment of $500 billion dollars over the next four years building new AI infrastructure for OpenAI in the US, with the intent to deploy $100B immediately.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This infrastructure will secure American leadership in AI, create hundreds of thousands of American jobs, and generate massive economic benefits for the entire world. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The initial equity funders in stargate are </span><a href="https://group.softbank/en/news/press/20250122" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">SoftBank</span></a><span style="font-weight:400;">, OpenAI, </span><a href="https://www.oracle.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Oracle</span></a><span style="font-weight:400;"> and </span><a href="https://www.mgx.ae/en" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">MGX</span></a><span style="font-weight:400;">.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Softbank and OpenAI are the lead partners for Stargate, with Softbank having financial responsibility, and OpenAI having operational responsibility. </span></li>
<li style="font-weight:400;"><a href="https://www.arm.com/markets/artificial-intelligence" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Arm</span></a><span style="font-weight:400;">, </span><a href="https://www.microsoft.com/en-us/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Microsoft</span></a><span style="font-weight:400;">, </span><a href="http://www.nvidia.com/page/home.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Nvidia</span></a></li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 289 of The Cloud Pod – where the forecast is always cloudy! Justin, Ryan, and Matt are here this week to bring you a riveting podcast on EU regulations! Are you asleep yet? No? Ok great. We promise it will be a good show – despite the title. 
Titles we almost went with this week:

Stargate: We’re not saying its Aliens, but its $500 Billion
️AWS: Now with extra sessions
EC2 Flex: Bigger, Badder and Probably still expensive
SNS FIFO: So fast, it’ll give you whiplash
‍⚖️Azure: Now with added Legalese (Thanks, EU)
OpenAI’s Stargate: From Chatbots to Interdimensional Travel (maybe)
☢️GCP’s Biochar Initiative: Turning Waste into… Well, Less Waste (hopefully)
️AWS Console Multiple Sessions: So you can prove you dropped those databases from multiple accounts
☁️Amazon still adds new features to SNS and the cloud pod is impressed
☠️AWS tries to kill chrome profiles

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
AI IS Going Great – Or How ML Makes All Its Money
01:47 Announcing The Stargate Project

Open AI announced a joint investment of $500 billion dollars over the next four years building new AI infrastructure for OpenAI in the US, with the intent to deploy $100B immediately.
This infrastructure will secure American leadership in AI, create hundreds of thousands of American jobs, and generate massive economic benefits for the entire world. 
The initial equity funders in stargate are SoftBank, OpenAI, Oracle and MGX.  
Softbank and OpenAI are the lead partners for Stargate, with Softbank having financial responsibility, and OpenAI having operational responsibility. 
Arm, Microsoft, Nvidia]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[289: DORA The Explorer… Of EU Regulations]]>
                </itunes:title>
                                    <itunes:episode>289</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p><span style="font-weight:400;">Welcome to episode 289 of The Cloud Pod – where the forecast is always cloudy! Justin, Ryan, and Matt are here this week to bring you a riveting podcast on EU regulations! Are you asleep yet? No? Ok great. We promise it will be a good show – despite the title. </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">Stargate: We’re not saying its Aliens, but its $500 Billion</span></li>
<li><span style="font-weight:400;">️AWS: Now with extra sessions</span></li>
<li><span style="font-weight:400;">EC2 Flex: Bigger, Badder and Probably still expensive</span></li>
<li><span style="font-weight:400;">SNS FIFO: So fast, it’ll give you whiplash</span></li>
<li><span style="font-weight:400;">‍⚖️Azure: Now with added Legalese (Thanks, EU)</span></li>
<li><span style="font-weight:400;">OpenAI’s Stargate: From Chatbots to Interdimensional Travel (maybe)</span></li>
<li><span style="font-weight:400;">☢️GCP’s Biochar Initiative: Turning Waste into… Well, Less Waste (hopefully)</span></li>
<li><span style="font-weight:400;">️AWS Console Multiple Sessions: So you can prove you dropped those databases from multiple accounts</span></li>
<li><span style="font-weight:400;">☁️Amazon still adds new features to SNS and the cloud pod is impressed</span></li>
<li><span style="font-weight:400;">☠️AWS tries to kill chrome profiles</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><b>AI IS Going Great – Or How ML Makes All Its Money</b></h2>
<p><b>01:47 </b><a href="https://openai.com/index/announcing-the-stargate-project/" target="_blank" rel="noreferrer noopener"><b>Announcing The Stargate Project</b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://openai.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Open AI</span></a><span style="font-weight:400;"> announced a joint investment of $500 billion dollars over the next four years building new AI infrastructure for OpenAI in the US, with the intent to deploy $100B immediately.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This infrastructure will secure American leadership in AI, create hundreds of thousands of American jobs, and generate massive economic benefits for the entire world. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The initial equity funders in stargate are </span><a href="https://group.softbank/en/news/press/20250122" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">SoftBank</span></a><span style="font-weight:400;">, OpenAI, </span><a href="https://www.oracle.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Oracle</span></a><span style="font-weight:400;"> and </span><a href="https://www.mgx.ae/en" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">MGX</span></a><span style="font-weight:400;">.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Softbank and OpenAI are the lead partners for Stargate, with Softbank having financial responsibility, and OpenAI having operational responsibility. </span></li>
<li style="font-weight:400;"><a href="https://www.arm.com/markets/artificial-intelligence" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Arm</span></a><span style="font-weight:400;">, </span><a href="https://www.microsoft.com/en-us/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Microsoft</span></a><span style="font-weight:400;">, </span><a href="http://www.nvidia.com/page/home.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Nvidia</span></a><span style="font-weight:400;">, Oracle and OpenAI are the key initial technology partners. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The buildout is currently underway starting in Texas, and they are evaluating potential sites across the country for more campuses as they finalize definitive agreements. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">As part of Stargate, Oracle, Nvidia and OpenAI will closely collaborate to build and operate this computing system. This builds on a deep collaboration between OpenAI and NVIDIA going back to 2016, and a newer partnership between OpenAI and Oracle. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This also builds on the existing OpenAI partnership with Microsoft. OpenAI will continue to increase its consumption of Azure as OpenAI continues its work with Microsoft with this additional computer to train leading models and deliver great products and services. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">“All of us look forward to continuing to build and develop AI—and in particular AGI—for the benefit of all of humanity.” This quote TOTALLY didn’t terrify us…</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Our conversations back in December about OpenAI trying to figure out their ownership model makes a lot more sense now. </span></li>
</ul>
<p><i><span style="font-weight:400;">07:22  Justin – “…</span></i><i><span style="font-weight:400;">it’s interesting that SoftBank is investing so much money into it considering, you know, the trade issues with China and SoftBank, you know, being mostly Chinese owned and invested in. Yeah. It’s one of the things about SoftBank that’s interesting as well as I didn’t think their funds had done that well after crypto kind of blew up on them in pretty spectacular ways, although it’s back apparently. So yeah, it’s interesting, you know, post inauguration day.”</span></i></p>
<h2><b>AWS</b></h2>
<p><b>06:11</b> <a href="https://aws.amazon.com/about-aws/whats-new/2025/01/aws-management-console-simultaneous-sign-in-multiple-accounts/" target="_blank" rel="noreferrer noopener"><b>The AWS Management Console now supports simultaneous sign-in for </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2025/01/aws-management-console-simultaneous-sign-in-multiple-accounts/" target="_blank" rel="noreferrer noopener"><b>multiple AWS accounts</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS is announcing multi-session support, which enables AWS customers to access multiple AWS accounts simultaneously in the AWS Console. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">You can sign in to up to 5 sessions in a single browser</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This can be a combination of root, IAM or federated roles in different accounts or the same account. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">WOW! This is a huge improvement and we’re REALLY excited about this. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Tab junkies, rejoice! </span></li>
</ul>
<p><i><span style="font-weight:400;">07:05  Ryan – “</span></i><i><span style="font-weight:400;">This is the biggest thing they’ve ever announced ever.”</span></i></p>
<p><b>15:27</b> <a href="https://aws.amazon.com/about-aws/whats-new/2025/01/amazon-ec2-flex-instances-new-larger-sizes/" target="_blank" rel="noreferrer noopener"><b>Introducing new larger sizes on Amazon EC2 Flex instances</b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://duckduckgo.com/y.js?ad_domain=amazon.com&amp;ad_provider=bingv7aa&amp;ad_type=txad&amp;click_metadata=RSZ5ubgasDP7G4o9boUzAd2y4gkkFKb2jlbLX7I4IbSpkT0P4CWedStJKXJAQ53wwTrQ4dRSyI63ofG-q4UwFLFGYlDr9EWfZ5bG43p6WDSjBfMKP4rE3GUaurHQTXjU.-lsbjuQQcNbPCyzQX4W_rQ&amp;eddgt=mBAMopHqY3hoyg3FFGTmlw%3D%3D&amp;rut=eb97557b5eae00446ea638d1e741e0f83da9a5e3a7a9afb37424c1ef6c968388&amp;u3=https%3A%2F%2Fwww.bing.com%2Faclick%3Fld%3De8nJ6v-bFyOC4cVCAT-PkdWzVUCUy_lQSPLBoZhoJoT1-yxGnEowfbRPgSK5eL_96kxbi2Yd1WOjuqJufHnDuAklXfcn0c346c9_tFwTZpH_fYWjAShBN8cDOBBOrkJ9aANfqwGETuc9duIoKh2gm-T_qAbH5VsLpa-2Xp8yauxkw3wHRFewD_zn0MIAQG7GnzbZWdPDsqccn4-yGCofyau5ijhfdATjCcrYcAAy0O6SivkLJOwNZf6aKQUD9M2Np5UX-Gw_21gf0fYXEugd_dE3z7ZalEUnV0oC3IfomIdeH0l2GTiQo9jRzHh7r_9R_67t37oJacsqtBFMhs9OgouK-0e5dPvzhbwL1edZ6He37L2WT0Y7Guo8LWtJ5VMhKjWMl9RVJkv5m0mcE7r33Su0MohbFJEGi2V9mNFMcKSUpLiwB1yUTYQ_FTArmfwqE8s3BDS1S0OfFyzEX-j3E-PCrINITg2snY47ip09GYPuXvuEqfvS5CnyMZyLvEKOGZNjyBc82M-Px0qV9HAj08YRopK9dc5413QH_vUx4bgHZoRVZAQ7jjNjU5XlgXVxsKGezqLhbvjWL27HGZB5eySrLGZFIyWwJw2SNM51LO0Ea4zC7nAdtRoL-OTxQZ8FL3DgjwdaeggEUIjitBAw2H4CdML3KzNidBlxt6zGgiMhLn_qzIm5FHYIIX6PEh2D5LM1fpzje8qSWtHQkWSInXPfLOg7BOfsjdW6ww9_gCOBtSbSekNysAisrzrTd5bjknjRVWLA%26u%3DaHR0cHMlM2ElMmYlMmZwaXhlbC5ldmVyZXN0dGVjaC5uZXQlMmY0NDIyJTJmY3ElM2Zldl9zaWQlM2QxMCUyNmV2X2xuJTNkYXdzJTI2ZXZfbHR4JTNkJTI2ZXZfbHglM2Rrd2QtNzExMjU0MjE3ODQ2MzQlM2Fsb2MtNzEyMjglMjZldl9jcnglM2Q3MTEyNDg5NDE5NTAyMCUyNmV2X210JTNkZSUyNmV2X2R2YyUzZGMlMjZldl9waHklM2Q4MDE1NCUyNmV2X2xvYyUzZCUyNmV2X2N4JTNkNDgyNTEwNzU0JTI2ZXZfYXglM2QxMTM3OTk1NTM2MDU1ODU3JTI2ZXZfZXglM2QlMjZldl9lZmlkJTNkNjUwMjZmYjI2ZDUxMTA3NWYzMzAwODJiZDUxYTNlYjElM2FHJTNhcyUyNnVybCUzZGh0dHBzJTI1M0ElMjUyRiUyNTJGYXdzLmFtYXpvbi5jb20lMjUyRmZyZWUlMjUyRiUyNTNGdHJrJTI1M0Q0NTFkNDM1Ni0xODg2LTQyZjktYTFhZS0wOGY4Y2ZhMWJjMGIlMjUyNnNjX2NoYW5uZWwlMjUzRHBzJTI1MjZzX2t3Y2lkJTI1M0RBTCE0NDIyITEwITcxMTI0ODk0MTk1MDIwITcxMTI1NDIxNzg0NjM0JTI1MjZlZl9pZCUyNTNENjUwMjZmYjI2ZDUxMTA3NWYzMzAwODJiZDUxYTNlYjElMjUzQUclMjUzQXMlMjZtc2Nsa2lkJTNkNjUwMjZmYjI2ZDUxMTA3NWYzMzAwODJiZDUxYTNlYjE%26rlid%3D65026fb26d511075f330082bd51a3eb1&amp;vqd=4-200020001140237269217679917190605643581&amp;iurl=%7B1%7DIG%3DB506D26F073740CD9523B59DC4C03F5D%26CID%3D20CF21706C386A870AF634F36D8D6B40%26ID%3DDevEx%2C5048.1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS</span></a><span style="font-weight:400;"> announced the general availability of two new larger size (12xlarge and 16xlarge) EC2 Flex instances (</span><a href="https://aws.amazon.com/ec2/instance-types/c7i/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">c7i</span></a><span style="font-weight:400;"> and </span><a href="https://aws.amazon.com/ec2/instance-types/m7i/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">m7i</span></a><span style="font-weight:400;"> variants)</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This new size expands the EC2 flex portfolio, providing additional compute options to scale up existing workloads or run larger-sized applications that need extra memory.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">These instances are powered by custom 4th gen </span><a href="https://www.intel.com/content/www/us/en/products/details/processors/xeon/scalable.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Intel Xeon scalable processors</span></a><span style="font-weight:400;">, only available on AWS, and offer up to 15% better performance over comparable x86-based intel processors.  </span></li>
</ul>
<p><i><span style="font-weight:400;">17:08 Ryan – “</span></i><i><span style="font-weight:400;">It’s the way that you had to provision memory and CPU and the relationship between the chosen Amazon was the instance type. And now I think if you select this instance type, you can tune those specifically and do a little bit of shaping.”</span></i></p>
<p><b>17:59 </b> <a href="https://aws.amazon.com/about-aws/whats-new/2025/01/aws-codebuild-test-splitting-parallelism/" target="_blank" rel="noreferrer noopener"><b>AWS CodeBuild now supports test splitting and parallelism</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">You can now split and run your tests across multiple parallel-running compute environments. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Based on your sharding strategy, </span><a href="https://aws.amazon.com/codebuild/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">CodeBuild</span></a><span style="font-weight:400;"> will divide and run your tests across the specified number of parallel environments.</span></li>
</ul>
<p><i><span style="font-weight:400;">18:12 Justin – “</span></i><i><span style="font-weight:400;">Now I appreciate this, but, uh, I would like to run multiple tasks on the same environment versus spending more money on more parallel environments. Or I’d like you to handle all the automation of spinning up all those parallel environments so I don’t have to do that. So if CodeBuild could get on that part of it, I’d be much happier.”</span></i></p>
<p><b>20:17</b> <a href="https://aws.amazon.com/about-aws/whats-new/2025/01/aws-codepipeline-debugging-experience-aws-management-console/" target="_blank" rel="noreferrer noopener"><b>AWS CodePipeline introduces new debugging experience in AWS </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2025/01/aws-codepipeline-debugging-experience-aws-management-console/" target="_blank" rel="noreferrer noopener"><b>Management Console</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS Code pipeline now offers an enhanced debugging experience in the AWS Management Console, enabling you to identify and resolve pipeline failures more efficiently. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The new debugging interface gives you a dedicated debugging page, accessible via the action details button. </span></li>
</ul>
<p><span style="font-weight:400;">20:47  Justin – “…</span><span style="font-weight:400;">now I can curse out CodeBuild and Code Pipeline at the same time!”</span></p>
<p><b>24:20</b> <a href="https://aws.amazon.com/about-aws/whats-new/2025/01/high-throughput-mode-amazon-sns-fifo-topics/" target="_blank" rel="noreferrer noopener"><b>Announcing high-throughput mode for Amazon SNS FIFO Topics</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon continues to push the </span><a href="https://docs.aws.amazon.com/sns/latest/dg/fifo-high-throughput.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">high-throughput boundaries of SNS FIFO topics</span></a><span style="font-weight:400;">.  Now with default throughput matching </span><a href="https://docs.aws.amazon.com/general/latest/gr/sns.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">SNS</span></a><span style="font-weight:400;"> standard topics across all regions.  When you enable high-throughput mode, SNS fifo topics will maintain order within the message group, while reducing the deduplication scope to the message-group level. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">WIth this change you can leverage up to 30k messages per second per account by default in US-East region, and 9k MPS per account in US west and Europe regions, and request quota increases for additional throughput in any region.  </span></li>
</ul>
<p><i><span style="font-weight:400;">25:09  Justin – “It’s still cheaper than Kafka!”</span></i></p>
<h2><b>GCP</b></h2>
<p><b>26:37</b> <a href="https://cloud.google.com/blog/products/containers-kubernetes/rearchitected-gke-hpa-improves-scaling-performance/" target="_blank" rel="noreferrer noopener"><b>GKE delivers breakthrough Horizontal Pod Autoscaler performance</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google Cloud is committed to providing the fastest and most reliable K8 platform with GKE.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Now, they are announcing an improved </span><a href="https://cloud.google.com/kubernetes-engine/docs/concepts/horizontalpodautoscaler" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Horizontal Pod Autoscaler</span></a><span style="font-weight:400;"> (HPA), the K8 features that automatically update workload resources to match demand. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">With the new performance HPA profile you get</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">2x faster scaling</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Improved metrics resolution</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Linear scaling to up to 1000 HPA objects</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">This matters because customers are regularly asking for it, and they frequently over provision resources to account for delays in the autoscaling stack, resulting in lower efficiency and higher costs. </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">You’ll minimize waste</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Improve application responsiveness</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Increase operational efficiency</span></li>
</ul>
</li>
<li style="font-weight:400;"><i><span style="font-weight:400;">“With GKE’s Performance HPA profile, we’ve witnessed a remarkable boost in horizontal auto-scaling speed. In our tests with over 1000 HPA objects, workloads scaled up twice as fast. We’re excited to leverage this performance enhancement in our production environments.”</span></i><span style="font-weight:400;"> – Sophy Cao, Senior Engineer, Spotify</span><span style="font-weight:400;">. </span></li>
</ul>
<p><i><span style="font-weight:400;">27:34  Ryan – “</span></i><i><span style="font-weight:400;">So I am dubious because every time I’ve. If my day job scaled up larger events while if you can create the containers great, but something else is going to fall down within the Kubernetes infrastructure. And so I was flabbergasted when joining a new team and I found out that they still have a huge process to warm their pods by pre-launching containers, because they found that they would crash the DNS server container or another sidecar that did proxying or something else in there. So I’m hoping that this profile will fix a lot of those issues.”</span></i></p>
<p><b>29:19</b> <a href="https://cloud.google.com/blog/products/identity-security/the-eus-dora-has-arrived-google-cloud-is-ready-to-help/" target="_blank" rel="noreferrer noopener"><b>The EU’s DORA regulation has arrived. Google Cloud is ready to help</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">No, it’s not *that* DORA. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The </span><a href="https://finance.ec.europa.eu/regulation-and-supervision/financial-services-legislation/implementing-and-delegated-acts/digital-operational-resilience-regulation_en" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Digital Operational Resilience Act</span></a><span style="font-weight:400;"> (DORA) takes effect as of Jan 17th, and financial entities in the EU must rise to a new level of operational resilience in the face of ever evolving digital threats.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">To help you tackle this new set of regulations, Google is sharing the DORA customers guides on </span><a href="https://services.google.com/fh/files/misc/eu_dora_customer_guide-register_of_information-googlecloud2.pdf" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Register of Information</span></a><span style="font-weight:400;"> and </span><a href="https://services.google.com/fh/files/misc/eu_dora_customer_guide_googlecloud.pdf" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Information and Communications Technology (ICT) Risk Management</span></a><span style="font-weight:400;"> and their new Google Cloud </span><a href="http://cloud.google.com/solutions/ciso-third-party-risk-management" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Third-Party Risk Management Resource Center</span></a><span style="font-weight:400;">.   </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">In addition, financial entities can request their DORA subcontractor listing today. </span></li>
</ul>
<p><i><span style="font-weight:400;">29:52  Matthew – “</span></i><i><span style="font-weight:400;">When I had to do some research on this for my day job, it looks like it mainly maps over to ISO. So if you are ISO 2700 and one, you’re mostly covered for this, which does make your life easier. I’m waiting for Azure to kind of come out with their same offering because it will make my life a little bit easier.”</span></i></p>
<p><b>31:19</b> <a href="https://cloud.google.com/blog/products/compute/first-google-axion-processor-c4a-now-ga-with-titanium-ssd/" target="_blank" rel="noreferrer noopener"><b>C4A, the first Google Axion Processor, now GA with Titanium SSD</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google is making the new C4A virtual machines with </span><a href="https://cloud.google.com/compute/docs/disks/local-ssd%5C" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Titaniums SSDS</span></a><span style="font-weight:400;"> generally available. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The Titanium SSD is custom designed for Google Cloud workloads that require real-time data processing, with low-latency and high-throughput storage performance. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Titanium SSDs on C4A VMs deliver storage performance of up to 2.4M random read IOPS, up to 10.4GiB/s of read throughput, and up to 35% lower access latency than previous-generation SSDs. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">C4A is a VM instance family, based on </span><a href="https://cloud.google.com/products/axion" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Google Axion Processors</span></a><span style="font-weight:400;">, that give you a 65% better price performance and up to 60% better energy efficiency than comparable current generation x86 based instances. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">C4A with Titanium SSDs, offer up to 72 vCPU’s, 576 GB of memory and 6tb of local storage in two shapes – standard with 4gb of memory per vcpu and high-memory with 8gb of memory per vcpu. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They both support up to 50 gbps of standard bandwidth and up to 100 gbps with tier 1 networking for high traffic applications, as well as supporting the latest generations of Balanced And Extreme </span><a href="https://cloud.google.com/compute/docs/disks/hyperdisks" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">hyperdisk</span></a><span style="font-weight:400;"> storage. </span></li>
</ul>
<p><i><span style="font-weight:400;">34:13  Justin – “</span></i><i><span style="font-weight:400;">This is the not the first axion processor. This is one of the second models they’ve released with it. This is the first one with the axion and the titanium SSD.”</span></i></p>
<p><b>35:08</b> <a href="https://cloud.google.com/release-notes#January_20_2025" target="_blank" rel="noreferrer noopener"><b>Smaller Releases of note</b></a><b>:</b></p>
<ul>
<li style="font-weight:400;"><b>Generally available</b><span style="font-weight:400;">: Managed instance groups (MIGs) let you create pools of </span><a href="https://cloud.google.com/compute/docs/instance-groups/suspended-and-stopped-vms-in-mig" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">suspended and stopped virtual machine (VM) instance</span></a><span style="font-weight:400;">s. You can manually suspend and stop VMs in a MIG to save on costs, or use suspended and stopped pools to speed up scale out operations of your MIG.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Congratulations Google, you’re doing what everyone else is already doing. </span></li>
</ul>
<p><b>36:56</b> <a href="https://blog.google/feed/solar-energy-oklahoma/" target="_blank" rel="noreferrer noopener"><b>Google is supporting new solar projects in Oklahoma</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google has entered into long-term agreements with Leeward Renewable Energy to support over 700 megawatts of solar projects in Oklahoma. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Google says it’s strategically located to support their data center operations, with one being less than one mile from their data center in Pryor, Oklahoma. </span></li>
</ul>
<p><b>38:11</b> <a href="https://blog.google/feed/were-announcing-our-first-partnerships-to-scale-biochar-for-co2-removal/" target="_blank" rel="noreferrer noopener"><b>We’re announcing our first partnerships to scale biochar for CO2 removal</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google is announcing two long-term purchase agreements to help scale biochar as a carbon removal solution. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Partnering with Varaha and Charm to purchase 100,000 tons of biochar carbon removal from each company by 2030.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This will enable them to remove 200,000 tons of carbon, helping them achieve their net zero emissions goal, as well as help catalyze biochar production towards a scale that helps the planet mitigate climate change. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">We think this one is cool, and hope companies continue with their green initiatives in this new administration. </span></li>
</ul>
<p><i><span style="font-weight:400;">Show copywriter note re: velociraptors with feathers. If you’ve ever seen a chicken stalk and eat a mouse or a lizard – you’ve seen velociraptors in action. They’re *terrifying*. Thank God chickens are small.  </span></i></p>
<h2><b>Azure</b></h2>
<p><b>40:36 </b> <a href="https://blogs.microsoft.com/on-the-issues/2025/01/15/innovating-in-line-with-the-european-unions-ai-act/" target="_blank" rel="noreferrer noopener"><b>Innovating in line with the European Union’s AI Act</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft during its recent </span><a href="https://aitour.microsoft.com/en-US/home" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AI tours</span></a><span style="font-weight:400;">, took a chance to meet with EU regulators and politicians to discuss AI and the new European Union AI Act, this is the first comprehensive legal framework for AI. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">It aims to ensure that AI systems developed and used in the EU are safe, trustworthy and respect fundamental rights. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The Act classifies AI systems based on their risk level into: Unacceptable, High, LImited or Minimal or No Risk. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft approach to compliance against the act includes:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Proactive approach – they are preparing by conducting internal reviews, updating internal policies and contracts and engaging with policymakers directly</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Focus on customer support by ensuring that documentation, tools and guidance all consider the act.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Shared responsibilities between AI providers and users, and emphasizing the need for compliance</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Building compliant products</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Plan to publish transparency reports and provide documentation for its AI systems to help customers understand their capabilities and limitations</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Collaboration with policy makers, regulators and industry groups to shape the implementation of the act and ensure effective and interoperable practices  </span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">41:40  Ryan – “I both love and hate this. I </span></i><i><span style="font-weight:400;">feel like we have no idea what we’re doing yet and we’re trying to regulate it. And so it seems like that’s going to be a problem – because there’s so much in here where it’s like there’s plans to do a thing. They’re going to put the frameworks together. None of it exists. And we all know how fast compliance can grow and adapt to a changing technology ecosystem because our day jobs are super fun at times.”</span></i></p>
<p><b>44:25 </b> <a href="https://www.theregister.com/2025/01/21/microsoft_joins_cispe/" target="_blank" rel="noreferrer noopener"><b>Microsoft joins CISPE, the Euro cloud crew that tried to curb its licensing</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">In the old adage if you can’t beat them, join them – and Microsoft is the latest member of CISPE months after it </span><a href="https://www.theregister.com/2024/07/10/microsoft_avoids_antitrust_probe/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">negotiated a settlement</span></a><span style="font-weight:400;"> with the trade association of European Cloud Providers over alleged anti-competitive software practices. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Of course not all </span><a href="https://cispe.cloud/members/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">members</span></a><span style="font-weight:400;"> of the group are happy with the move. </span></li>
<li style="font-weight:400;"><a href="https://www.theregister.com/2022/07/25/aws_slams_microsoft_cloud_licenses/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS</span></a><span style="font-weight:400;"> of course opposed Microsoft joining, but was outvoted by the board. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Google also attempted to join the board, but ended up joining the </span><a href="https://www.theregister.com/2024/10/29/open_cloud_coalition_microsoft_google/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Open Cloud Coalition</span></a><span style="font-weight:400;"> instead.  </span></li>
</ul>
<p><i><span style="font-weight:400;">45:26  Ryan – “</span></i><i><span style="font-weight:400;">I don’t know if it’s a good model, right? It’s like lobbying and bribery just out in the open.”</span></i></p>
<p><b>47:08 </b> <a href="https://blogs.microsoft.com/blog/2025/01/21/microsoft-and-openai-evolve-partnership-to-drive-the-next-phase-of-ai/" target="_blank" rel="noreferrer noopener"><b>Microsoft and OpenAI evolve partnership to drive the next phase of AI</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft is thrilled to continue their strategic partnership with </span><a href="https://blogs.microsoft.com/blog/tag/azure-openai-service/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">OpenAI</span></a><span style="font-weight:400;"> and to partner on </span><a href="https://www.bing.com/ck/a?!&amp;&amp;p=a96f30d832c33d9d5ef597f872fa7359449e80eb430ee9ba67450c843422fc39JmltdHM9MTczODAyMjQwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=111326c2-38f8-6e94-3a5e-34a8396a6f52&amp;psq=stargate&amp;u=a1aHR0cHM6Ly9vcGVuYWkuY29tL2luZGV4L2Fubm91bmNpbmctdGhlLXN0YXJnYXRlLXByb2plY3Qv&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Stargate</span></a><span style="font-weight:400;">.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The announcement is complementary to what the two companies have been working on since they got together in 2019. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The key elements of the partnership remain in place for the duration of the contract through 2030, with their access to Open AI’s IP, revenue sharing arrangements and exclusivity on OpenAI API’s all continuing going forward.</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft has rights to OpenAI IP (inclusive of model and infrastructure) for use within our products like </span><a href="https://www.bing.com/ck/a?!&amp;&amp;p=b3cac2e98e014bb434983cd773eb667cd9b8a97fd673a6bb07cd3defba991d8fJmltdHM9MTczODAyMjQwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=111326c2-38f8-6e94-3a5e-34a8396a6f52&amp;psq=copilot&amp;u=a1aHR0cHM6Ly9jb3BpbG90Lm1pY3Jvc29mdC5jb20vP21zb2NraWQ9MTExMzI2YzIzOGY4NmU5NDNhNWUzNGE4Mzk2YTZmNTI&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Copilot</span></a><span style="font-weight:400;">. This means our customers have access to the best model for their needs.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The OpenAI API is exclusive to Azure, runs on Azure and is also available through the Azure OpenAI Service. This agreement means customers benefit from having access to leading models on Microsoft platforms and direct from OpenAI.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft and OpenAI have revenue sharing agreements that flow both ways, ensuring that both companies benefit from increased use of new and existing models.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft remains a major investor in OpenAI, providing funding and capacity to support their advancements and, in turn, benefiting from their growth in valuation.</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Open AI recently made a new, large Azure commitment that will continue to support all OpenAI products as well as training. This new agreement also includes changes to the exclusivity on new capacity, moving to a model where MSFT has a right of first refusal. To further support OpenAI, Microsoft has approved OpenAI’s ability to build additional capacity, primarily for research and training of models. </span></li>
</ul>
<p><i><span style="font-weight:400;">49:18  Matthew – “</span></i><i><span style="font-weight:400;">It’s interesting also that Microsoft had to approve the opening ability to even be a part of that. At least that was the last sentence I read, you know, so I guess in the original agreement, maybe there was some like control that.”</span></i></p>
<h2><span style="font-weight:400;">Closing</span></h2>
<p><span style="font-weight:400;">And that is the week in the cloud! Visit our website, the home of the Cloud Pod where you can join our newsletter, slack team, send feedback or ask questions at theCloud Pod.net or tweet at us with hashtag #theCloudPod</span></p>
]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/1955248/c1e-p8j8u5kxqjavrm7j-ww64rgqpu8m-bj6wbl.mp3" length="61935197"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 289 of The Cloud Pod – where the forecast is always cloudy! Justin, Ryan, and Matt are here this week to bring you a riveting podcast on EU regulations! Are you asleep yet? No? Ok great. We promise it will be a good show – despite the title. 
Titles we almost went with this week:

Stargate: We’re not saying its Aliens, but its $500 Billion
️AWS: Now with extra sessions
EC2 Flex: Bigger, Badder and Probably still expensive
SNS FIFO: So fast, it’ll give you whiplash
‍⚖️Azure: Now with added Legalese (Thanks, EU)
OpenAI’s Stargate: From Chatbots to Interdimensional Travel (maybe)
☢️GCP’s Biochar Initiative: Turning Waste into… Well, Less Waste (hopefully)
️AWS Console Multiple Sessions: So you can prove you dropped those databases from multiple accounts
☁️Amazon still adds new features to SNS and the cloud pod is impressed
☠️AWS tries to kill chrome profiles

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
AI IS Going Great – Or How ML Makes All Its Money
01:47 Announcing The Stargate Project

Open AI announced a joint investment of $500 billion dollars over the next four years building new AI infrastructure for OpenAI in the US, with the intent to deploy $100B immediately.
This infrastructure will secure American leadership in AI, create hundreds of thousands of American jobs, and generate massive economic benefits for the entire world. 
The initial equity funders in stargate are SoftBank, OpenAI, Oracle and MGX.  
Softbank and OpenAI are the lead partners for Stargate, with Softbank having financial responsibility, and OpenAI having operational responsibility. 
Arm, Microsoft, Nvidia]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/1955248/c1a-k5d5-7z28o4d4a8d4-fggytk.jpg"></itunes:image>
                                                                            <itunes:duration>00:51:37</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                            </item>
                    <item>
                <title>
                    <![CDATA[288: You Might Be Able to Retrain Notebook LM Hosts to be Less Annoyed, But Not Your Cloud Pod Hosts]]>
                </title>
                <pubDate>Wed, 22 Jan 2025 14:46:24 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/1947529</guid>
                                    <link>https://tcpfm.castos.com/episodes/288-notebook-lm-annoyed</link>
                                <description>
                                            <![CDATA[<p><span style="font-weight:400;">Welcome to episode 288 of The Cloud Pod – where the forecast is always cloudy! Justin, Ryan, and Jonathan are your hosts as we make our way through this week’s cloud and AI news, including back to Vertex AI, Project Digits, Notebook LM, and some major improvements to AI image generation. </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">Digits… I’ll show you 5 digits…</span></li>
<li><span style="font-weight:400;">The only digit the AWS local zone in New York shows me is the middle one</span></li>
<li><span style="font-weight:400;">️Keep one eye open near Mercedes with Agentic AI</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><b>General News</b></h2>
<p><b>01:59 </b> <a href="https://www.theverge.com/2025/1/6/24337530/nvidia-ces-digits-super-computer-ai" target="_blank" rel="noreferrer noopener"><b>Nvidia announces $3,000 personal AI supercomputer called Digits</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">If you don’t want to hand over all your money to the cloud providers, you will be able to hand over $3,000 dollars to Nvidia… for a computer that is probably going to be obsolete in &lt;12 months. That’s fun! </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The new personal AI supercomputer, called </span><a href="https://www.nvidia.com/en-eu/project-digits/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Project Digits</span></a><span style="font-weight:400;">, will launch in May. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The heart of Digits is the new GB10 </span><a href="https://www.nvidia.com/en-us/data-center/technologies/blackwell-architecture/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Grace Blackwell Superchip</span></a><span style="font-weight:400;">, which packs enough processing power to run sophisticated AI models, while being compact enough to fit on a desk and run from a standard power outlet.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Digits can handle AI models with up to 200 billion parameters, and looks very similar to a </span><a href="https://www.theverge.com/24289730/apple-mac-mini-m4-review" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Mac Mini</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">“</span><i><span style="font-weight:400;">AI will be mainstream in every application for every industry. With Project Digits, the Grace Blackwell Superchip comes to millions of developers</span></i><span style="font-weight:400;">,” Nvidia CEO Jensen Huang </span><a href="https://nvidianews.nvidia.com/news/nvidia-puts-grace-blackwell-on-every-desk-and-at-every-ai-developers-fingertips" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">said in a press release</span></a><span style="font-weight:400;">. “</span><i><span style="font-weight:400;">Placing an AI supercomputer on the desks of every data scientist, AI researcher, and student empowers them to engage and shape the age of AI</span></i><span style="font-weight:400;">.”</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The Digits system comes with 128gb of unified coherent memory and up to 4tb of NVME storage.  For even more demanding apps, two digit systems can be linked together to handle models with 405b parameters. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The GB10 chip delivers up to 1 petaflop of AI performance, meaning it can perform 1 qu...</span></li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 288 of The Cloud Pod – where the forecast is always cloudy! Justin, Ryan, and Jonathan are your hosts as we make our way through this week’s cloud and AI news, including back to Vertex AI, Project Digits, Notebook LM, and some major improvements to AI image generation. 
Titles we almost went with this week:

Digits… I’ll show you 5 digits…
The only digit the AWS local zone in New York shows me is the middle one
️Keep one eye open near Mercedes with Agentic AI

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
General News
01:59  Nvidia announces $3,000 personal AI supercomputer called Digits

If you don’t want to hand over all your money to the cloud providers, you will be able to hand over $3,000 dollars to Nvidia… for a computer that is probably going to be obsolete in <12 months. That’s fun! 
The new personal AI supercomputer, called Project Digits, will launch in May. 
The heart of Digits is the new GB10 Grace Blackwell Superchip, which packs enough processing power to run sophisticated AI models, while being compact enough to fit on a desk and run from a standard power outlet.
Digits can handle AI models with up to 200 billion parameters, and looks very similar to a Mac Mini. 
“AI will be mainstream in every application for every industry. With Project Digits, the Grace Blackwell Superchip comes to millions of developers,” Nvidia CEO Jensen Huang said in a press release. “Placing an AI supercomputer on the desks of every data scientist, AI researcher, and student empowers them to engage and shape the age of AI.”
The Digits system comes with 128gb of unified coherent memory and up to 4tb of NVME storage.  For even more demanding apps, two digit systems can be linked together to handle models with 405b parameters. 
The GB10 chip delivers up to 1 petaflop of AI performance, meaning it can perform 1 qu...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[288: You Might Be Able to Retrain Notebook LM Hosts to be Less Annoyed, But Not Your Cloud Pod Hosts]]>
                </itunes:title>
                                    <itunes:episode>288</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p><span style="font-weight:400;">Welcome to episode 288 of The Cloud Pod – where the forecast is always cloudy! Justin, Ryan, and Jonathan are your hosts as we make our way through this week’s cloud and AI news, including back to Vertex AI, Project Digits, Notebook LM, and some major improvements to AI image generation. </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">Digits… I’ll show you 5 digits…</span></li>
<li><span style="font-weight:400;">The only digit the AWS local zone in New York shows me is the middle one</span></li>
<li><span style="font-weight:400;">️Keep one eye open near Mercedes with Agentic AI</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><b>General News</b></h2>
<p><b>01:59 </b> <a href="https://www.theverge.com/2025/1/6/24337530/nvidia-ces-digits-super-computer-ai" target="_blank" rel="noreferrer noopener"><b>Nvidia announces $3,000 personal AI supercomputer called Digits</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">If you don’t want to hand over all your money to the cloud providers, you will be able to hand over $3,000 dollars to Nvidia… for a computer that is probably going to be obsolete in &lt;12 months. That’s fun! </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The new personal AI supercomputer, called </span><a href="https://www.nvidia.com/en-eu/project-digits/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Project Digits</span></a><span style="font-weight:400;">, will launch in May. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The heart of Digits is the new GB10 </span><a href="https://www.nvidia.com/en-us/data-center/technologies/blackwell-architecture/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Grace Blackwell Superchip</span></a><span style="font-weight:400;">, which packs enough processing power to run sophisticated AI models, while being compact enough to fit on a desk and run from a standard power outlet.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Digits can handle AI models with up to 200 billion parameters, and looks very similar to a </span><a href="https://www.theverge.com/24289730/apple-mac-mini-m4-review" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Mac Mini</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">“</span><i><span style="font-weight:400;">AI will be mainstream in every application for every industry. With Project Digits, the Grace Blackwell Superchip comes to millions of developers</span></i><span style="font-weight:400;">,” Nvidia CEO Jensen Huang </span><a href="https://nvidianews.nvidia.com/news/nvidia-puts-grace-blackwell-on-every-desk-and-at-every-ai-developers-fingertips" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">said in a press release</span></a><span style="font-weight:400;">. “</span><i><span style="font-weight:400;">Placing an AI supercomputer on the desks of every data scientist, AI researcher, and student empowers them to engage and shape the age of AI</span></i><span style="font-weight:400;">.”</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The Digits system comes with 128gb of unified coherent memory and up to 4tb of NVME storage.  For even more demanding apps, two digit systems can be linked together to handle models with 405b parameters. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The GB10 chip delivers up to 1 petaflop of AI performance, meaning it can perform 1 quadrillion AI calculations per second. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Suppose you plunk down the money for Digits. In that case, you will also get access to Nvidia’s AI software library, including development kits, orchestration tools and pre-trained models available through the Nvidia NGC catalog. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The system runs on a Linux-based NVidia NGC catalog, and supports popular frameworks like </span><a href="https://pytorch.org/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">PyTorch</span></a><span style="font-weight:400;">, </span><a href="https://www.python.org/downloads/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Python</span></a><span style="font-weight:400;"> and </span><a href="https://jupyter.org/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Jupyter</span></a><span style="font-weight:400;"> notebooks. </span></li>
</ul>
<p><i><span style="font-weight:400;">09:25  Jonathan – ““</span></i><i><span style="font-weight:400;">The Blackwell is pretty recent, it’s the one that had a lot of problems with yield. And I kind of suspect that they’re sort of packaging this up and selling some of the chips which didn’t pass all the tests for the commercial products. And so they’re enabling whatever cores they can in these things to sell to consumers… Having all the memories is really great for the big models. It’s not going to be particularly performant now. I think the spec I saw was like one teraflop at quite low precision – like fb4 precision – which is quite low, and I think it’d be better off if you’re really interested in buying some like 3090s or 5090s or something like that. Obviously you don’t get the memory, but far better performance for the price.”</span></i></p>
<p><b>06:46</b> <a href="https://www.theverge.com/2025/1/8/24338939/nvidia-jensen-huang-hints-arm-desktop-cpu" target="_blank" rel="noreferrer noopener"><b>Nvidia’s Jensen Huang hints at ‘plans’ for its own desktop CPU</b></a><span style="font-weight:400;">   </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">It’s long been rumored that Nvidia is planning to break into the consumer CPU market in 2025, and we finally got some insight into those plans. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Nvidia CEO Jenen Huagh said there are bigger plans for the arm-based cpu within the GB10 chip </span><a href="https://www.theverge.com/2025/1/6/24337530/nvidia-ces-digits-super-computer-ai" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">introduced in the Digits computer</span></a><span style="font-weight:400;">, and is co-developed with </span><a href="https://www.mediatek.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Mediatek</span></a><span style="font-weight:400;">.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Huang told investors that they obviously have plans, and they can’t wait to tell us – or sell us – more. </span></li>
</ul>
<p><i><span style="font-weight:400;">07:22  Justin – “It’s </span></i><i><span style="font-weight:400;">interesting to see the dominance of Intel fall to the dominance of Nvidia and Nvidia just basically repeating the whole whole set of stuff all over again.”</span></i></p>
<h2><b>AI Is Going Great – Or, How ML Makes All its Money </b></h2>
<p><b>08:23</b> <a href="https://www.snowflake.com/en/blog/anthropic-claude-sonnet-cortex-ai/" target="_blank" rel="noreferrer noopener"><b>Build RAG and Agent-based AI Apps with Anthropic’s Claude 3.5 Sonnet in </b></a><a href="https://www.snowflake.com/en/blog/anthropic-claude-sonnet-cortex-ai/" target="_blank" rel="noreferrer noopener"><b>Snowflake Cortex AI</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Snowflake is announcing the GA of </span><a href="https://www.anthropic.com/claude/sonnet" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Claude 3.5</span></a><span style="font-weight:400;"> </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Sonnet as the first </span><a href="https://www.anthropic.com/claude" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Anthropic Foundation</span></a><span style="font-weight:400;"> model available in </span><a href="https://www.snowflake.com/en/data-cloud/cortex/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Snowflake Cortex AI.</span></a><span style="font-weight:400;">  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Customers can now access the most intelligent model in the Claude model family from Anthropic using familiar SQL, Python and REST API interfaces, within the Snowflake security perimeter. </span></li>
</ul>
<p><i><span style="font-weight:400;">16:43 Justin – “</span></i><i><span style="font-weight:400;">that’s actually nice. I didn’t realize that Snowflake was going to be making Claude available. Missed the EA, but glad to see my favorite model is at least available there.”</span></i></p>
<h2><b>AWS</b></h2>
<p><b>09:33</b> <a href="https://aws.amazon.com/about-aws/whats-new/2025/01/aws-compute-optimizer-idle-rightsizing-recommendations-amazon-ec2-auto-scaling-groups/" target="_blank" rel="noreferrer noopener"><b>AWS Compute Optimizer now expands idle and rightsizing </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2025/01/aws-compute-optimizer-idle-rightsizing-recommendations-amazon-ec2-auto-scaling-groups/" target="_blank" rel="noreferrer noopener"><b>recommendations for Amazon EC2 Auto Scaling groups</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Computer optimizer will now expand to idle and rightsizing recommendations for ASG’s with scaling policies and multiple instance types. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">With the new recommendations, you can take actions to optimize cost and performance for these groups without requiring specialized knowledge or engineering resources to analyze them. </span></li>
</ul>
<p><i><span style="font-weight:400;">09:56 Ryan – “</span></i><i><span style="font-weight:400;">Well, this is long overdue, Because you’ve always had, or for a long time anyway, you’ve had optimizations for standalone EC2 instances. But ASGs have always been ignored. And a huge amount of waste of people that set a minimum scale level for these things. And they’re just sitting there, burning through coal, but not taking any requests. So I’m glad to see these making the list.”</span></i></p>
<p><b>12:37</b> <a href="https://aws.amazon.com/about-aws/whats-new/2025/01/general-availability-aws-local-zone-new-york-city/" target="_blank" rel="noreferrer noopener"><b>Announcing the general availability of a new AWS Local Zone in New York </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2025/01/general-availability-aws-local-zone-new-york-city/" target="_blank" rel="noreferrer noopener"><b>City</b></a> <span style="font-weight:400;"> </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS is announcing the GA of AWS </span><a href="https://aws.amazon.com/about-aws/global-infrastructure/localzones/locations/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Local Zone</span></a><span style="font-weight:400;"> in New York City, supporting a wide range of workloads, including C7i, R7i, M6i and M6in EC2 instances, EBS volumes and ECS, EKS, ALB and AWS Direct connect, all available in the local zone. </span></li>
</ul>
<p><b>13:42 </b> <a href="https://www.theverge.com/24338171/aws-ceo-matt-garman-ai-chips-anthropic-cloud-computing-trainium-decoder-podcast-interview" target="_blank" rel="noreferrer noopener"><b>Why CEO Matt Garman is willing to bet AWS on AI</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">The excellent Decoder podcast with Nilay Patel recently invited Matt Garman on to talk about stepping into the AWS CEO role. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Matt hits on the same talking points you’ve heard in the past, that most companies are still barely in the cloud, there is a huge market, etc. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Matt talks about reorienting the computing infrastructure to support the evolving world of Generative AI.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">It’s clear from listening to the interview that Amazon is thinking about AI beyond just the model, but the monetization of the service around the model, etc. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They touch on several other interesting topics like AGI, Netflix as a customer, etc and it’s worth a listen too. </span></li>
</ul>
<p><i><span style="font-weight:400;">15:51 Justin – “</span></i><i><span style="font-weight:400;">I mean, basically building infrastructure services that support the needs of AI driven worlds. And we’ll talk about a little bit later in an Azure story, it will come up about AI first apps and what that’s going to mean and kind of some of those things. But I think that’s what he was referring to basically without using as catchy a phrase as Microsoft came up with.”</span></i></p>
<p><b>16:32</b> <a href="https://aws.amazon.com/blogs/aws/now-open-aws-mexico-central-region/" target="_blank" rel="noreferrer noopener"><b>Now open — AWS Mexico (Central) Region</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">In February 2024, </span><a href="https://aws.amazon.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS</span></a> <a href="https://aws.amazon.com/es/blogs/aws/new-aws-region-in-mexico-is-in-the-works/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">announced its plan</span></a><span style="font-weight:400;"> to expand into Mexico. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Today – 11 months later, they are excited to announce the GA of the </span><a href="https://aws.amazon.com/about-aws/whats-new/2023/01/aws-local-zones-lagos-lima-queretaro/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Mexico central region</span></a><span style="font-weight:400;"> with three AZ’s and API code mx-central-1. </span></li>
</ul>
<p><b>18:14</b> <a href="https://aws.amazon.com/blogs/opensource/aws-cdk-is-splitting-construct-library-and-cli/" target="_blank" rel="noreferrer noopener"><b>AWS CDK is splitting Construct Library and CLI</b></a><b>  </b></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/cdk/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS CDK</span></a><span style="font-weight:400;"> is a software development framework for defining cloud infrastructure in code and provisioning it through </span><a href="https://aws.amazon.com/cloudformation/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS CloudFormation</span></a><span style="font-weight:400;">.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">It consists of two primary components; The Construct Library that you use in a programming language to model your AWS app and a CLI.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The Construct Library synthesizes a model of your application to a directory on disk, and the CLI reads that directory file to deploy your application on AWS. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Starting in Feb 2025, the CDK CLI and CDK Construct Library will no longer be released in lockstep. Instead, they will both have their own independent release cadence, which means their version numbers are going to diverge.  There will be no impact to the CDK API or User Experience.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They are doing this as they have matured the library, they have found that changes to the different components proceed at different paces and require different testing strategies, this change gives them the ability to make changes to release cadence of one subproject without affecting the other, giving the entire project more agility. </span></li>
</ul>
<p><i><span style="font-weight:400;">19:42 Ryan – “</span></i><i><span style="font-weight:400;">I’ve really tried over and over and over to get into the CDK model, and it just doesn’t work for me. And I think I wonder if it’s just because I was sort of a sysadmin that turned into a programmer over time, if it came from that direction, or if it’s just my utter hatred of TypeScript.”</span></i></p>
<h2><b>GCP</b></h2>
<p><b>22:08 </b> <a href="https://cloud.google.com/blog/products/identity-security/unique-immersive-security-experience-coming-to-next-25/" target="_blank" rel="noreferrer noopener"><b>Get ready for a unique, immersive security experience at Next ‘25</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google Next is shockingly just around the corner (at the beginning of April) and Google is getting ready by telling you about all the great things you can look forward to. This week they highlight what to look forward to as a security person:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Access to a security lounge, a dedicated area in the expo where you can meet security leaders engineering Google Cloud’s </span><a href="https://cloud.google.com/security/trusted-cloud%5C" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">secure by design platform and products</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Interactive </span><a href="https://cloud.google.com/security/sec-ops" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Security Operations</span></a><span style="font-weight:400;"> Center to see Google Secops from the eyes of both the defender and adversary.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Mandiant threatspace where you’ll learn from frontline defenders nd incident responders</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Overviews on Securing your AI Experience</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Capture the flag challenge, where you can test and hone your cybersecurity skills. With real world data, random notes and information from the dark web simulate a real world threat hunt. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Security tabletop exercises where you can </span><a href="https://cloud.google.com/transform/the-empty-chair-guess-whos-missing-from-your-cybersecurity-tabletop-exercise/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">role-play and analyze</span></a><span style="font-weight:400;"> aspects of hypothetical but realistic security incidents. And Bird of a feather sessions. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Plus over 40 security breakout sessions. </span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">For CISO they have a dedicated programming track to equip CISO’s and other security leaders with insights and strategies that they need to navigate the evolving threat landscape. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Want to register? You can do that </span><a href="https://cloud.withgoogle.com/next/25" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">here</span></a><span style="font-weight:400;">. </span></li>
</ul>
<p><b>24:25</b> <a href="https://cloud.google.com/blog/products/ai-machine-learning/introducing-vertex-ai-rag-engine/" target="_blank" rel="noreferrer noopener"><b>Introducing Vertex AI RAG Engine: Scale your Vertex AI RAG pipeline with </b></a><a href="https://cloud.google.com/blog/products/ai-machine-learning/introducing-vertex-ai-rag-engine/" target="_blank" rel="noreferrer noopener"><b>confidence</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google is announcing the General Availability of the </span><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/rag-overview" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Vertex AI’s RAG engine</span></a><span style="font-weight:400;">, a fully managed service that helps you build and deploy RAG implementations with your data and methods. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Google’s AI RAG engine allows you to:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Adapt to any architecture: from models, vector databases and data sources that work for your use case. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Evolve with your use case: add new data sources, updating models, and/or adjusting retrieval parameters through simple configuration changes. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Evaluate in simple steps with different configurations to find what works best for your use case</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Feature set of the RAG Engine</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">DIY capabilities: DIY RAG empowers users to tailor their solutions by mixing and matching different components. It works great for low to medium complexity use cases with easy-to-get-started API, enabling fast experimentation, proof-of-concept and RAG-based application with a few clicks. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Search functionality: Vertex AI Search stands out as a robust, fully managed solution. It supports a wide variety of use cases, from simple to complex, with high out-of-the-box quality, easiness to get started and minimum maintenance.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Connectors: A rapidly growing list of connectors helps you quickly connect to various data sources, including Cloud Storage, Google Drive, Jira, Slack, or local files. RAG Engine handles the ingestion process (even for multiple sources) through an intuitive interface.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Enhanced performance and scalability: Vertex AI Search is designed to handle large volumes of data with exceptionally low latency. This translates to faster response times and improved performance for your RAG applications, especially when dealing with complex or extensive knowledge bases.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Simplified data management: Import your data from various sources, such as websites, BigQuery datasets, and Cloud Storage buckets, that can streamline your </span><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/rag-overview" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">data ingestion process</span></a><span style="font-weight:400;">.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Improved LLM output quality: By using the retrieval capabilities of </span><a href="https://cloud.google.com/vertex-ai/docs/vector-search/overview" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Vertex AI Search</span></a><span style="font-weight:400;">, you can help to ensure that your RAG application retrieves the most relevant information from your corpus, which leads to more accurate and informative LLM-generated outputs.</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">And customizable</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Parsing and Retrievable customizations. </span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">26:22  Jonathan – “</span></i><i><span style="font-weight:400;">It must be really tough, I think, being a service provider in this industry right now, because things are changing so quickly. It’s like, well, do we launch this Vertex AI rag product, or do we wait three months and this paper we just wrote about Titans, which is kind of like a slightly modified architecture that sort of separates episodic memory, like specific facts that you must remember as facts in themselves from the general training sort of pool of the network. And so that will help address hallucinations.”</span></i></p>
<p><b>32:07</b> <a href="https://blog.google/feed/mercedes-google-cloud-automotive-ai-agent/" target="_blank" rel="noreferrer noopener"><b>Google Cloud’s Automotive AI Agent arrives for Mercedes-Benz.</b></a><span style="font-weight:400;"> </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google is unveiling the </span><a href="https://cloud.google.com/solutions/vertical-ai-agents" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Automotive AI Agent</span></a><span style="font-weight:400;">, a new way for automakers to create helpful generative AI experiences.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Built using Gemini with Vertex AI, the Automotive AI Agent is specially tuned to allow automakers to create highly personalized and intuitive in-car agents that go beyond vehicle voice control. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This will allow you to ask via natural conversations like “is there an Italian restaurant nearby? As well as follow up questions like “does it have good reviews? What’s the most popular dish?”</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Mercedes-Benz is among the first to implement the Automotive AI Agent in its MBUX virtual assistant, coming to the new Mercedes-Benz CLA later this year. </span></li>
</ul>
<p><i><span style="font-weight:400;">32:49  Ryan – “</span></i><i><span style="font-weight:400;">Well, I keep thinking about the manufacturer-specific GPS interfaces. That was a terrible choice, because it was immediately out of date and not getting updates. And then everything just shifted to a mobile device that you can keep up to date. And this is going to be no different. Why? This is not a good idea.”</span></i></p>
<p><b>36:26</b> <a href="https://blog.google/technology/google-labs/video-image-generation-update-december-2024/" target="_blank" rel="noreferrer noopener"><b>State-of-the-art video and image generation with Veo 2 and Imagen 3</b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://blog.google/technology/ai/google-generative-ai-veo-imagen-3/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Last year</span></a><span style="font-weight:400;"> Google released VEO and </span><a href="https://deepmind.google/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Imagen 3</span></a><span style="font-weight:400;">, and creators have brought their ideas to life with the help of these models.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Now they are introducing the latest version of Veo, in Veo 2, and the latest version of Imagen 3, both of which achieve state-of-the-art results.  These models are now available in </span><a href="https://labs.google/fx/tools/video-fx" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">VideoFX</span></a><span style="font-weight:400;">, </span><a href="https://labs.google/fx/tools/image-fx" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">ImageFX</span></a><span style="font-weight:400;"> and their latest experiment, </span><a href="https://labs.google/fx/tools/whisk" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Whisk</span></a><span style="font-weight:400;">.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Veo 2 can create high-quality video in a wide range of subjects and styles. In head-to-head comparisons judged by human raters, Veo2 achieved </span><a href="https://deepmind.google/technologies/veo/veo-2" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">state of the art results</span></a><span style="font-weight:400;"> against leading models.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Veo 2 will deliver resolution up to 4k, and be extended to minutes in length. You can specify things like the lens to use, blur out background or focus on a subject by putting a shallow depth of field into the prompt. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">While many video models hallucinate unwanted details like extra fingers or unexpected objects, Veo 2 produces these less frequently, making the outputs more realistic. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Imagen 3 is improving and includes brighter, better-composed images. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">It can now render more diverse art styles more accurately, from photo realism to impressionism, from abstract to anime. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Whisk is their newest experiment, it lets you input or create images that convey the subject, scene and style you have in mind.  You can bring them together and remix them to create something uniquely your own, from a digital plushie to an enamel pin or sticker. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Whisk combines imagen 3 with Gemini’s visual understanding and description capabilities. </span></li>
</ul>
<p><i><span style="font-weight:400;">36:41  Justin – “</span></i><i><span style="font-weight:400;">I tried to try Wisk 3 or Wisk here with Imogen 3, cause I was curious. And it only can make digital plushies, enamel pins or stickers. So literally choose one of those three things and then what image would you like to use? And then here, here’s your result, which I thought was sort I’m like, well, that’s not really helpful.”</span></i></p>
<p><b>40:49</b> <a href="https://blog.google/around-the-globe/google-europe/united-kingdom/the-cmas-assessment-of-google-search/" target="_blank" rel="noreferrer noopener"><b>The CMA’s assessment of Google Search</b></a><span style="font-weight:400;">  </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">The UK CMA has announced they will be assessing whether Google Search has “Strategic Market Status” SMS under the new digital markets, competition and consumer regime and what new requirements Google Search may need to follow.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Google plans to engage constructively to lay out how services benefit UK consumers and businesses, as well as trade-offs of new regulations. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Will keep an eye on this one. </span></li>
</ul>
<p><i><span style="font-weight:400;">41:21  Justin – “</span></i><i><span style="font-weight:400;">We’ll keep an eye on this one. This would be probably a fun story because what Google wants and what the UK wants are probably completely different things; and this will probably eventually turn into an EU issue as well.”</span></i></p>
<p><b>42:02</b> <a href="https://techcrunch.com/2025/01/14/googles-notebooklm-had-to-teach-its-ai-podcast-hosts-not-to-act-annoyed-at-humans/" target="_blank" rel="noreferrer noopener"><b>Google’s NotebookLM had to teach its AI podcast hosts not to act annoyed </b></a><a href="https://techcrunch.com/2025/01/14/googles-notebooklm-had-to-teach-its-ai-podcast-hosts-not-to-act-annoyed-at-humans/" target="_blank" rel="noreferrer noopener"><b>at humans</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Techcrunch has an article about their NoteBookLM feature from Google, and apparently they had to teach them not to be annoyed. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">In December 2024, they added the ability to call in to the podcast and ask questions, essentially interrupting the AI hosts.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">When the features were first rolled out, the AI hosts seemed annoyed at such interruptions, and would occasionally give snippy comments to human callers like “I was getting to that” or “as I was about to say” which felt adversarial. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">NotebookLM’s team decided to do some friendliness tuning. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They posted on X… that friendliness tuning was in the “things i never thought would be my job, but are” category. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They tested a variety of different prompts, and landed on a new prompt that is more friendly and engaging. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Techcrunch tested the fix and said that it is working and the hosts even expressed surprise exclaiming “Woah” before politely asking the human to chime in. </span></li>
</ul>
<p><i><span style="font-weight:400;">43:09  Justin – “Maybe we can have NotebookLM call in to us and ask us questions!”</span></i></p>
<p><b>43:54</b> <a href="https://www.fierce-network.com/cloud/google-cloud-overtake-microsofts-no-2-cloud-position-year" target="_blank" rel="noreferrer noopener"><b>Google Cloud could overtake Microsoft’s No. 2 cloud position this year</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">First let me tell you my opinion… “yeah right”</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Analyst </span><a href="https://www.linkedin.com/in/jckgld/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Jack Gold</span></a><span style="font-weight:400;"> attempted to zero in on cloud hosting revenue for the big three hyperscalers, and he concluded that Google Cloud’s Pure cloud hosting revenue is likely much closer to Azure’s than Microsoft wants it to be.  In Fact he estimates it to be within $1 billion dollars. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">At current growth rates, he projects that Google Cloud’s revenue will be 55% greater than Azure. </span></li>
</ul>
<p><i><span style="font-weight:400;">45:25  Ryan – “</span></i><i><span style="font-weight:400;">I disagree with the time scale. And if you extend the time scale out too much longer, you just have to assume everything sort of stays the same. And there’s so many things that can change things. You know, like there was a, I’m sure there was a huge bump from AI for Microsoft, you know, a little while ago. has that been really spread across the other cloud providers? I don’t really know if they caught up.”</span></i></p>
<h2><b>Azure</b></h2>
<p><b>47:36</b> <a href="https://blogs.microsoft.com/blog/2025/01/13/introducing-core-ai-platform-and-tools/" target="_blank" rel="noreferrer noopener"><b>Introducing CoreAI – Platform and Tools</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Satya Nadella comes to us with an update he sent to Microsoft employees and is sharing publicly (I mean it would have been leaked anyways)</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Satya indicates that we are heading into the next era of the AI Platform shift.  2025 will be about model-forward applications that reshape all application categories.  Unlike previous platform shifts this will impact every layer of the application stack.  GUI, Servers, Cloud Native Databases all being done at once… 30 years of change compressed into 3 years. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">He says they will build agentic applications with memory, entitlements and action space that will inherit powerful model capabilities. And will adapt those capabilities for enhanced performance and safety across roles, business processes and industry domains. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This will lead to what he calls the AI-first App stack, one with new UI/UX patterns, runtimes to build with agents, orchestrate multiple agents, and a reimagined management and observability layer.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">So it is imperative that Azure must become the infrastructure for AI, while they build AI platforms and developer tools spanning Azure AI, foundry github and VS Code on top of it. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The good news per Satya they have been working on it for 2 years already and have learned a lot in terms of the systems, app platform and tools required for the AI era.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">To further advance the roadmap across the layers they are creating an ew engineering organization: CoreAI – Platform and Tools. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The new division will bring together Dev Div, AI platform and some key teams from the office of the CTO, including AI supercomputer, AI Agentic runtime and Engineering thrive, with the mission to build the end-to-end copilot &amp; AI stack for both first-party and third-party customers to build and run AI apps and agents. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This group will also build out GitHub Copilot, thus having a tight feedback loop between the leading AI-first product and the AI platform to motivate the stack and its roadmap. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The new Core AI team will be led by Jay Parikh EVP. </span></li>
</ul>
<p><i><span style="font-weight:400;">51:02 Justin – “</span></i><i><span style="font-weight:400;">I mean, it’s kind of neat though. Like if you think about that and then they put that with the AI agentic team and that like, could be really, cause I mean, it is, that is my day to day life. Like it’s my challenge. How do I get AI here? And there’s so many hurdles to make it happen.”</span></i></p>
<h2><b>Oracle</b></h2>
<p><b>52:44  </b><a href="https://www.oracle.com/news/announcement/oraclexstore-2025-01-13/" target="_blank" rel="noreferrer noopener"><b>Oracle Supercharges Retail Operations with New POS</b></a><span style="font-weight:400;"> </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Justin is a child and will never not laugh at Point of Sale being POS… so here’s a fun story to round out today’s show.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">There is nothing here really cloud related.… we just wanted to snicker about it. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Ok they do pitch you on using OCI and OCI Container instances to speed up your implementation, and a plug for OCI Roving Edge infrastructure for your store to run Xstore. So there – it’s cloud related. </span></li>
</ul>
<p><span style="font-weight:400;">Closing</span></p>
<p><span style="font-weight:400;">And that is the week in the cloud! Visit our website, the home of the Cloud Pod where you can join our newsletter, slack team, send feedback or ask questions at theCloud Pod.net or tweet at us with hashtag #theCloudPod</span></p>
]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/1947529/c1e-qx4xb2g0pna7g083-257mm3dxb68n-uiepb1.mp3" length="67696765"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 288 of The Cloud Pod – where the forecast is always cloudy! Justin, Ryan, and Jonathan are your hosts as we make our way through this week’s cloud and AI news, including back to Vertex AI, Project Digits, Notebook LM, and some major improvements to AI image generation. 
Titles we almost went with this week:

Digits… I’ll show you 5 digits…
The only digit the AWS local zone in New York shows me is the middle one
️Keep one eye open near Mercedes with Agentic AI

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
General News
01:59  Nvidia announces $3,000 personal AI supercomputer called Digits

If you don’t want to hand over all your money to the cloud providers, you will be able to hand over $3,000 dollars to Nvidia… for a computer that is probably going to be obsolete in <12 months. That’s fun! 
The new personal AI supercomputer, called Project Digits, will launch in May. 
The heart of Digits is the new GB10 Grace Blackwell Superchip, which packs enough processing power to run sophisticated AI models, while being compact enough to fit on a desk and run from a standard power outlet.
Digits can handle AI models with up to 200 billion parameters, and looks very similar to a Mac Mini. 
“AI will be mainstream in every application for every industry. With Project Digits, the Grace Blackwell Superchip comes to millions of developers,” Nvidia CEO Jensen Huang said in a press release. “Placing an AI supercomputer on the desks of every data scientist, AI researcher, and student empowers them to engage and shape the age of AI.”
The Digits system comes with 128gb of unified coherent memory and up to 4tb of NVME storage.  For even more demanding apps, two digit systems can be linked together to handle models with 405b parameters. 
The GB10 chip delivers up to 1 petaflop of AI performance, meaning it can perform 1 qu...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/1947529/c1a-k5d5-ndovv3jza3v0-dh2i9q.jpg"></itunes:image>
                                                                            <itunes:duration>00:56:25</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                            </item>
                    <item>
                <title>
                    <![CDATA[287: The Cloud Pod Rebrands to The Cloud AI So We Can Get A 1B Valuation]]>
                </title>
                <pubDate>Wed, 15 Jan 2025 20:31:22 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/1941708</guid>
                                    <link>https://tcpfm.castos.com/episodes/287-1b-ai-valuation</link>
                                <description>
                                            <![CDATA[<p><span style="font-weight:400;">Welcome to episode 287 of The Cloud Pod – where the forecast is always cloudy! 2025 is already shaping up to be another year of “unprecedented” times, but have no fear, Justin, Ryan, Jonathan, and Matthew are all in the house and (mostly) recovered from the holidays – and just in time to bring you all the latest new year news in the cloud world. </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">☁️Everyone is investing in AI… but you could invest in the cloud pod</span></li>
<li><span style="font-weight:400;">Oracle Exadata X11M: Burn a big pile of money</span></li>
<li><span style="font-weight:400;">The cloud pod has better security than Microsoft – mk</span></li>
<li><span style="font-weight:400;">️The new and improved Cloud Pod 4.0</span></li>
<li><span style="font-weight:400;">️Cloud Nine… Figures (or $80 billion)</span></li>
<li><span style="font-weight:400;">⚔️$60 Billion and Counting: The Ai Arms Race</span></li>
<li><span style="font-weight:400;">Oracle Exadata X11M: For When You Absolutely, Positively, Have to Burn Money</span></li>
<li><span style="font-weight:400;">The Cloud Pod rebrands to The Cloud AI so we can get 11B in funding</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><b>General News</b></h2>
<p><b>2:42 </b> <a href="https://siliconangle.com/2024/12/09/oracles-rampant-cloud-growth-wasnt-enough-wall-street-stock-slides-hours/" target="_blank" rel="noreferrer noopener"><b>Oracle’s rampant cloud growth wasn’t enough for Wall Street, and its stock </b></a><a href="https://siliconangle.com/2024/12/09/oracles-rampant-cloud-growth-wasnt-enough-wall-street-stock-slides-hours/" target="_blank" rel="noreferrer noopener"><b>slides after-hours</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">We missed talking about </span><a href="https://www.oracle.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Oracle</span></a><span style="font-weight:400;">’s earnings call on December 9th, since we were in the middle of our re:Invent shows. Apparently, their rapid cloud growth was not sufficient to appease the Wall Street gods., but honestly – what is ever good enough for them?  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They </span><a href="https://investor.oracle.com/investor-news/news-details/2024/Oracle-Announces-Fiscal-2025-Second-Quarter-Financial-Results/default.aspx" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">reported</span></a><span style="font-weight:400;"> earnings of 1.47 a share, just shy of the 1.48 expected by the analysts. Revenue was up 9% from a year before, at $14.06B below the street’s target of $14.1 Billion.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Income was up 26% from prior year, to 3.15B.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Revenue from cloud services and license support was up 12% to 10.8 billion. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Oracle CEO Safra Catz said growth in the AI segment was nothing short of extraordinary, with 336% growth in GPU unit consumption from the prior year. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Despite positive signs, Oracle guidance was soft and this also angered the Wall Street gods. </span></li>
</ul>
<p><i><span style="font-weight:400;">04:09  Justin – “…</span></i><i><span style="font-weight:400;">now in January, their stock is, up a dollar 11 today, but, looking at the month, they haven’t really recovered from earnings quite yet. So...</span></i></p>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 287 of The Cloud Pod – where the forecast is always cloudy! 2025 is already shaping up to be another year of “unprecedented” times, but have no fear, Justin, Ryan, Jonathan, and Matthew are all in the house and (mostly) recovered from the holidays – and just in time to bring you all the latest new year news in the cloud world. 
Titles we almost went with this week:

☁️Everyone is investing in AI… but you could invest in the cloud pod
Oracle Exadata X11M: Burn a big pile of money
The cloud pod has better security than Microsoft – mk
️The new and improved Cloud Pod 4.0
️Cloud Nine… Figures (or $80 billion)
⚔️$60 Billion and Counting: The Ai Arms Race
Oracle Exadata X11M: For When You Absolutely, Positively, Have to Burn Money
The Cloud Pod rebrands to The Cloud AI so we can get 11B in funding

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
General News
2:42  Oracle’s rampant cloud growth wasn’t enough for Wall Street, and its stock slides after-hours 

We missed talking about Oracle’s earnings call on December 9th, since we were in the middle of our re:Invent shows. Apparently, their rapid cloud growth was not sufficient to appease the Wall Street gods., but honestly – what is ever good enough for them?  
They reported earnings of 1.47 a share, just shy of the 1.48 expected by the analysts. Revenue was up 9% from a year before, at $14.06B below the street’s target of $14.1 Billion.
Income was up 26% from prior year, to 3.15B.  
Revenue from cloud services and license support was up 12% to 10.8 billion. 
Oracle CEO Safra Catz said growth in the AI segment was nothing short of extraordinary, with 336% growth in GPU unit consumption from the prior year. 
Despite positive signs, Oracle guidance was soft and this also angered the Wall Street gods. 

04:09  Justin – “…now in January, their stock is, up a dollar 11 today, but, looking at the month, they haven’t really recovered from earnings quite yet. So...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[287: The Cloud Pod Rebrands to The Cloud AI So We Can Get A 1B Valuation]]>
                </itunes:title>
                                    <itunes:episode>287</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p><span style="font-weight:400;">Welcome to episode 287 of The Cloud Pod – where the forecast is always cloudy! 2025 is already shaping up to be another year of “unprecedented” times, but have no fear, Justin, Ryan, Jonathan, and Matthew are all in the house and (mostly) recovered from the holidays – and just in time to bring you all the latest new year news in the cloud world. </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">☁️Everyone is investing in AI… but you could invest in the cloud pod</span></li>
<li><span style="font-weight:400;">Oracle Exadata X11M: Burn a big pile of money</span></li>
<li><span style="font-weight:400;">The cloud pod has better security than Microsoft – mk</span></li>
<li><span style="font-weight:400;">️The new and improved Cloud Pod 4.0</span></li>
<li><span style="font-weight:400;">️Cloud Nine… Figures (or $80 billion)</span></li>
<li><span style="font-weight:400;">⚔️$60 Billion and Counting: The Ai Arms Race</span></li>
<li><span style="font-weight:400;">Oracle Exadata X11M: For When You Absolutely, Positively, Have to Burn Money</span></li>
<li><span style="font-weight:400;">The Cloud Pod rebrands to The Cloud AI so we can get 11B in funding</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><b>General News</b></h2>
<p><b>2:42 </b> <a href="https://siliconangle.com/2024/12/09/oracles-rampant-cloud-growth-wasnt-enough-wall-street-stock-slides-hours/" target="_blank" rel="noreferrer noopener"><b>Oracle’s rampant cloud growth wasn’t enough for Wall Street, and its stock </b></a><a href="https://siliconangle.com/2024/12/09/oracles-rampant-cloud-growth-wasnt-enough-wall-street-stock-slides-hours/" target="_blank" rel="noreferrer noopener"><b>slides after-hours</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">We missed talking about </span><a href="https://www.oracle.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Oracle</span></a><span style="font-weight:400;">’s earnings call on December 9th, since we were in the middle of our re:Invent shows. Apparently, their rapid cloud growth was not sufficient to appease the Wall Street gods., but honestly – what is ever good enough for them?  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They </span><a href="https://investor.oracle.com/investor-news/news-details/2024/Oracle-Announces-Fiscal-2025-Second-Quarter-Financial-Results/default.aspx" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">reported</span></a><span style="font-weight:400;"> earnings of 1.47 a share, just shy of the 1.48 expected by the analysts. Revenue was up 9% from a year before, at $14.06B below the street’s target of $14.1 Billion.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Income was up 26% from prior year, to 3.15B.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Revenue from cloud services and license support was up 12% to 10.8 billion. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Oracle CEO Safra Catz said growth in the AI segment was nothing short of extraordinary, with 336% growth in GPU unit consumption from the prior year. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Despite positive signs, Oracle guidance was soft and this also angered the Wall Street gods. </span></li>
</ul>
<p><i><span style="font-weight:400;">04:09  Justin – “…</span></i><i><span style="font-weight:400;">now in January, their stock is, up a dollar 11 today, but, looking at the month, they haven’t really recovered from earnings quite yet. So we’ll see how they do as they continue through the year. But, yeah, I mean, tech in general is down. I mean, everything’s down. Everyone’s waiting for the election to, election, the, the soaring in and the new administration to come in as we’re past that.”</span></i></p>
<p><b>04:34</b> <a href="https://www.hashicorp.com/blog/hashicorp-2024-year-in-review" target="_blank" rel="noreferrer noopener"><b>HashiCorp 2024 year in review</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">2024 was a busy year for Hashicorp, and they wrote up a blog post to point out the highlights.</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">IBM + Hashicorp </span><a href="https://www.hashicorp.com/blog/hashicorp-joins-ibm" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">signed an agreement</span></a><span style="font-weight:400;"> to be acquired by Big Blue. With IBM, they believe they can bring modern infrastructure and security practices to an even greater number of organizations around the world, and they are excited for the possibilities. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Terraform got numerous updates including:</span>
<ul>
<li style="font-weight:400;"><a href="https://www.hashicorp.com/blog/terraform-stacks-explained" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Terraform Stacks</span></a></li>
<li style="font-weight:400;"><a href="https://www.hashicorp.com/blog/terraform-packer-nomad-and-waypoint-updates-help-scale-ilm-at-hashiconf-2024" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Module Lifecycle Management</span></a><span style="font-weight:400;"> to simplify day 2</span></li>
<li style="font-weight:400;"><a href="https://www.hashicorp.com/blog/terraform-packer-nomad-and-waypoint-updates-help-scale-ilm-at-hashiconf-2024" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Terraform Migrate for HCP Terraform adoption</span></a></li>
<li style="font-weight:400;"><a href="https://developer.hashicorp.com/terraform/cloud-docs/registry/test" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Test-integrated module published</span></a></li>
<li style="font-weight:400;"><a href="https://www.hashicorp.com/blog/terraform-1-10-improves-handling-secrets-in-state-with-ephemeral-values" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Ephemeral Values</span></a></li>
<li style="font-weight:400;"><a href="https://www.hashicorp.com/blog/terraform-1-7-adds-test-mocking-and-config-driven-remove#config-driven-remove" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Config-driven state updates</span></a><span style="font-weight:400;"> for refactoring and importing resources</span></li>
<li style="font-weight:400;"><a href="https://www.hashicorp.com/blog/simplify-policy-adoption-in-terraform-with-pre-written-sentinel-policies-for-aws" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Pre-written sentinel policy library</span></a><span style="font-weight:400;"> co-developed with AWS</span></li>
</ul>
</li>
<li style="font-weight:400;"><a href="https://www.hashicorp.com/blog/predictable-plugin-loading-in-packer-1-11" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Packer 1.11</span></a>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">New plugin loading process</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Packer and plugin version tracking</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">CI/CD Pipeline metadata</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Nomad got significant upgrades this year</span>
<ul>
<li style="font-weight:400;"><a href="https://www.hashicorp.com/blog/nomad-bench-load-testing-and-benchmarking-for-nomad%5C" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Nomad Bench</span></a><span style="font-weight:400;"> for load testing and benchmarking Nomad</span></li>
<li style="font-weight:400;"><a href="https://www.hashicorp.com/blog/nomad-1-9-adds-nvidia-mig-support-golden-job-versions-and-more" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">NVIDIA device driver</span></a><span style="font-weight:400;"> added support</span></li>
<li style="font-weight:400;"><a href="https://www.hashicorp.com/blog/nomad-1-9-adds-nvidia-mig-support-golden-job-versions-and-more#quotas-for-device-resources-enterprise" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Enhancements for GPU scheduling</span></a><span style="font-weight:400;"> and resource quotas</span></li>
<li style="font-weight:400;"><a href="https://www.hashicorp.com/blog/nomad-1-9-adds-nvidia-mig-support-golden-job-versions-and-more" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Exec2 task driver</span></a></li>
<li style="font-weight:400;"><a href="https://www.hashicorp.com/blog/nomad-1-9-adds-nvidia-mig-support-golden-job-versions-and-more" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Libvirt task driver beta for improved virtual machine support</span></a></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Vault</span>
<ul>
<li style="font-weight:400;"><a href="https://www.youtube.com/watch?v=cel1GQ0wpQA" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Secrets Sync</span></a></li>
<li style="font-weight:400;"><a href="https://www.hashicorp.com/blog/hcp-vault-secrets-adds-enterprise-capabilities-for-auto-rotation-dynamic-secrets" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Autorotation and dynamic secrets</span></a></li>
<li style="font-weight:400;"><a href="https://www.hashicorp.com/blog/vault-1-17-brings-wif-est-support-for-pki-and-more" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">WIF Support</span></a></li>
<li style="font-weight:400;"><a href="https://www.hashicorp.com/blog/new-vault-boundary-offerings-advance-security-lifecycle-management-hashidays-2024#vault-secrets-operator" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Vault Secrets Operator for K8</span></a></li>
</ul>
</li>
<li style="font-weight:400;"><a href="https://developer.hashicorp.com/hcp/docs/vault-radar" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">HCP Vault Radar</span></a><span style="font-weight:400;"> (New product)</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Scans your digital estate for unmanaged secrets and PII</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Consul</span>
<ul>
<li style="font-weight:400;"><a href="https://www.hashicorp.com/blog/consul-1-18-ga-improves-enterprise-reliability-with-long-term-support" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Transparent proxy for ECS</span></a></li>
<li style="font-weight:400;"><a href="https://developer.hashicorp.com/consul/docs/k8s/dns/views" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Consul DNS views for K8</span></a></li>
<li style="font-weight:400;"><a href="https://www.hashicorp.com/blog/consul-1-19-improves-kubernetes-workflows-snapshot-support-and-nomad-integration" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Registration CRD for Consul for K8</span></a></li>
</ul>
</li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">06:28  Ryan- “</span></i><i><span style="font-weight:400;">I was as surprised as you are with the Nomad news. But then I was thinking about it and it’s just like, there isn’t the greatest of options for managing infrastructure if you’re not on a cloud hyperscan or so. It’s like, you can use OpenStack, which gets a little bit of support, but I don’t think it’s really, I don’t know if it’s, I still don’t know what that is for, and I don’t know if it manages infrastructure.”</span></i></p>
<p><b>07:17 </b> <a href="https://www.techzine.eu/news/infrastructure/127466/ibm-acquisition-of-hashicorp-again-in-peril-as-antritrust-looms/" target="_blank" rel="noreferrer noopener"><b>IBM acquisition of HashiCorp again in peril as antitrust looms</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Not so fast </span><a href="https://www.bing.com/ck/a?!&amp;&amp;p=55da0a592b5c4c6951ceea959a4a22c03d592746a8a565dc3f7360f22f40efb8JmltdHM9MTczNjgxMjgwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=111326c2-38f8-6e94-3a5e-34a8396a6f52&amp;psq=Hashicorp&amp;u=a1aHR0cHM6Ly93d3cuaGFzaGljb3JwLmNvbS8&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Hashi</span></a><span style="font-weight:400;"> on that acquisition…</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Britains Competitions &amp; Markets authority is going to investigate IBM’s acquisition of Hashicorp. It has launched a merge inquiry, with a deadline of February 25th. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They are asking for parties to comment before Jan 16th. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The big prize </span><a href="https://www.bing.com/ck/a?!&amp;&amp;p=d6cc88156764f5b8d1bfdbbe828b876c5925d767c4818ae251db3d38c5a055f9JmltdHM9MTczNjgxMjgwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=111326c2-38f8-6e94-3a5e-34a8396a6f52&amp;psq=IBM&amp;u=a1aHR0cHM6Ly93d3cuaWJtLmNvbS8&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">IBM</span></a><span style="font-weight:400;"> is after is of course Terraform… which has some conflicts with IBM owned </span><a href="https://www.bing.com/ck/a?!&amp;&amp;p=6a78ea7182bec4c92f10a2e7dc9353881a5dc88f56ae54f1faf3d17c079debebJmltdHM9MTczNjgxMjgwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=111326c2-38f8-6e94-3a5e-34a8396a6f52&amp;psq=Redhat+ansible&amp;u=a1aHR0cHM6Ly93d3cucmVkaGF0LmNvbS9lbi90ZWNobm9sb2dpZXMvbWFuYWdlbWVudC9hbnNpYmxl&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Red Hat Ansible</span></a><span style="font-weight:400;">. </span></li>
</ul>
<p><i><span style="font-weight:400;">09:25  Jonathan – “</span></i><i><span style="font-weight:400;">Well, just like any government agency they are somewhat self, what’s the word, you create an agency that does something, they have to do it, they have to justify their own existence, you know?”</span></i></p>
<h2><b>AI Is Going Great – Or, How ML Makes All its Money </b></h2>
<p><b>10:00</b> <a href="https://openai.com/index/why-our-structure-must-evolve-to-advance-our-mission/" target="_blank" rel="noreferrer noopener"><b>Why OpenAI’s Structure Must Evolve To Advance Our Mission</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Open AI’s board of directors is evaluating their corporate structure in order to best support the mission of ensuring AGI benefits all of humanity, with three objectives:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Choose a non-profit / </span><a href="https://openai.com/index/openai-lp/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">for-profit</span></a><span style="font-weight:400;"> structure that is best for the long-term success of the </span><a href="https://openai.com/index/openai-lp/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">mission</span></a><span style="font-weight:400;">.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Make the non-profit sustainable.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Equip each arm to do its part.</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">As they enter 2025, the Open AI business has expanded beyond a research lab, and then a startup to now they need to become an enduring company. The board is consulting with outside legal and financial advisors to determine the best structure of OpenAI to advance the mission.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">WIll keep an eye on this as it continues to develop. </span></li>
</ul>
<p><b>11:36 </b> <a href="https://openai.com/index/o1-and-new-tools-for-developers/" target="_blank" rel="noreferrer noopener"><b>OpenAI o1 and new tools for developers</b></a><b>  </b></p>
<ul>
<li style="font-weight:400;"><a href="https://www.bing.com/ck/a?!&amp;&amp;p=8e73930a2cb9b78175dc932f89035113b31ce1ed9afb6971a9fcc9dc9846163bJmltdHM9MTczNjgxMjgwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=111326c2-38f8-6e94-3a5e-34a8396a6f52&amp;psq=openai&amp;u=a1aHR0cHM6Ly9vcGVuYWkuY29tLw&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">OpenAI</span></a><span style="font-weight:400;"> is introducing their more capable models, new tools for customization of those models and upgrades that improve performance, flexibility and cost-efficiency for developers building with AI. </span>
<ul>
<li style="font-weight:400;"><a href="https://platform.openai.com/docs/models#o1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">OpenAI o1 in the API</span></a><span style="font-weight:400;">, with support for function calling, developer messages, structured outputs, and vision capabilities. </span></li>
<li style="font-weight:400;"><a href="https://platform.openai.com/docs/guides/realtime" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Realtime API updates</span></a><span style="font-weight:400;">, including simple WebRTC integration, a 60% price reduction for GPT-4o audio, and support for GPT-4o mini at one-tenth of previous audio rates</span></li>
<li style="font-weight:400;"><a href="https://platform.openai.com/docs/guides/fine-tuning#preference" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Preference fine-tuning</span></a><span style="font-weight:400;">, a new model customization technique that makes it easier to tailor models based on user and developer preferences</span></li>
<li style="font-weight:400;"><a href="https://platform.openai.com/docs/libraries" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">New Go and Java SDKS</span></a><span style="font-weight:400;"> available in Beta</span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">12:39  Jonathan – “</span></i><i><span style="font-weight:400;">I think the branding’s kind of messed up. Because they were really the first to launch a decent consumer-facing service, ChatGPT is like the brand name, just like Google. And so the fact that they’re not using, not calling it ChatGPT 01, it just boggles me. don’t know. I understand why they want to separate the web service from the underlying models, but at the same time, who really cares?”</span></i></p>
<p><b>16:01 </b><a href="https://www.nytimes.com/2025/01/07/technology/anthropic-ai-funding.html?unlocked_article_code=1.nk4.hTXa.enTNus36cWFH&amp;smid=url-share" target="_blank" rel="noreferrer noopener"><b>A.I. Start-Up Anthropic Is in Talks That Could Value It at $60 Billion</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Anthropic is in talks to raise a new round of funding that could value the company at $60B, up from the </span><a href="https://www.nytimes.com/2024/02/20/technology/anthropic-funding-ai.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">$16B less than a year ago</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Led by Lightspeed Venture Partners, the new round could pump an additional $2B into the company. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Since the company was founded in 2021, it has raised more than 11.3B from Venture firms.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;"> The talks come in the midst of a new surge of funding talks with the industry’s most prominent AI startups. Including the 6B raised by xAI and the 6.6B raised by OpenAI. </span></li>
</ul>
<p><i><span style="font-weight:400;">16:43 Jonathan – “</span></i><i><span style="font-weight:400;">What’s weird is of course the Chinese models like DeepSeek and Qwenn, which are trained on cheaper hardware that they have access to for much less money. I think DeepSeek was trained for like a tenth the price of any of the competing models. We’ve kind of forced the Chinese AI engineers to be really innovative because of the trade restrictions and they’re gonna eat the lunch of these companies here.”</span></i></p>
<h2><b>AWS</b></h2>
<p><b>18:15 </b> <a href="https://www.csoonline.com/article/3625205/amazon-refuses-microsoft-365-deployment-because-of-lax-cybersecurity.html" target="_blank" rel="noreferrer noopener"><b>Amazon refuses Microsoft 365 deployment because of lax cybersecurity</b></a><span style="font-weight:400;"> </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon CISO CJ Moses publicly shamed Microsoft security, halting his employer’s deployment of M365 for a full year as the vendor tries to fix a long list of security problems Amazon identified. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Industry security executives are of two minds. Some applauded Amazon, saying that the online retail giant has the revenue and employees to push Microsoft to fix issues like this. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Others though were cynical, saying that the move is less altruistic, and more to improve cybersecurity and a thinly disguised sales pitch for AWS.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Moses says they conducted their own analysis of the software and asked for changes to guard against unauthorized access and create a more detailed accounting of user activity in the apps. He said they deep-dived O365 and all of the controls around it and held. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon requests included modifying tools to verify the users accessing the apps are properly authorized and once in, that their actions are tracked in a manner that Amazon’s automated systems can monitor for changes that might indicate a security risk.   </span></li>
</ul>
<p><i><span style="font-weight:400;">20:07  Matthew – “</span></i><i><span style="font-weight:400;">That’s a stretch because also looking at S3 and it doesn’t really follow the same IAM model and you know EC2 and VPC falls into the EC2 world, which doesn’t really follow the same model. like any legacy services that you try to fit into the box don’t really work great. So I almost feel like, yes, I’m not disagreeing with them where it is a hodgepodge of technologies they’ve merged together over the years into what it is now. There definitely are things that they need to clean up. But I also think that AWS still has some things there too that need to be improved.”</span></i></p>
<p><b>22:11 </b> <a href="https://aws.amazon.com/blogs/aws/stable-diffusion-3-5-large-is-now-available-in-amazon-bedrock/" target="_blank" rel="noreferrer noopener"><b>Stable Diffusion 3.5 Large is now available in Amazon Bedrock</b></a><span style="font-weight:400;"> </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Pre-announced at Re:Invent 2024, </span><a href="https://aws.amazon.com/bedrock/stability-ai/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Stable Diffusion 3.5 Large</span></a><span style="font-weight:400;"> is now actually available in </span><a href="https://aws.amazon.com/bedrock/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Bedrock</span></a><span style="font-weight:400;"> allowing you to generate high-quality images from text descriptions in a wide range of styles to accelerate the creation of concept art, visual effects and detailed product imagery for customers in media, gaming, advertising and retail. </span></li>
</ul>
<p><i><span style="font-weight:400;">23:21  Jonathan – “</span></i><i><span style="font-weight:400;">One of the interesting use cases I think is becoming more popular is people making AI generated adult content and publishing it on things like OnlyFans. And they’re not even real people behind these accounts, they’re literally just machines cranking out images of people that don’t exist.”</span></i></p>
<p><b>29:50 </b> <a href="https://techcrunch.com/2025/01/07/aws-says-itll-invest-at-least-11b-to-expand-data-center-infrastructure-in-georgia/" target="_blank" rel="noreferrer noopener"><b>AWS says it’ll invest ‘at least’ $11B to expand data center infrastructure in </b></a><a href="https://techcrunch.com/2025/01/07/aws-says-itll-invest-at-least-11b-to-expand-data-center-infrastructure-in-georgia/" target="_blank" rel="noreferrer noopener"><b>Georgia</b></a><span style="font-weight:400;">  </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS has announced plans to </span><a href="https://www.aboutamazon.com/news/aws/aws-investment-georgia-ai-cloud-infrastructure" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">invest $11B in Georgia</span></a><span style="font-weight:400;"> to expand its infrastructure to support various cloud and AI technologies. AWS estimates it will create roughly 550 jobs in the state. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This comes 8 months after they </span><a href="https://www.reuters.com/technology/amazon-invest-11-billion-indiana-build-data-centers-2024-04-25/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">announced</span></a><span style="font-weight:400;"> the intent to invest 11B in datacenters in Indiana as well. </span></li>
</ul>
<p><i><span style="font-weight:400;">30:40  Justin – “11 billion dollars seems like a lot for a local zone…”</span></i></p>
<p><b>32:00 </b> <a href="https://aws.amazon.com/blogs/aws/announcing-the-new-aws-asia-pacific-thailand-region/" target="_blank" rel="noreferrer noopener"><b>Announcing the new AWS Asia Pacific (Thailand) Region</b></a><span style="font-weight:400;"> </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon is announcing that the AWS Asia Pacific (Thailand) region is now generally available with three AZ’s. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This is the first region in Thailand and the fourteenth region in Asia Pac. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The adoption of cloud computing has gained significant momentum in Thailand, driven by evolving business needs and government initiatives such as </span><a href="https://www.boi.go.th/upload/content/Thailand,%20Taking%20off%20to%20new%20heights%20@%20belgium_5ab4e8042850e.pdf" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Thailand 4.0</span></a><span style="font-weight:400;">. </span></li>
</ul>
<h2><b>GCP</b></h2>
<p><b>33:27 </b> <a href="https://www.cnbc.com/2024/12/27/google-ceo-pichai-tells-employees-the-stakes-are-high-for-2025.html" target="_blank" rel="noreferrer noopener"><b>Tech Google CEO Pichai tells employees to gear up for big 2025: ‘The </b></a><a href="https://www.cnbc.com/2024/12/27/google-ceo-pichai-tells-employees-the-stakes-are-high-for-2025.html" target="_blank" rel="noreferrer noopener"><b>stakes are high’</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google CEO </span><a href="https://www.cnbc.com/sundar-pichai/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Sundar Pichai</span></a><span style="font-weight:400;"> told his employees that the stakes are high in 2025, as the company faces increased competition and regulatory hurdles and contends with rapid advances in AI. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">He addressed the need to move faster as a company as this is a disruptive moment. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">“It’s not lost on me that we are facing scrutiny across the world,” Pichai said. “It comes with our size and success. It’s part of a broader trend where tech is now impacting society at scale. So more than ever, through this moment, we have to make sure we don’t get distracted.”</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">AI, Regulation around them being a search monopoly…. Indeed it’s going to be a busy year for Google. </span></li>
</ul>
<p><i><span style="font-weight:400;">34:40  Ryan – “</span></i><i><span style="font-weight:400;">I think that just goes to show you that, you know, why some of the stakes are high and why the antitrust is there, right? Like, it’s sort of a fallacy that there’s multiple businesses within the Google ecosystem. You know, they did all the separation, but that was mostly for financial reasons and I think maybe it had some sort of driving force behind it.”</span></i></p>
<p><b>35:43</b> <a href="https://cloud.google.com/blog/products/databases/database-centers-supports-bigtable-memorystore-and-firestore/" target="_blank" rel="noreferrer noopener"><b>Database Center: Now with support for Bigtable, Firestore, and </b></a><a href="https://cloud.google.com/blog/products/databases/database-centers-supports-bigtable-memorystore-and-firestore/" target="_blank" rel="noreferrer noopener"><b>Memorystore</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google is expanding the capabilities of Google </span><a href="https://cloud.google.com/database-center/docs/set-up-database-center" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Database Center</span></a><span style="font-weight:400;"> with the addition of support for </span><a href="https://www.bing.com/search?q=bigtable&amp;filters=dtbk:%22MCFvdmVydmlldyFvdmVydmlldyFjY2ZjODk2YS0zYWY0LTNiOWEtNzJjMi1mMDQ2MTE1ZDZiYzY%3D%22+sid:%22ccfc896a-3af4-3b9a-72c2-f046115d6bc6%22+tphint:%22f%22&amp;FORM=DEPNAV" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Bigtable</span></a><span style="font-weight:400;">, </span><a href="https://www.bing.com/ck/a?!&amp;&amp;p=6d953c5bcf6461930ca693534ed6e9d112ddcdb25145e74e0aad96fe72d32ca5JmltdHM9MTczNjgxMjgwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=111326c2-38f8-6e94-3a5e-34a8396a6f52&amp;psq=memorystore&amp;u=a1aHR0cHM6Ly9jbG91ZC5nb29nbGUuY29tL21lbW9yeXN0b3JlP2hsPWVu&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Memorystore</span></a><span style="font-weight:400;"> and </span><a href="https://www.bing.com/ck/a?!&amp;&amp;p=fbfaa2a716cb9eca83a75e7aabd91b35a0199beadc9ece9ad306f9d35e650f2dJmltdHM9MTczNjgxMjgwMA&amp;ptn=3&amp;ver=2&amp;hsh=4&amp;fclid=111326c2-38f8-6e94-3a5e-34a8396a6f52&amp;psq=firestore+database&amp;u=a1aHR0cHM6Ly9maXJlYmFzZS5nb29nbGUuY29tL2RvY3MvZmlyZXN0b3JlLw&amp;ntb=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Firestore Database</span></a><span style="font-weight:400;"> in preview. </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Gain a comprehensive view of your entire database fleet.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Proactively de-risk your database fleet.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Optimize your database fleet with AI powered assistance. </span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">36:41  Ryan – “</span></i><i><span style="font-weight:400;">Yeah, I’ve seen many people screw this up too, because they’re thinking they’re using a cache, but they actually set a non-expiration date on it. So the data just lives in the cache forever. Yeah, where the data they put in there. Yeah, why does the data disappear? Yeah, I’ve seen that one too.”</span></i></p>
<h2><b>Azure</b></h2>
<p><b>39:22</b> <a href="https://azure.microsoft.com/en-us/blog/announcing-the-o1-model-in-azure-openai-service-multimodal-reasoning-with-astounding-analysis/" target="_blank" rel="noreferrer noopener"><b>Announcing the o1 model in Azure OpenAI Service: Multimodal reasoning </b></a><a href="https://azure.microsoft.com/en-us/blog/announcing-the-o1-model-in-azure-openai-service-multimodal-reasoning-with-astounding-analysis/" target="_blank" rel="noreferrer noopener"><b>with “astounding” analysis</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft </span><a href="https://azure.microsoft.com/en-us/products/ai-services/openai-service/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Azure OpenAI Service</span></a><span style="font-weight:400;"> will support the o1 model soon.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The new model brings advanced reasoning capabilities and improvements that will significantly enhance your AI applications and solutions. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Weird to pre announce – and do a full court press release for something you can’t even use. We don’t understand your tactics, Azure.  </span></li>
</ul>
<p><b>40:29 </b> <a href="https://techcrunch.com/2025/01/03/microsoft-to-spend-80-billion-in-fy25-on-data-centers-for-ai/" target="_blank" rel="noreferrer noopener"><b>Microsoft to spend $80 billion in FY’25 on data centers for AI</b></a><b>  </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft has earmarked $80B in fiscal 2025 to build data centers designed to handle AI workloads. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">These AI enabled data centers will be designed to train AI models and deploy AI and cloud-based applications around the world. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">“</span><i><span style="font-weight:400;">As we look into the future, it’s clear that artificial intelligence is poised to become a world-changing GPT. AI promises to drive innovation and boost productivity in every sector of the economy</span></i><span style="font-weight:400;">,” Brad Smith Microsoft Vice Chair and President wrote. “</span><i><span style="font-weight:400;">The United States is poised to stand at the forefront of this new technology wave, especially if it doubles down on its strengths and effectively partners internationally</span></i><span style="font-weight:400;">.”</span></li>
</ul>
<p><i><span style="font-weight:400;">41:13  Justin – “</span></i><i><span style="font-weight:400;">They’re building Skynet. It’s Microsoft. They’re building Skynet. It’s not going to be secure. It’s going to get taken over by somebody. We already talked about Microsoft. Were you here earlier?”</span></i></p>
<h2><b>Oracle</b></h2>
<p><b>42:58  </b><a href="https://www.oracle.com/news/announcement/oracle-introduces-exadata-x11m-platform-2025-01-07/" target="_blank" rel="noreferrer noopener"><b>Oracle Exadata X11M Delivers Extreme Performance, Increased Efficiency, </b></a><a href="https://www.oracle.com/news/announcement/oracle-introduces-exadata-x11m-platform-2025-01-07/" target="_blank" rel="noreferrer noopener"><b>and Improved Energy Savings for Data and AI Workloads</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">If you need a great way to burn that shiny new budget you received in 2025, Oracle is announcing the new and improved Oracle Exadata X11M, the latest generation of the Exadata platform. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The X11M has significant performance improvements over the X10M with 55% faster Vector Searches, 25% faster OLTP transactions and concurrent transactions and 25% faster analytic query processing. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The initial Exadata infrastructure that includes 2 database servers and 3 storage servers with 8 ECPU hours, will run you 12,799 dollars per month… it gets crazy real fast. Buyer beware! </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Just bumping to 64 ECPU increases the price to 26,800 dollars. </span></li>
</ul>
<p><i><span style="font-weight:400;">43:41  Matthew – “</span></i><i><span style="font-weight:400;">How many seconds can you burn your budget in?”</span></i></p>
<p><span style="font-weight:400;">Closing</span></p>
<p><span style="font-weight:400;">And that is the week in the cloud! Visit our website, the home of the Cloud Pod, where you can join our newsletter, slack team, send feedback or ask questions at theCloud Pod.net or tweet, toot, bluesky us with hashtag #theCloudPod The Cloud Pod, a renowned cloud technology platform, has recently rebranded itself as The Cloud AI with the objective of achieving a staggering 1B valuation. This strategic move aligns with their vision to leverage artificial intelligence capabilities for exponential growth. Stay updated with the latest developments and insights by visiting their website, theCloud Pod.net, where you can also subscribe to their newsletter, join their slack team, and engage with them on social media using the hashtag #theCloudPod. </span></p>
]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/1941708/c1e-mq1qan9jw8hx9wmp-ndon0zwguoqw-th1q49.mp3" length="56705483"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 287 of The Cloud Pod – where the forecast is always cloudy! 2025 is already shaping up to be another year of “unprecedented” times, but have no fear, Justin, Ryan, Jonathan, and Matthew are all in the house and (mostly) recovered from the holidays – and just in time to bring you all the latest new year news in the cloud world. 
Titles we almost went with this week:

☁️Everyone is investing in AI… but you could invest in the cloud pod
Oracle Exadata X11M: Burn a big pile of money
The cloud pod has better security than Microsoft – mk
️The new and improved Cloud Pod 4.0
️Cloud Nine… Figures (or $80 billion)
⚔️$60 Billion and Counting: The Ai Arms Race
Oracle Exadata X11M: For When You Absolutely, Positively, Have to Burn Money
The Cloud Pod rebrands to The Cloud AI so we can get 11B in funding

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
General News
2:42  Oracle’s rampant cloud growth wasn’t enough for Wall Street, and its stock slides after-hours 

We missed talking about Oracle’s earnings call on December 9th, since we were in the middle of our re:Invent shows. Apparently, their rapid cloud growth was not sufficient to appease the Wall Street gods., but honestly – what is ever good enough for them?  
They reported earnings of 1.47 a share, just shy of the 1.48 expected by the analysts. Revenue was up 9% from a year before, at $14.06B below the street’s target of $14.1 Billion.
Income was up 26% from prior year, to 3.15B.  
Revenue from cloud services and license support was up 12% to 10.8 billion. 
Oracle CEO Safra Catz said growth in the AI segment was nothing short of extraordinary, with 336% growth in GPU unit consumption from the prior year. 
Despite positive signs, Oracle guidance was soft and this also angered the Wall Street gods. 

04:09  Justin – “…now in January, their stock is, up a dollar 11 today, but, looking at the month, they haven’t really recovered from earnings quite yet. So...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/1941708/c1a-k5d5-25n8kgz9ujvd-4zyhhi.jpg"></itunes:image>
                                                                            <itunes:duration>00:47:16</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                            </item>
                    <item>
                <title>
                    <![CDATA[286: I Can Sum Up 2024 – AI AI AI AI and uhh… ML]]>
                </title>
                <pubDate>Wed, 01 Jan 2025 21:48:33 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/1932764</guid>
                                    <link>https://tcpfm.castos.com/episodes/286-i-can-sum-up-2024</link>
                                <description>
                                            <![CDATA[<p><span style="font-weight:400;">Welcome to episode 286 of The Cloud Pod – where the forecast is always cloudy! Welcome to the final show of 2024! We thank you for joining us on our cloud journey over the past year. During this last show of the year, we look back on all the tech that changed our jobs and lives, and make predictions for an AI filled 2025. Join Justin, Jonathan, Ryan, and Matthew as they look forward to even more discussions about undersea cables. Happy New Year! </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">We thought 2024 would never end</span></li>
<li><span style="font-weight:400;">I can sum up 2024 – AI AI AI AI and uhh AI</span></li>
<li><span style="font-weight:400;">️AI has taken over the Cloud Pod – we are not really here</span></li>
<li><span style="font-weight:400;">‍2024 the year we hoped AI would replace us… close but not yet</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><b>General News</b></h2>
<h2><b>00:31   2024 Predictions Look Back</b></h2>
<p><span style="font-weight:400;">Matt</span></p>
<ul>
<li><b>Simpler and Easier to access LLM with new services</b></li>
<li><span style="font-weight:400;">Kubernetes will become simpler for smaller companies to operate that doesn’t require Highly Paid Devops/Scientists</span></li>
<li><span style="font-weight:400;">Low Employee Churn Rates and increased Tenure (Quiet Quitting)</span></li>
</ul>
<p><i><span style="font-weight:400;">02:07  Matthew – “</span></i><i><span style="font-weight:400;">How is it simpler and easier? I think that there are more ways to run it. The general public has an easier way to access it. And they are simpler as Justin said that they are becoming easier and more efficient and better to use for the average user. So I know that I talked to many people that I work with now and just in general and people that are not in tech, which I feel like a year ago.”</span></i></p>
<p><span style="font-weight:400;">Jonathan</span></p>
<ul>
<li><span style="font-weight:400;">There will be mass layoffs in tech directly attributed to AI in Q1 2024 (10k or more)</span></li>
<li><span style="font-weight:400;">Someone will start a cult that follows an AI LLM God believing in sentience, a higher power. </span></li>
<li><b>AI will find a new home in education. Lesson Plans, Personalized Learning plans by students, etc. </b></li>
</ul>
<p><i><span style="font-weight:400;">02:07  Jonathan – “</span></i><i><span style="font-weight:400;">Well, there is a religion called the First Church of Artificial Intelligence, but it’s been around for longer than this year. I think it’s like five, six years old at this point. So that’s kind of cheating.</span></i></p>
<p><span style="font-weight:400;">Ryan</span></p>
<ul>
<li><span style="font-weight:400;">Start seeing the financial impact of AI to better profitability by using AI.</span></li>
<li><b>AI Solution tied towards new employee onboarding (replace wiki technology)</b></li>
<li><span style="font-weight:400;">Removal of stateful firewalls as traffic ruleset (next-gen next-gen firewall)</span></li>
</ul>
<p><i><span style="font-weight:400;">02:07  Ryan – “</span></i><i><span style="font-weight:400;">I mean, agentic AI is something that’s been rolled out in a lot of companies. I know in my day job, it’s been rolled out. I hope to see this get even stronger and more obvious just because I think that, you know, the days of searching through thousands of documents or the one, you know, unmaintained team page that someone built three years ago when they were new are over. And so I’d like to see this continue.</span></i></p>
<p><span style="font-weight:400;">Justi...</span></p>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 286 of The Cloud Pod – where the forecast is always cloudy! Welcome to the final show of 2024! We thank you for joining us on our cloud journey over the past year. During this last show of the year, we look back on all the tech that changed our jobs and lives, and make predictions for an AI filled 2025. Join Justin, Jonathan, Ryan, and Matthew as they look forward to even more discussions about undersea cables. Happy New Year! 
Titles we almost went with this week:

We thought 2024 would never end
I can sum up 2024 – AI AI AI AI and uhh AI
️AI has taken over the Cloud Pod – we are not really here
‍2024 the year we hoped AI would replace us… close but not yet

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
General News
00:31   2024 Predictions Look Back
Matt

Simpler and Easier to access LLM with new services
Kubernetes will become simpler for smaller companies to operate that doesn’t require Highly Paid Devops/Scientists
Low Employee Churn Rates and increased Tenure (Quiet Quitting)

02:07  Matthew – “How is it simpler and easier? I think that there are more ways to run it. The general public has an easier way to access it. And they are simpler as Justin said that they are becoming easier and more efficient and better to use for the average user. So I know that I talked to many people that I work with now and just in general and people that are not in tech, which I feel like a year ago.”
Jonathan

There will be mass layoffs in tech directly attributed to AI in Q1 2024 (10k or more)
Someone will start a cult that follows an AI LLM God believing in sentience, a higher power. 
AI will find a new home in education. Lesson Plans, Personalized Learning plans by students, etc. 

02:07  Jonathan – “Well, there is a religion called the First Church of Artificial Intelligence, but it’s been around for longer than this year. I think it’s like five, six years old at this point. So that’s kind of cheating.
Ryan

Start seeing the financial impact of AI to better profitability by using AI.
AI Solution tied towards new employee onboarding (replace wiki technology)
Removal of stateful firewalls as traffic ruleset (next-gen next-gen firewall)

02:07  Ryan – “I mean, agentic AI is something that’s been rolled out in a lot of companies. I know in my day job, it’s been rolled out. I hope to see this get even stronger and more obvious just because I think that, you know, the days of searching through thousands of documents or the one, you know, unmaintained team page that someone built three years ago when they were new are over. And so I’d like to see this continue.
Justi...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[286: I Can Sum Up 2024 – AI AI AI AI and uhh… ML]]>
                </itunes:title>
                                    <itunes:episode>286</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p><span style="font-weight:400;">Welcome to episode 286 of The Cloud Pod – where the forecast is always cloudy! Welcome to the final show of 2024! We thank you for joining us on our cloud journey over the past year. During this last show of the year, we look back on all the tech that changed our jobs and lives, and make predictions for an AI filled 2025. Join Justin, Jonathan, Ryan, and Matthew as they look forward to even more discussions about undersea cables. Happy New Year! </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">We thought 2024 would never end</span></li>
<li><span style="font-weight:400;">I can sum up 2024 – AI AI AI AI and uhh AI</span></li>
<li><span style="font-weight:400;">️AI has taken over the Cloud Pod – we are not really here</span></li>
<li><span style="font-weight:400;">‍2024 the year we hoped AI would replace us… close but not yet</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><b>General News</b></h2>
<h2><b>00:31   2024 Predictions Look Back</b></h2>
<p><span style="font-weight:400;">Matt</span></p>
<ul>
<li><b>Simpler and Easier to access LLM with new services</b></li>
<li><span style="font-weight:400;">Kubernetes will become simpler for smaller companies to operate that doesn’t require Highly Paid Devops/Scientists</span></li>
<li><span style="font-weight:400;">Low Employee Churn Rates and increased Tenure (Quiet Quitting)</span></li>
</ul>
<p><i><span style="font-weight:400;">02:07  Matthew – “</span></i><i><span style="font-weight:400;">How is it simpler and easier? I think that there are more ways to run it. The general public has an easier way to access it. And they are simpler as Justin said that they are becoming easier and more efficient and better to use for the average user. So I know that I talked to many people that I work with now and just in general and people that are not in tech, which I feel like a year ago.”</span></i></p>
<p><span style="font-weight:400;">Jonathan</span></p>
<ul>
<li><span style="font-weight:400;">There will be mass layoffs in tech directly attributed to AI in Q1 2024 (10k or more)</span></li>
<li><span style="font-weight:400;">Someone will start a cult that follows an AI LLM God believing in sentience, a higher power. </span></li>
<li><b>AI will find a new home in education. Lesson Plans, Personalized Learning plans by students, etc. </b></li>
</ul>
<p><i><span style="font-weight:400;">02:07  Jonathan – “</span></i><i><span style="font-weight:400;">Well, there is a religion called the First Church of Artificial Intelligence, but it’s been around for longer than this year. I think it’s like five, six years old at this point. So that’s kind of cheating.</span></i></p>
<p><span style="font-weight:400;">Ryan</span></p>
<ul>
<li><span style="font-weight:400;">Start seeing the financial impact of AI to better profitability by using AI.</span></li>
<li><b>AI Solution tied towards new employee onboarding (replace wiki technology)</b></li>
<li><span style="font-weight:400;">Removal of stateful firewalls as traffic ruleset (next-gen next-gen firewall)</span></li>
</ul>
<p><i><span style="font-weight:400;">02:07  Ryan – “</span></i><i><span style="font-weight:400;">I mean, agentic AI is something that’s been rolled out in a lot of companies. I know in my day job, it’s been rolled out. I hope to see this get even stronger and more obvious just because I think that, you know, the days of searching through thousands of documents or the one, you know, unmaintained team page that someone built three years ago when they were new are over. And so I’d like to see this continue.</span></i></p>
<p><span style="font-weight:400;">Justin</span></p>
<ol>
<li style="font-weight:400;"><span style="font-weight:400;">LLM will hit the trough of disillusionment either on Cost, Environmental impact or people realizing how limited these models are</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Another AI model other than Transformer based</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">We will see another large defector from Public Cloud (not 37 Signals or X/Twitter)</span></li>
</ol>
<p><i><span style="font-weight:400;">13:26  Justin – “</span></i><i><span style="font-weight:400;">I feel partially vindicated that I was sort of right, just I thought we didn’t be in the trough a little faster, but maybe it’s coming still. I don’t know. they’re innovating pretty quickly. I don’t think they’ll get there, but definitely environmental is going to become a big, big conversation around AI.”</span></i></p>
<h2><b>17:02  Favorite Story of 2024</b></h2>
<p><span style="font-weight:400;">Did you remember that Gemini wasn’t a thing in 2023? It feels like it’s been around forever. 2024 saw some serious jumps forward in tech and innovation, as well as a lot of quality of life improvements overall. But here’s a quick rundown of our favorite articles from the past year: </span></p>
<p><b>Ryan</b></p>
<p><span style="font-weight:400;">Introduction of RAG into the AI models</span></p>
<p><a href="https://aws.amazon.com/blogs/aws/knowledge-bases-for-amazon-bedrock-now-supports-amazon-aurora-postgresql-and-cohere-embedding-models/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">https://aws.amazon.com/blogs/aws/knowledge-bases-for-amazon-bedrock-now-supports-amazon-aurora-postgresql-and-cohere-embedding-models/</span></a></p>
<p><b>Matthew</b></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-cloudfront-vpc-origins/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-cloudfront-vpc-origins/</span></a><span style="font-weight:400;"> </span></p>
<p><b>Jonathan</b></p>
<p><a href="https://www.theinformation.com/articles/sam-altman-to-return-to-openai-board-of-directors?rc=3t8xtd" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Open AI Sam Altman Drama</span></a></p>
<p><b>Justin</b></p>
<p><a href="https://cloud.google.com/blog/products/infrastructure/announcing-humboldt-the-first-cable-route-between-south-america-and-asia-pacific" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Announcing Humboldt, the first cable route between South America and Asia-Pacific</span></a></p>
<p><b>Other 2024 things of note:</b></p>
<ul>
<li><span style="font-weight:400;">Call chat gpt</span></li>
<li><a href="https://blog.google/technology/google-deepmind/google-deepmind-demis-hassabis-john-jumper-nobel-prize-chemistry-alphafold/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">https://blog.google/technology/google-deepmind/google-deepmind-demis-hassabis-john-jumper-nobel-prize-chemistry-alphafold/</span></a><span style="font-weight:400;"> </span></li>
<li><a href="https://focus.finops.org/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Finops FOCUS</span></a></li>
<li><a href="https://siliconangle.com/2024/01/10/terraform-fork-opentofu-launches-general-availability/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Terraform fork OpenTofu launches into general availability</span></a><span style="font-weight:400;"> </span></li>
<li><a href="https://www.theregister.com/2024/01/10/broadcom_ends_vmware_partner_program/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Broadcom ditches VMware Cloud Service Providers</span></a><span style="font-weight:400;"> </span></li>
<li><a href="https://azure.microsoft.com/en-us/updates/azure-elastic-san-is-now-generally-available/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Azure Elastic SAN is now generally available</span></a><span style="font-weight:400;"> </span></li>
<li><a href="https://openai.com/index/hello-gpt-4o/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Hello GPT-4o</span></a></li>
<li><a href="https://aws.amazon.com/blogs/aws/introducing-amazon-guardduty-malware-protection-for-amazon-s3/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Introducing Amazon GuardDuty Malware Protection for Amazon S3</span></a></li>
<li><a href="https://www.businessinsider.com/aws-deprioritized-cloud-services-surprising-customers-salespeople-2024-8" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon decision to deprioritize 7 cloud services caught customers and even some salespeople by surprise</span></a><span style="font-weight:400;"> </span></li>
<li><a href="https://www.powermag.com/aws-acquiring-data-center-campus-powered-by-nuclear-energy/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">https://www.powermag.com/aws-acquiring-data-center-campus-powered-by-nuclear-energy/</span></a><span style="font-weight:400;"> </span></li>
<li><a href="https://cloud.google.com/blog/products/databases/announcing-memorystore-for-valkey" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">https://cloud.google.com/blog/products/databases/announcing-memorystore-for-valkey</span></a><span style="font-weight:400;"> </span></li>
<li><a href="https://techcommunity.microsoft.com/t5/azure-sql-blog/elastic-pools-for-azure-sql-database-hyperscale-now-generally/ba-p/4242658" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Elastic pools for Azure SQL Database Hyperscale now Generally Available!</span></a></li>
<li><a href="https://techcommunity.microsoft.com/blog/azuresqlblog/introducing-database-watcher-for-azure-sql/4085637" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Database watcher</span></a><span style="font-weight:400;"> (Preview)</span></li>
<li><a href="https://aws.amazon.com/about-aws/whats-new/2024/08/amazon-s3-conditional-writes/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">https://aws.amazon.com/about-aws/whats-new/2024/08/amazon-s3-conditional-writes/</span></a></li>
<li><span style="font-weight:400;">Flex consumption webapp (</span><a href="https://techcommunity.microsoft.com/blog/appsonazureblog/announcing-azure-functions-flex-consumption-sign-up-for-the-early-access-preview/3983621" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">https://techcommunity.microsoft.com/blog/appsonazureblog/announcing-azure-functions-flex-consumption-sign-up-for-the-early-access-preview/3983621</span></a><span style="font-weight:400;">) </span></li>
<li><a href="https://azure.microsoft.com/en-us/blog/enhance-your-security-capabilities-with-azure-bastion-premium/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Enhance your security capabilities with Azure Bastion Premium</span></a><span style="font-weight:400;">  </span></li>
<li><a href="https://aws.amazon.com/about-aws/whats-new/2024/03/aws-cost-allocation-tags-retroactive-application/?ck_subscriber_id=512838477" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Cost Allocation Tags now support retroactive application</span></a></li>
<li><a href="https://azure.microsoft.com/en-us/updates/general-availability-of-the-automatic-scaling-feature" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">General Availability: Automatic Scaling for App Service Web Apps</span></a></li>
<li><a href="https://techcrunch.com/2024/10/17/microsoft-said-it-lost-weeks-of-security-logs-for-its-customers-cloud-products/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Microsoft said it lost weeks of security logs for its customers’ cloud products</span></a></li>
</ul>
<h2><b>32:11   2025 Predictions</b></h2>
<p><b>Ryan</b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Someone will come up with the ability to quickly provide an LLM model for individuals.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">AI will go to the edge of the computing layer, in a more native edge stack of some kind. In a native way (Lambda on the edge-esque but AI.)</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Cloud native security mesh for multi-cloud hybrid environments. App to App at the edge </span></li>
</ul>
<p><b>Matthew</b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">We are going to see FOCUS be adopted by Snowflake or Databricks, who sell consumption models outside of hyperscalers</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Lot more security in AI, ethical focus and features in AI. A SOC or ISO specific standard for AI. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon is going to keep deprecating at least 5 more services. Workmail for an extra point</span></li>
</ul>
<p><b>Jonathan</b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">A company will claim that Artificial General Intelligence has been achieved (sentience?)</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Delegation of work to existing AI Agent Personal assistance that work in the real world. (for example Google booking reservations, but AI)</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Models that can learn in real-time. (not referencing current information) but incorporating it via what they learned in conversations or interactions with other systems. </span></li>
</ul>
<p><b>Justin</b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Over 10 companies after Q2 (Amazon and AT&amp;T) will announce they are returning to office 5 days a week</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Open AI will not be seen as the leader that they are in 2024. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">We will have a GPT 5, a Claude 4 and a Gemini 3.0 </span></li>
</ul>
<p><i><span style="font-weight:400;">45:01  Justin – “</span></i><i><span style="font-weight:400;">I just feel like their innovation curve has definitely slowed down where I still see Claude and Gemini and Alibaba you mentioned. They’re all innovating quite a bit and I would not be shocked to see the market shift.”</span></i></p>
<p><i><span style="font-weight:400;">53:36  Jonathan – “</span></i><i><span style="font-weight:400;">That was kind of, that was going to be one of my predictions, but I couldn’t really quantify it in a way which would be measurable to win the point. I think there’s obviously a need for tons of data. I’m not going to say that we’re running out of data exactly, although the quality is a bit questionable, but I think access to data is going to be super important. And I didn’t know how to turn that into a prediction, but like when I, when I go to Safeway and buy my groceries, I want a way to get my, my like receipt electronically, so that I can plug that into an AI. So then I go to do my groceries, my AI will know what’s in my pantry. And if I say, what can I cook to eat today? And it can be like, well, you’ve got this stuff. Why don’t you make this? I just think there’s so many places where access to data would make life easier for a consumer. And right now, it’s very asymmetric. Safeway or Albertsons have access to all the data. They can market the shit out of me because they know exactly what I buy, when I buy – patterns of all kinds of stuff, but I have none of that. So I want to see some of that asymmetry go away and I want access to the data that other people have about me.”</span></i></p>
<p><b>50:27 And since we suck at predictions, here are other experts who may also </b><b>suck:</b></p>
<ul>
<li class="display-heading-04"><a href="https://www.ciodive.com/news/6-enterprise-technology-predictions-2025/735181/" target="_blank" rel="noreferrer noopener">6 enterprise technology predictions to watch in 2025</a></li>
<li><a href="https://www.allthingsdistributed.com/2024/12/tech-predictions-for-2025-and-beyond.html" target="_blank" rel="noreferrer noopener">Werner Vogels – Tech predictions for 2025 and beyond</a></li>
</ul>
<p> </p>
<h2><span style="font-weight:400;">Closing</span></h2>
<p><span style="font-weight:400;">And that is the week in the cloud! Visit our website, the home of the Cloud Pod where you can join our newsletter, slack team, send feedback or ask questions at theCloud Pod.net or tweet at us with hashtag #theCloudPod</span></p>
]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/1932764/c1e-2okob822vgbm26rz-ok37v7n1igp0-s2msuu.mp3" length="71565499"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 286 of The Cloud Pod – where the forecast is always cloudy! Welcome to the final show of 2024! We thank you for joining us on our cloud journey over the past year. During this last show of the year, we look back on all the tech that changed our jobs and lives, and make predictions for an AI filled 2025. Join Justin, Jonathan, Ryan, and Matthew as they look forward to even more discussions about undersea cables. Happy New Year! 
Titles we almost went with this week:

We thought 2024 would never end
I can sum up 2024 – AI AI AI AI and uhh AI
️AI has taken over the Cloud Pod – we are not really here
‍2024 the year we hoped AI would replace us… close but not yet

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
General News
00:31   2024 Predictions Look Back
Matt

Simpler and Easier to access LLM with new services
Kubernetes will become simpler for smaller companies to operate that doesn’t require Highly Paid Devops/Scientists
Low Employee Churn Rates and increased Tenure (Quiet Quitting)

02:07  Matthew – “How is it simpler and easier? I think that there are more ways to run it. The general public has an easier way to access it. And they are simpler as Justin said that they are becoming easier and more efficient and better to use for the average user. So I know that I talked to many people that I work with now and just in general and people that are not in tech, which I feel like a year ago.”
Jonathan

There will be mass layoffs in tech directly attributed to AI in Q1 2024 (10k or more)
Someone will start a cult that follows an AI LLM God believing in sentience, a higher power. 
AI will find a new home in education. Lesson Plans, Personalized Learning plans by students, etc. 

02:07  Jonathan – “Well, there is a religion called the First Church of Artificial Intelligence, but it’s been around for longer than this year. I think it’s like five, six years old at this point. So that’s kind of cheating.
Ryan

Start seeing the financial impact of AI to better profitability by using AI.
AI Solution tied towards new employee onboarding (replace wiki technology)
Removal of stateful firewalls as traffic ruleset (next-gen next-gen firewall)

02:07  Ryan – “I mean, agentic AI is something that’s been rolled out in a lot of companies. I know in my day job, it’s been rolled out. I hope to see this get even stronger and more obvious just because I think that, you know, the days of searching through thousands of documents or the one, you know, unmaintained team page that someone built three years ago when they were new are over. And so I’d like to see this continue.
Justi...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/1932764/c1a-k5d5-8d96548nb07x-lhl1gv.jpg"></itunes:image>
                                                                            <itunes:duration>00:59:39</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                            </item>
                    <item>
                <title>
                    <![CDATA[285: 6 years of cloud news… and we’re still talking about FPGAs and PowerPC]]>
                </title>
                <pubDate>Thu, 26 Dec 2024 19:26:24 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/1929542</guid>
                                    <link>https://tcpfm.castos.com/episodes/286-6-years-cloud-pod-birthday-fpga</link>
                                <description>
                                            <![CDATA[<p><span style="font-weight:400;">Welcome to episode 285 of the Explain it to me Like I’m 5 Podcast, formerly known as The Cloud Pod – where the forecast is always cloudy! We’ve got a lot of news this week, including the last of our coverage from re:Invent, ChatGTP Pro, FPGA, and even some major staffing turnovers.</span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul style="list-style-type:circle;">
<li><span style="font-weight:400;">Throw $200 dollars in a fire with ChatGPT Pro</span></li>
<li><span style="font-weight:400;">Jeff Barr is wrapped up by Agentic AI</span></li>
<li><span style="font-weight:400;">️The Tribble with Trilliums</span></li>
<li><span style="font-weight:400;">️The Wind in the Quantum Willows </span></li>
<li><span style="font-weight:400;">⚰️Rise of the dead instances FPGA and PowerPC</span></li>
<li><span style="font-weight:400;">Jeff Barr is replaced by Nova</span></li>
<li><span style="font-weight:400;">The Cloud Pod: Return of the dead instances types</span></li>
<li><span style="font-weight:400;">After 6 year Jeff Barr hands over the reigns to the CloudPod</span></li>
<li><span style="font-weight:400;">⌚For our 6th birthday Jeff barr Retires</span></li>
<li><span style="font-weight:400;">For our 6th birthday jeff barr delegates announcements to the cloud pod</span></li>
<li><span style="font-weight:400;">6 years of meaningless PR drivel</span></li>
<li><span style="font-weight:400;">‍6 years of cloud news and we still don’t know what Quantum computing is</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><b>General News</b></h2>
<p><span style="font-weight:400;">HAPPY 6th BIRTHDAY! </span></p>
<p><b>2:00 </b><a href="https://www.hashicorp.com/blog/hashicorp-at-re-invent-2024-security-lifecycle-management-with-aws" target="_blank" rel="noreferrer noopener"><b>HashiCorp at re:Invent 2024: Security Lifecycle Management with AWS</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Hashi is a big sponsor of </span><a href="https://www.hashicorp.com/blog/hashicorp-at-aws-re-invent-your-blueprint-for-cloud-success" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">re:Invent</span></a><span style="font-weight:400;">, so of course they had some news of their own to release. </span></li>
<li style="font-weight:400;"><a href="https://www.hashicorp.com/products/vault/hcp-vault-secrets" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">HCP Vault Secrets</span></a> <a href="https://www.youtube.com/watch?v=XdvKbzOM8h4" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">auto-rotation</span></a><span style="font-weight:400;"> is now generally available. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Dynamic secrets are generally available via HCP Vault Secrets.</span></li>
<li style="font-weight:400;"><a href="https://developer.hashicorp.com/vault/docs/sync" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Secrets sync</span></a><span style="font-weight:400;"> will help keep your secrets synced with </span><a href="https://developer.hashicorp.com/vault/docs/sync/awssm" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Secrets Manager</span></a><span style="font-weight:400;">. It still appears to be one direction, but you can now also view secrets in AWS Secrets Manager that are managed by vault. </span></li>
<li style="font-weight:400;"><a href="https://developer.hashicorp.com/hcp/docs/vault-radar" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">HCP Vault Radar</span></a><span style="font-weight:400;">, now in beta, auto...</span></li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 285 of the Explain it to me Like I’m 5 Podcast, formerly known as The Cloud Pod – where the forecast is always cloudy! We’ve got a lot of news this week, including the last of our coverage from re:Invent, ChatGTP Pro, FPGA, and even some major staffing turnovers.
Titles we almost went with this week:

Throw $200 dollars in a fire with ChatGPT Pro
Jeff Barr is wrapped up by Agentic AI
️The Tribble with Trilliums
️The Wind in the Quantum Willows 
⚰️Rise of the dead instances FPGA and PowerPC
Jeff Barr is replaced by Nova
The Cloud Pod: Return of the dead instances types
After 6 year Jeff Barr hands over the reigns to the CloudPod
⌚For our 6th birthday Jeff barr Retires
For our 6th birthday jeff barr delegates announcements to the cloud pod
6 years of meaningless PR drivel
‍6 years of cloud news and we still don’t know what Quantum computing is

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
General News
HAPPY 6th BIRTHDAY! 
2:00 HashiCorp at re:Invent 2024: Security Lifecycle Management with AWS

Hashi is a big sponsor of re:Invent, so of course they had some news of their own to release. 
HCP Vault Secrets auto-rotation is now generally available. 
Dynamic secrets are generally available via HCP Vault Secrets.
Secrets sync will help keep your secrets synced with AWS Secrets Manager. It still appears to be one direction, but you can now also view secrets in AWS Secrets Manager that are managed by vault. 
HCP Vault Radar, now in beta, auto...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[285: 6 years of cloud news… and we’re still talking about FPGAs and PowerPC]]>
                </itunes:title>
                                    <itunes:episode>285</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p><span style="font-weight:400;">Welcome to episode 285 of the Explain it to me Like I’m 5 Podcast, formerly known as The Cloud Pod – where the forecast is always cloudy! We’ve got a lot of news this week, including the last of our coverage from re:Invent, ChatGTP Pro, FPGA, and even some major staffing turnovers.</span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul style="list-style-type:circle;">
<li><span style="font-weight:400;">Throw $200 dollars in a fire with ChatGPT Pro</span></li>
<li><span style="font-weight:400;">Jeff Barr is wrapped up by Agentic AI</span></li>
<li><span style="font-weight:400;">️The Tribble with Trilliums</span></li>
<li><span style="font-weight:400;">️The Wind in the Quantum Willows </span></li>
<li><span style="font-weight:400;">⚰️Rise of the dead instances FPGA and PowerPC</span></li>
<li><span style="font-weight:400;">Jeff Barr is replaced by Nova</span></li>
<li><span style="font-weight:400;">The Cloud Pod: Return of the dead instances types</span></li>
<li><span style="font-weight:400;">After 6 year Jeff Barr hands over the reigns to the CloudPod</span></li>
<li><span style="font-weight:400;">⌚For our 6th birthday Jeff barr Retires</span></li>
<li><span style="font-weight:400;">For our 6th birthday jeff barr delegates announcements to the cloud pod</span></li>
<li><span style="font-weight:400;">6 years of meaningless PR drivel</span></li>
<li><span style="font-weight:400;">‍6 years of cloud news and we still don’t know what Quantum computing is</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><b>General News</b></h2>
<p><span style="font-weight:400;">HAPPY 6th BIRTHDAY! </span></p>
<p><b>2:00 </b><a href="https://www.hashicorp.com/blog/hashicorp-at-re-invent-2024-security-lifecycle-management-with-aws" target="_blank" rel="noreferrer noopener"><b>HashiCorp at re:Invent 2024: Security Lifecycle Management with AWS</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Hashi is a big sponsor of </span><a href="https://www.hashicorp.com/blog/hashicorp-at-aws-re-invent-your-blueprint-for-cloud-success" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">re:Invent</span></a><span style="font-weight:400;">, so of course they had some news of their own to release. </span></li>
<li style="font-weight:400;"><a href="https://www.hashicorp.com/products/vault/hcp-vault-secrets" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">HCP Vault Secrets</span></a> <a href="https://www.youtube.com/watch?v=XdvKbzOM8h4" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">auto-rotation</span></a><span style="font-weight:400;"> is now generally available. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Dynamic secrets are generally available via HCP Vault Secrets.</span></li>
<li style="font-weight:400;"><a href="https://developer.hashicorp.com/vault/docs/sync" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Secrets sync</span></a><span style="font-weight:400;"> will help keep your secrets synced with </span><a href="https://developer.hashicorp.com/vault/docs/sync/awssm" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Secrets Manager</span></a><span style="font-weight:400;">. It still appears to be one direction, but you can now also view secrets in AWS Secrets Manager that are managed by vault. </span></li>
<li style="font-weight:400;"><a href="https://developer.hashicorp.com/hcp/docs/vault-radar" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">HCP Vault Radar</span></a><span style="font-weight:400;">, now in beta, automates the detection and identification of unmanaged secrets in your code, including AWS infrastructure configurations</span></li>
</ul>
<p><i><span style="font-weight:400;">03:10  Matthew – “</span></i><i><span style="font-weight:400;">This qualifies under the category of things that I feel like we talked about so long ago, I just already assumed was GA. I’m surprised that it wasn’t.”</span></i></p>
<p><b>03:34 </b><a href="https://www.hashicorp.com/blog/hashicorp-at-re-invent-2024-infrastructure-lifecycle-management-with-aws" target="_blank" rel="noreferrer noopener"><b>HashiCorp at re:Invent 2024: Infrastructure Lifecycle Management with AWS</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Terraform AWS provider is now at </span><a href="https://www.hashicorp.com/blog/terraform-aws-provider-tops-3-billion-downloads" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">3 billion downloads</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The </span><a href="https://www.hashicorp.com/blog/terraform-aws-cloud-control-api-provider-now-generally-available" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Cloud Control Provider is also now generally available</span></a><span style="font-weight:400;"> with the 1.0 release.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This is the provider built around </span><a href="https://registry.terraform.io/providers/hashicorp/awscc/latest" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Cloud Control API</span></a><span style="font-weight:400;"> to bring new services to Hashicorp Terraform faster. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">In June, AWS and Hashi partnered to </span><a href="https://www.globenewswire.com/news-release/2024/06/04/2893429/0/en/HashiCorp-and-AWS-sign-strategic-collaboration-agreement-to-expand-joint-product-and-go-to-market-initiatives.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">co-develop a comprehensive set of terraform policies</span></a><span style="font-weight:400;"> in compliance with standards like </span><a href="https://www.cisecurity.org/cis-benchmarks" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">CIS</span></a><span style="font-weight:400;">, HIPAA, FINOS and the AWS Well-Architected Framework. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">In beta now, </span><a href="https://www.hashicorp.com/blog/simplify-policy-adoption-in-terraform-with-pre-written-sentinel-policies-for-aws" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">pre-written sentinel policy sets for AWS</span></a><span style="font-weight:400;"> available via the </span><a href="https://registry.terraform.io/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Terraform Registry</span></a><span style="font-weight:400;">. </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Supported services include: EC2, KMS, Cloudtrail, S3, IAm, VPC, RDS, EFS</span></li>
</ul>
</li>
<li style="font-weight:400;"><a href="https://www.hashicorp.com/blog/terraform-stacks-explained" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Terraform Stacks</span></a><span style="font-weight:400;"> are now in public beta to simplify infrastructure provisioning and management at scale. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">When deploying and managing infrastructure at scale, teams usually need to provision the same infrastructure multiple times with different input values, across multiple cloud provider accounts, regions and environments and landing zones.     </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Before stacks, there was no built-in way to provision and manage the lifecycle of these instances as a single unit in Terraform, making it difficult to manage each infrastructure root module individually. </span></li>
</ul>
<p><i><span style="font-weight:400;">05:43  Ryan – “</span></i><i><span style="font-weight:400;">I’m a big fan of doing policy evaluation at, you know, Terraform and VolkTime just to get that feedback directly to whoever’s executing that Terraform, rather than have it be a security ticket later or just blocked by permissions. I feel like it’s very good feedback. So having pre-built policies makes life easy, because developing those policies isn’t exactly fun, but that’s super cool.”</span></i></p>
<p><b>08:14 </b><a href="https://www.hashicorp.com/blog/terraform-1-10-improves-handling-secrets-in-state-with-ephemeral-values" target="_blank" rel="noreferrer noopener"><b>Terraform 1.10 improves handling secrets in state with ephemeral values</b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://developer.hashicorp.com/terraform/downloads" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Terraform 1.10</span></a><span style="font-weight:400;"> is now </span><a href="https://developer.hashicorp.com/terraform/install?product_intent=terraform" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">generally available</span></a><span style="font-weight:400;">, with several new features, including:</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Handling secrets. Ephemeral Values to enable secure handling of secrets.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Before secrets get persisted in the plan or state file. Since the secrets are stored in plaintext within these artifacts, any mismanaged access to the file would compromise the secrets. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">To address this, ephemeral values. These values are not stored in any artifact. Not the plan file or the statefile. They are not expected to remain consistent from plan to apply, or from one plan/apply round to the next. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Ephemeral supports marking input and output variables as ephemeral.  Within ephemeral blocks, which declare that something needs to be created or fetched separately for each terraform phase, then used to configure some other ephemeral object, and then explicitly closed before the end of the phase. </span></li>
</ul>
<p><i><span style="font-weight:400;">09:22  Ryan – “</span></i><i><span style="font-weight:400;">I’ve had to battle this with security teams who are looking at, you know, approving Terraform enterprise. I’ve had people pull secrets out of the state file and then use them inappropriately. This is a great feature to see. So pretty psyched about it.”</span></i></p>
<p><b>09:49 </b><a href="https://www.reuters.com/business/intel-ceo-pat-gelsinger-retire-2024-12-02/" target="_blank" rel="noreferrer noopener"><b>Intel CEO Gelsinger forced out after board lost confidence in turnaround </b></a></p>
<p><a href="https://www.reuters.com/business/intel-ceo-pat-gelsinger-retire-2024-12-02/" target="_blank" rel="noreferrer noopener"><b>plan</b></a><b>.</b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Interestingly on stage at AWS, they made the claim that 50% of new CPU capacity was on AWS Graviton. Note great for Intel. </span></li>
<li style="font-weight:400;"><a href="https://www.reuters.com/technology/inside-intel-ceo-pat-gelsinger-fumbled-revival-an-american-icon-2024-10-29/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">CEO Pat Gelsinger</span></a><span style="font-weight:400;"> has been forced out of </span><a href="https://www.reuters.com/markets/companies/INTC.O" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Intel</span></a><span style="font-weight:400;"> after 4 years, handing control to two lieutenants as they search for a successor. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Reports are that he left after a board meeting where the directors felt his plan was too costly and ambitious to turn Intel around – efforts so far weren’t working, and the progress of change wasn’t fast enough. </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Because yeah replacing the top guy is a sure fire way to make things happen faster…</span></li>
</ul>
</li>
</ul>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Gelsinger inherited a company in 2021 rife with challenges which he only made worse in many aspects. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">He made claims about AI chip deals that exceeded Intel’s own estimates, leading the company to </span><a href="https://www.reuters.com/technology/artificial-intelligence/year-intels-touted-ai-chip-deals-have-fallen-short-2024-11-01/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">scrap revenue forecasts a month ago</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The full results of his turnaround won’t be known till next year, when he plans to bring a flagship laptop chip back into its own factories. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Intel started construction on a </span><a href="https://www.reuters.com/technology/intel-plans-new-chip-manufacturing-site-ohio-report-2022-01-21/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">$20B suite of new factories in Ohio</span></a><span style="font-weight:400;">, and hired a larger workforce to try and reclaim the crown.  This eventually led to layoffs and potential sales or spinouts of assets. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Gelsinger’s plan included becoming a major player in contract manufacturing for others, a business model called “foundry”.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Intel has announced foundry customers including Microsoft and Amazon, but neither would bring to INtel the volumes of chips needed to reach profitability. (I mean at least until it’s proven it works).</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">In addition, they were looking to TSMC to build some of its chips, at the same time trying to compete with TSMC resulting in them not getting great pricing on TSMC fab.  </span></li>
</ul>
<p><i><span style="font-weight:400;">11:37  Jonathan – “</span></i><i><span style="font-weight:400;">We could do a whole episode on the screwups that Intel’s made over the years. I think they just got, they were in such a dominant position and they became complacent into a risk averse, which is kind of funny to hear that the board were complaining that Gelsinger’s plan was too risky, basically is what they were saying. So they were too risk averse, they still are risk averse. They never took AMD seriously as a competitor…I don’t think anybody could have turned Intel around in 4 years.”</span></i></p>
<h2><b>AI Is Going Great – Or, How ML Makes All its Money </b></h2>
<p><b>17:00 </b><a href="https://openai.com/index/introducing-chatgpt-pro/" target="_blank" rel="noreferrer noopener"><b>Introducing ChatGPT Pro</b></a><b>  </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Open AI is adding ChatGPT pro, a $200 monthly plan that enables scaled access to the best of OpenAI’s models and tools. This plan includes unlimited access to their smartest model, OpenAI o1, as well as to o1-mini, GPT-4o and advanced voice. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Yeah sorry… I also canceled this subscription recently. </span></li>
</ul>
<p><i><span style="font-weight:400;">17:37  Jonathan – “</span></i><i><span style="font-weight:400;">I cancelled my ChatGPT subscription a long time ago and switched to Claude and that’s $20 a month and I regularly run out of credits on there. I would imagine it’s comparatively priced in terms of the number of tokens in and out every day. I mean, I know some people are shocked by the cost, like, my God, it’s $200. But really think about the productivity increase that I’ve seen in using AI over the past few months. I’d pay it in a heartbeat, you know, if Anthropic had an equivalent plan, $200 a month, unlimited access to Claude, even slightly slowed down, you know, I don’t necessarily need like instantaneous responses, but the value you’re getting for $200 is immense.”</span></i></p>
<h2><b>AWS</b></h2>
<p><b>20:40 </b><a href="https://aws.amazon.com/blogs/aws/now-available-second-generation-fpga-powered-amazon-ec2-instances-f2/" target="_blank" rel="noreferrer noopener"><b>Now Available – Second-Generation FPGA-Powered Amazon EC2 instances </b></a></p>
<p><a href="https://aws.amazon.com/blogs/aws/now-available-second-generation-fpga-powered-amazon-ec2-instances-f2/" target="_blank" rel="noreferrer noopener"><b>(F2)</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Justin was actually surprised about this announcement – one that they didn’t cover at re:Invent – but that there is a second FPGA powered instance at all. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">AWS is announcing the F2 instance with up to 8 AMD Field-programmable gate arrays (FPGAs), </span><a href="https://www.amd.com/en/products/processors/server/epyc/7003-series.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AMD EPYC</span></a><span style="font-weight:400;"> (Milan) processors with up to 192 cores, high bandwidth memory, up to 8 TiB of SSD based instance storage and up to 2 TiB of memory, the new F2 instances are available in two sizes, and are ready to accelerate your genomics, multimedia processing, big data, satellite communication, networking, silicon simulation and live video workloads. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Some cool examples of how you might use these things:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Genomics – Astrazeneca used thousands of F1 instances to build the world’s fastest genomics pipeline, able to process over 400k whole genome samples in under two months.  They will adopt Illumina DRAGEN for F2 to realize better performance at lower cost.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Satellite operators are moving from inflexible and expensive physical infrastructure (modulators, demodulators, combiners, splitters, etc) toward, agile software-defined, FPGA powered solutions. Using DSP (digital Signal processor) elements on the FPGA, they can be reconfigured in the field to support new waveforms and meet changing requirements. Combined wit the 8 FPGAs, generous amounts of network bandwidth and support for the </span><a href="https://github.com/aws/aws-fpga" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Data Plan Development Kit</span></a><span style="font-weight:400;"> and </span><a href="https://github.com/aws/aws-fpga/tree/f2/sdk/apps/virtual-ethernet" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Virtual Ethernet</span></a><span style="font-weight:400;"> satellite providers can support processing of multiple, complex waveforms in parallel. </span></li>
<li style="font-weight:400;"><a href="https://www.neuroblade.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Neroblade</span></a><span style="font-weight:400;"> SQL processing Unit (SPU) integrates with Preso, Spark, and other open source query engines, delivering faster query processing and market-leading query throughput efficiency when running on F2. </span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">22:39  Ryan – “</span></i><i><span style="font-weight:400;">Yeah, I didn’t understand what it did then. I don’t understand what it does now.”</span></i></p>
<p><b>26:37 </b><a href="https://aws.amazon.com/blogs/aws/introducing-storage-optimized-amazon-ec2-i8g-instances-powered-by-aws-graviton4-processors-and-3rd-gen-aws-nitro-ssds/" target="_blank" rel="noreferrer noopener"><b>Introducing storage optimized Amazon EC2 I8g instances powered by AWS </b></a><a href="https://aws.amazon.com/blogs/aws/introducing-storage-optimized-amazon-ec2-i8g-instances-powered-by-aws-graviton4-processors-and-3rd-gen-aws-nitro-ssds/" target="_blank" rel="noreferrer noopener"><b>Graviton4 processors and 3rd gen AWS Nitro SSDs</b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/ec2/instance-types/i8g/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">EC2 I8g instances</span></a><span style="font-weight:400;"> are now available to you! </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">These new storage optimized instance types provide the highest real-time storage performance among storage-optimized EC2 instances with the third generation of </span><a href="https://aws.amazon.com/blogs/aws/aws-nitro-ssd-high-performance-storage-for-your-i-o-intensive-applications/%5C" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Nitro SSDs</span></a><span style="font-weight:400;"> and </span><a href="https://aws.amazon.com/ec2/graviton/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Graviton Processors</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">AWS Graviton 4 is the most powerful and energy efficient processor they have ever designed for a broad range of workloads running on EC2 instances using a 64-bit ARM instruction set architecture. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">I8g is the first instance type to use third-generation </span><a href="https://aws.amazon.com/ec2/nitro/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Nitro</span></a><span style="font-weight:400;"> SSds. These instances offer up to 22.5 TB of local NVME SSD storage with up to 65% better real-time storage performance per TB and 60 percent lower latency variability compared to the previous generation of i4g. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">You can get these new shiny instances with up to 96vcpu, 768gb of memory and 22.5 tb of storage.  Usual network caps and ebs caps are there with smaller instances, etc. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon suggests you consider these servers for I/O intensive workloads that require low-latency access to data such as transactions databases, real-time databases, noSQL and real time analytics such as Spark. </span></li>
</ul>
<p><i><span style="font-weight:400;">28:02  Matthew- “</span></i><i><span style="font-weight:400;">I always liked the iSeries. I’ve used them a few times. The free storage there when you don’t care about this type of data and it’s really truly ephemeral or you built it so you have, you know, three NoSQL replicas and you know, one each AZ gives you that free storage layer and doesn’t really cost you that much extra is really nice. And this performance of it was, you know, blazingly fast when I think I did it with the i3. So I can’t imagine what the i8 is.”</span></i></p>
<p><b>29:31 </b><a href="https://aws.amazon.com/blogs/aws/and-thats-a-wrap/" target="_blank" rel="noreferrer noopener"><b>And that’s a wrap!</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Jeff Barr is announcing that after 20 years, 3283 posts, and 1,577,105 words he is wrapping up as lead blogger on the AWS news blog. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Jeff is apparently stepping back to being a builder, and says he went from a developer who could market to a marketer who used to be able to develop.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">While there is nothing wrong with that, he wants to go back to building. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">He will still appear on the AWS OnAir twitch show and will speak at community events around the globe, but will be primarily building. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">But don’t worry – there is a robust AWS News blog team that will keep cranking out new announcements for us to cover. All of us at TCP look forward to seeing what Jeff gets up to next!</span></li>
</ul>
<p><i><span style="font-weight:400;">30:08  Justin – “</span></i><i><span style="font-weight:400;">I look forward to seeing what you’re up to next and if there’s a new lead blogger – or if lead blogger becomes Nova over time.”</span></i></p>
<h2><b>GCP</b></h2>
<p><b>31:28 </b><a href="https://cloud.google.com/blog/products/databases/new-proxy-adapter-eases-cassandra-to-spanner-migration/" target="_blank" rel="noreferrer noopener"><b>New Cassandra to Spanner adapter simplifies Yahoo’s migration journey</b></a></p>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Cassandra, a key value noSQL database, is prized for its speed and scalability, and used broadly for applications that require rapid data retrieval and storage such as Caching, Session management, and real-time analytics. The simple key value pair structure gives you high performance and easy management, especially for large datasets. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">But the simplicity means poor support for complex queries, potential data redundancy and difficulty in modeling intricate relationships. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">To help solve this, they are making it easier than ever to switch from Cassandra to Spanner, with the introduction of the </span><a href="https://github.com/cloudspannerecosystem/cassandra-to-spanner-proxy" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Cassandra to Spanner Proxy Adapter,</span></a><span style="font-weight:400;"> an open source tool for plug and play migrations of Cassandra workloads to Spanner, without any changes to the application logic. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">If you’re wondering if the proxy adapter will scale for your needs, don’t worry. Its battle tested by none other than Yahoo. </span></li>
</ul>
</li>
</ul>
<ul>
<li style="font-weight:400;"><i><span style="font-weight:400;">“The Cassandra Adapter has provided a foundation for migrating the Yahoo Contacts workload from Cassandra to Spanner without changing any of our CQL queries. Our migration strategy has more flexibility, and we can focus on other engineering activities while utilizing the scale, redundancy, and support of Spanner without updating the codebase. Spanner is cost-effective for our specific needs, delivering the performance required for a business of our scale. This transition enables us to maintain operational continuity while optimizing cost and performance.” </span></i><b><i>– Patrick JD Newnan, Principal Product Manager, Core Mail and Analytics, Yahoo </i></b></li>
</ul>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">To get started here are the high level steps to taking advantage of the new Proxy Adapter:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Assess your schema, data model and query patterns to determine which you can simplify after moving to Spanner</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Schema Design. Luckily the table declaration and data types are similar to Cassandras, and with spanner you can take advantage of relational capabilities and features like </span><a href="https://cloud.google.com/spanner/docs/schema-and-data-model#parent-child" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">interleaved tables</span></a><span style="font-weight:400;"> for optimal performance. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Migrate your data through either a bulk load or use Cassandra’s CDC for real time replication. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Setup the proxy adapter and update your Cassandra configuration. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Test it thoroughly – not in production first</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Cutover to the new adapter. </span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">32:45  Ryan – “</span></i><i><span style="font-weight:400;">I didn’t work with the context team much when I was there (Yahoo) but I was on the platform engineering team that sort of created the internal services that provided this functionality. And one of the things that was just starting as I was leaving is the migration to Cassandra away from our internal tool. So it’s exciting. That’s how long ago it was. But it’s, from a Google perspective, that’s a fantastic business model, right? If you can get people using your service by making it really easy to adopt, and then as they slowly transition, you know, the application can probably get better functionality and more features by calling it natively. And it’s a lot easier to consume rather than like a giant migration and rewrite type of thing.”</span></i></p>
<p><b>36:28 </b><a href="https://cloud.google.com/blog/products/identity-security/announcing-expanded-custom-org-policy-portfolio-of-supported-products/" target="_blank" rel="noreferrer noopener"><b>Improve your security posture with expanded Custom Org Policy</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google is adding support for more than 30 additional services to </span><a href="https://cloud.google.com/resource-manager/docs/organization-policy/creating-managing-custom-constraints" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Custom Org Policies</span></a><span style="font-weight:400;">.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Originally limited to GKE, DataProc, Compute Engine and Cloud Storage, they are adding some very common ones include BigQuery, Cert Manager, KMS, Load Balancing, NGFW, Cloud Run, Cloud SQL, Cloud VPN, Data Flow, Firewstore, IAM, Identity Platform, Redis, PSC, Secret Manager and VPC.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This allows you to enforce conditional restrictions such as specific roles to resources in a project. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">You can also now set custom org policy to </span><a href="https://cloud.google.com/resource-manager/docs/organization-policy/restricting-domains%5C" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Domain Restricted Sharing</span></a><span style="font-weight:400;"> principals including all users of an org, specific partner identities, service accounts and service agents. </span></li>
</ul>
<p><i><span style="font-weight:400;">37:36  Ryan – “</span></i><i><span style="font-weight:400;">I want to grant everyone primitive roles so I don’t have to manage like very fine grained policies, but I also don’t want them to create, you know, API keys that are going to get proliferated everywhere. And so now with this policy, you can say, you know, you can’t, even with all the permissions, you can’t export this big query dataset to somewhere public or, you know, that, depending on what the conditionals allowed are. So that’s pretty cool. I like that.”</span></i></p>
<p><b>38:29 </b><a href="https://blog.google/technology/google-deepmind/google-gemini-ai-update-december-2024/" target="_blank" rel="noreferrer noopener"><b>Introducing Gemini 2.0: our new AI model for the agentic era</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google says Sit down Nova… announcing a week after re:Invent the </span><a href="https://gemini.google.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Gemini 2.0</span></a><span style="font-weight:400;"> model is available and ready for the agentic era. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Of course, this announcement comes just 2 weeks after Justin cancelled his Gemini subscription. Figures. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">A year ago </span><a href="https://blog.google/technology/ai/google-gemini-ai/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Gemini 1.0 was launched</span></a><span style="font-weight:400;">, with the intent to focus on information as the key to human progress.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The first Gemini model built to be natively multi-modal, Gemini 1.0 and 1.5 drove big advances with multi-modality and long context to understand information across text, video, images, audio and code, and process a lot of it. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Gemini 2.0 is the most capability multi-modal capable model yet per google. With new advances in multi-modality like native image and audio output and native tool use, it will enable them to build new AI agents that bring them closer to their vision of a universal assistant. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Gemini 2.0 flash experimental model will be available to all gemini users. And they are launching a new feature called </span><a href="https://blog.google/products/gemini/google-gemini-deep-research/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Deep Research</span></a><span style="font-weight:400;">, which uses advanced reasoning and long context capabilities to act as research assistant, exploring complex topics and compiling reports on your behalf.  Available in Gemini Advanced today. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">2.0 flash replaces 1.5 flash and outperforms 1.5 and even outperforms 1.5 pro on key benchmarks. (See article for some examples)</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Updates to </span><a href="https://deepmind.google/technologies/gemini/project-astra/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Project Astra</span></a><span style="font-weight:400;"> that they announced at I/O.  From feedback they have made improvements with the Gemini 2.0 version of Astra. Better dialogue, new tool use including google search, lens and maps. Better memory allowing up to 10 minutes of in -session memory and improved latency. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Project Mariner is a new agent that helps you accomplish complex tasks.  Starting with your web browser. This research prototype is able to understand and reason across information in your browser screen, including pixels and web elements like text, code, images and forms. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Jules is a new AI agent to assist developers with code. It integrates directly into your github workflow. It can tackle an issue, develop a plan and execute it, all under a developers direction and supervision. </span></li>
</ul>
<p><i><span style="font-weight:400;">40:41  Justin – “</span></i><i><span style="font-weight:400;">I think it’s just the way to announce a new model. And then they give you some purpose-built agents versus having to build agents from scratch, which you said you would do before.”</span></i></p>
<p><b>41:33 </b><a href="https://cloud.google.com/blog/products/compute/trillium-tpu-is-ga/" target="_blank" rel="noreferrer noopener"><b>Announcing the general availability of Trillium, our sixth-generation TPU</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Trilium the 6th generation TPU is now generally available. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Trillium TPUs were used to train the new </span><a href="https://blog.google/technology/google-deepmind/google-gemini-ai-update-december-2024" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Gemini 2.0</span></a><span style="font-weight:400;">, google’s most capable AI model yet. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Some of the key improvements of Trillium:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">4x improvement in training performance</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">3x increase in inference throughput</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">67% increase in energy efficiency</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">4.7x increase in peak compute performance per chip</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Doubled the high bandwidth memory</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Doubled the interchip interconnect bandwidth</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">100k trillium chips in a single Jupiter network fabric</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Up to 2.5x improvement in training performance dollar and up to 1.4x improvement in inference performance dollar. </span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">42:15  Jonathan – “…</span></i><i><span style="font-weight:400;">relative to the old ones. Okay. Yeah, that’s a slight red flag for me. Maybe an orange flag. They’re not comparing it with things that people actually know.”</span></i></p>
<p><b>42:46 </b><a href="https://cloud.google.com/blog/topics/google-cloud-next/registration-open-for-google-cloud-next-25/" target="_blank" rel="noreferrer noopener"><b>Registration is open for Google Cloud Next 2025</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google Next returns to Beautiful Las Vegas at Mandalay Bay, April 9th-11th, 2025.  </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">In fact you can </span><a href="https://cloud.withgoogle.com/next?utm_source=cgc-blog&amp;utm_medium=blog&amp;utm_campaign=FY25-Q2-global-EXP106-physicalevent-er-next25-mc&amp;utm_content=reg-is-open-blog-dec-5-cgc-blog&amp;utm_term=-" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">register now</span></a><span style="font-weight:400;"> using the last bits of your 2024 budgets. </span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Early bird pricing is $999 for a limited time (February 14th or when tickets sell out – whichever comes first.)</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Experience AI in action!</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Forge Powerful Connections (meet The Cloud Pod Hosts)</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Build and Learn Live. </span></li>
</ul>
<p><i><span style="font-weight:400;">43:08  Ryan – “</span></i><i><span style="font-weight:400;">I’m terrified of what they mean by experience AI in action. Absolutely terrified.”</span></i></p>
<p><b>44:47 </b><a href="https://cloud.google.com/blog/products/infrastructure/google-cloud-announces-41st-cloud-region-in-mexico/" target="_blank" rel="noreferrer noopener"><b>¡Hola Mexico! Google Cloud region in Querétaro now open</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google cloud is opening their 41st cloud region in Queretaro, Mexico. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This is the third cloud region in Latin America, after Santiago, Chile and Sao Paulo, Brazil. </span></li>
</ul>
<p><i><span style="font-weight:400;">45:12  Matthew – “</span></i><i><span style="font-weight:400;">It’s amazing how many regions all these cloud providers have. It used to be like, my god, they’re opening a region. Now it’s like, right, they’re opening another region. Like, is news now, cool.”</span></i></p>
<p><b>45:32 </b><a href="https://cloud.google.com/blog/products/infrastructure-modernization/converge-enterprise-cloud-with-ibm-power-for-google-cloud-ip4g/" target="_blank" rel="noreferrer noopener"><b>(Re)Introducing IBM Power for Google Cloud</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google is reminding you that they continue to offer IBM Power systems on the Google Cloud. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Originally launched in 2020, the service then partnered with Converge Technology Solutions in 2022 to upgrade the service by enhancing network connectivity and bringing full support to the IBM i operating system. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Today, their announcing </span><a href="https://console.cloud.google.com/marketplace/product/ibm-sg/ibm-power-cloud-for-gcp" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Converge Enterprise Cloud with IBM Power for Google Cloud</span></a><span style="font-weight:400;">, or simply IP4G supports all three major environments in Power: AIX, IBM i and Linux.  It is now available in 4 regions; two in Canada and two in EMEa – in addition to the two in North America.</span></li>
<li style="font-weight:400;"><i><span style="font-weight:400;">“Infor was one of the original IP4G subscribers, and years later, we continue to run mission-critical IBM Power workloads in IP4G for our clients. IP4G’s availability and performance have more than met our requirements, and we are extremely satisfied with our overall IP4G experience.”</span></i><span style="font-weight:400;"> – Scott Vassh, Vice President, WMS Development</span><span style="font-weight:400;"> </span></li>
</ul>
<p><i><span style="font-weight:400;">46:13  Matthew – “</span></i><i><span style="font-weight:400;">This just feels like you’re not cloud native.”</span></i></p>
<p><b>48:38 </b><a href="https://cloud.google.com/blog/products/sap-google-cloud/compute-engine-x4-machine-types-for-sap-workloads/" target="_blank" rel="noreferrer noopener"><b>Achieve peak SAP S/4HANA performance with Compute Engine X4 </b></a><a href="https://cloud.google.com/blog/products/sap-google-cloud/compute-engine-x4-machine-types-for-sap-workloads/" target="_blank" rel="noreferrer noopener"><b>machines</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">For those of you trying to make HANA scale GCP has a new machine type for you the</span><a href="https://cloud.google.com/blog/products/compute/compute-engine-c3-bare-metal-and-x4-machine-types-now-ga" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;"> X4</span></a><span style="font-weight:400;">.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The X4 is purpose-built to handle the demanding workloads of SAP Hana OLTP and OLAP workloads.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">These machines deliver strong performance, scalability, and reliability empowering businesses to unlock the full potential of their SAP S/4 Hana, SAP Business Suite on SAP HANA and SAP Industry Solutions.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">X4 is also built to support OLAP workloads such as BW/4HANA and BW on HANA</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Please no follow up questions on what any of those HANA things are. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">X4 is available in 16tb, 24tb, and 32tb memory configurations and 960, 1440, 1920 vCPU cores respectively with “standard sizing” SAP certification for SAP HANA OLTP and OLAP capabilities. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The X4 16tb machine achieved an SAP benchmark result that was 8% higher than the closest IaaS solution (thanks Google)</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They are also the only cloud provider providing a certified 32TB SAP machine. </span></li>
<li style="font-weight:400;"><i><span style="font-weight:400;">“In the past few years, our SAP HANA systems have seen significant data growth with an increasing need for higher performance. With the 24TB X4 machines and Hyperdisk storage, we have been able to raise the ceiling for our future data growth and are also looking to see improvements in our performance. Added to this, Google’s X4 machines are cloud native, giving us opportunities to automate system management and operations.” </span></i><span style="font-weight:400;">– </span><b>Shawn Lund, US Chief Technology Officer, Deloitte</b></li>
</ul>
<p><i><span style="font-weight:400;">49:08  Justin – “I’ll I’ve learned about SAP HANA is that I don’t ever want to manage it.”</span></i></p>
<p><b>50:42 </b><a href="https://cloud.google.com/blog/products/ai-machine-learning/bringing-ai-agents-to-enterprises-with-google-agentspace/" target="_blank" rel="noreferrer noopener"><b>Introducing Google Agentspace: Bringing AI agents and AI-powered search </b></a><a href="https://cloud.google.com/blog/products/ai-machine-learning/bringing-ai-agents-to-enterprises-with-google-agentspace/" target="_blank" rel="noreferrer noopener"><b>to enterprises</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google is introducing </span><a href="https://cloud.google.com/products/agentspace" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Google Agentspace</span></a><span style="font-weight:400;">, which is a *terrible* name. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Agentspace unlocks enterprise expertise for employees with agents that bring together Google’s advanced reasoning, Google-quality search and enterprise data regardless of where it is hosted. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">It will make your employees highly productive by helping them accomplish complex tasks that require planning, research, content generation, and actions all with a single prompt. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Agentspace unlocks enterprise expertise by:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">New ways to interact and engage with your enterprise data using NotebookLM.  Including </span><a href="http://notebooklm.google/plus" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">NotebookLM Plus</span></a><span style="font-weight:400;">, your employees can upload information to synthesize, uncover insights, and enjoy new ways of engaging with data, such as podcast audio like summaries and more. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Information discovery across the enterprise including searching unstructured data such as emails and documents. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Expert agents to automate your business functions like expense reports, or other multistep processes. </span></li>
</ul>
</li>
</ul>
<h2><b>Azure</b></h2>
<p><b>52:46 </b><a href="https://techcrunch.com/2024/12/12/microsoft-debuts-phi-4-a-new-generative-ai-model-in-research-preview/" target="_blank" rel="noreferrer noopener"><b>Microsoft debuts Phi-4, a new generative AI model, in research preview</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft has also told Nova to sit down.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Phi-4 is improved in several areas over its predecessor per Microsoft, particularly in math problem solving. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Phi-4 is available in limited access via </span><a href="https://learn.microsoft.com/en-us/azure/ai-studio/what-is-ai-studio" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Azure AI Foundry</span></a><span style="font-weight:400;"> development platform and only for research purposes. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This is Microsoft’s smallest model, coming in at 14 billion parameters in size. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">It competes with other small models such as GPT-4o minim, Gemini 2.0 flash and Claude 3.5 Haiku. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft attributes its performance improvements to high-quality synthetic datasets alongside high-quality datasets of human-generated content and some unspecified post-training improvements. </span></li>
</ul>
<p><i><span style="font-weight:400;">53:29  Justin – “</span></i><i><span style="font-weight:400;">They tell you it’s for research purpose only and then it goes and becomes very toxic, you can just say, well, it was only in research.”</span></i></p>
<h2><b>Oracle</b></h2>
<p><b>54:57  </b><a href="https://www.oracle.com/news/announcement/oracle-database-at-aws-available-in-limited-preview-2024-12-02/" target="_blank" rel="noreferrer noopener"><b>Oracle Database@AWS Available in Limited Preview</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">For those waiting with baited breath for Oracle Database@AWS, you might still be waiting unless you can get into the limited preview. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">“</span><i><span style="font-weight:400;">Up until now, it has been impossible to replicate the performance and functionality of Oracle Database on Exadata in AWS</span></i><span style="font-weight:400;">,” said Dave McCarthy, research vice president, IDC. “</span><i><span style="font-weight:400;">With Oracle Database@AWS, customers can finally enjoy that same experience with an easy migration path to the cloud for their on-premises mission-critical workloads. This allows them to reap the benefits of simplifying their daily management and operations to prioritize modernization initiatives.</span></i><span style="font-weight:400;">”</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">“</span><i><span style="font-weight:400;">We want our customers to have access to our data services and to be able to seamlessly use multiple clouds,” said Karan Batta, senior vice president, Oracle Cloud Infrastructure. “This partnership provides a unified way for customers to use the best of Oracle and AWS to take advantage of the latest AI innovations and simplify operations. The introduction of Oracle Exadata Database Service in the AWS US East Region is only the beginning, and we plan to continue to work with AWS to meet customer demand.</span></i><span style="font-weight:400;">”</span></li>
</ul>
<p><i><span style="font-weight:400;">55:42  Justin – “</span></i><i><span style="font-weight:400;">I mean, I feel like they’re actually, I think, I think this is exit. think they’re actually installing exit data in the data center. And this hardware is highly tuned for this purpose.”</span></i></p>
<h2><span style="font-weight:400;">Closing</span></h2>
<p><span style="font-weight:400;">And that is the week in the cloud! Visit our website, the home of the Cloud Pod where you can join our newsletter, slack team, send feedback or ask questions at theCloud Pod.net or tweet at us with hashtag #theCloudPod</span></p>
]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/1929542/c1e-1xdxbjmqp1s4mx8q-qd4nvzm3upzq-grbgr6.mp3" length="69486675"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 285 of the Explain it to me Like I’m 5 Podcast, formerly known as The Cloud Pod – where the forecast is always cloudy! We’ve got a lot of news this week, including the last of our coverage from re:Invent, ChatGTP Pro, FPGA, and even some major staffing turnovers.
Titles we almost went with this week:

Throw $200 dollars in a fire with ChatGPT Pro
Jeff Barr is wrapped up by Agentic AI
️The Tribble with Trilliums
️The Wind in the Quantum Willows 
⚰️Rise of the dead instances FPGA and PowerPC
Jeff Barr is replaced by Nova
The Cloud Pod: Return of the dead instances types
After 6 year Jeff Barr hands over the reigns to the CloudPod
⌚For our 6th birthday Jeff barr Retires
For our 6th birthday jeff barr delegates announcements to the cloud pod
6 years of meaningless PR drivel
‍6 years of cloud news and we still don’t know what Quantum computing is

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
General News
HAPPY 6th BIRTHDAY! 
2:00 HashiCorp at re:Invent 2024: Security Lifecycle Management with AWS

Hashi is a big sponsor of re:Invent, so of course they had some news of their own to release. 
HCP Vault Secrets auto-rotation is now generally available. 
Dynamic secrets are generally available via HCP Vault Secrets.
Secrets sync will help keep your secrets synced with AWS Secrets Manager. It still appears to be one direction, but you can now also view secrets in AWS Secrets Manager that are managed by vault. 
HCP Vault Radar, now in beta, auto...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/1929542/c1a-k5d5-rk4gdw89bngz-hnjhrv.jpg"></itunes:image>
                                                                            <itunes:duration>00:57:55</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                            </item>
                    <item>
                <title>
                    <![CDATA[284: Amazon Q uses machine learning to get smarter, but Bond’s Q can turn a wristwatch into a laser beam. Your move, AI.]]>
                </title>
                <pubDate>Thu, 19 Dec 2024 12:27:21 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/1924121</guid>
                                    <link>https://tcpfm.castos.com/episodes/284-amazon-q-reinvent</link>
                                <description>
                                            <![CDATA[<p><span style="font-weight:400;">Welcome to episode 284 of The Cloud Pod – where the forecast is always cloudy! Everybody is in the house this week, and it’s a good thing because since we’ve last recorded re:Invent happened, and we have a LOT to talk about. So let’s jump right in! </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">Amazon Steals from Azure…. We Are Doomed </span></li>
<li><span style="font-weight:400;">️The Cloud Pod Can Now Throw Away a lot of Code</span></li>
<li><span style="font-weight:400;">The Cloud Pod Controls the Future</span></li>
<li><span style="font-weight:400;">The Cloud Pod Observes More Insights</span></li>
<li><span style="font-weight:400;">We Are Simplicity</span></li>
<li><span style="font-weight:400;">❌X None of the Above</span></li>
<li><span style="font-weight:400;">Stop Trying to Make Bedrock &amp; Q Happen</span></li>
<li><span style="font-weight:400;">My Head Went SuperNova over all the Q Announcements</span></li>
<li><span style="font-weight:400;">These are Not the Gadgets Bond Needed, Q! </span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><b>AWS</b><span style="font-weight:400;"> </span></h2>
<p><b>08:12 It’s the re:Invent recap! </b></p>
<p><b>Did you make any announcement predictions? Let’s see how our hosts’  </b><b>predictions stacked up to reality. </b></p>
<p><b>Matt – 1</b></p>
<ul>
<li><span style="font-weight:400;">Large Green Computing Reinvent</span></li>
<li><span style="font-weight:400;">LLM at the Edge</span></li>
<li><b>Something new on S3✅</b></li>
</ul>
<p><b>Ryan (AI) – 1</b></p>
<ul>
<li><span style="font-weight:400;">Improved serverless observability tools</span></li>
<li><b>Expansion of AI Driven workflows in datalakes✅</b></li>
<li><span style="font-weight:400;">Greater Focus on Multi-Account or Multi-region orchestration, centralized compliance management, or enhanced security services</span></li>
</ul>
<p><b>Jonathan – 0</b></p>
<ol>
<li style="font-weight:400;"><span style="font-weight:400;">New Edge Computing Capabilities better global application deployment type features. (Cloudflare competitor maybe)</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">New automated cost optimization tools</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Automated RAG/vector to S3</span></li>
</ol>
<p><b>Justin  – 2</b></p>
<ol>
<li style="list-style-type:none;">
<ol>
<li style="font-weight:400;"><span style="font-weight:400;">Managed Backstage or platform like service</span></li>
</ol>
</li>
</ol>
<ul>
<li><b>New LLM multi-modal replacement or upgrade to Titan✅</b></li>
</ul>
<ol>
<li style="font-weight:400;"><span style="font-weight:400;">Competitor VM offering to Broadcom✅</span></li>
</ol>
<p><b>Honorable Mentions:</b></p>
<p><span style="font-weight:400;">Jonathan:</span></p>
<p><span style="font-weight:400;">Deeper integration between serverless and container services</span></p>
<p><span style="font-weight:400;">New region</span></p>
<p><b>Enhanced Observability with AI driven debugging tool✅</b></p>
<p><span style="font-weight:400;">Justin:</span></p>
<p><span style="font-weight:400;">Multicloud management – in a bigger way (Anthos competitor)</span></p>
<p><span style="font-weight:400;">Agentic AI toolings</span></p>
<p><span style="font-weight:400;">New ARM graviton chip</span></p>
<p><b>How many will AI or Artificial Intelligence be said: 45</b></p>
<p><b>Justin – 35✅</b></p>
<p><span style="font-weight:400;">Jonathan – 72</span></p>
<p><b>Pre:Invent</b></p>
<p><span style="font-weight:400;">There were over 180 announcements, and yes – we have them all listed here for you...</span></p>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 284 of The Cloud Pod – where the forecast is always cloudy! Everybody is in the house this week, and it’s a good thing because since we’ve last recorded re:Invent happened, and we have a LOT to talk about. So let’s jump right in! 
Titles we almost went with this week:

Amazon Steals from Azure…. We Are Doomed 
️The Cloud Pod Can Now Throw Away a lot of Code
The Cloud Pod Controls the Future
The Cloud Pod Observes More Insights
We Are Simplicity
❌X None of the Above
Stop Trying to Make Bedrock & Q Happen
My Head Went SuperNova over all the Q Announcements
These are Not the Gadgets Bond Needed, Q! 

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
AWS 
08:12 It’s the re:Invent recap! 
Did you make any announcement predictions? Let’s see how our hosts’  predictions stacked up to reality. 
Matt – 1

Large Green Computing Reinvent
LLM at the Edge
Something new on S3✅

Ryan (AI) – 1

Improved serverless observability tools
Expansion of AI Driven workflows in datalakes✅
Greater Focus on Multi-Account or Multi-region orchestration, centralized compliance management, or enhanced security services

Jonathan – 0

New Edge Computing Capabilities better global application deployment type features. (Cloudflare competitor maybe)
New automated cost optimization tools
Automated RAG/vector to S3

Justin  – 2



Managed Backstage or platform like service




New LLM multi-modal replacement or upgrade to Titan✅


Competitor VM offering to Broadcom✅

Honorable Mentions:
Jonathan:
Deeper integration between serverless and container services
New region
Enhanced Observability with AI driven debugging tool✅
Justin:
Multicloud management – in a bigger way (Anthos competitor)
Agentic AI toolings
New ARM graviton chip
How many will AI or Artificial Intelligence be said: 45
Justin – 35✅
Jonathan – 72
Pre:Invent
There were over 180 announcements, and yes – we have them all listed here for you...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[284: Amazon Q uses machine learning to get smarter, but Bond’s Q can turn a wristwatch into a laser beam. Your move, AI.]]>
                </itunes:title>
                                    <itunes:episode>284</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p><span style="font-weight:400;">Welcome to episode 284 of The Cloud Pod – where the forecast is always cloudy! Everybody is in the house this week, and it’s a good thing because since we’ve last recorded re:Invent happened, and we have a LOT to talk about. So let’s jump right in! </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">Amazon Steals from Azure…. We Are Doomed </span></li>
<li><span style="font-weight:400;">️The Cloud Pod Can Now Throw Away a lot of Code</span></li>
<li><span style="font-weight:400;">The Cloud Pod Controls the Future</span></li>
<li><span style="font-weight:400;">The Cloud Pod Observes More Insights</span></li>
<li><span style="font-weight:400;">We Are Simplicity</span></li>
<li><span style="font-weight:400;">❌X None of the Above</span></li>
<li><span style="font-weight:400;">Stop Trying to Make Bedrock &amp; Q Happen</span></li>
<li><span style="font-weight:400;">My Head Went SuperNova over all the Q Announcements</span></li>
<li><span style="font-weight:400;">These are Not the Gadgets Bond Needed, Q! </span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><b>AWS</b><span style="font-weight:400;"> </span></h2>
<p><b>08:12 It’s the re:Invent recap! </b></p>
<p><b>Did you make any announcement predictions? Let’s see how our hosts’  </b><b>predictions stacked up to reality. </b></p>
<p><b>Matt – 1</b></p>
<ul>
<li><span style="font-weight:400;">Large Green Computing Reinvent</span></li>
<li><span style="font-weight:400;">LLM at the Edge</span></li>
<li><b>Something new on S3✅</b></li>
</ul>
<p><b>Ryan (AI) – 1</b></p>
<ul>
<li><span style="font-weight:400;">Improved serverless observability tools</span></li>
<li><b>Expansion of AI Driven workflows in datalakes✅</b></li>
<li><span style="font-weight:400;">Greater Focus on Multi-Account or Multi-region orchestration, centralized compliance management, or enhanced security services</span></li>
</ul>
<p><b>Jonathan – 0</b></p>
<ol>
<li style="font-weight:400;"><span style="font-weight:400;">New Edge Computing Capabilities better global application deployment type features. (Cloudflare competitor maybe)</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">New automated cost optimization tools</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Automated RAG/vector to S3</span></li>
</ol>
<p><b>Justin  – 2</b></p>
<ol>
<li style="list-style-type:none;">
<ol>
<li style="font-weight:400;"><span style="font-weight:400;">Managed Backstage or platform like service</span></li>
</ol>
</li>
</ol>
<ul>
<li><b>New LLM multi-modal replacement or upgrade to Titan✅</b></li>
</ul>
<ol>
<li style="font-weight:400;"><span style="font-weight:400;">Competitor VM offering to Broadcom✅</span></li>
</ol>
<p><b>Honorable Mentions:</b></p>
<p><span style="font-weight:400;">Jonathan:</span></p>
<p><span style="font-weight:400;">Deeper integration between serverless and container services</span></p>
<p><span style="font-weight:400;">New region</span></p>
<p><b>Enhanced Observability with AI driven debugging tool✅</b></p>
<p><span style="font-weight:400;">Justin:</span></p>
<p><span style="font-weight:400;">Multicloud management – in a bigger way (Anthos competitor)</span></p>
<p><span style="font-weight:400;">Agentic AI toolings</span></p>
<p><span style="font-weight:400;">New ARM graviton chip</span></p>
<p><b>How many will AI or Artificial Intelligence be said: 45</b></p>
<p><b>Justin – 35✅</b></p>
<p><span style="font-weight:400;">Jonathan – 72</span></p>
<p><b>Pre:Invent</b></p>
<p><span style="font-weight:400;">There were over 180 announcements, and yes – we have them all listed here for you. You’re welcome. </span></p>
<p><b>17:12 </b><a href="https://aws.amazon.com/blogs/aws/time-based-snapshot-copy-for-amazon-ebs/" target="_blank" rel="noreferrer noopener"><b>Time-based snapshot copy for Amazon EBS</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Now you can specify a desired completion duration, from 15 minutes to 48 hours when you copy an Amazon EBS snapshot within or between Amazon regions or accounts. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This will allow you to meet your time-based compliance and business requirements for critical workloads, mostly around DR capabilities. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">We’re just glad to see this one finally, because having it built in directly to the console to guarantee that EBS snapshots make it to the other region is a big quality of life enhancement.</span></li>
</ul>
<p><a href="https://aws.amazon.com/blogs/aws/announcing-future-dated-amazon-ec2-on-demand-capacity-reservations/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Announcing future-dated Amazon EC2 On-Demand Capacity Reservations</span></a></p>
<p><a href="https://aws.amazon.com/blogs/aws/introducing-a-new-experience-for-aws-system-manager/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Introducing a new experience for AWS Systems Manager</span></a><span style="font-weight:400;">  </span></p>
<p><a href="https://aws.amazon.com/blogs/aws/introducing-new-capabilities-to-aws-cloudtrail-lake-to-enhance-your-cloud-visibility-and-investigations/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Introducing new capabilities to AWS CloudTrail Lake to enhance your cloud visibility and investigations</span></a></p>
<p><a href="https://aws.amazon.com/blogs/aws/improve-your-app-authentication-workflow-with-new-amazon-cognito-features/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Improve your app authentication workflow with new Amazon Cognito features</span></a><span style="font-weight:400;"> </span></p>
<p><a href="https://aws.amazon.com/blogs/aws/track-performance-of-serverless-applications-built-using-aws-lambda-with-application-signals/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Track performance of serverless applications built using AWS Lambda with Application Signals</span></a><span style="font-weight:400;"> </span></p>
<p><a href="https://aws.amazon.com/blogs/aws/announcing-a-visual-update-to-the-aws-management-console-preview/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Announcing a visual update to the AWS Management Console (preview)</span></a><span style="font-weight:400;"> </span></p>
<p><a href="https://aws.amazon.com/blogs/aws/introducing-amazon-cloudfront-vpc-origins-enhanced-security-and-streamlined-operations-for-your-applications/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Introducing Amazon CloudFront VPC origins: Enhanced security and streamlined operations for your applications</span></a></p>
<p><a href="https://aws.amazon.com/blogs/aws/amazon-cloudfront-now-accepts-your-applications-grpc-calls/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon CloudFront now accepts your applications’ gRPC calls</span></a></p>
<p><b>20:50</b> <a href="https://www.aboutamazon.com/news/aws/amazon-invests-additional-4-billion-anthropic-ai?utm_source=convertkit&amp;utm_medium=email&amp;utm_campaign=AWS%20Graviton%20Weekly%20#%20114%20-%2015734543" target="_blank" rel="noreferrer noopener"><b>Amazon and Anthropic deepen strategic collaboration</b></a> <b>  </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon and Anthropic deepened their strategic collaboration with another $4 billion investment from Amazon to also use their Neutronium chips, which came up later on Mainstage at Monday Night Live and as well as on Matt’s presentation.</span></li>
</ul>
<p><a href="https://aws.amazon.com/blogs/aws/introducing-amazon-guardduty-extended-threat-detection-aiml-attack-sequence-identification-for-enhanced-cloud-security/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Introducing Amazon GuardDuty Extended Threat Detection: AI/ML attack sequence identification for enhanced cloud security</span></a></p>
<p><a href="https://aws.amazon.com/blogs/aws/container-insights-with-enhanced-observability-now-available-in-amazon-ecs/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Container Insights with enhanced observability now available in Amazon ECS</span></a></p>
<p><a href="https://aws.amazon.com/blogs/aws/aws-clean-rooms-now-supports-multiple-clouds-and-data-sources/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Clean Rooms now supports multiple clouds and data sources</span></a></p>
<p><b>21:34 </b><a href="https://aws.amazon.com/blogs/aws/new-physical-aws-data-transfer-terminals-let-you-upload-to-the-cloud-faster/" target="_blank" rel="noreferrer noopener"><b>New physical AWS Data Transfer Terminals let you upload to the cloud </b></a><a href="https://aws.amazon.com/blogs/aws/new-physical-aws-data-transfer-terminals-let-you-upload-to-the-cloud-faster/" target="_blank" rel="noreferrer noopener"><b>faster</b></a><b>   </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">New physical AWS data transfer terminals let you upload to the cloud faster. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">So, we got rid of the trucks. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">We got rid of the disks that we send you in the mail. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">BUT If you have your own disks that you’d like to bring to a physical location in either Los Angeles or New York, you can connect them with the cable directly to the Amazon cloud through a public endpoint that is available. (We assume it’s in a secure building or something.)</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Basically you reserve a time slot to visit your nearest location and upload that data quickly to your AWS public endpoint. </span></li>
</ul>
<p><a href="https://aws.amazon.com/blogs/aws/enhance-your-productivity-with-new-extensions-and-integrations-in-amazon-q-business/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Enhance your productivity with new extensions and integrations in Amazon Q Business</span></a></p>
<p><a href="https://aws.amazon.com/blogs/aws/announcing-amazon-fsx-intelligent-tiering-a-new-storage-class-for-fsx-for-openzfs/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Announcing Amazon FSx Intelligent-Tiering, a new storage class for FSx for OpenZFS</span></a></p>
<p><a href="https://aws.amazon.com/blogs/aws/new-rag-evaluation-and-llm-as-a-judge-capabilities-in-amazon-bedrock/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">New RAG evaluation and LLM-as-a-judge capabilities in Amazon Bedrock</span></a></p>
<p><a href="https://aws.amazon.com/blogs/aws/securely-share-aws-resources-across-vpc-and-account-boundaries-with-privatelink-vpc-lattice-eventbridge-and-step-functions/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Securely share AWS resources across VPC and account boundaries with PrivateLink, VPC Lattice, EventBridge, and Step Functions</span></a></p>
<p><b>23: 52 </b><a href="https://aws.amazon.com/blogs/aws/new-aws-security-incident-response-helps-organizations-respond-to-and-recover-from-security-events/" target="_blank" rel="noreferrer noopener"><b>New AWS Security Incident Response helps organizations respond to and </b></a></p>
<p><a href="https://aws.amazon.com/blogs/aws/new-aws-security-incident-response-helps-organizations-respond-to-and-recover-from-security-events/" target="_blank" rel="noreferrer noopener"><b>recover from security events</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS announced that the new AWS Security Incident Response Service designed to help organizations manage security events quickly and effectively, services purpose-built to help customers prepare for, respond to, and recover from various security events, including account takeovers, data breaches, and ransomware is now available. It essentially automates the triage, and there’s 24 hour customer service for assistance. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Your security response team will appreciate this one. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">We approve. </span></li>
</ul>
<p><a href="https://aws.amazon.com/blogs/aws/new-apis-in-amazon-bedrock-to-enhance-rag-applications-now-available/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">New APIs in Amazon Bedrock to enhance RAG applications, now available</span></a></p>
<p><a href="https://aws.amazon.com/blogs/aws/connect-users-to-data-through-your-apps-with-storage-browser-for-amazon-s3/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Connect users to data through your apps with Storage Browser for Amazon S3</span></a></p>
<p><a href="https://aws.amazon.com/blogs/aws/introducing-new-partyrock-capabilities-and-free-daily-usage/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Introducing new PartyRock capabilities and free daily usage</span></a></p>
<p><a href="https://aws.amazon.com/blogs/aws/amazon-memorydb-multi-region-is-now-generally-available/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon MemoryDB Multi-Region is now generally available</span></a></p>
<p><a href="https://aws.amazon.com/blogs/aws/introducing-default-data-integrity-protections-for-new-objects-in-amazon-s3/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Introducing default data integrity protections for new objects in Amazon S3</span></a></p>
<p><a href="https://aws.amazon.com/blogs/aws/aws-data-migration-service-improves-database-schema-conversion-with-generative-ai/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Database Migration Service now automates time-intensive schema conversion tasks using generative AI</span></a></p>
<p><a href="https://aws.amazon.com/blogs/aws/simplify-governance-with-declarative-policies/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Simplify governance with declarative policies</span></a></p>
<p><a href="https://aws.amazon.com/blogs/aws/aws-verified-access-now-supports-secure-access-to-resources-over-non-https-protocols/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Verified Access now supports secure access to resources over non-HTTP(S) protocols (in preview)</span></a><span style="font-weight:400;">     </span></p>
<p><a href="https://aws.amazon.com/blogs/aws/announcing-aws-transfer-family-web-apps-for-fully-managed-amazon-s3-file-transfers/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Announcing AWS Transfer Family web apps for fully managed Amazon S3 file transfers</span></a></p>
<p><a href="https://aws.amazon.com/blogs/aws/introducing-amazon-opensearch-service-zero-etl-integration-for-amazon-security-lake/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Introducing Amazon OpenSearch Service and Amazon Security Lake integration to simplify security analytics</span></a></p>
<p><a href="https://aws.amazon.com/blogs/aws/use-your-on-premises-infrastructure-in-amazon-eks-clusters-with-amazon-eks-hybrid-nodes/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Use your on-premises infrastructure in Amazon EKS clusters with Amazon EKS Hybrid Nodes</span></a></p>
<p><a href="https://aws.amazon.com/blogs/aws/streamline-kubernetes-cluster-management-with-new-amazon-eks-auto-mode/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Streamline Kubernetes cluster management with new Amazon EKS Auto Mode</span></a></p>
<p><a href="https://aws.amazon.com/blogs/aws/introducing-storage-optimized-amazon-ec2-i8g-instances-powered-by-aws-graviton4-processors-and-3rd-gen-aws-nitro-ssds/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Introducing storage optimized Amazon EC2 I8g instances powered by AWS Graviton4 processors and 3rd gen AWS Nitro SSDs</span></a></p>
<p><a href="https://aws.amazon.com/blogs/aws/now-available-storage-optimized-amazon-ec2-i7ie-instances/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Now available: Storage optimized Amazon EC2 I7ie instances</span></a></p>
<p><a href="https://aws.amazon.com/blogs/aws/new-amazon-cloudwatch-database-insights-comprehensive-database-observability-from-fleets-to-instances/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">New Amazon CloudWatch Database Insights: Comprehensive database observability from fleets to instances</span></a></p>
<p><a href="https://aws.amazon.com/blogs/aws/new-amazon-cloudwatch-and-amazon-opensearch-service-launch-an-integrated-analytics-experience/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">New Amazon CloudWatch and Amazon OpenSearch Service launch an integrated analytics experience</span></a></p>
<p><a href="https://aws.amazon.com/blogs/aws/amazon-fsx-for-lustre-unlocks-full-network-bandwidth-and-gpu-performance/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon FSx for Lustre increases throughput to GPU instances by up to 12x</span></a><span style="font-weight:400;">         </span></p>
<p><b>Networking</b></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/block-public-access-amazon-virtual-private-cloud/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS announces Block Public Access for Amazon Virtual Private Cloud</span></a><span style="font-weight:400;"> </span></p>
<p><b>25:39</b> <a href="https://aws.amazon.com/about-aws/whats-new/2024/11/aws-privatelink-across-region-connectivity/" target="_blank" rel="noreferrer noopener"><b>AWS PrivateLink now supports cross-region connectivity</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">PrivateLink now supports cross-region connectivity. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Until now, interface VPC endpoints only support connectivity to VPC endpoint services in the same region. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This allows neighboring customers to connect to VPC endpoint services hosted in other AWS regions in the same AWS partition over interface endpoints. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">We like this one, because some of the limitations of being restricted to specific regional targets was a bit difficult.</span></li>
</ul>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/aws-cloud-wan-on-premises-connectivity-direct-connect/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Cloud WAN simplifies on-premises connectivity via AWS Direct Connect</span></a><span style="font-weight:400;"> </span></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/aws-application-load-balancer-certificate-authority-advertisement/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Application Load Balancer introduces Certificate Authority advertisement to simplify client behavior while using Mutual TLS</span></a></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/cross-zone-application-load-balancer-zonal-shift-autoshift/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Cross-zone enabled Application Load Balancer now supports zonal shift and zonal autoshift</span></a><span style="font-weight:400;"> </span></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/aws-application-load-balancer-header-modification-enhanced-traffic-control-security/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Application Load Balancer introduces header modification for enhanced traffic control and security</span></a><span style="font-weight:400;"> </span></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-vpc-ipam-organizational-units-aws-organizations/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon VPC IPAM now supports enabling IPAM for organizational units within AWS Organizations</span></a><span style="font-weight:400;"> </span></p>
<p><b>26:23 </b><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-cloudfront-vpc-origins/" target="_blank" rel="noreferrer noopener"><b>Amazon CloudFront announces VPC origins</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon CloudFront now announces VPC Origins. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This is a feature Justin especially has wanted forever. It basically allows a customer to use CloudFront to deliver content from applications hosted in VPC private subnets, and with the VPC Origins, customers can have their ALB, NLB, or EC2 instance in that private subnet that’s accessible only through their CloudFront distribution. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Now you don’t have to do the dance where you go from CloudFront to a public endpoint to go to your private endpoint anymore. Woohoo!</span></li>
</ul>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/load-balancer-capacity-unit-reservation-application-balancers/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Load Balancer Capacity Unit Reservation for Application and Network Load Balancers</span></a></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-cloudfront-supports-grpc-delivery/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon CloudFront now supports gRPC delivery</span></a><span style="font-weight:400;">  </span></p>
<p><b>Compute</b></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-ec2-auto-scaling-highly-responsive-scaling-policies/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon EC2 Auto Scaling introduces highly responsive scaling policies</span></a><span style="font-weight:400;">  </span></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-ec2-provisioning-control-instances-on-demand-capacity/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon EC2 introduces provisioning control to launch instances on On-Demand Capacity</span></a></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/aws-resilience-hub-summary-view/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Resilience Hub introduces a summary view</span></a><span style="font-weight:400;">  </span></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-ec2-cpu-performance-attribute-instance-type-selection/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon EC2 added New CPU-Performance Attribute for Instance Type Selection</span></a><span style="font-weight:400;"> </span></p>
<p><b>27:36 </b><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-ec2-lineage-information-amis/" target="_blank" rel="noreferrer noopener"><b>Amazon EC2 now provides lineage information for your AMIs</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon EC2 has taken the great container lineage capabilities you have there, where you can see where the container got created and then how many times people added or modified it. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They brought that to you AMIs. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">So if you want AMI lineage, you can now get that. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">You can easily trace and copy or find the derived AMI back to the original AMI source through the records, which is important for some organizations who have heavy duty FOM requirements and/or they have image factory type solutions that basically create golden images of AMIs and they need to be able to see if it’s the one.</span></li>
</ul>
<p><i><span style="font-weight:400;">37:14  Matthew – “…</span></i><i><span style="font-weight:400;">this solves a Lambda that they posted, I think, probably like five, seven years ago, which was just a Lambda that watches the public endpoints, IP addresses for CloudFront, and just would update your security group rules so that you could only have that accessing it. I think I’ve deployed like 30 times, and every time you have to do a security group expansion, because it’s over 50 IP ranges, it’s always fun.”</span></i></p>
<p><b>Databases</b></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/provisioned-tcus-amazon-timestream-liveanalytics/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Announcing Provisioned Timestream Compute Units (TCUs) for Amazon Timestream for LiveAnalytics</span></a><span style="font-weight:400;"> </span></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-redshift-multi-data-warehouse-through-data-sharing/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Redshift multi-data warehouse writes through data sharing is now generally available</span></a></p>
<p><b>28:25</b> <a href="https://aws.amazon.com/about-aws/whats-new/2024/11/aws-dms-data-masking/" target="_blank" rel="noreferrer noopener"><b>AWS DMS now supports Data Masking</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon database migration service now supports data masking, allowing you to automatically remove sensitive data at the column level during migrations to help comply with GDPR, et cetera. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This makes DMS now even more interesting if you’re trying to keep a dev environment replicated with somewhat accurate production data without having actual customer data there.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">DMS is more than just migrations; it can also keep things in sync, so this is a nice capability, that you don’t have to build in glue or some other terrible ETL process.</span></li>
</ul>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/aws-dms-improved-performance-data-validation/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS DMS now delivers improved performance for data validation</span></a><span style="font-weight:400;"> </span></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-rds-blue-green-deployments-green-storage-performant-switchover/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon RDS Blue/Green Deployments Green storage fully performant prior to switchover</span></a><span style="font-weight:400;"> </span></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/elasticache-version-8-0-for-valkey-scaling-memory-efficiency/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon ElastiCache version 8.0 for Valkey brings faster scaling and improved memory efficiency</span></a><span style="font-weight:400;"> </span></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-rds-blue-green-deployments-storage-volume-shrink/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon RDS Blue/Green Deployments support storage volume shrink</span></a><span style="font-weight:400;"> </span></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-aurora-serverless-v2-scaling-zero-capacity/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Aurora Serverless v2 supports scaling to zero capacity</span></a><span style="font-weight:400;"> </span></p>
<p><b>Storage</b></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-ebs-time-based-copy-snapshots/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon EBS announces Time-based Copy for EBS Snapshots</span></a><span style="font-weight:400;"> </span></p>
<p><b>29:01 </b><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-s3-enforcement-conditional-write-operations-general-purpose-buckets/" target="_blank" rel="noreferrer noopener"><b>Amazon S3 now supports enforcement of conditional write operations for S3 general purpose buckets</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon S3 now supports enforcement of conditional write operations for S3 general purpose buckets. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Using bucket policies, this enforcement of conditional writes, you can mandate the S3 check the existence of an object before creating it in your bucket. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Then you can also mandate the S3 check the state of the object content before updating your bucket. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This will help you simplify distributed apps for preventing unintentional data overwrites, especially in high concurrency and multi-writer scenarios. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">So… it only took them how many years to fix this problem? Thanks. </span></li>
</ul>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-s3-functionality-conditional-writes/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon S3 adds new functionality for conditional writes</span></a><span style="font-weight:400;">  </span></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/mountpoint-amazon-s3-high-performance-shared-cache/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Mountpoint for Amazon S3 now supports a high performance shared cache</span></a><span style="font-weight:400;"> </span></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/aws-backup-s3-adds-new-restore-parameter/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Backup for Amazon S3 adds new restore parameter</span></a><span style="font-weight:400;"> </span></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/customized-delete-protection-amazon-ebs-snapshots-ebs-backed-amis/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Announcing customized delete protection for Amazon EBS Snapshots and EBS-backed AMIs</span></a><span style="font-weight:400;"> </span></p>
<p><b>Containers</b></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-ecs-az-rebalancing-speeds-mean-time-recovery-event/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon ECS announces AZ rebalancing that speeds up mean time to recovery after an infrastructure event</span></a></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/predictive-scaling-for-amazon-ecs-services/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS announces support for predictive scaling for Amazon ECS services</span></a><span style="font-weight:400;"> </span></p>
<p><b>Devops/System Management</b></p>
<p><b>30:03 </b><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/aws-systems-manager-experience-simplifying-node-management/" target="_blank" rel="noreferrer noopener"><b>The new AWS Systems Manager experience: Simplifying node management</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">They now streamline your node management, and now provide you access to see if it’s an EC2 instance, if it’s an on-prem instance, or if it’s a hybrid instance on top of Outpost or something else. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This wasn’t quite what we were looking for in the systems manager improvement camp, but that’s what they gave us. Wop wop. </span></li>
</ul>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/aws-cloudformation-hooks-cloud-control-api-configurations-evaluation/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS CloudFormation Hooks now allows AWS Cloud Control API resource configurations evaluation</span></a><span style="font-weight:400;"> </span></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/aws-cloudformation-recycle-bin-rules/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Announcing AWS CloudFormation support for Recycle Bin rules</span></a><span style="font-weight:400;"> </span></p>
<p><b>Observability</b></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/application-signals-otel-x-ray-otlp-endpoint-traces/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Application Signals provides OTEL support via X-Ray OTLP endpoint for traces</span></a><span style="font-weight:400;"> </span></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-cloudwatch-metrics-aws-lambda-esms/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Announcing new Amazon CloudWatch Metrics for AWS Lambda Event Source Mappings (ESMs)</span></a><span style="font-weight:400;"> </span></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-cloudwatch-visibility-application-transactions/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon CloudWatch launches full visibility into application transactions</span></a><span style="font-weight:400;"> </span></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-cloudwatch-internet-monitor-aws-local-zones-vpc-subnets/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon CloudWatch Internet Monitor adds AWS Local Zones support for VPC subnets</span></a><span style="font-weight:400;"> </span></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-cloudwatch-application-signals-runtime-metrics/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon CloudWatch Application Signals launches support for Runtime Metrics</span></a><span style="font-weight:400;"> </span></p>
<p><b>AI/Machine Learning</b></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-bedrock-agents-custom-orchestration/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Bedrock Agents now supports custom orchestration</span></a><span style="font-weight:400;">   </span></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/introducing-advanced-scaling-amazon-emr-managed-scaling/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Introducing Advanced Scaling in Amazon EMR Managed Scaling</span></a><span style="font-weight:400;"> </span></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/inlineagents-agents-amazon-bedrock/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Announcing InlineAgents for Agents for Amazon Bedrock</span></a><span style="font-weight:400;"> </span></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-ec2-capacity-blocks-instant-start-times-extensions/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon EC2 Capacity Blocks now supports instant start times and extensions</span></a><span style="font-weight:400;"> </span></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-bedrock-flows-new-capabilities/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Bedrock Flows is now generally available with two new capabilities</span></a><span style="font-weight:400;"> </span></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/prompt-optimization-preview-amazon-bedrock/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Introducing Prompt Optimization in Preview in Amazon Bedrock</span></a><span style="font-weight:400;"> </span></p>
<p><b>Q</b></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-q-business-browser-extension/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Q Business now available as browser extension</span></a><span style="font-weight:400;"> </span></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-q-developer-pro-tier-dashboard-user-activity/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Q Developer Pro tier introduces a new, improved dashboard for user activity</span></a></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-q-developer-personalized-chat-answers-console-context/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Q Developer can now provide more personalized chat answers based on console context</span></a><span style="font-weight:400;">  </span></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-q-apps-private-sharing/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Introducing Amazon Q Apps with private sharing</span></a></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-q-apps-data-collection-preview/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Q Apps introduces data collection (Preview)</span></a></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-q-developer-chat-customizations-generally-available/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Q Developer Chat Customizations is now generally available</span></a><span style="font-weight:400;"> </span></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/smartsheet-connector-amazon-q-business-generally-available/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Smartsheet connector for Amazon Q Business is now generally available</span></a></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/ses-sail-manager-delivery-email-amazon-q-business-applications/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">SES Mail Manager adds delivery of email to Amazon Q Business applications</span></a><span style="font-weight:400;">  </span></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/aws-announces-amazon-q-account-console-mobile-app/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Announces Amazon Q account resources chat in the AWS Console Mobile App</span></a><span style="font-weight:400;"> </span></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/amazonq-business-answers-tables-embedded-documents/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Q Business now supports answers from tables embedded in documents</span></a><span style="font-weight:400;"> </span></p>
<p><b>Finops</b></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-q-developer-natural-language-cost-analysis/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Q Developer now provides natural language cost analysis</span></a><span style="font-weight:400;">  </span></p>
<p><b>31:51</b> <a href="https://aws.amazon.com/about-aws/whats-new/2024/11/aws-enhanced-root-cause-insights-cost-anomalies/" target="_blank" rel="noreferrer noopener"><b>AWS delivers enhanced root cause insights to help explain cost anomalies</b></a> <a href="https://aws.amazon.com/about-aws/whats-new/2024/11/aws-billing-cost-management-savings-plans-purchase-analyzer/" target="_blank" rel="noreferrer noopener"><b>AWS Billing and Cost Management announces Savings Plans Purchase Analyzer</b></a><b> </b></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/aws-compute-optimizer-idle-resource-recommendation/" target="_blank" rel="noreferrer noopener"><b>AWS Compute Optimizer now supports idle resource recommendation</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">New enhanced root cause insights are available to help explain cost anomalies. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They’ll tell you why your cost has ballooned three or four thousand dollars, without you having to go figure it out yourself, which is handy. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They also gave you a new savings plan purchase analyzer, which allows you to quickly estimate the cost, coverage, and utilization impact of your plan savings plan purchase. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">That’s sort of the opposite of giving you the prediction –  or like giving you the recommender is now saying, okay, if you bought the recommendation, here’s what it actually would do. So now you get both directions of modeling, which is good. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">AWS compute optimizer now supports idle resource recommendations for you as well. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">So three nice Finops improvements.</span></li>
</ul>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/12/aws-invoice-configuration/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS announces Invoice Configuration</span></a><span style="font-weight:400;"> </span></p>
<p><b>Quicksight</b></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-quicksight-import-visual-capability-preview/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon QuickSight now supports import visual capability (preview)</span></a></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-quicksight-highcharts-visual-preview/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon QuickSight launches Highcharts visual (preview)</span></a></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-quicksight-image-component/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon QuickSight launches Image component</span></a></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-quicksight-layer-map/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon QuickSight launches Layer Map</span></a></p>
<p><b>Serverless</b></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/aws-lambda-provisioned-mode-kafka-esms/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Lambda announces Provisioned Mode for Kafka event source mappings (ESMs)</span></a><span style="font-weight:400;">    </span></p>
<p><b>34:25</b> <a href="https://aws.amazon.com/about-aws/whats-new/2024/11/aws-lambda-application-performance-monitoring-cloudwatch-signals/" target="_blank" rel="noreferrer noopener"><b>AWS Lambda supports application performance monitoring (APM) via CloudWatch Application Signals</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon Lambda now supports application performance monitoring or APM via CloudWatch application signals. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This gives you the ability to see the health and performance of the service application built using Lambda, and makes it easy for you to identify and troubleshoot performance issues to minimize the MTTR and operational costs of running your service app, which you only wanted for a thousand years to have better telemetry inside of Lambda. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">We’ve only wanted this for a thousand years, so thank you for finally delivering that.</span></li>
</ul>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/aws-lambda-s3-failed-event-destination-stream-event-sources/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Lambda supports Amazon S3 as a failed-event destination for asynchronous and stream event sources</span></a><span style="font-weight:400;"> </span></p>
<p><span style="font-weight:400;">Security</span></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/new-feature-tiers-essentials-plus-amazon-cognito/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Announcing new feature tiers: Essentials and Plus for Amazon Cognito</span></a><span style="font-weight:400;"> </span></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/aws-amplify-passwordless-authentication-amazon-cognito/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Amplify introduces passwordless authentication with Amazon Cognito</span></a><span style="font-weight:400;">  </span></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-cognito-passwordless-authentication-low-friction-secure-logins/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Cognito now supports passwordless authentication for low-friction and secure logins</span></a><span style="font-weight:400;"> </span></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/aws-control-tower-hooks-management-proactive-controls-support-additional-regions/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Control Tower improves Hooks management for proactive controls and extends proactive controls support in additional regions</span></a><span style="font-weight:400;">  </span></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/12/amazon-ec2-allowed-amis-enhance-ami-governance/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon EC2 introduces Allowed AMIs to enhance AMI governance</span></a><span style="font-weight:400;"> </span></p>
<p><span style="font-weight:400;">Other</span></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-workspaces-rocky-linux/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon WorkSpaces introduces support for Rocky Linux</span></a><span style="font-weight:400;"> </span></p>
<h2><b>RE:INVENT</b></h2>
<p><b>36:07 Monday Night Live – Said AI or Artificial Intelligence – 10</b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Only one announcement during MNL. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">If you’re a hardware nerd, this is definitely the talk to watch. </span></li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/about-aws/whats-new/2024/12/latency-optimized-inference-foundation-models-amazon-bedrock/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Introducing latency-optimized inference for foundation models in Amazon Bedrock</span></a><span style="font-weight:400;"> </span></li>
</ul>
<p><i><span style="font-weight:400;">37:14  Jonathan – “</span></i><i><span style="font-weight:400;">It’s hard to connect to as a consumer or a user because it’s not off the shelf stuff. You don’t read about it in PC Magazine and then think, wow, Amazon’s deployed 10,000 of these things. It’s like, no, they built this thing. They designed this thing for this very specific purpose and it’s absolutely amazing and you’re never going to get your hands on it.”</span></i></p>
<p><b>38:02 Tuesday – Matt Garman – Said AI or Artificial Intelligence – 19</b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Probably the worst “what is AWS” intro, but we’ll forgive him for that. </span></li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/introducing-amazon-nova-frontier-intelligence-and-industry-leading-price-performance/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Introducing Amazon Nova: Frontier intelligence and industry leading price performance</span></a><span style="font-weight:400;"> </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon Nova – replacement for Titan. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Has 4 models; will be a complex reasoning model. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Nova also understands rag functions, and has multiple additional components, including:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Nova Canvas – image generating function </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Nova Reel – state of the art video generation model (Hello, Amazon Prime content.) </span></li>
</ul>
</li>
</ul>
<p><b>43:39 S3 Tables </b></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/introducing-queryable-object-metadata-for-amazon-s3-buckets-preview/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Introducing queryable object metadata for Amazon S3 buckets (preview)</span></a></li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/new-amazon-s3-tables-storage-optimized-for-analytics-workloads/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">New Amazon S3 Tables: Storage optimized for analytics workloads</span></a><span style="font-weight:400;">  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This is their new native Apache iceberg format support inside of S3. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">It comes as a competitor to Parquet files, and allows you to have basically table buckets that can act as iceberg tables, which can be handy for your AI ML use cases and training models. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They also announced inquirable object metadata for Amazon S3 buckets, which the guys kind of mocked earlier.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This is basically providing a rich metadata service that’ll allow you to store 20 elements, including the bucket name, object key, creation, modification time, storage class, encryption status, tags, and user metadata that you can define. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They showed on stage an example of this using a hike image and basically showed several of the parameters of an image, including the image size, et cetera. </span></li>
</ul>
<p><i><span style="font-weight:400;">44:51  Ryan – “</span></i><i><span style="font-weight:400;">Yeah, I can’t remember if we were actually making fun of this during the show or when we were just preparing for the show, but it’s definitely a feature for Amazon themselves because it was… I’ve abused Amazon as three queries for this exact purpose. I’m sure I wasn’t alone.”</span></i></p>
<p><b>45:35 Q Continuum</b></p>
<p><span style="font-weight:400;">Matt went a little off the deep end t walking about Q and Bedrock stuff, including: </span></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/amazon-q-business-is-adding-new-workflow-automation-capability-and-50-action-integrations/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Q Business is adding new workflow automation capability and 50+ action integrations</span></a></li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/new-capabilities-from-amazon-q-business-enable-isvs-to-enhance-generative-ai-experiences/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">New capabilities from Amazon Q Business enable ISVs to enhance generative AI experiences</span></a></li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/new-amazon-q-developer-agent-capabilities-include-generating-documentation-code-reviews-and-unit-tests/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">New Amazon Q Developer agent capabilities include generating documentation, code reviews, and unit tests</span></a></li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/announcing-amazon-q-developer-transformation-capabilities-for-net-preview/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Announcing Amazon Q Developer transformation capabilities for .NET (preview)</span></a></li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/announcing-amazon-q-developer-transformation-capabilities-for-net-mainframe-and-vmware-workloads-preview/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Announcing Amazon Q Developer transformation capabilities for .NET, mainframe, and VMware workloads (preview)</span></a></li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/investigate-and-remediate-operational-issues-with-amazon-q-developer/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Investigate and remediate operational issues with Amazon Q Developer (in preview)</span></a></li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/introducing-gitlab-duo-with-amazon-q/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Introducing GitLab Duo with Amazon Q</span></a><span style="font-weight:400;">       </span></li>
</ul>
<p><b>Bedrock</b></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/introducing-multi-agent-collaboration-capability-for-amazon-bedrock/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Introducing multi-agent collaboration capability for Amazon Bedrock (preview)</span></a><span style="font-weight:400;"> </span></li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/prevent-factual-errors-from-llm-hallucinations-with-mathematically-sound-automated-reasoning-checks-preview/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Prevent factual errors from LLM hallucinations with mathematically sound Automated Reasoning checks (preview)</span></a></li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/build-faster-more-cost-efficient-highly-accurate-models-with-amazon-bedrock-model-distillation-preview/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Build faster, more cost-efficient, highly accurate models with Amazon Bedrock Model Distillation (preview)</span></a><span style="font-weight:400;"> </span></li>
</ul>
<p><b>50:39 Sagemaker </b><span style="font-weight:400;">– the next kitchen sink! It’s going to be really confusing; don’t say we didn’t warn you. </span></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/introducing-the-next-generation-of-amazon-sagemaker-the-center-for-all-your-data-analytics-and-ai/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Introducing the next generation of Amazon SageMaker: The center for all your data, analytics, and AI</span></a><span style="font-weight:400;"> </span></li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/introducing-amazon-sagemaker-lakehouse-support-for-zero-etl-integrations-from-applications/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon SageMaker Lakehouse and Amazon Redshift supports zero-ETL integrations from applications</span></a><span style="font-weight:400;"> </span></li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/amazon-sagemaker-lakehouse-integrated-access-controls-now-available-in-amazon-athena-federated-queries/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon SageMaker Lakehouse integrated access controls now available in Amazon Athena federated queries</span></a></li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/simplify-analytics-and-aiml-with-new-amazon-sagemaker-lakehouse/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Simplify analytics and AI/ML with new Amazon SageMaker Lakehouse</span></a></li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/new-amazon-dynamodb-zero-etl-integration-with-amazon-sagemaker-lakehouse/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">New Amazon DynamoDB zero-ETL integration with Amazon SageMaker Lakehouse</span></a></li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/discover-govern-and-collaborate-on-data-and-ai-securely-with-amazon-sagemaker-data-and-ai-governance/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Discover, govern, and collaborate on data and AI securely with Amazon SageMaker Data and AI Governance</span></a></li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/announcing-the-general-availability-of-data-lineage-in-the-next-generation-of-amazon-sagemaker-and-amazon-datazone/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Announcing the general availability of data lineage in the next generation of Amazon SageMaker and Amazon DataZone</span></a></li>
</ul>
<p><i><span style="font-weight:400;">52:21  Ryan- “</span></i><i><span style="font-weight:400;">I mean SageMaker was already a kitchen sink for ML solutions, right? Like all the different things that and it made it really difficult to sort of summarize what it was useful for. And now it’s so much worse.”</span></i></p>
<p><b>54:12 EC2 (Matt Garman’s favorite service)</b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Matt mentioned that this was his favorite service, since he was the head of it for a while. </span></li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/amazon-ec2-trn2-instances-and-trn2-ultraservers-for-aiml-training-and-inference-is-now-available/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon EC2 Trn2 Instances and Trn2 UltraServers for AI/ML training and inference are now available</span></a></li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/new-amazon-ec2-p5en-instances-with-nvidia-h200-tensor-core-gpus-and-efav3-networking/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">New Amazon EC2 P5en instances with NVIDIA H200 Tensor Core GPUs and EFAv3 networking</span></a><span style="font-weight:400;"> </span></li>
</ul>
<p><b>56:48 Wednesday (Swamy) – 15 Times</b></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/accelerate-foundation-model-training-and-fine-tuning-with-new-amazon-sagemaker-hyperpod-recipes/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Accelerate foundation model training and fine-tuning with new Amazon SageMaker HyperPod recipes</span></a><span style="font-weight:400;"> </span></li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/about-aws/whats-new/2024/12/amazon-sagemaker-partner-ai-apps/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS announces Amazon SageMaker Partner AI Apps</span></a><span style="font-weight:400;"> </span></li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/amazon-bedrock-marketplace-access-over-100-foundation-models-in-one-place/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Bedrock Marketplace: Access over 100 foundation models in one place</span></a><span style="font-weight:400;"> </span></li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/reduce-costs-and-latency-with-amazon-bedrock-intelligent-prompt-routing-and-prompt-caching-preview/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Reduce costs and latency with Amazon Bedrock Intelligent Prompt Routing and prompt caching (preview)</span></a><span style="font-weight:400;"> </span></li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/about-aws/whats-new/2024/12/genai-index-amazon-kendra/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Announcing GenAI Index in Amazon Kendra</span></a><span style="font-weight:400;"> </span></li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/new-amazon-bedrock-capabilities-enhance-data-processing-and-retrieval/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">New Amazon Bedrock capabilities enhance data processing and retrieval</span></a><span style="font-weight:400;"> </span></li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/amazon-bedrock-guardrails-now-supports-multimodal-toxicity-detection-with-image-support/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Bedrock Guardrails now supports multimodal toxicity detection with image support (preview)</span></a><span style="font-weight:400;"> </span></li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/use-amazon-q-developer-to-build-ml-models-in-amazon-sagemaker-canvas/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Use Amazon Q Developer to build ML models in Amazon SageMaker Canvas</span></a><span style="font-weight:400;"> </span></li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/solve-complex-problems-with-new-scenario-analysis-capability-in-amazon-q-in-quicksight/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Solve complex problems with new scenario analysis capability in Amazon Q in QuickSight</span></a><span style="font-weight:400;"> </span></li>
</ul>
<p><b>59:04 Non Keynote or at Partner Keynote</b></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/introducing-buy-with-aws-an-accelerated-procurement-experience-on-aws-partner-sites-powered-by-aws-marketplace/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Introducing Buy with AWS: an accelerated procurement experience on AWS Partner sites, powered by AWS Marketplace</span></a></li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/aws-education-equity-initiative-applying-generative-ai-to-educate-the-next-wave-of-innovators/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Education Equity Initiative: Applying generative AI to educate the next wave of innovators</span></a><span style="font-weight:400;"> </span></li>
</ul>
<p><b>1:00:09 Thursday (Werner) – 1</b></p>
<p><span style="font-weight:400;">Complexity isn’t bad.</span></p>
<p><span style="font-weight:400;">No announcements</span></p>
<p><i><span style="font-weight:400;">AI or Artificial Intelligence was said 45 times</span></i></p>
<p><i><span style="font-weight:400;">1:00:25  Jonathan – “…</span></i><i><span style="font-weight:400;">complexity is weird though, because complexity kind of emerges from what he builds. Like, you never go out to build a complex system. It’s just something that naturally happens. And so I appreciated him calling it out and saying that it’s not inherently bad unless it’s something that becomes unreliable or unmanageable.”</span></i></p>
<h2><span style="font-weight:400;">Closing</span></h2>
<p><span style="font-weight:400;">And that is the week in the cloud! Visit our website, the home of the Cloud Pod where you can join our newsletter, slack team, send feedback or ask questions at theCloud Pod.net or tweet at us with hashtag #theCloudPod</span></p>
]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/1924121/c1e-qx4xb2q4x3b7g083-1pdmx2ogik0j-6lqeed.mp3" length="75976536"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 284 of The Cloud Pod – where the forecast is always cloudy! Everybody is in the house this week, and it’s a good thing because since we’ve last recorded re:Invent happened, and we have a LOT to talk about. So let’s jump right in! 
Titles we almost went with this week:

Amazon Steals from Azure…. We Are Doomed 
️The Cloud Pod Can Now Throw Away a lot of Code
The Cloud Pod Controls the Future
The Cloud Pod Observes More Insights
We Are Simplicity
❌X None of the Above
Stop Trying to Make Bedrock & Q Happen
My Head Went SuperNova over all the Q Announcements
These are Not the Gadgets Bond Needed, Q! 

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
AWS 
08:12 It’s the re:Invent recap! 
Did you make any announcement predictions? Let’s see how our hosts’  predictions stacked up to reality. 
Matt – 1

Large Green Computing Reinvent
LLM at the Edge
Something new on S3✅

Ryan (AI) – 1

Improved serverless observability tools
Expansion of AI Driven workflows in datalakes✅
Greater Focus on Multi-Account or Multi-region orchestration, centralized compliance management, or enhanced security services

Jonathan – 0

New Edge Computing Capabilities better global application deployment type features. (Cloudflare competitor maybe)
New automated cost optimization tools
Automated RAG/vector to S3

Justin  – 2



Managed Backstage or platform like service




New LLM multi-modal replacement or upgrade to Titan✅


Competitor VM offering to Broadcom✅

Honorable Mentions:
Jonathan:
Deeper integration between serverless and container services
New region
Enhanced Observability with AI driven debugging tool✅
Justin:
Multicloud management – in a bigger way (Anthos competitor)
Agentic AI toolings
New ARM graviton chip
How many will AI or Artificial Intelligence be said: 45
Justin – 35✅
Jonathan – 72
Pre:Invent
There were over 180 announcements, and yes – we have them all listed here for you...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/1924121/c1a-k5d5-6zwv08v4s54z-1rhst4.jpg"></itunes:image>
                                                                            <itunes:duration>01:03:19</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                            </item>
                    <item>
                <title>
                    <![CDATA[283: You’ve Got Re:Invent Predictions]]>
                </title>
                <pubDate>Wed, 27 Nov 2024 09:46:59 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/1909571</guid>
                                    <link>https://tcpfm.castos.com/episodes/283-youve-got-reinvent-predictions</link>
                                <description>
                                            <![CDATA[<p><span style="font-weight:400;">Welcome to episode 283 of The Cloud Pod, where the forecast is always cloudy! Break out your crystal balls and shuffle those tarot decks, because it’s Re:Invent prediction time! Sorry we missed you all last week – the plague has been strong with us. But Justin and Jonathan are BACK, and we’ve got a ton of news, so buckle in and let’s get started! </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">Not My Snowcones! </span></li>
<li><span style="font-weight:400;">Lambda at 10: Still Better Than Windows Containers </span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><b>General News </b><span style="font-weight:400;"> </span></h2>
<p><b>01:27 </b><a href="https://arstechnica.com/gadgets/2024/11/the-voice-of-america-onlines-youve-got-mail-has-died-at-age-74/" target="_blank" rel="noreferrer noopener"><b>The voice of America Online’s “You’ve got mail” has died at age 74</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Elwoods Edwards, the voice behind the online service </span><a href="https://www.aol.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AOL</span></a><span style="font-weight:400;">’s iconic “You’ve got mail” sound notification has died at the age of 74. He was just one day shy of his 75th birthday. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The “you’ve got mail” soundbite started in 1989 when Steve Case, CEO of Quantum Computer Services (which will later become America Online or AOL,) wanted to add a human voice to their Quantum online service.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Karen Edwards, who worked as a customer service representative, heard Case discussing the plan and suggested her husband Elwood, a professional broadcaster. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Edwards recorded the famous phrase and others (“Welcome” “File’s done” and “Goodbye” among them) on a cassette recorder in his living room. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">He was paid $200 for the service.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">His voice is still used to greet users of the current AOL service. </span></li>
</ul>
<h2><b>AWS</b><span style="font-weight:400;"> </span></h2>
<p><b>03:04 It’s Time for </b><a href="https://reinvent.awsevents.com/" target="_blank" rel="noreferrer noopener"><b>RE:Invent</b></a><b> Predictions!</b></p>
<p><b>Matt</b></p>
<ol>
<li style="font-weight:400;"><span style="font-weight:400;">Large Green Computing Reinvent</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">LLM at the Edge</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Something new On S3</span></li>
</ol>
<p><b>Ryan (AI)</b></p>
<ol>
<li style="font-weight:400;"><span style="font-weight:400;">Improved serverless observability tools</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Expansion of AI Driven workflows in datalakes</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Greater Focus on Multi-Account or Multi-region orchestration, centralized compliance management, or enhanced security services</span></li>
</ol>
<p><b>Jonathan</b></p>
<ol>
<li style="font-weight:400;"><span style="font-weight:400;">New Edge Computing Capabilities better global application deployment type features. (Cloudflare competitor maybe)</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">New automated cost optimization tools</span></li>
<li></li></ol>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 283 of The Cloud Pod, where the forecast is always cloudy! Break out your crystal balls and shuffle those tarot decks, because it’s Re:Invent prediction time! Sorry we missed you all last week – the plague has been strong with us. But Justin and Jonathan are BACK, and we’ve got a ton of news, so buckle in and let’s get started! 
Titles we almost went with this week:

Not My Snowcones! 
Lambda at 10: Still Better Than Windows Containers 

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
General News  
01:27 The voice of America Online’s “You’ve got mail” has died at age 74

Elwoods Edwards, the voice behind the online service AOL’s iconic “You’ve got mail” sound notification has died at the age of 74. He was just one day shy of his 75th birthday. 
The “you’ve got mail” soundbite started in 1989 when Steve Case, CEO of Quantum Computer Services (which will later become America Online or AOL,) wanted to add a human voice to their Quantum online service.  
Karen Edwards, who worked as a customer service representative, heard Case discussing the plan and suggested her husband Elwood, a professional broadcaster. 
Edwards recorded the famous phrase and others (“Welcome” “File’s done” and “Goodbye” among them) on a cassette recorder in his living room. 
He was paid $200 for the service.  
His voice is still used to greet users of the current AOL service. 

AWS 
03:04 It’s Time for RE:Invent Predictions!
Matt

Large Green Computing Reinvent
LLM at the Edge
Something new On S3

Ryan (AI)

Improved serverless observability tools
Expansion of AI Driven workflows in datalakes
Greater Focus on Multi-Account or Multi-region orchestration, centralized compliance management, or enhanced security services

Jonathan

New Edge Computing Capabilities better global application deployment type features. (Cloudflare competitor maybe)
New automated cost optimization tools
]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[283: You’ve Got Re:Invent Predictions]]>
                </itunes:title>
                                    <itunes:episode>283</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p><span style="font-weight:400;">Welcome to episode 283 of The Cloud Pod, where the forecast is always cloudy! Break out your crystal balls and shuffle those tarot decks, because it’s Re:Invent prediction time! Sorry we missed you all last week – the plague has been strong with us. But Justin and Jonathan are BACK, and we’ve got a ton of news, so buckle in and let’s get started! </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">Not My Snowcones! </span></li>
<li><span style="font-weight:400;">Lambda at 10: Still Better Than Windows Containers </span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><b>General News </b><span style="font-weight:400;"> </span></h2>
<p><b>01:27 </b><a href="https://arstechnica.com/gadgets/2024/11/the-voice-of-america-onlines-youve-got-mail-has-died-at-age-74/" target="_blank" rel="noreferrer noopener"><b>The voice of America Online’s “You’ve got mail” has died at age 74</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Elwoods Edwards, the voice behind the online service </span><a href="https://www.aol.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AOL</span></a><span style="font-weight:400;">’s iconic “You’ve got mail” sound notification has died at the age of 74. He was just one day shy of his 75th birthday. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The “you’ve got mail” soundbite started in 1989 when Steve Case, CEO of Quantum Computer Services (which will later become America Online or AOL,) wanted to add a human voice to their Quantum online service.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Karen Edwards, who worked as a customer service representative, heard Case discussing the plan and suggested her husband Elwood, a professional broadcaster. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Edwards recorded the famous phrase and others (“Welcome” “File’s done” and “Goodbye” among them) on a cassette recorder in his living room. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">He was paid $200 for the service.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">His voice is still used to greet users of the current AOL service. </span></li>
</ul>
<h2><b>AWS</b><span style="font-weight:400;"> </span></h2>
<p><b>03:04 It’s Time for </b><a href="https://reinvent.awsevents.com/" target="_blank" rel="noreferrer noopener"><b>RE:Invent</b></a><b> Predictions!</b></p>
<p><b>Matt</b></p>
<ol>
<li style="font-weight:400;"><span style="font-weight:400;">Large Green Computing Reinvent</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">LLM at the Edge</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Something new On S3</span></li>
</ol>
<p><b>Ryan (AI)</b></p>
<ol>
<li style="font-weight:400;"><span style="font-weight:400;">Improved serverless observability tools</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Expansion of AI Driven workflows in datalakes</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Greater Focus on Multi-Account or Multi-region orchestration, centralized compliance management, or enhanced security services</span></li>
</ol>
<p><b>Jonathan</b></p>
<ol>
<li style="font-weight:400;"><span style="font-weight:400;">New Edge Computing Capabilities better global application deployment type features. (Cloudflare competitor maybe)</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">New automated cost optimization tools</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Automated RAG/vector to S3</span></li>
</ol>
<p><b>Justin </b></p>
<ol>
<li style="font-weight:400;"><span style="font-weight:400;">Managed Backstage or platform like service</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">New LLM multi-modal replacement or upgrade to Titan</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Competitor VM offering to Broadcom </span></li>
</ol>
<p><b>Honorable Mentions</b></p>
<p><span style="font-weight:400;">Jonathan:</span></p>
<p><span style="font-weight:400;">Deeper integration between serverless and container services</span></p>
<p><span style="font-weight:400;">New Region</span></p>
<p><span style="font-weight:400;">Enhanced Observability with AI driven debugging tool</span></p>
<p><span style="font-weight:400;">Justin:</span></p>
<p><span style="font-weight:400;">Multi Cloud management – in a bigger way (Anthos competitor)</span></p>
<p><span style="font-weight:400;">Agentic AI toolings</span></p>
<p><span style="font-weight:400;">New ARM graviton chip</span></p>
<p><b>How many times will AI or Artificial Intelligence be said: </b></p>
<p><span style="font-weight:400;">Justin – 35</span></p>
<p><span style="font-weight:400;">Jonathan – 72</span></p>
<p><span style="font-weight:400;">And now it’s time for Pre:Invent announcements: </span></p>
<p><b>20:09 </b><a href="https://aws.amazon.com/blogs/aws/introducing-express-brokers-for-amazon-msk-to-deliver-high-throughput-and-faster-scaling-for-your-kafka-clusters/" target="_blank" rel="noreferrer noopener"><b>Introducing Express brokers for Amazon MSK to deliver high throughput </b></a><a href="https://aws.amazon.com/blogs/aws/introducing-express-brokers-for-amazon-msk-to-deliver-high-throughput-and-faster-scaling-for-your-kafka-clusters/" target="_blank" rel="noreferrer noopener"><b>and faster scaling for your Kafka clusters</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon is announcing the general availability of Express Brokers, a new broker type for </span><a href="https://aws.amazon.com/msk" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Managed Streaming for Apache Kafka</span></a><span style="font-weight:400;"> (MSK).  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The new Express Broker is designed to deliver up to 3x more throughput per-broker, scale up to 20 times faster, and reduce recovery time by 90 percent – as compared to standard brokers running </span><a href="https://kafka.apache.org/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Apache Kafka</span></a><span style="font-weight:400;">.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Express Brokers come preconfigured with Kafka </span><a href="https://docs.aws.amazon.com/msk/latest/developerguide/bestpractices.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">best practices</span></a><span style="font-weight:400;"> by default.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They also support Kafka API’s and provide the same low latency performance that Amazon MSK customers expect, so they can continue using existing client applications without any changes. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Express Broker provided improved compute and storage elasticity for Kafka applications when using </span><a href="https://console.aws.amazon.com/msk/home?#/clusters" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon MSK</span></a><span style="font-weight:400;"> provisioned clusters.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Some of the key features of the new express brokers include: </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Easier operations with hand-free storage management</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Fewer brokers with up to 3x throughput per broker</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Higher utilization with 20 times faster scaling</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Higher resilience with 90 percent faster recovery</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Cost wise (Ohio)</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Express.m7g.4xlarge – 16 vcpu – 64gib – 3.264 per hour</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Standard Broker – 16 vcpu – 64gb – 1.632</span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">21:10  Jonathan – “it seems like would be a no-brainer if you’re running enough single brokers to meet their capacity, then switching to these as long as you maintain your redundancy would be kind of a no-brainer. I wonder what they’ve done exactly to make this new class of instances. They’re not just bigger instances, surely.”</span></i></p>
<p><b>22:13 </b><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-ebs-performance-statistics-ebs-volume-health/" target="_blank" rel="noreferrer noopener"><b>Amazon EBS now supports detailed performance statistics on EBS volume </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-ebs-performance-statistics-ebs-volume-health/" target="_blank" rel="noreferrer noopener"><b>health</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon is really ticking off a ton of Justin’s requests for </span><a href="https://docs.aws.amazon.com/AmazonCloudWatch/latest/monitoring/WhatIsCloudWatch.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">CloudWatch</span></a><span style="font-weight:400;">!  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This week, CW gets detailed performance statistics for EBS volumes. This new capability provides you with real-time visibility into the performance of your EBS volumes, making it easier to monitor the health of your storage resources and take action sooner if things go south.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">You can access 11 metrics at up to per-second granularity to monitor input/output statistics of your EBS volumes, including driven I/O and I/O latency histograms. </span></li>
</ul>
<p><i><span style="font-weight:400;">22:44  Justin – “So, you know, in the early days of auto scaling, one of the things that a lot of customers would do was they would create testing when the node would come up and they would actually test the IO throughput to the EBS volume because they were not always created equal. And so if you got a bad EBS volume, you create another one or rescale or kill that node and try again until you get one that performs to your specifications. So now, at least exposing this to you so you can actually just monitor it from CloudWatch, which is a much simpler way than running a bunch of automated tests.”</span></i></p>
<p><b>24:00 </b><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/ec2-auto-scaling-strict-availability-zone-balance/" target="_blank" rel="noreferrer noopener"><b>EC2 Auto Scaling introduces provisioning control on strict availability zone </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/ec2-auto-scaling-strict-availability-zone-balance/" target="_blank" rel="noreferrer noopener"><b>balance</b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/autoscaling/ec2/userguide/auto-scaling-groups.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon EC2 auto scaling groups</span></a><span style="font-weight:400;"> (ASG) introduce a new capability for customers to strictly balance their workloads across Availability Zones, enabling greater control over provisioning and management of EC2 instances,</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Previously, if you wanted to strictly balance ASG instances across AZs, you had to override the default behavior in EC2 and invest in custom code to modify the ASG’s existing behaviors with life cycle hooks or maintain multiple ASGs.</span></li>
</ul>
<p><i><span style="font-weight:400;">24:24  Justin – “…one of the things, if you are in a region with three zones and you want three nodes in your auto scaling group, it’ll spin up A and B and then they say C doesn’t have the capacity. It’ll just keep spinning away at C – letting you know that it’s not launching that server forever, which is just terrible. So now you at least say like look, I still want segmentation. I would still want at least two regions, but that third node can’t spin up in C. You can just put it in B or A.”</span></i></p>
<p><b>25:55 </b><a href="https://aws.amazon.com/blogs/machine-learning/amazon-bedrock-prompt-management-is-now-available-in-ga/" target="_blank" rel="noreferrer noopener"><b>Amazon Bedrock Prompt Management is now available in GA</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon is announcing the GA of </span><a href="https://aws.amazon.com/es/bedrock/prompt-management/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Bedrock Prompt Management</span></a><span style="font-weight:400;">, with new features that provide enhanced options for configuring your prompts and enabling seamless integration for invoking them in your generative AI applications. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon Bedrock Prompt Management simplifies the creation, evaluation, versioning and sharing of prompts to help developers and prompt engineers get better responses from foundation models (FMs) for their use cases.       </span></li>
</ul>
<p><i><span style="font-weight:400;">26:19  Jonathan – “ Yeah, you can always ask A.I. to write a prompt for you, which has always worked really well for me. Yeah, this is kind of nice. I’ve been using Langchain in Python recently. I think it’s also available for TypeScript as well. But Langchain supports creating prompt templates, and then you can string a whole series of things together and build agents and all kinds of stuff. So it’s nice to see that they’re kind of catching up with what the open source community already has in terms of usability for this.”</span></i></p>
<p><b>27:03 </b><a href="https://aws.amazon.com/blogs/storage/aws-snow-device-updates/" target="_blank" rel="noreferrer noopener"><b>AWS Snow device updates</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon is taking our snowcones, and reducing options for snowballs. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Effective November 12, 2024, AWS has discontinued three previous generation, end of life snowball device models; specifically the Storage optimized 80TB, Edge Compute optimized with 52vcpu, and the Compute optimized with GPU devices.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">You will no longer be able to order these models, and if you have one in your environment you have one year to return the unit. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The only snowballs that will continue to be supported are the </span><a href="https://aws.amazon.com/blogs/aws/new-snowball-edge-storage-optimized-devices-with-more-storage-and-bandwidth/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Storage optimized 210TB devices</span></a><span style="font-weight:400;"> with NVME storage, and </span><a href="https://aws.amazon.com/about-aws/whats-new/2022/10/aws-snowball-edge-compute-optimized-double-compute-capacatiy-ssd-nvme-storage/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Compute Optimized with 104 vCPU</span></a><span style="font-weight:400;"> with full SSD 28TB NVME for edge workloads. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">If these two options don’t work for your edge computing needs, they have AWS Outpost solutions in 1U, 2U and 42U configurations. </span></li>
</ul>
<p><i><span style="font-weight:400;">28:11  Jonathan – “It’s interesting, kind of in the hindsight, we wondered who really used these things to begin with. And maybe it was just a good idea. Maybe it was internally used and they thought other people would want to use them and there just wasn’t a market for it.”</span></i></p>
<p><b>29:57 </b><a href="https://aws.amazon.com/blogs/aws/aws-lambda-snapstart-for-python-and-net-functions-is-now-generally-available/" target="_blank" rel="noreferrer noopener"><b>AWS Lambda SnapStart for Python and .NET functions is now generally </b></a><a href="https://aws.amazon.com/blogs/aws/aws-lambda-snapstart-for-python-and-net-functions-is-now-generally-available/" target="_blank" rel="noreferrer noopener"><b>available</b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/new-accelerate-your-lambda-functions-with-lambda-snapstart/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Snapstart</span></a><span style="font-weight:400;"> now supports Python and .Net, coming 2 years after they introduced it for Java functions.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Lambda Snapstart caches and reuses snapshotted memory and disk state of any one-time initialization code, or code that runs only the first time the Lambda Function is invoked. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">For Python functions, startup latency from initialization code can be several seconds; when you add in dependencies, this can balloon to 10+ seconds.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Snapstart can reduce latency from several seconds to as low as sub-second for these scenarios. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">For .net functions, they expect most use cases to benefit because .net just-in-time compilation takes up to several seconds. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Latency variability associated with the initialization of Lambda functions has been a long-standing barrier to lambda adoption for .net use cases.  </span></li>
</ul>
<p><i><span style="font-weight:400;">30:58  Jonathan – “Wow, mean, just think of the cost saving. In usage, let alone the virtual capacity increase they’ve just got if everyone just suddenly starts using this. Even if it’s just two seconds per invocation that they’re saving, that’s two seconds they can sell to somebody else.”</span></i></p>
<p><b>31:51 </b><a href="https://aws.amazon.com/blogs/aws/aws-lambda-turns-ten-the-first-decade-of-serverless-innovation/" target="_blank" rel="noreferrer noopener"><b>AWS Lambda turns ten – looking back and looking ahead</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Lambda turns 10! </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">As many services are now reaching this milestone, we’re not sure how much we’ll talk about these, but </span><a href="https://aws.amazon.com/blogs/aws/run-code-cloud/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Lambda</span></a><span style="font-weight:400;"> was a big deal when it was launched, and deserves a mention. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Jeff Barr writes that today over 1.5 million lambda users collecting makes tens of trillion function </span><a href="https://docs.aws.amazon.com/lambda/latest/api/API_Invoke.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">invocations</span></a><span style="font-weight:400;"> per month.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Key milestones:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">2014 – Lambda announced in preview ahead of Re:Invent with support for </span><a href="http://node.js" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">node.js</span></a><span style="font-weight:400;"> and ability to respond to event triggers from S3 buckets, </span><a href="https://aws.amazon.com/blogs/aws/dynamodb-update-triggers-streams-lambda-cross-region-replication-app/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">DynamoDB</span></a><span style="font-weight:400;"> and </span><a href="https://aws.amazon.com/kinesis/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Kinesis</span></a><span style="font-weight:400;"> streams. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">2015 – </span><a href="https://aws.amazon.com/blogs/aws/aws-lambda-update-production-status-and-a-focus-on-mobile-apps/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">GA</span></a><span style="font-weight:400;"> supports </span><a href="https://aws.amazon.com/sns/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">SNS</span></a><span style="font-weight:400;"> notifications as triggers and now supports functions written in </span><a href="https://aws.amazon.com/blogs/aws/aws-lambda-update-run-java-code-in-response-to-events/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Java</span></a><span style="font-weight:400;">.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">2016 – </span><a href="https://docs.aws.amazon.com/lambda/latest/dg/lambda-python.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Python</span></a><span style="font-weight:400;"> support, increased function duration to 5 minutes (it was later increased to 15 minutes), ability to access </span><a href="https://aws.amazon.com/blogs/aws/aws-lambda-update-python-vpc-increased-function-duration-scheduling-and-more/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">resources in a VPC</span></a><span style="font-weight:400;">, and the </span><a href="https://github.com/awslabs/serverless-application-model" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Serverless Application Model</span></a><span style="font-weight:400;">, as well as the </span><a href="https://aws.amazon.com/blogs/aws/new-aws-step-functions-build-distributed-applications-using-visual-workflows/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">launch of Step Functions</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">2017 – </span><a href="https://aws.amazon.com/blogs/aws/aws-lambda-support-for-aws-x-ray/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Xray support</span></a></li>
<li style="font-weight:400;"><span style="font-weight:400;">2018 – </span><a href="https://aws.amazon.com/blogs/aws/aws-lambda-adds-amazon-simple-queue-service-to-supported-event-sources/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">SQS support</span></a><span style="font-weight:400;">, </span><a href="https://aws.amazon.com/blogs/aws/cloudformation-macros/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Cloudformation extensions</span></a><span style="font-weight:400;"> and ability to </span><a href="https://aws.amazon.com/blogs/aws/new-for-aws-lambda-use-any-programming-language-and-share-common-components/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">write lambda functions in any language</span></a><span style="font-weight:400;">.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">2019 – </span><a href="https://aws.amazon.com/blogs/aws/new-provisioned-concurrency-for-lambda-functions/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Provisioned concurrency</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">2020 – Savings Plan, and </span><a href="https://aws.amazon.com/blogs/aws/new-use-aws-privatelink-to-access-aws-lambda-over-private-aws-network/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Private Link support</span></a><span style="font-weight:400;">, </span><a href="https://aws.amazon.com/blogs/aws/new-for-aws-lambda-1ms-billing-granularity-adds-cost-savings/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">1ms billing granularity</span></a><span style="font-weight:400;"> and you can now use up to </span><a href="https://aws.amazon.com/blogs/aws/new-for-aws-lambda-functions-with-up-to-10-gb-of-memory-and-6-vcpus/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">10GB of memory</span></a><span style="font-weight:400;"> and 6 CPU as well as support for </span><a href="https://aws.amazon.com/blogs/aws/new-for-aws-lambda-container-image-support/." target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">container images</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">2021 – </span><a href="https://aws.amazon.com/blogs/aws/introducing-amazon-s3-object-lambda-use-your-code-to-process-data-as-it-is-being-retrieved-from-s3/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">S3 Object Lamba</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">2022 – </span><a href="https://aws.amazon.com/blogs/aws/aws-lambda-now-supports-up-to-10-gb-ephemeral-storage/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">10GB of temporary storage</span></a><span style="font-weight:400;"> (which was controversial, if we recall.)</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">2024 – New observability capabilities with Logs, </span><a href="https://docs.aws.amazon.com/lambda/latest/dg/snapstart.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Java functions</span></a><span style="font-weight:400;"> that use ARM, </span><a href="https://aws.amazon.com/blogs/compute/aws-lambda-introduces-recursive-loop-detection-apis/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">recursive loop</span></a><span style="font-weight:400;"> and new </span><a href="https://aws.amazon.com/blogs/compute/introducing-an-enhanced-local-ide-experience-for-aws-lambda-developers/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">IDE methods</span></a><span style="font-weight:400;">. </span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Looking ahead Jeff barr talks about the next decade of serverless, where he believes:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Serverless will be the default choice</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Continued shift toward composability</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Automated, AI-optimized infra management</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Extensibility and integration</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Security – Threat detection and AI assisted remediation will work to make serverless apps more secure. </span></li>
</ul>
</li>
</ul>
<p><b>36:15 </b><a href="https://aws.amazon.com/blogs/aws/centrally-managing-root-access-for-customers-using-aws-organizations/" target="_blank" rel="noreferrer noopener"><b>Centrally managing root access for customers using AWS Organizations</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/iam/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">IAM</span></a><span style="font-weight:400;"> is launching a new capability to allow security teams to centrally manage root access for member accounts in </span><a href="https://aws.amazon.com/organizations/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS organizations</span></a><span style="font-weight:400;">.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">You can now easily manage root credentials and perform highly privileged actions. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Since the beginning, </span><a href="https://aws.amazon.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS</span></a><span style="font-weight:400;"> accounts have been provisioned with highly privileged root user credentials, which had unrestricted access across the account.  While powerful, it posted significant security risks. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Many customers built manual approaches to ensure MFA was enabled, regular root credential rotations and secure storage of credentials in vaults. This becomes problematic, however, as you scale into the 100’s of accounts that most enterprises run. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">In addition specific root actions such as unlocking </span><a href="https://aws.amazon.com/s3/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">S3</span></a><span style="font-weight:400;"> bucket policies or </span><a href="https://aws.amazon.com/sqs/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">SQS</span></a> <a href="https://aws.amazon.com/premiumsupport/knowledge-center/sqs-queue-access-issues-deny-policy" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">resource policies</span></a><span style="font-weight:400;">, required the root credentials. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Now with this new ability you get central management of root credentials and root sessions. Together, they offer security teams a secure, scalable and compliant way to manage root access across AWS organization member accounts. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Central management of root credentials:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Remove long term root credentials programmatically from member accounts. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Prevent credential recovery</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Provisioned secure-by-default accounts  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Help you stay compliant.</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">But sometimes you may still need the ability to do something with root, and for that they are launching root sessions: </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Secure alternative to maintaining long-term root access. Now you gain short-term, task-scoped root access to member accounts.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Root Session benefits:</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Task scoped root access</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Centralized management</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Alignment with AWS best practices</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">This new capability isn’t giving you full root access, just temporary credentials to perform one of the following actions:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Auditing root user credentials</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Re-enabling account recovery</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Deleting root user credentials</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Unlocking an S3 bucket policy</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Unlocking an SQS queue policy</span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">39:12  Jonathan – “It’s wonderful. No longer have to explain to the security team that setting the root password at some 64 character random password and then discarding it was actually a secure option, which I still think was a secure option after use.”</span></i></p>
<p><b>40:30 </b><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-route-53-resolver-dns-firewall-advanced/" target="_blank" rel="noreferrer noopener"><b>Introducing Amazon Route 53 Resolver DNS Firewall Advanced</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon must have hired someone from Azure to build this capability… </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">We are now getting another flavor of Route 53 resolver DNS firewall advanced, a new set of capabilities to the existing firewall that will allow you to monitor and block suspicious DNS traffic associated with advanced DNS threats, such as DNS tunneling and Domain Generation Algorithms (DGAs), that are designed to avoid detection by threat intelligence feeds or are difficult for threat intelligence feeds alone to track and block in time. </span></li>
</ul>
<p><b>41:35 </b><a href="https://aws.amazon.com/blogs/database/new-amazon-dynamodb-lowers-pricing-for-on-demand-throughput-and-global-tables/" target="_blank" rel="noreferrer noopener"><b>Amazon DynamoDB lowers pricing for on-demand throughput and global </b></a></p>
<p><a href="https://aws.amazon.com/blogs/database/new-amazon-dynamodb-lowers-pricing-for-on-demand-throughput-and-global-tables/" target="_blank" rel="noreferrer noopener"><b>tables</b></a><span style="font-weight:400;">      </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS engineering has been working on making </span><a href="https://aws.amazon.com/dynamodb/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">DynamoDB</span></a><span style="font-weight:400;"> more efficient, and through this they have identified and are passing along cost savings to you. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Effective November 1st, DynamoDB has reduced prices for on-demand throughput by 50% and global tables by up to 67%, making it more cost-effective than ever to build, scale, and optimize applications.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">AWS points out that while provisioned capacity workloads were reasonable in the past, the new on-demand pricing benefits will result in most customers achieving a lower price with on-demand mode. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This also allows you to skip capacity planning, get automatic pricing, usage based pricing instead of capacity and the ability to scale to 0, as well as this makes it easier to adopt Serverless capabilities.</span></li>
</ul>
<p><i><span style="font-weight:400;">41:58  Justin – “…</span></i><span style="font-weight:400;">one of the interesting things I have found in this article was that it points out that while provisioning capacity, where those were reasonable in the past, the new on-demand pricing benefit will result in most customers achieving a lower price with on-demand nodes. We’ll still meet the capacity need without having to capacity plan or do scaling of that capacity throughput. So they’re actually saying that, because of this price adjustment, the cost benefit is much better. And so you should definitely consider moving back to on-demand Dynamo DP.”</span></p>
<p><b>43:52 </b><a href="https://aws.amazon.com/blogs/aws/introducing-resource-control-policies-rcps-a-new-authorization-policy/" target="_blank" rel="noreferrer noopener"><b>Introducing resource control policies (RCPs), a new type of authorization </b></a><a href="https://aws.amazon.com/blogs/aws/introducing-resource-control-policies-rcps-a-new-authorization-policy/" target="_blank" rel="noreferrer noopener"><b>policy in AWS Organizations</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon is introducing </span><a href="https://docs.aws.amazon.com/organizations/latest/userguide/orgs_manage_policies_rcps.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">resource control policies (RCPs)</span></a><span style="font-weight:400;"> – a new authorization policy managed in AWS organizations that can be used to set the maximum available permissions on resources within your entire organization.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They are a type of preventative control that help you establish </span><a href="https://aws.amazon.com/identity/data-perimeters-on-aws/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">data perimeters</span></a><span style="font-weight:400;"> in your AWS environment and restrict access to resources at scale. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Currently supports are in place for </span><a href="https://aws.amazon.com/s3/getting-started/?nc=sn&amp;loc=6&amp;dn=1" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">S3</span></a><span style="font-weight:400;">, </span><a href="https://docs.aws.amazon.com/STS/latest/APIReference/welcome.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">STS</span></a><span style="font-weight:400;">, </span><a href="https://aws.amazon.com/kms/getting-started/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">KMS</span></a><span style="font-weight:400;">, </span><a href="https://docs.aws.amazon.com/AWSSimpleQueueService/latest/SQSDeveloperGuide/sqs-getting-started.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">SQS</span></a><span style="font-weight:400;">, and </span><a href="https://aws.amazon.com/secrets-manager/getting-started/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Secrets Manager</span></a><span style="font-weight:400;">.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">You might be asking what are differences between </span><a href="https://docs.aws.amazon.com/organizations/latest/userguide/orgs_manage_policies_scps.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Service Control Policies</span></a><span style="font-weight:400;"> and RCPs? We got you. </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">SCPs limit permissions granted to principles (IAM role/users)</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">RCPs limit permissions granted to resources themselves</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">RCPs are evaluated when resources are accessed, regardless of who is making the API request</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Some key use cases:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Enforcing organization wide resource access controls</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Ensure S3 buckets can only be accessed by principals within your organization</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Prevent unauthorized external access even if developers accidentally configure overly permissive policies</span></li>
</ul>
</li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Combining SCP and RCP give you an ability to set maximum allowable permissions from different angles (Principals vs resources) and used together they create a comprehensive security baseline for organizations needing strict access controls. </span></li>
</ul>
<p><i><span style="font-weight:400;">45:54  Justin – “…it sounds boring, but then when you think about it, it’s like, this is actually really cool.”</span></i></p>
<h2><b>GCP</b></h2>
<p><b>46:31 </b><a href="https://cloud.google.com/blog/products/data-analytics/dataplex-discovers-and-catalogs-cloud-storage-data/" target="_blank" rel="noreferrer noopener"><b>Dataplex Automatic Discovery makes Cloud Storage data available for </b></a><a href="https://cloud.google.com/blog/products/data-analytics/dataplex-discovers-and-catalogs-cloud-storage-data/" target="_blank" rel="noreferrer noopener"><b>Analytics and governance</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Ever Growing data – both structured and unstructured – continues to make it a challenge to locate the right data at the right time, and a significant portion of enterprise data remains undiscovered or underutilized, often referred as “dark data”.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">To help address dark data, Google is announcing automatic discovery and cataloging of Google Cloud Storage data with </span><a href="https://cloud.google.com/dataplex/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Dataplex</span></a><span style="font-weight:400;">, part of </span><a href="https://cloud.google.com/bigquery/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">BigQuery’s</span></a><span style="font-weight:400;"> unified platform for intelligent data to AI governance. </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Automatically discover valuable data assets residing within cloud storage, including structured and unstructured data such as documents, files, PDFs, images and more</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Harvest and catalog metadata for your discovered assets by keeping schema definitions up to date with built-in compatibility checks and partition detection, as data evolves</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Enable analytics for data science and AI uses cases at scale with auto-created </span><a href="https://cloud.google.com/biglake" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">BigLake</span></a><span style="font-weight:400;">, external or object tables, eliminating the need for data duplication or manually creating table definitions. </span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">47:41  Justin – “…you know, data is the new currency. So finding your data and your organization can be somewhat a needle in the haystack; because everyone stores data where they think they need it. And then you have different enterprise systems, different SaaS applications are using… so, you know, to have a system that’s kind of inside of your environment, that’s able to automatically scan and find your data assets and then pull them into a data lake. Even if you don’t need them, that’s just incredibly valuable just for discovery.”</span></i></p>
<p><b>49:39 </b><a href="https://cloud.google.com/blog/products/identity-security/shift-left-your-cloud-compliance-auditing-with-audit-manager/" target="_blank" rel="noreferrer noopener"><b>Shift-left your cloud compliance auditing with Audit Manager</b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/products/audit-manager" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Audit manager</span></a><span style="font-weight:400;"> from Google is now generally available. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Audit manager will help you accelerate your compliance efforts by providing:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Cleared </span><a href="https://cloud.google.com/architecture/framework/security/shared-responsibility-shared-fate" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">shared responsibility</span></a><span style="font-weight:400;"> outlines; including a matrix of shared responsibilities that delineates compliance duties between cloud providers and customers, offering actionable recommendations tailored to your workloads.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Automated Compliance Assessments: Evaluation of your workloads against industry-standard technical control requirements in a simple and automated manner. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Audit-ready evidence – Automated generation of comprehensive, verifiable evidence reports to support your compliance claims and overarching governance activity. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Actionable Remediation Guidance.</span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">50:56  Jonathan – “I wonder if compliance auditors in general will eventually die off, not literally, but I wonder if Google or Amazon or somebody else could actually build a tool which you say, I want to be compliant with X framework will reach a point where it can be trusted enough to go and do assessments, collect data, generate reports, and then give you findings without the involvement of the PWCs or anybody else of the world.”</span></i></p>
<p><b>53:20 </b><a href="https://cloud.google.com/blog/products/containers-kubernetes/gke-65k-nodes-and-counting/" target="_blank" rel="noreferrer noopener"><b>65,000 nodes and counting: Google Kubernetes Engine is ready for </b></a><a href="https://cloud.google.com/blog/products/containers-kubernetes/gke-65k-nodes-and-counting/" target="_blank" rel="noreferrer noopener"><b>trillion-parameter AI models</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">For the masochists out there, you can now support up to 65,000 GKE nodes, which GKE believes is 10x more what either AWS or Azure can do,</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Why would you want 65,000 nodes you might ask? Well AI of course! </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">That would be combined with access to things like GPU, Cloud TPU v5e node, and giving the ability to manage over 250,000 accelerators in one cluster.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Some recent GKE innovations:</span>
<ul>
<li style="font-weight:400;"><a href="https://cloud.google.com/kubernetes-engine/docs/how-to/data-container-image-preloading" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Secondary boot disks</span></a></li>
<li style="font-weight:400;"><a href="https://cloud.google.com/kubernetes-engine/docs/how-to/dcgm-metrics" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Fully managed DCGM metrics</span></a></li>
<li style="font-weight:400;"><a href="https://cloud.google.com/kubernetes-engine/docs/how-to/persistent-volumes/hyperdisk-ml" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Hyperdisk ML</span></a></li>
<li style="font-weight:400;"><a href="https://cloud.google.com/run/docs/configuring/services/gpu" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Serverless GPU’s</span></a></li>
<li style="font-weight:400;"><a href="https://cloud.google.com/kubernetes-engine/docs/concepts/about-custom-compute-classes" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Custom Compute Classes</span></a></li>
<li style="font-weight:400;"><a href="https://cloud.google.com/blog/products/compute/introducing-trillium-6th-gen-tpus?e=48754805" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Trillium</span></a><span style="font-weight:400;"> support</span></li>
<li style="font-weight:400;"><a href="https://cloud.google.com/blog/products/compute/trillium-sixth-generation-tpu-is-in-preview" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">A3 Ultra VM</span></a><span style="font-weight:400;"> </span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">53:51  Justin – “You’re gonna need to communicate with your account rep before you spin up your 65,000 GKE nodes.”</span></i></p>
<h2><b>Azure</b></h2>
<p><b>55:55 </b><a href="https://www.microsoft.com/en-us/windows-server/blog/2024/11/04/windows-server-2025-now-generally-available-with-advanced-security-improved-performance-and-cloud-agility/" target="_blank" rel="noreferrer noopener"><b>Windows Server 2025 now generally available, with advanced security, </b></a><a href="https://www.microsoft.com/en-us/windows-server/blog/2024/11/04/windows-server-2025-now-generally-available-with-advanced-security-improved-performance-and-cloud-agility/" target="_blank" rel="noreferrer noopener"><b>improved performance, and cloud agility </b></a><b>  </b></p>
<ul>
<li style="font-weight:400;"><a href="https://www.microsoft.com/windows-server/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">WIndows Server 2025</span></a><span style="font-weight:400;"> (mentioned earlier) is now Generally Available, which also means Windows Server 2019 is now entering “end of servicing” and will reach end of Support in January 2029.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Note to listeners: As a reminder, Windows 2016 is end of support in Jan 2027. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft’s goal is to deliver a secure and high-performance windows server platform tailored to meet the diverse needs of their customers. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This release is designed to let you deploy apps in any environment, whether its on-premises, hybrid or in the cloud. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Some of the key investments areas of investment are interesting in 2025</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Advanced Multi-layered Security</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AD – gets new security capabilities including improvements in protocols, encryption, hardening and new cryptographic support</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">File services/Message block (SMB) hardening.  2025 includes SMB over QUIC to enable secure access to file shares over the internet. SMB security also has hardened firewall defaults, brute force attack prevention and protections for man in the middle, relay and spoofing attacks. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Delegated Managed Service Accounts (dMSA): Unlike traditional service accounts, dMSAs don’t require manual password management since AD takes care of it. With dMSAs, specific permissions can be delegated to access resources in the domain, which reduces security risks and provides better visibility and logs of service account activity</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Cloud Agility anywhere</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Hotpatching enabled by Azure Arc-  Customers operating fully in the cloud have inherent modern security advantages like automatic software updates and back-up and recovery.  And their bringing some of those cloud t hings to Windows 2025 on premise with new hotpatching subscription service, enabled by Azure Arc.  With hotpatching, customers will experience fewer reboots and minimal disruption to operations. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Easy Azure Arc onboarding, enabling hybrid features and enhanced operational flexibility</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">SDN Multisite Feature – Software defined SDN multi-site feature offers native L2 and L3 connectivity for workload migrations across various locations, coupled with unified network policy management</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Unified policy management allowing for centralized management of network policies, making it easier to maintain consistent security and performance standards across your hybrid cloud environment</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">AI, performance and scale</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Hyper-V, AI, Machine Learning – with built in support for GPU partitioning and the ability to process large data sets across distributed environments, Windows Server 2025 offers high-performance platform for both traditional applications and advanced AI workloads with live migration and high availability</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">NVME storage performance – Windows Server 2025 delivers up to 60% more storage IOPS performance compared to windows server 2022 on identical systems. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Storage Spaces Direct and storage flexibility – Windows Server supports a wide range of storage solutions such as local, NAS, and SAN for decades and continues. But Windows Server 2025 delivers more storage innovation with Native REFS deduplication and compression, thinly provisioned storage spaces, and storage replica compression now available in all editions of Windows Server 2025</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Hyper V performance and Scale: Windows Server 2025 Hyper V can now support 240TB of memory per VM and 2048 VPs per VM. </span></li>
</ul>
</li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">53:51  Jonathan – “Wow, that’s lot of new stuff. guess I was thinking, well, who, you know, in the cloud, they typically don’t allow virtualization anyway. So who would need all these features? Well, they need it for themselves. They need it for them. They built this, this is Windows 2025 Azure release.”</span></i></p>
<p><b>1:02:35 </b><a href="https://azure.microsoft.com/en-us/blog/enhance-the-security-and-operational-capabilities-of-your-azure-kubernetes-service-with-advanced-container-networking-services-now-generally-available/" target="_blank" rel="noreferrer noopener"><b>Enhance the security and operational capabilities of your Azure </b></a><a href="https://azure.microsoft.com/en-us/blog/enhance-the-security-and-operational-capabilities-of-your-azure-kubernetes-service-with-advanced-container-networking-services-now-generally-available/" target="_blank" rel="noreferrer noopener"><b>Kubernetes Service with Advanced Container Networking Services, now </b></a><a href="https://azure.microsoft.com/en-us/blog/enhance-the-security-and-operational-capabilities-of-your-azure-kubernetes-service-with-advanced-container-networking-services-now-generally-available/" target="_blank" rel="noreferrer noopener"><b>generally </b></a><a href="https://azure.microsoft.com/en-us/blog/enhance-the-security-and-operational-capabilities-of-your-azure-kubernetes-service-with-advanced-container-networking-services-now-generally-available/" target="_blank" rel="noreferrer noopener"><b>available</b></a></p>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Azure is announcing the general availability of </span><a href="https://learn.microsoft.com/en-us/azure/aks/advanced-container-networking-services-overview" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Advanced Container Networking Services</span></a><span style="font-weight:400;"> for </span><a href="https://azure.microsoft.com/en-us/products/kubernetes-service/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Azure Kubernetes Service</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">ACNS focuses on delivering a seamless and integrated experience that allows you to maintain robust security postures and gain deep insights into your network traffic and application performance.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This ensures that your containerized applications are not only secure but also meet your performance and reliability goals allowing you to confidently manage and scale your infrastructure. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">ACNS observability features:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Node-level metrics</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Hubble Metrics, DNS and Pod level metrics</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Hubble flow logs</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Service Dependency Map</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">ACNS Container Network Security Features:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">FQDN filtering and security agent DNS proxy</span></li>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/en-us/azure/aks/azure-cni-powered-by-cilium" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Cilium Agent</span></a></li>
<li style="font-weight:400;"><span style="font-weight:400;">Security Agent DNS proxy</span></li>
</ul>
</li>
</ul>
</li>
<li style="font-weight:400;"><i><span style="font-weight:400;">At H&amp;M Group, platform engineering is a core practice, supported by our cloud-native internal developer platform, which enables autonomous product teams to build and host microservices. Deep network observability and robust security are key to our success, and the Advanced Container Networking Service features help us achieve this. Real-time flow logs accelerate our ability to troubleshoot connectivity issues, while FQDN filtering ensures secure communication with trusted external domains.” </span></i><b><i>— Magnus Welson, Engineering manager, container platform, H&amp;M Group</i></b></li>
</ul>
<p><b>1:05:04 </b><a href="https://azure.microsoft.com/en-us/blog/unlocking-the-future-azure-networking-updates-on-security-reliability-and-high-availability/" target="_blank" rel="noreferrer noopener"><b>Unlocking the future: Azure networking updates on security, reliability, </b></a><a href="https://azure.microsoft.com/en-us/blog/unlocking-the-future-azure-networking-updates-on-security-reliability-and-high-availability/" target="_blank" rel="noreferrer noopener"><b>and high availability</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Several new networking updates to help with security, reliability and high availability</span></li>
<li style="font-weight:400;"><a href="https://azure.microsoft.com/en-us/solutions/network-security/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Security enhancements</span></a>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Bastion Developer SKu GA</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Virtual network Encryption: FPGA powered encryption for VM to VM Communication</span></li>
<li style="font-weight:400;"><a href="https://learn.microsoft.com/azure/dns/dnssec" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">DNSSEC support</span></a><span style="font-weight:400;"> in preview</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Reliability</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">ExpressRoute Metro SKU</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Maximum Resiliency (4 independent ingress paths to Azure)</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">New Guided configuration for multi-site express routes</span></li>
</ul>
</li>
<li style="font-weight:400;"><a href="https://azure.microsoft.com/en-us/products/load-balancer/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Load Balancer </span></a><span style="font-weight:400;">Improvements</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Admin Stage</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Cross Subscription Support</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Enhanced Health Status Monitoring with detailed reason codes</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Scaling and Management</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Increased IP address Support: up to 1 million routable IP addresses per Virtual network</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">IPAM in preview</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Virtual Network Verifier Static analysis of packet flow validation</span></li>
</ul>
</li>
</ul>
<p><b>1:07:00 </b><a href="https://azure.microsoft.com/en-us/blog/announcing-the-availability-of-azure-openai-data-zones-and-latest-updates-from-azure-ai/" target="_blank" rel="noreferrer noopener"><b>Announcing the availability of Azure OpenAI Data Zones and latest </b></a><a href="https://azure.microsoft.com/en-us/blog/announcing-the-availability-of-azure-openai-data-zones-and-latest-updates-from-azure-ai/" target="_blank" rel="noreferrer noopener"><b>updates from Azure AI</b></a><b>  </b></p>
<ul>
<li style="font-weight:400;"><a href="https://azure.microsoft.com/en-us/blog/enterprise-trust-in-azure-openai-service-strengthened-with-data-zones/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Open AI Datazones</span></a><span style="font-weight:400;"> for the US and EU gives you new deployment options that provide enterprises with more flexibility and control over data privacy and residency needs. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This ensures that your data is stored and processed within specific geographic boundaries, ensuring compliance within a regional data residency requirement while maintaining optimal performance. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Azure has also enabled </span><a href="https://learn.microsoft.com/en-us/azure/ai-services/openai/how-to/prompt-caching" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Prompt caching for o1-preview, o1-mini, GPT-4o and GPT-4o-mini</span></a><span style="font-weight:400;"> on Azure OpenAI service. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">With prompt caching, they’re giving you a </span><a href="https://azure.microsoft.com/en-us/pricing/details/cognitive-services/openai-service/?msockid=247c4e088bea6619334d5ce08a5067f1#pricing" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">50% discount on cached input tokens</span></a><span style="font-weight:400;"> on standard </span><a href="https://azure.microsoft.com/en-us/products/ai-services/openai-service/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Azure OpenAI</span></a><span style="font-weight:400;"> on standard offering and faster processing times. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Provisioned global deployment offering: They are lowering the initial deployment quantity for PT-4o model to 15 provisioned throughput until with additional increments for 5PTUs.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They are also lowering the price for Provisioned global hourly by 50% to broaden access to OpenAI Services. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Several new models are available</span>
<ul>
<li style="font-weight:400;"><a href="https://techcommunity.microsoft.com/blog/aiplatformblog/announcing-healthcare-ai-models-in-azure-ai-model-catalog/4282460" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Healthcare industry models</span></a><span style="font-weight:400;"> include MedImageInsight, MedImageParse, CXRReportGen</span></li>
<li style="font-weight:400;"><a href="https://techcommunity.microsoft.com/blog/machinelearningblog/ministral-3b-small-model-from-mistral-is-now-available-in-the-azure-ai-model-cat/4277641" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Minstral 3B</span></a><span style="font-weight:400;"> from Mistral AI</span></li>
<li style="font-weight:400;"><a href="https://techcommunity.microsoft.com/t5/ai-machine-learning-blog/introducing-multimodal-embed-3-powering-enterprise-search-across/ba-p/4276660" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Cohere Embed 3</span></a></li>
<li style="font-weight:400;"><a href="https://aka.ms/phi3.5ft-techblog" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Fine tuning</span></a><span style="font-weight:400;"> is GA for </span><a href="https://azure.microsoft.com/en-us/products/phi/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Phi 3.5 family</span></a></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">1:07:52  Jonathan – “Prompt caching is probably a poor name for it actually, it really isn’t. Well, it’s kind of caching the… I guess it’s caching parts of Prompt. It’s caching… it’s like not reloading tokens into memory before inference. It’s like you can reuse the same or common parts.”</span></i></p>
<p><b>1:08:57 </b><a href="https://opensource.microsoft.com/blog/2024/11/07/introducing-hyperlight-virtual-machine-based-security-for-functions-at-scale/" target="_blank" rel="noreferrer noopener"><b>Introducing Hyperlight: Virtual machine-based security for functions at </b></a><a href="https://opensource.microsoft.com/blog/2024/11/07/introducing-hyperlight-virtual-machine-based-security-for-functions-at-scale/" target="_blank" rel="noreferrer noopener"><b>scale</b></a> <span style="font-weight:400;"> </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft Azure Core Upstream team is excited to announce the Hyperlight project, an open-source Rust library you can use to execute small, embedded functions using hypervisor-based protection for each function call at scale. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">It can do this at a speed that enables each function request to have its own hypervisor for protection.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Hyperlight is a library to execute functions as fast as possible while isolating those functions within a VM.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Developers and software architects can use hyperlight to add serverless customizations to their applications that are able to securely run untrusted code.  Hyperlight enables these for IoT gateway function embedding, high throughput cloud services and so on. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;"> Hyperlight can create a new VM in 1-2 milliseconds. While this is still slower than using sandboxed runtimes like V8 or WasmTime directly, with Hyperlight you can take those same runtimes and place inside a VM to protect you in the event of a sandbox escape. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Hyperlight is so fast, that a one-two millisecond cold start for each VM is fast enough that it becomes practical to spin up VMs as needed in response to events. Also make it possible to scale to 0, meaning that you might not need to keep idle VM’s. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft will be submitting this CNCF. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">It sounds like firecracker but is something slightly different based on comments on </span><a href="https://news.ycombinator.com/item?id=42078476" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Hacker News</span></a><span style="font-weight:400;">. </span></li>
</ul>
<p><i><span style="font-weight:400;">1:10:04  Jonathan – “I think it will complement Firecracker really nicely because it’s meant for function-based workloads, not VM-based workloads. so, a millisecond startup time, just… That’s almost… It’s close enough to zero to be zero compared with 125 milliseconds for a Firecracker cold start time. And to be fair, an eighth of a second to start up a VM is amazingly impressive, but…But one to two milliseconds to fire up a virtualized function that can run is just great. Wow.”</span></i></p>
<h2><span style="font-weight:400;">Closing</span></h2>
<p><span style="font-weight:400;">And that is the week in the cloud! Visit our website, the home of the Cloud Pod where you can join our newsletter, slack team, send feedback or ask questions at theCloud Pod.net or tweet at us with hashtag #theCloudPod</span></p>
]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/1909571/c1e-v0z0c90r92t4jwrm-34gwwn8ga336-b507zm.mp3" length="88320438"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 283 of The Cloud Pod, where the forecast is always cloudy! Break out your crystal balls and shuffle those tarot decks, because it’s Re:Invent prediction time! Sorry we missed you all last week – the plague has been strong with us. But Justin and Jonathan are BACK, and we’ve got a ton of news, so buckle in and let’s get started! 
Titles we almost went with this week:

Not My Snowcones! 
Lambda at 10: Still Better Than Windows Containers 

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
General News  
01:27 The voice of America Online’s “You’ve got mail” has died at age 74

Elwoods Edwards, the voice behind the online service AOL’s iconic “You’ve got mail” sound notification has died at the age of 74. He was just one day shy of his 75th birthday. 
The “you’ve got mail” soundbite started in 1989 when Steve Case, CEO of Quantum Computer Services (which will later become America Online or AOL,) wanted to add a human voice to their Quantum online service.  
Karen Edwards, who worked as a customer service representative, heard Case discussing the plan and suggested her husband Elwood, a professional broadcaster. 
Edwards recorded the famous phrase and others (“Welcome” “File’s done” and “Goodbye” among them) on a cassette recorder in his living room. 
He was paid $200 for the service.  
His voice is still used to greet users of the current AOL service. 

AWS 
03:04 It’s Time for RE:Invent Predictions!
Matt

Large Green Computing Reinvent
LLM at the Edge
Something new On S3

Ryan (AI)

Improved serverless observability tools
Expansion of AI Driven workflows in datalakes
Greater Focus on Multi-Account or Multi-region orchestration, centralized compliance management, or enhanced security services

Jonathan

New Edge Computing Capabilities better global application deployment type features. (Cloudflare competitor maybe)
New automated cost optimization tools
]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/1909571/c1a-k5d5-25kwwnn9cm1-zrlxdm.jpg"></itunes:image>
                                                                            <itunes:duration>01:13:36</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                            </item>
                    <item>
                <title>
                    <![CDATA[282: Search – ChatGPT vs Google…. Fight!]]>
                </title>
                <pubDate>Thu, 14 Nov 2024 10:17:37 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/1889271</guid>
                                    <link>https://tcpfm.castos.com/episodes/282-search-chatgpt-vs-google-fight</link>
                                <description>
                                            <![CDATA[<p><span style="font-weight:400;">Welcome to episode 282 of The Cloud Pod, where the forecast is always cloudy! This week Justin, Ryan, and Matthew are happy to be joining you in the clouds versus watching election information. This week we’re talking nuclear energy, AI Search tools, and all things Pre:Invent. Welcome, and thanks for joining us! </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">️The Cloud Pod Would Much Rather Record This Show Than Watch the Election </span><span style="font-weight:400;">Results</span></li>
<li><span style="font-weight:400;">️IBM Comes for Your AI Dollars</span></li>
<li><span style="font-weight:400;">AWS Goes Limitless with the PostgreSQL Possibilities</span></li>
<li><span style="font-weight:400;">⌚It is Upon Us the Pre-Invent Period and AWS Does Not Disappoint</span></li>
<li><span style="font-weight:400;">⚛️Amazon Loses Its Nuclear Superhero</span></li>
</ul>
<h3><b>A big thanks to this week’s <a href="https://www.thecloudpod.net/sponsor/">sponsor</a>:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><b>Follow Up</b></h2>
<p><b>01:13 </b><a href="https://www.axios.com/2024/11/04/ferc-amazon-data-center-susquehanna-nuclear" target="_blank" rel="noreferrer noopener"><b>Energy regulators scrutinizing data center use reject Amazon bid</b></a><span style="font-weight:400;"> </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Late Friday, the Federal Energy Regulatory Commission </span><a href="https://www.ferc.gov/media/er24-2172-pjms-susquehanna-co-location-proposal" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">rejected a proposal</span></a><span style="font-weight:400;"> that would have allowed an Amazon data center to </span><a href="https://www.aboutamazon.com/news/sustainability/amazon-nuclear-small-modular-reactor-net-carbon-zero" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">co-locate with an existing</span></a><span style="font-weight:400;"> nuclear power plant in Pennsylvania.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The commission voted it down 2-1 </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">FERC chairman Willie Phillips said that the commission should encourage the development of data centers and semiconductor manufacturing as national security and economic development priorities.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Commissioners Mark Christie and Lindsay See (both R) voted to reject the proposal, while Davis Rosner and Judy Change (D) didn’t vote. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Talen Energy, who signed the agreement, drew challenges from neighboring utilities AEP and Exelon – who challenged the novel arrangement, arguing it would unfairly shift costs of running the broader grid to other consumers. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">FERC’s order found the region’s grid operator, PJM Interconnection, failed to show why the proposal was necessary and prove such a deal would be limited to the Susquehanna plant given the widespread interest in placing data centers next to power plants. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Talen said the ruling would have a chilling effect on the region’s economic development and it is weighing its options. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Will see what happens with Microsoft/Constellation energies plan to </span><a href="https://www.constellationenergy.com/newsroom/2024/Constellation-to-Launch-Crane-Clean-Energy-Center-Restoring-Jobs-and-..."></a></li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 282 of The Cloud Pod, where the forecast is always cloudy! This week Justin, Ryan, and Matthew are happy to be joining you in the clouds versus watching election information. This week we’re talking nuclear energy, AI Search tools, and all things Pre:Invent. Welcome, and thanks for joining us! 
Titles we almost went with this week:

️The Cloud Pod Would Much Rather Record This Show Than Watch the Election Results
️IBM Comes for Your AI Dollars
AWS Goes Limitless with the PostgreSQL Possibilities
⌚It is Upon Us the Pre-Invent Period and AWS Does Not Disappoint
⚛️Amazon Loses Its Nuclear Superhero

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
Follow Up
01:13 Energy regulators scrutinizing data center use reject Amazon bid 

Late Friday, the Federal Energy Regulatory Commission rejected a proposal that would have allowed an Amazon data center to co-locate with an existing nuclear power plant in Pennsylvania.  
The commission voted it down 2-1 
FERC chairman Willie Phillips said that the commission should encourage the development of data centers and semiconductor manufacturing as national security and economic development priorities.  
Commissioners Mark Christie and Lindsay See (both R) voted to reject the proposal, while Davis Rosner and Judy Change (D) didn’t vote. 
Talen Energy, who signed the agreement, drew challenges from neighboring utilities AEP and Exelon – who challenged the novel arrangement, arguing it would unfairly shift costs of running the broader grid to other consumers. 
FERC’s order found the region’s grid operator, PJM Interconnection, failed to show why the proposal was necessary and prove such a deal would be limited to the Susquehanna plant given the widespread interest in placing data centers next to power plants. 
Talen said the ruling would have a chilling effect on the region’s economic development and it is weighing its options. 
Will see what happens with Microsoft/Constellation energies plan to ]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[282: Search – ChatGPT vs Google…. Fight!]]>
                </itunes:title>
                                    <itunes:episode>282</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p><span style="font-weight:400;">Welcome to episode 282 of The Cloud Pod, where the forecast is always cloudy! This week Justin, Ryan, and Matthew are happy to be joining you in the clouds versus watching election information. This week we’re talking nuclear energy, AI Search tools, and all things Pre:Invent. Welcome, and thanks for joining us! </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">️The Cloud Pod Would Much Rather Record This Show Than Watch the Election </span><span style="font-weight:400;">Results</span></li>
<li><span style="font-weight:400;">️IBM Comes for Your AI Dollars</span></li>
<li><span style="font-weight:400;">AWS Goes Limitless with the PostgreSQL Possibilities</span></li>
<li><span style="font-weight:400;">⌚It is Upon Us the Pre-Invent Period and AWS Does Not Disappoint</span></li>
<li><span style="font-weight:400;">⚛️Amazon Loses Its Nuclear Superhero</span></li>
</ul>
<h3><b>A big thanks to this week’s <a href="https://www.thecloudpod.net/sponsor/">sponsor</a>:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><b>Follow Up</b></h2>
<p><b>01:13 </b><a href="https://www.axios.com/2024/11/04/ferc-amazon-data-center-susquehanna-nuclear" target="_blank" rel="noreferrer noopener"><b>Energy regulators scrutinizing data center use reject Amazon bid</b></a><span style="font-weight:400;"> </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Late Friday, the Federal Energy Regulatory Commission </span><a href="https://www.ferc.gov/media/er24-2172-pjms-susquehanna-co-location-proposal" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">rejected a proposal</span></a><span style="font-weight:400;"> that would have allowed an Amazon data center to </span><a href="https://www.aboutamazon.com/news/sustainability/amazon-nuclear-small-modular-reactor-net-carbon-zero" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">co-locate with an existing</span></a><span style="font-weight:400;"> nuclear power plant in Pennsylvania.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The commission voted it down 2-1 </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">FERC chairman Willie Phillips said that the commission should encourage the development of data centers and semiconductor manufacturing as national security and economic development priorities.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Commissioners Mark Christie and Lindsay See (both R) voted to reject the proposal, while Davis Rosner and Judy Change (D) didn’t vote. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Talen Energy, who signed the agreement, drew challenges from neighboring utilities AEP and Exelon – who challenged the novel arrangement, arguing it would unfairly shift costs of running the broader grid to other consumers. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">FERC’s order found the region’s grid operator, PJM Interconnection, failed to show why the proposal was necessary and prove such a deal would be limited to the Susquehanna plant given the widespread interest in placing data centers next to power plants. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Talen said the ruling would have a chilling effect on the region’s economic development and it is weighing its options. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Will see what happens with Microsoft/Constellation energies plan to </span><a href="https://www.constellationenergy.com/newsroom/2024/Constellation-to-Launch-Crane-Clean-Energy-Center-Restoring-Jobs-and-Carbon-Free-Power-to-The-Grid.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">restart</span></a><span style="font-weight:400;"> 3-Mile Island. </span></li>
</ul>
<p><i><span style="font-weight:400;">3:21  Justin – “It’s sort of sad because I kind like the idea of nuclear power to solve a bunch of problems, but it has to be done in the right way for sure.”</span></i></p>
<h2><b>General News </b><span style="font-weight:400;"> </span></h2>
<p><b>04:12  IT’S EARNINGS TIME!  </b></p>
<p><b>04:22 </b><a href="https://siliconangle.com/2024/10/23/ibm-revenue-misses-execs-say-ai-will-drive-future-growth/" target="_blank" rel="noreferrer noopener"><b>IBM revenue misses, but execs say AI will drive future growth</b></a><b> </b></p>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">This week, we have an additional company we don’t typically talk about… but IBM kicked off this quarter’s earning seasons which indicated that the AI dividend has yet to pay off for big infrastructure players. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Earnings were 2.30 a share, excluding non-recurring items, and were eight cents better than consensus estimates.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Although revenue rose 2% on a constant currency basis to $14.97 billion, they were slightly below the $15.08 billion consensus. </span></li>
</ul>
</li>
<li style="font-weight:400;"><i><span style="font-weight:400;">“We are very focused on ensuring we get an early lead position and establish IBM consulting as a strategic provider of choice for gen AI,” </span></i><span style="font-weight:400;">said Chief Financial Officer James Kavanaugh.</span><i><span style="font-weight:400;"> “This is a long-term growth factor with a multiplier effect across our  software, our platforms and our infrastructure.” </span></i><span style="font-weight:400;">About three-quarters of the gen AI business is consulting, and one-quarter is software.</span></li>
</ul>
<p><i><span style="font-weight:400;">05:32  Ryan – “…it seems like that’s a pretty good play to beef up, you know, your consultant side of the business to implement that. Because a lot of businesses are going to need to do that. And a lot of them don’t have the in house skills to do it.”</span></i></p>
<p><b>05:50 </b><a href="https://finance.yahoo.com/news/alphabet-stock-soars-as-earnings-crush-estimates-on-strong-cloud-growth-213627854.html" target="_blank" rel="noreferrer noopener"><b>Alphabet stock soars as earnings crush estimates on strong cloud growth</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Alphabet reported earnings per share of $2.12 on revenue of $88.27 billion for the quarter ended September 30th.  Representing a profit and sales increase from the same period last year of 37% and 15%, respectively. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Analysts had expected revenue of $1.83 per share and 86.44 billion. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Advertising revenue topped expectations at 65.85 billion vs expectations of 65.5. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Cloud revenue was $11.4 b up 35% from the same period last year exceeding expectations. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Sundar Pichai said “</span><i><span style="font-weight:400;">this business has real momentum, and the overall opportunity is increasing as customers embrace gen AI</span></i><span style="font-weight:400;">”</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Google plans to spend $13 billion on capital expenditures.</span></li>
</ul>
<p><i><span style="font-weight:400;">07:09  Matthew – “I mean, I was talking recently with some people and they were saying how a lot more of the really small companies are leveraging Google just because their developer experience inside the platform is much better than the other ones. It’s interesting to kind of see if that’s it, but it’s a ton of small companies to keep up with.”</span></i></p>
<p><b>07:40 </b><a href="https://www.geekwire.com/2024/amazon-stock-jumps-6-as-q3-revenue-up-11-to-158-9b-profits-hit-15-3b-aws-sales-up-19/" target="_blank" rel="noreferrer noopener"><b>Amazon stock jumps 6%; Q3 revenue up 11% to $158.9B; profits hit $15.3B; </b></a><a href="https://www.geekwire.com/2024/amazon-stock-jumps-6-as-q3-revenue-up-11-to-158-9b-profits-hit-15-3b-aws-sales-up-19/" target="_blank" rel="noreferrer noopener"><b>AWS sales up 19%</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon topped estimates for the 3rd quarter, reporting $158.9 billion in revenue, up 11% YOY and earnings per share of $1.43.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Profits jumped to 15.3 billion, from 9.9 billion a year ago. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">AWS came in just below expectations at 27.4 billion in revenue, up 19%, with 10.4 billion in operating income. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Investors continue to keep a close eye on </span><a href="https://www.aboutamazon.com/news/innovation-at-amazon/amazon-generative-ai-seller-growth-shopping-experience" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AI</span></a><span style="font-weight:400;"> adoption on the cloud giant. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This is all interesting despite layoffs and unfavorable RTO policies; they are currently at 1.55 million employees, up 3% YOY. </span></li>
</ul>
<p><i><span style="font-weight:400;">09:17  Justin – “…it’s a crazy amount of people, by the way. I can’t even fathom having 1.5 million employees. Like, what do they all do?”</span></i></p>
<p><b>09:39 </b><a href="https://www.cnbc.com/2024/10/30/microsoft-msft-q1-earnings-report-2025.html" target="_blank" rel="noreferrer noopener"><b>Microsoft dips on weak guidance after beating on earnings</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft reported an earnings and revenue beat for the fiscal first quarter, but was bludgeoned for predicting slower growth than analysts expected. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Revenue was 65.59 or 3.30 per share vs the 64.51 or 3.10 per share expected. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">CEO Satya Nadella said he feels pretty good that going into the second half of this fiscal year that the supply-demand will match up. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Azure growth was 33%, with 12 points of that growth coming from AI services. </span></li>
</ul>
<p><i><span style="font-weight:400;">10:47  Justin – “No, they were, they were applauded for doing well and beating expectations, but they were beaten because they predicted slower growth for this quarter and the next quarter. So it was more, I don’t think they lowered their guidance, but I think they basically said to expect it to be on the lower side of the range that they gave, which made investors unhappy.”</span></i></p>
<h2><b>AI is Going Great – Or How ML Makes All Its Money</b><span style="font-weight:400;"> </span></h2>
<p><b>11:38 </b><a href="https://openai.com/index/introducing-chatgpt-search/" target="_blank" rel="noreferrer noopener"><b>Introducing ChatGPT search</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><a href="https://chatgpt.com/?hints=search" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">ChatGPT</span></a><span style="font-weight:400;"> has launched a </span><a href="https://chromewebstore.google.com/detail/chatgpt-search/ejcfepkfckglbgocfkanmcdngdijcgld" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Chrome extension</span></a><span style="font-weight:400;"> to take over the search experience from Google. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">With </span><a href="https://openai.com/index/searchgpt-prototype/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">ChatGPT Search</span></a><span style="font-weight:400;">, you can search the web with fast, timely answers with links to relevant web sources, which you would have previously needed to go to a search engine for. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">ChatGPT will choose to search the web based on what you asked, or you can manually choose to search by clicking the web search icon. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Search will be available at </span><a href="http://chatgpt.com" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">chatgpt.com</span></a><span style="font-weight:400;"> and their desktop and mobile apps. </span></li>
<li style="font-weight:400;"><a href="https://openai.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Open AI</span></a><span style="font-weight:400;"> says that getting answers on the web can take a lot of effort, and sometimes requires multiple searches and digging through links to find quality sources and the right information.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Now with chat you can get a better answer. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">For real time sources, Chat GPT has partnered with news and data providers to get things like weather, stock, sports, news and maps. </span></li>
</ul>
<p><i><span style="font-weight:400;">12:58  Matthew – “I just like how their first real solution was, hey, let’s do a Chrome plugin, which is owned by Google. You’re just trying a weird next step.”</span></i></p>
<h2><b>AWS</b></h2>
<p><b>15:16 </b><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/amazon-virtual-private-cloud-security-group-sharing/?ck_subscriber_id=512838477" target="_blank" rel="noreferrer noopener"><b>Amazon Virtual Private Cloud launches new security group sharing features</b></a><b>  </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS is making it easier to manage your security groups with a new security group sharing feature.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">You can associate a security group with multiple-VPCs in the same account using Security Group VPC associations. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">When using shared VPC, you can now also share security groups with participant accounts in that shared VPC using shared security groups.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This ensures security group consistency and simplifies configuration and maintenance for your admins. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Now make it possible to publish a managed security group for SaaS services customers may want to connect too…. </span></li>
</ul>
<p><i><span style="font-weight:400;">16:02  Matthew – “They had something that I definitely used in the past, which was a Lambda that watched the Amazon SNS topic for the public IP addresses. you could block it. In theory, you could do the same thing. Well, especially because you was over the default 50 group limit, 50 rule limit. So every time you wanted to use it, you always had to request the limit upgrade.”</span></i></p>
<p><b>18:08 </b><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/lambda-application-building-vs-code-ide-aws-toolkit/" target="_blank" rel="noreferrer noopener"><b>AWS enhances the Lambda application building experience with VS Code </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/lambda-application-building-vs-code-ide-aws-toolkit/" target="_blank" rel="noreferrer noopener"><b>IDE and AWS Toolkit</b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/pm/lambda/?trk=a6a233ef-5b73-4cf2-8e07-411e4cf5d402&amp;sc_channel=ps&amp;s_kwcid=AL!4422!10!71605931086581!71606456338081&amp;ef_id=ff3b18b8000f1d7a55ca7f8c37b8edbe:G:s&amp;msclkid=ff3b18b8000f1d7a55ca7f8c37b8edbe" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Lambda</span></a><span style="font-weight:400;"> is giving you a new experience to simplify the development of lambda based apps using VS Code IDE and the AWS Toolkit.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This experience streamlines the code-test-deploy-debug cycle, providing a guided walkthrough that assists developers from setting up their local development environment to run their first application on the cloud and adds enhanced user experience in each step in the cycle. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">When you install the AWS toolkit extension on VSCode, you will be greeted with a new app building experience. It will guide you through the necessary tooling installations and configurations required to set up your local environment for building Lambda-based apps. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">In addition, you get a curated list of sample application walkthroughs, which guide them step-by-step through coding, testing and deploying their apps in the cloud. </span></li>
</ul>
<p><i><span style="font-weight:400;">16:02  Ryan – “My first thought when reading this is I’m curious on how this will like sort of fit in with my AWS SAM workflows, which does give you a CI-CD workflow because publishing directly with cloud formation. So it is sort of an interesting thing. I’m hoping that you could kind of seamlessly merge those experiences because it would be kind of nice if they made that easier.”</span></i></p>
<p><b>19:382 </b><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/aws-lambda-fault-injection-service-actions/" target="_blank" rel="noreferrer noopener"><b>AWS Lambda now supports AWS Fault Injection Service (FIS) actions</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS Lambda now supports the </span><a href="https://aws.amazon.com/fis/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Fault Injection Service</span></a><span style="font-weight:400;"> (FIS) actions. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">With FIS actions for AWS Lambda, developers and operators can now verify their application’s response to Lambda errors for all language runtimes without modifying the code. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Some of the tests can be to return custom HTTP status codes from the gateway or add one second of startup delay to 1% of invocations.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">It’s nice to have some fault injection opportunities for your Lambda functions at once as well.</span></li>
</ul>
<p><b>12:32 </b><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/aws-accepts-partial-card-payments/" target="_blank" rel="noreferrer noopener"><b>AWS now accepts partial card payments</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">In something that I feel took way too long, AWS customers can now pay with their cards to make partial payments toward their monthly bill. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Until now, customers could only pay their entire bill at once, prior to the due date.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">With partial payments, customers can now split the amount due into smaller </span><a href="https://signin.aws.amazon.com/signin?redirect_uri=https%3A%2F%2Fus-east-1.console.aws.amazon.com%2Fbilling%2Fhome%3Fregion%3Dus-east-1%26state%3DhashArgs%2523%252Fpaymentsoverview%252Fpayments-due%26isauthcode%3Dtrue&amp;client_id=arn%3Aaws%3Aiam%3A%3A934814114565%3Auser%2Fportal-aws-auth&amp;forceMobileApp=0&amp;code_challenge=VWWwiqos9Jb6iKN4YRIuDU5HUxvu5yibJCcCBtn2zYk&amp;code_challenge_method=SHA-256" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">payments</span></a><span style="font-weight:400;"> which can be charged on different cards. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">To do this previously you would have had to call AWS </span><a href="https://support.console.aws.amazon.com/support/home" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">customer service</span></a><span style="font-weight:400;">, but now you can do it from your Console account. </span></li>
</ul>
<p><b>22:12 </b><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/amazon-redshift-integration-bedrock-generative-ai/" target="_blank" rel="noreferrer noopener"><b>AWS announces Amazon Redshift integration with Amazon Bedrock for </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/amazon-redshift-integration-bedrock-generative-ai/" target="_blank" rel="noreferrer noopener"><b>generative AI</b></a></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/general-availability-auto-copy-amazon-redshift/" target="_blank" rel="noreferrer noopener"><b>Announcing general availability of auto-copy for Amazon Redshift</b></a></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/amazon-redshift-incremental-refresh-mvs-data-lake-tables/" target="_blank" rel="noreferrer noopener"><b>Amazon Redshift now supports incremental refresh on Materialized Views </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/amazon-redshift-incremental-refresh-mvs-data-lake-tables/" target="_blank" rel="noreferrer noopener"><b>(MVs) for data lake tables</b></a></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/amazon-redshift-serverless-ai-driven-scaling-optimization/" target="_blank" rel="noreferrer noopener"><b>Announcing Amazon Redshift Serverless with AI-driven scaling and </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/amazon-redshift-serverless-ai-driven-scaling-optimization/" target="_blank" rel="noreferrer noopener"><b>Optimization</b></a></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/csv-result-format-amazon-redshift-data-api/" target="_blank" rel="noreferrer noopener"><b>AWS announces CSV result format support for Amazon Redshift Data API</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Several new features for </span><a href="https://aws.amazon.com/redshift/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Redshift</span></a><span style="font-weight:400;"> this week including:</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Redshift Integration with </span><a href="https://aws.amazon.com/bedrock/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Bedrock</span></a><span style="font-weight:400;"> allowing you to leverage large language models from simple SQL commands alongside your Redshift data</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The next gen AI driven scaling and optimization in cloud data warehousing. </span><a href="https://docs.aws.amazon.com/redshift/latest/mgmt/serverless-whatis.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Redshift Serverless</span></a><span style="font-weight:400;"> uses AI techniques to automatically scale with workload changes across all key dimensions such as data volume changes, concurrent users and query complexity. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The redshift Data API now supports comma separated values (CSV) result format which provides flexibility in how you access and process data, allowing you to choose between JSON and CSV formats</span></li>
</ul>
<p><i><span style="font-weight:400;">22:51  Ryan – “I just keep thinking about the Redshift product team. Like, they must be just devastated because clearly these were made for mainstage announcements. It’s even got generative AI. They did all the things and they still didn’t make it.”</span></i></p>
<p><b>23:31 </b><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/amazon-cloudwatch-ebs-volumes-exceeding-performance/" target="_blank" rel="noreferrer noopener"><b>Amazon CloudWatch now monitors EBS volumes exceeding provisioned </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/amazon-cloudwatch-ebs-volumes-exceeding-performance/" target="_blank" rel="noreferrer noopener"><b>Performance</b></a></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/amazon-cloudwatch-monitoring-io-latency-ebs-volumes/" target="_blank" rel="noreferrer noopener"><b>New Amazon CloudWatch metrics for monitoring I/O latency of Amazon </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/amazon-cloudwatch-monitoring-io-latency-ebs-volumes/" target="_blank" rel="noreferrer noopener"><b>EBS Volumes</b></a></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/amazon-elasticache-metrics-monitor-server-response-time/" target="_blank" rel="noreferrer noopener"><b>Amazon ElastiCache for Valkey adds new CloudWatch metrics to monitor </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/amazon-elasticache-metrics-monitor-server-response-time/" target="_blank" rel="noreferrer noopener"><b>server-side response time</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Cloudwatch will now </span><a href="https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/using_cloudwatch_ebs.html#ebs-volume-metrics" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">monitor EBS volumes</span></a><span style="font-weight:400;"> exceeding provisioned performance! About time!  This will allow you to quickly identify and respond to latency issues stemming from under provisioned EBS volumes that may impact the performance of your applications. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">You can now get two new </span><a href="https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/using_cloudwatch_ebs.html#ebs-volume-metrics" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Cloudwatch metrics</span></a><span style="font-weight:400;"> for your EBS volumes, including VolumeAvgReadLatency and VolumeAvgWriteLatency, to monitor the performance of your EBS volumes.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">And finally, Elasticache for Valeky node based clusters now support server side write request latency and read request latency metrics. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">None of these would have made the main stage, but they’re definitely quality of life improvements. So, thanks AWS. </span></li>
</ul>
<p><b>26:40 </b><a href="https://aws.amazon.com/blogs/aws/unlock-the-potential-of-your-supply-chain-data-and-gain-actionable-insights-with-aws-supply-chain-analytics/" target="_blank" rel="noreferrer noopener"><b>Unlock the potential of your supply chain data and gain actionable insights </b></a><a href="https://aws.amazon.com/blogs/aws/unlock-the-potential-of-your-supply-chain-data-and-gain-actionable-insights-with-aws-supply-chain-analytics/" target="_blank" rel="noreferrer noopener"><b>with AWS Supply Chain Analytics</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">In a sign that </span><a href="https://aws.amazon.com/aws-supply-chain/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS supply Chain</span></a><span style="font-weight:400;"> is not getting deprecated anytime soon, they are announcing the GA of AWS Supply Chain Analytics powered by </span><a href="https://quicksight.aws/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Quicksight</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The new feature helps you to build custom report dashboards using your data in AWS Supply Chain.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">With this feature your business analysts or supply chain managers can perform custom analysis, visual data and gain actionable insights for your supply chain management operations. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Justin, being the executive among us, really appreciated the pretty graphs. </span></li>
</ul>
<p><b>27:38 </b><a href="https://aws.amazon.com/blogs/aws/amazon-aurora-postgresql-limitless-database-is-now-generally-available/" target="_blank" rel="noreferrer noopener"><b>Amazon Aurora PostgreSQL Limitless Database is now generally available</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon is announcing the GA of </span><a href="https://aws.amazon.com/aurora/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Aurora</span></a> <a href="https://aws.amazon.com/blogs/aws/join-the-preview-amazon-aurora-limitless-database/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">PostgreSQL Limitless Databases</span></a><span style="font-weight:400;">, a new serverless horizontal (sharding) capability for Aurora.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">You can scale beyond the existing Aurora Limits for write throughput and storage by distributing the database workload over multiple aurora writer instances while maintaining the ability to use it as a single database. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This was previewed last year at Re:invent 2023. </span></li>
</ul>
<p><i><span style="font-weight:400;">28:44  Justin – “</span></i><span style="font-weight:400;">So one of the things that will mess people up a little bit is that they, you know, way you size this as minimum and maximum capacity measured by Aurora capacity units, which, know, is magic numbers that they created that sort of represent CPUs and things. And so you can set up your minute, your, 16 ACUs as your minimum, and then you can go up to as many as 6,144 ACUs as the maximum, which, that seems like a lot of shards.”</span></p>
<p><b>29:48 </b><a href="https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-ses-inline-template-support-send-email-apis/" target="_blank" rel="noreferrer noopener"><b>Amazon SES adds inline template support to send email APIs</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS continues to fix SES annoyances and eliminate platform toil. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">SES now allows customers to provide templates directly within the sendbulkemail or sendemail API request. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">SES will use the provided inline template content to render and assemble the email content for delivery, reducing the need to manage template resources in the SES account. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">We remember Justin asking for this 47 years ago, but it’s here, finally. So, yay?</span></li>
</ul>
<p><b>31:49 </b><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/aws-udp-privatelink-dual-stack-network-load-balancers/" target="_blank" rel="noreferrer noopener"><b>AWS announces UDP support for AWS PrivateLink and dual-stack Network </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/aws-udp-privatelink-dual-stack-network-load-balancers/" target="_blank" rel="noreferrer noopener"><b>Load Balancers</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS is launching UDP protocol support on</span><a href="https://docs.aws.amazon.com/vpc/latest/privatelink/what-is-privatelink.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;"> AWS Privatelink</span></a><span style="font-weight:400;"> on IPv4 and IPv6 and on the </span><a href="https://docs.aws.amazon.com/elasticloadbalancing/latest/network/introduction.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Network Load Balancer</span></a><span style="font-weight:400;"> over Ipv6. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Previously, AWS Privatelink only supported TCP, while NLB supported UDP only over IPv4.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This enables customers who use AWS Privatelink and clients that use IPv6 to access UDP-based applications such as media-streaming, gaming, VOIP and other applications. </span></li>
</ul>
<p><b>33:09 </b><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/aws-appsync-websocket-apis-web-mobile-experiences/" target="_blank" rel="noreferrer noopener"><b>AWS AppSync launches new serverless WebSocket APIs to power real-time </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/aws-appsync-websocket-apis-web-mobile-experiences/" target="_blank" rel="noreferrer noopener"><b>web and mobile experiences at any scale</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS is launching AWS AppSync Events, a new solution for building secure and performant serverless WebSocket APIs to power real-time web and mobile experiences at any scale. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">AWS AppSync Events let you easily broadcast real-time event data to a few or millions of subscribers using secure and performant serverless WebSocket APIs, without needing to manage connections or resource scaling. </span></li>
</ul>
<p><i><span style="font-weight:400;">33:19  Justin – “If I knew what AppSync did and I knew what my use case would be for this, I’d probably be really excited about it, but I don’t really know either, so that’s all I’m gonna say about it.”</span></i></p>
<p><b>35:13 </b><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/amazon-route-53-https-sshfp-svcb-tlsa-dns-support/" target="_blank" rel="noreferrer noopener"><b>Amazon Route 53 announces HTTPS, SSHFP, SVCB, and TLSA DNS </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/amazon-route-53-https-sshfp-svcb-tlsa-dns-support/" target="_blank" rel="noreferrer noopener"><b>resource record support</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Route53 now supports HTTPS and Service Binding (SVCB) record types, which provide clients with improved performance and privacy.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Instead of only providing the IP addresses of endpoints in response to a DNS query, HTTPS and SVCB records respond with additional information needed to set up connections such as whether your endpoint supports HTTP/3, thereby letting supporting clients connect faster and more securely. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">In addition you can create TLS Authentication (TLSA) records with route 53. TLSA records may be used to associate TLS server certificates or public keys with your domain name, leveraging DNS Security Extensions (DNSSEC).  This provides you with a prerequisite component of DNS-based authentication of named Entities (DANE), a protocol frequently used in conjunction with the SMTP to assure secure and confidential mail transport.       </span></li>
</ul>
<p><i><span style="font-weight:400;">36:12 Ryan – “Well, if all problems are DNS, you should just add more complexity, right?”</span></i></p>
<p><b>40:00 </b><a href="https://aws.amazon.com/blogs/enterprise-strategy/how-executives-can-avoid-being-disrupted-by-emerging-technologies/" target="_blank" rel="noreferrer noopener"><b>How Executives Can Avoid Being Disrupted by Emerging Technologies</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Ironic from the cloud company being disrupted by AI.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon says innovation happens 50x faster than five years ago… and to be good at staying ahead you need to:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Anticipate technology trends and Be a bit of a technology fortune teller</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They give you 5 ways to do this:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Engage in Technology monitoring and scouting</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Create a culture of curiosity and experimentation</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Use technology road mapping and scenario planning</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Form external partnerships</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">We are really looking forward to The Cloud Pod Center of Engagement. Details to follow. It will most likely take place at Disneyland; make those park reservations now. </span></li>
</ul>
</li>
</ul>
<h2><b>GCP</b></h2>
<p><b>42:03 </b><a href="https://cloud.google.com/blog/products/identity-security/mandatory-mfa-is-coming-to-google-cloud-heres-what-you-need-to-know/" target="_blank" rel="noreferrer noopener"><b>Mandatory MFA is coming to Google Cloud. Here’s what you need to know</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Like what Microsoft recently enacted, GCP plans to implement mandatory MFA for Google Cloud in a phased approach that will roll out to all users worldwide during 2025.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">To ensure a smooth transition, Google Cloud will provide advanced notifications to enterprises and users to help plan MFA deployments. </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Phase 1: Starting in November 2024: Encourage MFA Adoption</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Phase 2: Early 2025: MFA required for password logins </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Phase 3: End of 2025: MFA for federated users</span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">42:26  Ryan – “I am a little nervous about that phase three just because there’s always differences when you do MFA through Federation as I’ve learned through AWS integrations. And so it’s like, I hope that goes smoothly.”</span></i></p>
<p><b>43:59 </b><a href="https://cloud.google.com/blog/products/compute/trillium-sixth-generation-tpu-is-in-preview/" target="_blank" rel="noreferrer noopener"><b>Powerful infrastructure innovations for your AI-first future</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google is dumping money into AI Hardware at an impressive pace and so we get to geek out with some infrastructure! Woohoo! </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They are announcing Trillium, their 6th generation TPU, is now available to Google Cloud customers in preview</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Over 4x improvement in training performance</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Up to 3x increase in inference throughput</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">A 67% increase in energy efficiency</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">An impress 4.7x increase in peak compute performance per chip</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Double the high bandwidth memory</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Double the interchip interconnect</span></li>
</ul>
</li>
<li style="font-weight:400;"><a href="https://cloud.google.com/blog/products/compute/introducing-a3-supercomputers-with-nvidia-h100-gpus" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">New A3 and A3 Mega VMs</span></a><span style="font-weight:400;"> powered by the NVIDIA H100 Tensor Core GPU</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">2x the GPU to GPU bandwidth, Up to 2x higher LLM inference performance and ability to scale tens of thousands of GPUs in a dense, performance-optimized cluster for large AI and HPC workloads. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Support for the upcoming NVIDIA GB200 NVL72 GPUs, with more details coming soon.</span></li>
<li style="font-weight:400;"><a href="https://cloud.google.com/blog/products/compute/titanium-underpins-googles-workload-optimized-infrastructure" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Titanium</span></a><span style="font-weight:400;">, their system used to offload technologies that underpin their infrastructure, has been enhanced to support AI workloads.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;"> Titanium reduces processing overhead on the host through a combination of on-host and off-host offloads, to deliver more compute and memory resources for your workloads. </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">And while AI infrastructure can benefit from all of Titanium’s core capabilities, AI workloads are unique in the accelerator-to-accelerator performance requirements. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">To meet these needs, they have introduced a new titanium ML network adapter that includes and builds on NVIDIA ConnectX-7 NICs to further support VPCs, traffic encryption and virtualization. </span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Hyperdisk ML </span><a href="https://cloud.google.com/blog/products/compute/whats-new-with-google-clouds-ai-hypercomputer-architecture" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">is now generally available</span></a><span style="font-weight:400;">.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Hyperdisk ML is their AI-focused block storage service that we announced in April 2024.  Now generally available, it complements the computing and networking innovations discussed in this blog with purpose-built storage for AI and HPC workloads. </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Hyperdisk ML accelerates data load times effectively</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">You can attach 2500 instances to the same volume, and get 1.2tb/s of aggregate throughput per volume, which is more than 100x higher than offerings from major block storage competitors. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Shorter data load times translate to less accelerator idle time and greater cost efficiency</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">GKE now automatically creates multi-zone volumes for your data</span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">46:01  Justin – “ …we want you to know it’s coming because other our competitors are going to be offering these, but we also are going to offer them. So we want you to know that, but we don’t know what they’re going to cost or anything about them because Nvidia hasn’t given us any details, but we want to announce first.”</span></i></p>
<p><b>48:45 </b><a href="https://cloud.google.com/blog/products/compute/try-c4a-the-first-google-axion-processor/" target="_blank" rel="noreferrer noopener"><b>C4A VMs now GA: Our first custom Arm-based Axion CPU</b></a><b> </b></p>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">At Next 24, </span><a href="https://cloud.google.com/blog/products/compute/introducing-googles-new-arm-based-cpu" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Google announced the Axion processors</span></a><span style="font-weight:400;">, their first custom ARM based CPUs designed for the data center.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Now they are Generally Available, the first Axion based VM Series, the C4A, with up to 10% better price-performance than the latest generation Arm-based instances available from leading cloud providers. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">C4a Vms are a great option for a variety of general-purpose workloads like web and app servers, containerized microservices, open source databases, in-memory caches, data analytics engines, media processing and AI inference applications. </span></li>
</ul>
</li>
</ul>
<ul>
<li style="font-weight:400;"><i><span style="font-weight:400;">“Spanner is one of the most critical and complex services at Google, powering products including YouTube, Gmail, and Google Ads. In our initial tests on Axion processors, we’ve observed up to 60% better query performance per vCPU over prior generation servers. As we scale out our footprint, we expect this to translate to a more stable and responsive experience for our users, even under the most demanding conditions.” – </span></i><b>Andi Gutmans, VP/GM Databases, Google</b></li>
</ul>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">C4A broadens their general-purpose VM portfolio, and is offered in a range of configurations:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Standard 1:4 vcpu to memory</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">High Memory: 1:8 vcpu to memory</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">High CPU 1:2 vcpu to memory</span></li>
</ul>
</li>
</ul>
</li>
<li style="font-weight:400;"><i><span style="font-weight:400;">“Honeycomb.io helps engineering teams debug their production systems quickly and efficiently. Sampling is a key mechanism for controlling observability costs. For our customers who are running applications on Google Cloud, we have validated that the new Axion CPUs and C4A VMs offer the best price-performance on Google Cloud for running our Refinery sampling proxy to forward only the most important, representative samples to Honeycomb.” </span></i><b><i>– </i></b><b>Liz Fong-Jones, Field CTO, Honeycomb</b></li>
</ul>
<p><i><span style="font-weight:400;">50:10  Justin – “Yeah, that was a weird quote. For our customers that run on a different cloud than us, this works great. OK.”</span></i></p>
<p><b>51:13 </b><a href="https://cloud.google.com/blog/products/networking/cross-cloud-network-enhancements-for-distributed-workloads/" target="_blank" rel="noreferrer noopener"><b>Introducing an industry first: application awareness on Cloud Interconnect</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google introduced </span><a href="https://cloud.google.com/solutions/cross-cloud-network?hl=en" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Cross-Cloud Network</span></a><span style="font-weight:400;"> to transform and simplify hybrid and multi-cloud connectivity, and enable organizations to easily build distributed apps.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">As organizations modernize their infrastructure, leveraging AI/ML and other managed services, they have adopted Cross-Cloud Network to reduce operational complexity and lower the TCO.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The point of Cloud Interconnect was to provide robust, high bandwidth, </span><a href="https://cloud.google.com/network-connectivity/docs/interconnect/sla" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">SLA</span></a><span style="font-weight:400;"> backed connectivity to google cloud. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">With </span><a href="https://cloud.google.com/network-connectivity/docs/interconnect/concepts/cci-overview" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Cross-Cloud interconnect</span></a><span style="font-weight:400;"> they enable dedicated and private connectivity from Google to another cloud provider. Together, they form the foundation for building hybrid and multi cloud distributed apps. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Customers have traditionally lacked the capability to prioritize traffic, forcing them to overprovision bandwidth or risk subpar performance during periods of congestion. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">TO address this need for traffic prioritization, google is introducing application awareness on Cloud Interconnect in preview.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Google Cloud is the first major cloud service provider to offer a managed traffic differentiation solution that empowers you to solve the critical challenge of traffic prioritization over </span><a href="https://cloud.google.com/network-connectivity/docs/interconnect/concepts/overview" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Cloud Interconnect</span></a><span style="font-weight:400;">.  </span></li>
<li style="font-weight:400;"><a href="https://cloud.google.com/network-connectivity/docs/interconnect/how-to/cci/configure-traffic-differentiation" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Application awareness</span></a><span style="font-weight:400;"> enables flexibility with a choice of two policies: Strict priority across traffic classes and bandwidth shared per traffic class. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Application awareness on Cloud Interconnect provides multiple business benefits, including:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Prioritization of business critical traffic</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Lower total cost of ownership (TCO)</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Fully managed, SLA backed solution</span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">52:29  Matthew – “I just wonder how much, how many people actually need this. Like for QOS, like I feel like I’ve really set it up on VoIP and like backups, offsite backups back in the day. Like that was about it…it just feels like the wrong way to manage it.”</span></i></p>
<p><b>55:01 </b><a href="https://cloud.google.com/blog/products/networking/speed-scale-reliability-25-years-of-data-center-networking/" target="_blank" rel="noreferrer noopener"><b>Speed, scale and reliability: 25 years of Google data-center networking </b></a><a href="https://cloud.google.com/blog/products/networking/speed-scale-reliability-25-years-of-data-center-networking/" target="_blank" rel="noreferrer noopener"><b>evolution</b></a><span style="font-weight:400;">    </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">We have talked often on this show how important it is to know the principles behind how the hyperscale of your choice is defined. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">In the case of AWS, they have a strong regional/availability zone isolation model.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">For GCP, we have talked about their common storage layer and what it enables.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This blog post gives you key insights into the design thinking of the 25 year design of Google’s Network. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">As Google says, Rome wasn’t built in a day, and neither was Google’s network.  But 25 years in, they share some of the details of how they started out small and now run the 5th generation </span><a href="https://cloud.google.com/blog/topics/systems/the-evolution-of-googles-jupiter-data-center-network?e=48754805" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Jupiter Datacenter network</span></a><span style="font-weight:400;"> which now scales to 13 Petabits/Sec of bisection bandwidth. For perspective, this network could support a video call @1.5MB/s for all 8 billion people on Earth. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The network evolution has been guided by a few key principles:</span>
<ul>
<li style="font-weight:400;"><b>Anything, anywhere:</b><span style="font-weight:400;"> Our data center networks support efficiency and simplicity by allowing large-scale jobs to be placed anywhere among 100k+ servers within the same network fabric, with high-speed access to needed storage and support services. This scale improves application performance for internal and external workloads and eliminates internal fragmentation. </span></li>
<li style="font-weight:400;"><b>Predictable, low latency: </b><span style="font-weight:400;">We prioritize consistent performance and minimizing tail latency by provisioning bandwidth headroom, maintaining 99.999% network availability, and proactively managing congestion through end-host and fabric cooperation.</span></li>
<li style="font-weight:400;"><b>Software-defined and systems-centric:</b><span style="font-weight:400;"> Leveraging software-defined networking (SDN) for flexibility and agility, we qualify and globally release dozens of new features every two weeks across our global network.</span></li>
<li style="font-weight:400;"><b>Incremental evolution and dynamic topology: </b><span style="font-weight:400;">Incremental evolution helps us to refresh the network granularly (rather than bringing it down wholesale), while dynamic topology helps us to continuously adapt to changing workload demands. The combination of optical circuit switching and SDN supports in-place physical upgrades and an ever-evolving, heterogeneous network that supports multiple hardware generations in a single fabric.</span></li>
<li style="font-weight:400;"><b>Traffic engineering and application-centric QoS:</b><span style="font-weight:400;"> Optimizing traffic flows and ensuring Quality of Service helps us tailor the network to each application’s needs.</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">These principles lead to 2015 and Jupiter, the first petabit network with 1.3 Pb/S of aggregate bandwidth by leveraging merchan switch silicon, Clos Topologies and SDN. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">2022 they enabled 6 Pb/S with deep integration of optical circuit switching (OCS), wave division Multiplexing, and highly scalable </span><a href="https://www.usenix.org/conference/nsdi21/presentation/ferguson" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Orion</span></a><span style="font-weight:400;"> SDN controller.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">2023 13 Petabit per second network by enhanced jumper support to native 400G/s link speeds in the network core. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The fundamental building block of Jupiter networks now consists of 512 ports of 400GB/s of connectivity both to end hosts and to the rest of the data center, for an aggregate of 204.8 TB/s of bidirectional non-blocking bandwidth per block. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">2024 and Beyond. They are charting for the future with the next gen of network infrastructure, for example they are busy working on networking infrastructure needs for the </span><a href="https://cloud.google.com/blog/products/compute/trillium-sixth-generation-tpu-is-in-preview" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">A3 Ultra VMs</span></a><span style="font-weight:400;">, featuring </span><a href="https://www.nvidia.com/en-us/data-center/gb200-nvl72/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">NVIDIA ConnectX-7</span></a><span style="font-weight:400;"> networking, supports non-blocking 3.2 Tbps per server of GPU to GPU traffic over RDMA over converged ethernet. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They will deliver significant advances in network scale and bandwidth, both per port and network wide. </span></li>
</ul>
<h2><b>Azure</b></h2>
<p><b>1:00:03 </b><a href="https://devblogs.microsoft.com/devops/no-new-azure-devops-oauth-apps-beginning-february-2025/" target="_blank" rel="noreferrer noopener"><b>No new Azure DevOps OAuth apps beginning February 2025</b></a><b>  </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Starting Feb 3 2025, Microsoft will no longer accept new registrations of Azure Devops Oauth Apps. This is their first step in sunsetting the Azure Devops Oauth Platform.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Going forward they are advocating for you to build apps on top of the Azure Devops REST API to explore the Microsoft Identity platform and registering a new Entra application instead. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">All existing oauth apps will work until the official end of life in 2026.</span></li>
</ul>
<p><i><span style="font-weight:400;">1:00:16  Justin – “So run and provision those as quickly as possible so you have them if you’re working in middle of a project before they go away and you have to redo all your work.</span></i></p>
<p><b>1:01:29 </b><a href="https://blogs.microsoft.com/blog/2024/10/31/microsoft-names-jay-parikh-as-a-member-of-the-senior-leadership-team/" target="_blank" rel="noreferrer noopener"><b>Microsoft names Jay Parikh as a member of the senior leadership team</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Satya Nadella is welcoming Jay Parikh to Microsoft as a member of the Senior leadership team (SLT), reporting to Satya. Jay was the global head of engineering at Facebook (now Meta) and most recently was the CEO of Lacework. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">His focus will extend beyond technology, which his passion for and dedication to developing people will foster a strong culture and build world-class talent. Jay will be immersed in learning about Microsoft’s priorities and culture, spending time with senior leaders and meeting with customers, partners and employees around the world. They will share more on his role and focus in a few months…..</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Have to wonder what the long term viability of Charlie Bell is. </span></li>
</ul>
<p><i><span style="font-weight:400;">24:29  Justin – “…all I can think of is Azure has been beaten up pretty bad on security. Charlie Bell’s been there about two years, hasn’t seemed to move the needle and I don’t know, but if I was a betting man, I’d say the former CEO of a security startup is probably going to maybe be in charge of something security wise.”</span></i></p>
<h2><span style="font-weight:400;">Closing</span></h2>
<p><span style="font-weight:400;">And that is the week in the cloud! Visit our website, the home of the Cloud Pod where you can join our newsletter, slack team, send feedback or ask questions at theCloud Pod.net or tweet at us with hashtag #theCloudPod</span></p>
]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/1889271/c1e-p8j8u5jz7zivrm7j-7zkv9qjduk7-5gssbk.mp3" length="76995312"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 282 of The Cloud Pod, where the forecast is always cloudy! This week Justin, Ryan, and Matthew are happy to be joining you in the clouds versus watching election information. This week we’re talking nuclear energy, AI Search tools, and all things Pre:Invent. Welcome, and thanks for joining us! 
Titles we almost went with this week:

️The Cloud Pod Would Much Rather Record This Show Than Watch the Election Results
️IBM Comes for Your AI Dollars
AWS Goes Limitless with the PostgreSQL Possibilities
⌚It is Upon Us the Pre-Invent Period and AWS Does Not Disappoint
⚛️Amazon Loses Its Nuclear Superhero

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
Follow Up
01:13 Energy regulators scrutinizing data center use reject Amazon bid 

Late Friday, the Federal Energy Regulatory Commission rejected a proposal that would have allowed an Amazon data center to co-locate with an existing nuclear power plant in Pennsylvania.  
The commission voted it down 2-1 
FERC chairman Willie Phillips said that the commission should encourage the development of data centers and semiconductor manufacturing as national security and economic development priorities.  
Commissioners Mark Christie and Lindsay See (both R) voted to reject the proposal, while Davis Rosner and Judy Change (D) didn’t vote. 
Talen Energy, who signed the agreement, drew challenges from neighboring utilities AEP and Exelon – who challenged the novel arrangement, arguing it would unfairly shift costs of running the broader grid to other consumers. 
FERC’s order found the region’s grid operator, PJM Interconnection, failed to show why the proposal was necessary and prove such a deal would be limited to the Susquehanna plant given the widespread interest in placing data centers next to power plants. 
Talen said the ruling would have a chilling effect on the region’s economic development and it is weighing its options. 
Will see what happens with Microsoft/Constellation energies plan to ]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/1889271/c1a-k5d5-pkjmv98xtq3r-cx5fps.jpg"></itunes:image>
                                                                            <itunes:duration>01:04:10</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                            </item>
                    <item>
                <title>
                    <![CDATA[281: Happy Birthday, ECS. You’re still so much better than K8 at 10!]]>
                </title>
                <pubDate>Thu, 07 Nov 2024 14:23:09 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/1876998</guid>
                                    <link>https://tcpfm.castos.com/episodes/281-happy-birthday-ecs</link>
                                <description>
                                            <![CDATA[<p><span style="font-weight:400;">Welcome to episode 281 of The Cloud Pod, where the forecast is always cloudy! Justin and Ryan are your hosts as we search the clouds for all the latest news and info. This week we’re talking about ECS turning 10 (yes, we were there when it was announced, and yes, we’re old,) some more drama from the CrowdStrike fiasco, lots of updates to GitHub, plus more. Join us!  </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">Github Universe full of ECS containers</span></li>
<li><span style="font-weight:400;">️Github Universe lives up to the Universal expectations </span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><b>Follow Up</b></h2>
<p><b>01:09</b> <b>Dr. Matt Woods ended up at PWC as chief innovation officer</b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">YAWN</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">What exactly does a chief innovation officer at PWC do? Is this like a semi-retirement? </span></li>
</ul>
<h2><b>General News </b></h2>
<p><b>01:44 </b><a href="https://arstechnica.com/tech-policy/2024/10/crowdstrike-accuses-delta-of-blaming-its-own-it-failures-on-global-outage/" target="_blank" rel="noreferrer noopener"><b>TSA silent on CrowdStrike’s claim Delta skipped required security update</b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.delta.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Delta</span></a><span style="font-weight:400;"> isn’t backing down with </span><a href="https://www.crowdstrike.com/en-us/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">CrowdStrike</span></a><span style="font-weight:400;">, and in a </span><a href="https://cdn.arstechnica.net/wp-content/uploads/2024/10/Delta-v-CrowdStrike-Complaint-10-25-24.pdf" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">court filing</span></a><span style="font-weight:400;"> said CrowdStrike should be on the hook for the entire $500M in losses, partly because CrowdStrike has admitted that it should have done more testing and staggered deployments to catch bugs. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Delta further alleges that CrowdStrike postured as a certified best-in-class security provider who “never cuts corners,” while secretly designing its software to bypass Microsoft security certifications to make changes at the core of Delta’s computer systems without Delta’s knowledge. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Delta says they would never have agreed to such a dangerous process if it had been disclosed. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">In its testimony to Congress, CrowdStrike said that they follow standard protocols, and that they are protecting against threats as they evolve.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">CrowdStrike is also accusing Delta of failing to follow laws, including best practices established by the TSA.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">According to CrowdStrike, most customers were up within a day of the issue – while Delta took 5 days. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Crowdstrike alleges that Delta’s negligence caused this in following the TSA requirements designed to ensure that no major airline ever experiences prolonged system outages. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">CrowdStrike realized Delta failed to follow the requirements when its efforts to help...</span></li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 281 of The Cloud Pod, where the forecast is always cloudy! Justin and Ryan are your hosts as we search the clouds for all the latest news and info. This week we’re talking about ECS turning 10 (yes, we were there when it was announced, and yes, we’re old,) some more drama from the CrowdStrike fiasco, lots of updates to GitHub, plus more. Join us!  
Titles we almost went with this week:

Github Universe full of ECS containers
️Github Universe lives up to the Universal expectations 

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
Follow Up
01:09 Dr. Matt Woods ended up at PWC as chief innovation officer

YAWN
What exactly does a chief innovation officer at PWC do? Is this like a semi-retirement? 

General News 
01:44 TSA silent on CrowdStrike’s claim Delta skipped required security update

Delta isn’t backing down with CrowdStrike, and in a court filing said CrowdStrike should be on the hook for the entire $500M in losses, partly because CrowdStrike has admitted that it should have done more testing and staggered deployments to catch bugs. 
Delta further alleges that CrowdStrike postured as a certified best-in-class security provider who “never cuts corners,” while secretly designing its software to bypass Microsoft security certifications to make changes at the core of Delta’s computer systems without Delta’s knowledge. 
Delta says they would never have agreed to such a dangerous process if it had been disclosed. 
In its testimony to Congress, CrowdStrike said that they follow standard protocols, and that they are protecting against threats as they evolve.
CrowdStrike is also accusing Delta of failing to follow laws, including best practices established by the TSA.
According to CrowdStrike, most customers were up within a day of the issue – while Delta took 5 days. 
Crowdstrike alleges that Delta’s negligence caused this in following the TSA requirements designed to ensure that no major airline ever experiences prolonged system outages. 
CrowdStrike realized Delta failed to follow the requirements when its efforts to help...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[281: Happy Birthday, ECS. You’re still so much better than K8 at 10!]]>
                </itunes:title>
                                    <itunes:episode>281</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p><span style="font-weight:400;">Welcome to episode 281 of The Cloud Pod, where the forecast is always cloudy! Justin and Ryan are your hosts as we search the clouds for all the latest news and info. This week we’re talking about ECS turning 10 (yes, we were there when it was announced, and yes, we’re old,) some more drama from the CrowdStrike fiasco, lots of updates to GitHub, plus more. Join us!  </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">Github Universe full of ECS containers</span></li>
<li><span style="font-weight:400;">️Github Universe lives up to the Universal expectations </span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><b>Follow Up</b></h2>
<p><b>01:09</b> <b>Dr. Matt Woods ended up at PWC as chief innovation officer</b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">YAWN</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">What exactly does a chief innovation officer at PWC do? Is this like a semi-retirement? </span></li>
</ul>
<h2><b>General News </b></h2>
<p><b>01:44 </b><a href="https://arstechnica.com/tech-policy/2024/10/crowdstrike-accuses-delta-of-blaming-its-own-it-failures-on-global-outage/" target="_blank" rel="noreferrer noopener"><b>TSA silent on CrowdStrike’s claim Delta skipped required security update</b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.delta.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Delta</span></a><span style="font-weight:400;"> isn’t backing down with </span><a href="https://www.crowdstrike.com/en-us/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">CrowdStrike</span></a><span style="font-weight:400;">, and in a </span><a href="https://cdn.arstechnica.net/wp-content/uploads/2024/10/Delta-v-CrowdStrike-Complaint-10-25-24.pdf" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">court filing</span></a><span style="font-weight:400;"> said CrowdStrike should be on the hook for the entire $500M in losses, partly because CrowdStrike has admitted that it should have done more testing and staggered deployments to catch bugs. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Delta further alleges that CrowdStrike postured as a certified best-in-class security provider who “never cuts corners,” while secretly designing its software to bypass Microsoft security certifications to make changes at the core of Delta’s computer systems without Delta’s knowledge. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Delta says they would never have agreed to such a dangerous process if it had been disclosed. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">In its testimony to Congress, CrowdStrike said that they follow standard protocols, and that they are protecting against threats as they evolve.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">CrowdStrike is also accusing Delta of failing to follow laws, including best practices established by the TSA.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">According to CrowdStrike, most customers were up within a day of the issue – while Delta took 5 days. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Crowdstrike alleges that Delta’s negligence caused this in following the TSA requirements designed to ensure that no major airline ever experiences prolonged system outages. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">CrowdStrike realized Delta failed to follow the requirements when its efforts to help remediate the issue revealed alleged technological shortcomings and failures to follow security best practices, including outdated IT systems, issues in Delta’s AD environment and thousands of compromised passwords.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Delta threatened to sue Microsoft as well as CrowdStrike, but has only named CrowdStrike to date in the lawsuits. </span></li>
</ul>
<p><i><span style="font-weight:400;">3:48  Ryan – “It’s a tool that needs to evolve very quickly to emerging threats. And while the change that was pushed through shouldn’t have gone through that particular workflow, and that’s a mistake, I do think that that should exist as part of it. Yes, could they have done better with documentation and all that? Of course.”</span></i></p>
<p><b>04:51 </b><a href="https://cloud.google.com/blog/products/infrastructure-modernization/google-is-a-leader-in-gartner-magic-quadrant-for-strategic-cloud-platform-services/" target="_blank" rel="noreferrer noopener"><b>Google is a Leader in Gartner Magic Quadrant for Strategic Cloud Platform </b></a><a href="https://cloud.google.com/blog/products/infrastructure-modernization/google-is-a-leader-in-gartner-magic-quadrant-for-strategic-cloud-platform-services/" target="_blank" rel="noreferrer noopener"><b>Services</b></a><span style="font-weight:400;"> </span></p>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">It’s Magic Quadrant time! But let’s be real – when ISN’T it MQ time. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The Magic Quadrant is out for Cloud Platforms… and AWS is still top dog. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">BUT Microsoft and </span><a href="http://cloud.google.com/resources/gartner-strategic-cloud-platform-services" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Google</span></a><span style="font-weight:400;"> have moved further to the right than AWS – which is for completeness of vision.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Oracle also made the leaders quadrant. </span></li>
</ul>
</li>
</ul>
<ul>
<li><a href="https://aws.amazon.com/blogs/aws/read-the-2023-gartner-magic-quadrant-for-strategic-cloud-platform-services/" target="_blank" rel="noreferrer noopener"><b>AWS</b></a></li>
</ul>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Strengths</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Operational excellence</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Solutions support</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Robust Developer experience</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Cautions</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Complex and inconsistent service interfaces</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Limited traction for proprietary AI models</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Fewer Sovereign cloud options</span></li>
</ul>
</li>
</ul>
</li>
</ul>
</li>
</ul>
<ul>
<li><b>Google</b></li>
</ul>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Strengths</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AI Infused IT Modernization</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Environmental Sustainability</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Digital Sovereignty</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Cautions</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Incomplete understanding of traditional enterprise needs</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Uneven resilience</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Distributed cloud inconsistencies</span></li>
</ul>
</li>
</ul>
</li>
</ul>
</li>
</ul>
<ul>
<li><b>Azure</b></li>
</ul>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Strengths</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Cross-Microsoft Capabilities</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Industry Clouds</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Strategic partnership with OpenAI</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Cautions</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Ongoing Security Challenges</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Capacity Shortages</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Inconsistent Service and Support</span></li>
</ul>
</li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">07:04  Justin – “…it’s still a shared security model. You still have requirements you have to meet. So you’re not off the hook completely by checking assured workloads for sure.”</span></i></p>
<p><b>08:12 </b><a href="https://blog.cloudflare.com/ddos-threat-report-for-2024-q3/" target="_blank" rel="noreferrer noopener"><b>4.2 Tbps of bad packets and a whole lot more: Cloudflare’s Q3 DDoS report</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Cloudflare gives us the 19th edition of the CloudFlare DDOS threat report. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The number of DDoS attacks spiked in the third quarter of 2024. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Cloudflare mitigated nearly 6 million DDOS attacks, representing a 49% increase in QoQ and 55% increase YoY. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Out of those 6 million, Cloudflare’s autonomous DDOS defense systems detected and mitigated over 200 hyper-volumetric DDoS attacks exceeding rates of 3 terabits per second (Tbps) and 2 Billion packets per second (Bpps). </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The </span><a href="https://blog.cloudflare.com/how-cloudflare-auto-mitigated-world-record-3-8-tbps-ddos-attack" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">largest attack</span></a><span style="font-weight:400;"> peaked at 4.2TB and lasted a minute.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The Banking and Financial services industry is subjected to the most DDoS attacks. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">China was the country most targeted, and Indonesia was the largest source of attacks. </span></li>
</ul>
<p><i><span style="font-weight:400;">09:27  Justin – “DDoS is not an IF thing. It’s a WHEN problem for every company.”</span></i></p>
<h2><b>AI is Going Great – Or How ML Makes All Its Money</b><span style="font-weight:400;"> </span></h2>
<p><b>10:12 </b><a href="https://arstechnica.com/ai/2024/10/github-copilot-moves-beyond-openai-models-to-support-claude-3-5-gemini/" target="_blank" rel="noreferrer noopener"><b>GitHub Copilot moves beyond OpenAI models to support Claude 3.5, </b></a><a href="https://arstechnica.com/ai/2024/10/github-copilot-moves-beyond-openai-models-to-support-claude-3-5-gemini/" target="_blank" rel="noreferrer noopener"><b>Gemini</b></a></p>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">In a sign of continuing ruptures between OpenAI and Microsoft (in Justin’s opinion,) </span><a href="https://copilot.microsoft.com/onboarding" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Copilot</span></a><span style="font-weight:400;"> will switch from being exclusively </span><a href="https://openai.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">OpenAI GPT</span></a><span style="font-weight:400;"> models to a </span><a href="https://github.blog/news-insights/product-news/bringing-developer-choice-to-copilot/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">multi-modal approach</span></a><span style="font-weight:400;"> over the coming weeks.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">First Anthropic </span><a href="https://www.anthropic.com/claude/sonnet" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">3.5 Sonnet</span></a><span style="font-weight:400;"> will roll out to Copilots chat web and VS Code interfaces, with </span><a href="https://gemini.google.com/app?hl=en" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Google Gemini 1.5 pro</span></a><span style="font-weight:400;"> coming a short term later. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">In addition, Copilot will support gpt o1-preview and 01 mini, which are intended to be stronger at advanced reasoning than GPT-4 – which copilot has used until now. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The new approach makes sense for users as certain models are better at certain languages or types of tasks.</span></li>
</ul>
</li>
<li style="font-weight:400;"><i><span style="font-weight:400;">“There is no one model to rule every scenario,”</span></i><span style="font-weight:400;"> wrote GitHub CEO Thomas Dohmke </span><i><span style="font-weight:400;">“It is clear the next phase of AI code generation will not only be defined by multi-model functionality, but by multi-model choice.”</span></i></li>
</ul>
<p><i><span style="font-weight:400;">11:11  Ryan – “it’s very interesting that GitHub is doing that with Microsoft’s heavily involvement in OpenAI. But I also wonder if this is one of those things where the subsidiary is given a little bit more leniency, especially since it’s not really divorcing OpenAI or ChatGPT in general.”</span></i></p>
<h2><b>AWS</b></h2>
<p><b>12:32 </b><a href="https://aws.amazon.com/blogs/aws/ec2-image-builder-now-supports-building-and-testing-macos-images/" target="_blank" rel="noreferrer noopener"><b>EC2 Image Builder now supports building and testing macOS images</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><a href="https://en.wikipedia.org/wiki/MacOS" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">MacOS</span></a><span style="font-weight:400;"> is now supported in </span><a href="https://aws.amazon.com/image-builder/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">EC2 Image Builder</span></a><span style="font-weight:400;">.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This will allow you to create and manage machine images for your macOS workloads, in addition to the existing support for Windows and Linux.</span></li>
</ul>
<p><b>13:54 </b><a href="https://aws.amazon.com/blogs/aws/celebrating-10-years-of-amazon-ecs-powering-a-decade-of-containerized-innovation/" target="_blank" rel="noreferrer noopener"><b>Celebrating 10 Years of Amazon ECS: Powering a Decade of Containerized </b></a><a href="https://aws.amazon.com/blogs/aws/celebrating-10-years-of-amazon-ecs-powering-a-decade-of-containerized-innovation/" target="_blank" rel="noreferrer noopener"><b>Innovation</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/ecs/getting-started/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">ECS</span></a><span style="font-weight:400;"> is now 10 years old!! We still remember it being announced at Re:invent in 2014… and we’ve been fans ever since. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Its had a fun evolution:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">2014 </span><a href="https://aws.amazon.com/blogs/aws/cloud-container-management/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">EC2 Container Service Launch</span></a></li>
<li style="font-weight:400;"><span style="font-weight:400;">2015 </span><a href="https://aws.amazon.com/about-aws/whats-new/2015/12/amazon-ec2-container-service-supports-additional-cloudwatch-metrics/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">ECS Autoscaling</span></a></li>
<li style="font-weight:400;"><span style="font-weight:400;">2016 </span><a href="https://aws.amazon.com/blogs/aws/new-aws-application-load-balancer/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">ALB for ECS</span></a></li>
<li style="font-weight:400;"><span style="font-weight:400;">2017 </span><a href="https://aws.amazon.com/fargate/getting-started/?nc=sn&amp;loc=4" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Fargate</span></a></li>
<li style="font-weight:400;"><span style="font-weight:400;">2018 </span><a href="https://aws.amazon.com/blogs/aws/aws-auto-scaling-unified-scaling-for-your-cloud-applications/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Auto Scaling</span></a></li>
<li style="font-weight:400;"><span style="font-weight:400;">2019 </span><a href="https://aws.amazon.com/blogs/aws/coming-soon-graviton2-powered-general-purpose-compute-optimized-memory-optimized-ec2-instances/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Graviton 2 support</span></a></li>
<li style="font-weight:400;"><span style="font-weight:400;">2020 </span><a href="https://aws.amazon.com/blogs/opensource/announcing-the-general-availability-of-bottlerocket-an-open-source-linux-distribution-purpose-built-to-run-containers/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">BottleRocket</span></a></li>
<li style="font-weight:400;"><span style="font-weight:400;">2021 </span><a href="https://aws.amazon.com/blogs/containers/new-using-amazon-ecs-exec-access-your-containers-fargate-ec2/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">ECS Exec</span></a></li>
<li style="font-weight:400;"><span style="font-weight:400;">2022 </span><a href="https://aws.amazon.com/blogs/aws/new-amazon-ecs-service-connect-enabling-easy-communication-between-microservices/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">ECS Service connect</span></a></li>
<li style="font-weight:400;"><span style="font-weight:400;">2023 </span><a href="https://aws.amazon.com/blogs/aws/introducing-amazon-guardduty-ecs-runtime-monitoring-including-aws-fargate/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Guard Duty ECS runtime support</span></a></li>
<li style="font-weight:400;"><span style="font-weight:400;">2024 </span><a href="https://aws.amazon.com/blogs/aws/amazon-ecs-supports-a-native-integration-with-amazon-ebs-volumes-for-data-intensive-workloads/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">EBS support</span></a><span style="font-weight:400;"> </span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">16:29  Justin – “Despite Kubernetes dominating the market, you know, ECS has continued to get a lot of innovation. I imagine it runs a lot of services under the hood at AWS for their use cases and how they run your services that you consume…Happy birthday, ECS. Stop getting older because I can’t be aging this fast.”</span></i></p>
<p><b>17:54 </b><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/aws-efa-updates-scalability-ai-ml-applications/" target="_blank" rel="noreferrer noopener"><b>AWS announces EFA update for scalability with AI/ML applications</b></a><b>  </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS announces the launch of a new interface type that decouples the EFA and the ENA.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">EFA provides high bandwidth low latency networking crucial for calling AI/ML workloads. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The new interface (EFA-only) allows you to create a standalone EFA device on secondary interfaces. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This allows you to scale your compute clusters to run AI/ML applications without straining private Ipv4 space or encountering IP routing challenges with linux. </span></li>
</ul>
<h2><b>GCP</b></h2>
<p><b>19:35 </b><a href="https://cloud.google.com/blog/products/compute/updates-to-ai-hypercomputer-software-stack/" target="_blank" rel="noreferrer noopener"><b>AI Hypercomputer software updates: Faster training and inference, a new </b></a><a href="https://cloud.google.com/blog/products/compute/updates-to-ai-hypercomputer-software-stack/" target="_blank" rel="noreferrer noopener"><b>resource hub, and more</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google is announcing major updates to the </span><a href="https://cloud.google.com/solutions/ai-hypercomputer" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AI Hypercomputer</span></a><span style="font-weight:400;"> software layer for training and inference performance, improved resiliency at scale, as well as centralized hub for hypercomputer resources</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Centralized AI Hypercomputer Resources on GitHub:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Launch of the </span><a href="https://github.com/ai-hypercomputer" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AI Hypercomputer GitHub organization</span></a><span style="font-weight:400;">, a central repository for developers to access reference implementations like </span><a href="https://github.com/AI-Hypercomputer/maxtext" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">MaxText</span></a><span style="font-weight:400;"> and </span><a href="https://github.com/AI-Hypercomputer/maxdiffusion" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">MaxDiffusion</span></a><span style="font-weight:400;">, orchestration tools like </span><a href="https://github.com/AI-Hypercomputer/xpk" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">xpk</span></a><span style="font-weight:400;"> (Accelerated Processing Kit), and performance recipes for </span><a href="https://github.com/AI-Hypercomputer/gpu-recipes" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">GPUs</span></a><span style="font-weight:400;"> on Google Cloud.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Facilitates easier discovery and contribution to AI Hypercomputer’s open-source projects.</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">MaxText Now Supports A3 Mega VMs:</span>
<ul>
<li style="font-weight:400;"><a href="https://github.com/AI-Hypercomputer/maxtext" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">MaxText</span></a><span style="font-weight:400;">, an open-source, high-performance implementation for large language models (LLMs), now optimized for </span><a href="https://cloud.google.com/skus/sku-groups/a3-mega-on-demand-vms" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">A3 Mega VMs</span></a><span style="font-weight:400;"> powered by NVIDIA H100 Tensor Core GPUs.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Offers a 2x improvement in GPU-to-GPU network bandwidth over A3 VMs.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Collaboration with NVIDIA to optimize JAX and </span><a href="https://jax.readthedocs.io/en/latest/xla_flags.html#gpu-xla-flags" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">XLA</span></a><span style="font-weight:400;"> for overlapping communication and computation on GPUs.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Introduction of FP8 mixed-precision training using </span><a href="https://github.com/google/aqt" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Accurate Quantized Training</span></a><span style="font-weight:400;"> (AQT), delivering up to 55% improvement in effective model FLOPS utilization compared to bf16 precision.</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Reference Implementations and Kernels for Mixture of Experts (MoE):</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Expansion of MaxText to include both “capped” and “no-cap” MoE implementations, providing flexibility between predictable performance and dynamic resource allocation.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Open-sourcing of </span><a href="https://jax.readthedocs.io/en/latest/pallas/index.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Pallas kernels</span></a><span style="font-weight:400;"> optimized for</span><a href="https://github.com/jax-ml/jax/blob/f52b016de1f605676a2cab750938f737d2f22b1a/jax/experimental/pallas/ops/tpu/megablox/gmm.py#L314" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;"> block-sparse matrix multiplication</span></a><span style="font-weight:400;"> on Cloud TPUs, compatible with PyTorch and JAX, enhancing MoE model training performance.</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Monitoring Large-Scale Training:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Introduction of a </span><a href="https://github.com/AI-Hypercomputer/cloud-tpu-monitoring-debugging" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">reference monitoring recipe</span></a><span style="font-weight:400;"> to create a </span><a href="https://github.com/AI-Hypercomputer/cloud-tpu-monitoring-debugging?tab=readme-ov-file#monitoring-dashboard" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Cloud Monitoring dashboard</span></a><span style="font-weight:400;"> in Google Cloud projects.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Enables tracking of metrics like CPU utilization and identification of outliers, simplifying MLOps for large-scale training jobs.</span></li>
</ul>
</li>
<li style="font-weight:400;"><a href="https://arxiv.org/pdf/2304.01433" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">SparseCore</span></a><span style="font-weight:400;"> on Cloud TPU v5p Now Generally Available:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">SparseCore, a hardware accelerator for embeddings on Cloud TPU v5p, is now generally available.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Each TPU v5p chip includes four SparseCores, delivering up to 2.5x performance improvement for models like DLRM-V2 compared to previous generations.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Enhances performance for recommender systems and models relying on embeddings.</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Improved LLM Inference Performance:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Introduction of KV cache quantization and ragged attention kernels in </span><a href="https://github.com/AI-Hypercomputer/JetStream" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">JetStream</span></a><span style="font-weight:400;">, an open-source, optimized engine for LLM inference.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">These enhancements improve inference performance by up to 2x on Cloud TPU v5e.</span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">21:02  Ryan – “it really does show how much the IEI branding is taking over everything. Because a lot of these things were the same things we were talking about for machine learning.”</span></i></p>
<p><b>21:44 </b><a href="https://cloud.google.com/blog/products/data-analytics/introducing-ai-driven-bigquery-data-preparation/" target="_blank" rel="noreferrer noopener"><b>BigQuery’s AI-assisted data preparation is now in preview</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Now in preview, </span><a href="https://cloud.google.com/bigquery/docs/data-prep-introduction" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">BigQuery data preparation</span></a><span style="font-weight:400;"> provides a number of capabilities:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AI-powered suggestions: BigQuery data preparation uses Gemini in BigQuery to analyze your data and schema and provide intelligent suggestions for cleaning, transforming, and enriching the data. This significantly reduces the time and effort required for manual data preparation tasks.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Data cleansing and standardization: Easily identify and rectify inconsistencies, missing values, and formatting errors in your data.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Visual data pipelines: The intuitive, low-code visual interface helps both technical and non-technical users easily design complex data pipelines, and leverage BigQuery’s rich and extensible SQL capabilities.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Data pipeline orchestration: Automate the execution and monitoring of your data pipelines. The SQL generated by BigQuery data preparation can become part of a </span><a href="https://cloud.google.com/dataform" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Dataform</span></a><span style="font-weight:400;"> data engineering pipeline that you can deploy and orchestrate with CI/CD, for a shared development experience.</span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">22:12  Justin – “What could go wrong with low code complex data pipeline?”</span></i></p>
<p><b>23:21 </b><a href="https://cloud.google.com/blog/products/api-management/apigee-a-leader-in-2024-gartner-api-management-magic-quadrant/" target="_blank" rel="noreferrer noopener"><b>Google Cloud Apigee named a Leader in the 2024 Gartner® Magic </b></a><a href="https://cloud.google.com/blog/products/api-management/apigee-a-leader-in-2024-gartner-api-management-magic-quadrant/" target="_blank" rel="noreferrer noopener"><b>Quadrant™ for API Management</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">It’s amazing how many companies are in this quadrant but don’t feel like real API gateways.. </span></li>
</ul>
<p><i><span style="font-weight:400;">24:29  Justin – “Amazon web services though, being a very, very good at ability to execute, but not a completeness of vision. they’re in the challenger quadrant, speaks volumes about how little innovation API gateway has gotten.”</span></i></p>
<h2><b>Azure</b></h2>
<p><b>25:42 </b><a href="https://siliconangle.com/2024/10/16/microsofts-financial-disclosures-reveal-azures-market-position/" target="_blank" rel="noreferrer noopener"><b>What Microsoft’s financial disclosures reveal about Azure’s market position</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft will now change the way it reports some Azure metrics to the stock market in their upcoming earnings call (Which we’ll cover next week.)</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">MS said the change will align Azure with consumption revenue and by inference more closely aligning how AWS reports its metrics. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The account change removed slower growth revenue streams and raised the growth rates for azure.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">It also increased the AI contribution within Azure. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Removed services:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">EMS (Enterprise Mobility and Security) and Power BI</span></li>
</ul>
</li>
</ul>
<p><b>27:17  </b><a href="https://azure.microsoft.com/en-us/blog/azure-at-github-universe-new-tools-to-help-simplify-ai-app-development/" target="_blank" rel="noreferrer noopener"><b>Azure at GitHub Universe: New tools to help simplify AI app development</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Github Copilot for Azure now in Preview, integrating the tools you use your IDE and Azure.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">You can now use @azure, giving you personalized guidance to learn about services and tools without leaving your code. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This can accelerate and streamline development by provisioning and deploying resources through Azure Developer CLI templates. </span></li>
<li style="font-weight:400;"><a href="https://aka.ms/aiapps" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AI App Templates</span></a><span style="font-weight:400;"> further accelerate your development by helping you get started faster and simplifying evaluation and the path to production.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Using an AI App template directly in your preferred IDE such as Github codespaces, vs code and visual studio.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">You can even get recommendations for specific templates right from Github Copilot for Azure based on your AI use case or scenario.  </span></li>
<li style="font-weight:400;"><a href="https://github.blog/changelog/2024-10-29-github-models-is-now-available-in-public-preview" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Github Models</span></a><span style="font-weight:400;"> now in preview to give you access to </span><a href="https://azure.microsoft.com/en-us/solutions/ai/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Azure AI</span></a><span style="font-weight:400;">’s leading model garden.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Keeping Java apps up to date can be time consuming, and to help they are giving you </span><a href="https://aka.ms/java-at-GHU2024" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Github CoPilot upgrade assistant for Java</span></a><span style="font-weight:400;"> to offer an approach using AI to simplify this process and allowing you to upgrade your java apps with minimal manual effort. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Scale AI applications with </span><a href="https://learn.microsoft.com/en-us/azure/ai-studio/how-to/develop/evaluate-sdk" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Azure AI evaluation</span></a><span style="font-weight:400;"> and online A/B experimentation using CI/CD workflows</span></li>
</ul>
<p><i><span style="font-weight:400;">28:37 Ryan – “I like all of these, but I really don’t like that they’re keeping the Java apps up to date. Like, they’re just furthering the life of that terrible, terrible language. And one of the things is that they abstract all these simple things away, but it’s like, that’s why I hate it. It shouldn’t exist. It’s terrible. And newer languages have moved on.”</span></i></p>
<p><b>29:21  </b><a href="https://github.blog/news-insights/product-news/universe-2024-previews-releases/" target="_blank" rel="noreferrer noopener"><b>New from Universe 2024: Get the latest previews and releases</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AI-Native = </span><a href="https://github.blog/changelog/2024-10-29-refine-and-validate-code-review-suggestions-with-copilot-workspace-public-preview" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Github Copilot Workspace + Code Review</span></a><span style="font-weight:400;"> + Copilot Autofix to allow you to rapidly refine, validate and land Copilot-generated code suggestions from copilot code review, copilot autofix and third party copilot extensions. </span></li>
<li style="font-weight:400;"><a href="https://gh.io/spark" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Github Spark</span></a><span style="font-weight:400;"> is a new way to start ideas. It’s powered by natural language and it sets the stage for github’s vision to help 1 billion people become developers. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">With live history, previews and the ability to edit code directly, Github Spark allows you to create microapps that take that crazy small, fun idea and bring it to life. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Raising the quality of Copilot power experiences, they have added new features such as multi-modal choice, improved code completion, implicit agent selection in github copilot chat, better support for C++ and .Net and expanded availability in Xcode and Windows Terminal. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">You can now edit multiple lines and files with copilot in </span><a href="https://github.blog/changelog/2024-10-29-multi-file-editing-code-review-custom-instructions-and-more-for-github-copilot-in-vs-code-october-release-v0-22/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">VSCode</span></a><span style="font-weight:400;">, applying edits directly as you iterate on your codebase with natural language.  </span></li>
<li style="font-weight:400;"><a href="https://github.blog/changelog/2024-10-29-github-copilot-code-review-in-github-com-public-preview/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Github Copilot code reviews</span></a><span style="font-weight:400;"> provide copilot powered feedback on your code as soon as you create a pull request. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This means no more waiting for hours to start the feedback loop. Configure rules for your team and keep quality high with the help of your trusted AI pair programmer.  Now supporting C#, Java, Javascript, Python, Typescript, Ruby, Go and Markdown. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Github Copilot extensions allow you or your organization to integrate proprietary tools directly into your IDE via the github marketplace.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Some that we saw in the marketplace were Docker for Github Copilot, Teams toolkit for Github Copilot. Atlassian, New Relic etc. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">For the EU, you now get </span><a href="https://github.blog/changelog/2024-10-29-github-enterprise-cloud-data-residency-in-the-eu-is-generally-available/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Data residency</span></a><span style="font-weight:400;"> for Github Enterprise Cloud. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Github Issues got further improvements with sub issues, issue types, advanced search and increased project item limits </span></li>
</ul>
<p><i><span style="font-weight:400;">28:37 Ryan – “I do like adding the code reviews and feedback ability to GitHub. I think that’s a fantastic thing just to have built in. I hope that that allows some of the finding nine different people to validate my PRs to make sure I can go to production, go away, but we’ll see, doubt it.”</span></i></p>
<p><b>34:06 </b><a href="https://azure.microsoft.com/en-us/blog/accelerate-scale-with-azure-openai-service-provisioned-offering/" target="_blank" rel="noreferrer noopener"><b>Accelerate scale with Azure OpenAI Service Provisioned offering</b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://azure.microsoft.com/en-us/products/ai-services/openai-service/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Azure OpenAI Service</span></a><span style="font-weight:400;"> Data Zones allows enterprises to scale AI workloads while maintaining compliance with regional data residency requirements. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">It offers flexible, multi-regional data processing within selected data boundaries, eliminating the need to manage multiple resources across regions.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">99% Latency SLA for Token Generation: Ensures faster and more consistent token generation speeds, especially at high volumes, providing predictable performance for mission-critical applications.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Reduced Pricing and Lower Deployment Minimums:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Hourly pricing for Provisioned Global deployments reduced from $2.00 to $1.00 per hour.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Deployment minimums for Provisioned Global reduced by 70%, and scaling increments reduced by up to 90%, lowering the barrier for businesses to start using the Provisioned offering.</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Prompt Caching: Offers a significant cost and performance advantage by caching repetitive API requests. Cached tokens are discounted by 50% for the Standard offering.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Simplified Token Throughput Information: Provides a clear view of input and output tokens per minute for each Provisioned deployment, eliminating the need for detailed conversion tables or calculators.</span></li>
</ul>
<p><i><span style="font-weight:400;">35:36 Justin – “I implemented Claude and my VS code, and when I ask it questions now it tells me how many tokens I used, which has been really helpful to like learn how many tokens and how much that does cost me. You know, especially when you’re paying by the drip now, like I have Claude subscription as well. And that one, just paid 20 bucks a month and I see the value of just paying 20 bucks a month if you’re doing a lot of heavy duty stuff, but if you need to integrate an app, you have to use API’s and that’s where the tokens really kill you.”</span></i></p>
<p><b>36:04 </b><a href="https://techcommunity.microsoft.com/t5/azure-tools-blog/announcing-azapi-2-0/ba-p/4275733" target="_blank" rel="noreferrer noopener"><b>Announcing AzAPI 2.0</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AzAPI provider, designed to expedite the integration of new Azure services with Hashicorp Terraform, has now released 2.0.  This updated version marks a significant step in their goal to provide launch day support for azure services using terraform</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Key Features of the AzAPI include</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Resource Specific versioning allowing users to switch to a new API version without altering provider versions</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Special functions like azapi_update_resource and azapi_resource_action</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Immediate day 0 support for new services.</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Also, all resource properties, outputs and state representation are now handled by Hashicorp configuration language instead of JSON</span></li>
</ul>
<p><i><span style="font-weight:400;">37:15 Justin – “I kind of like the idea of it though, because, you know, if you, if you change the API for the service and now you have to roll a whole brand new provider, you have to maintain a lot of branches of providers. Cause if you push, you know, to a new provider that has different syntax, like that could be a breaking change. So this allows you to take advantage of a newer API without the breaking change potentially.”</span></i></p>
<p><b>38:31 </b><a href="https://techcommunity.microsoft.com/t5/ai-azure-ai-services-blog/announcing-azure-openai-global-batch-general-availability-at/ba-p/4276921" target="_blank" rel="noreferrer noopener"><b>Announcing Azure OpenAI Global Batch General availability: At scale </b></a><a href="https://techcommunity.microsoft.com/t5/ai-azure-ai-services-blog/announcing-azure-openai-global-batch-general-availability-at/ba-p/4276921" target="_blank" rel="noreferrer noopener"><b>processing with 50% less cost!</b></a><b>  </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">GA of Azure OpenAI global batch offering, designed to handle large-scale and high-volume processing tasks efficiently.  Process asynchronous groups of requests with separate quota, a 24 hour turnaround and 50% less cost than global standard.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;"> Why Azure OpenAI Global Batch?</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Benefit 50% lower costs, enabling you to either introduce new workloads or run existing workloads more frequently, thereby increasing overall business value.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Efficiently handle large-scale workloads that would be impractical to process in real-time, significantly reducing processing times.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Minimize engineering overhead for job management with a high resource quota, allowing you to queue and process gigabytes of data with ease. Substantially high </span><a href="https://learn.microsoft.com/en-us/azure/ai-services/openai/how-to/batch?tabs=standard-input%2Cpython-secure&amp;pivots=programming-language-ai-studio#global-batch-quota" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">quotas</span></a><span style="font-weight:400;"> for batch.</span></li>
</ul>
<h2><b>Oracle</b></h2>
<p><b>40:09 </b><a href="https://blogs.oracle.com/cloud-infrastructure/post/converged-db-multi-cloud" target="_blank" rel="noreferrer noopener"><b>Create a multi cloud data platform with a converged database</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Oracle Autonomous Database will be available across all major cloud service providers (hyperscalers) by 2025, including Oracle Cloud Infrastructure (OCI), Amazon Web Services (AWS), Google Cloud Platform (GCP), and Microsoft Azure.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Introduction of Oracle’s Converged Database Solution: A single database that manages all data types (structured, unstructured, graph, geospatial, vectors) and can be deployed across private data centers and all major cloud platforms.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">New Features:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Deployment Across Multiple Clouds:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Oracle Autonomous Database on OCI: Offers features like automated security measures, continuous monitoring, and scalability without rearchitecting applications.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Integration with AWS: Strategic partnership enabling deeper analytical insights by combining Oracle Database services with AWS Analytics for near-real-time analytics and machine learning without complex data pipelines.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Oracle Database@Azure: Availability of Oracle Database services within Azure data centers, allowing seamless integration with native Microsoft Azure services for high performance and low latency.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Oracle Database@Google Cloud: Integration of Oracle technologies into Google Cloud, providing services like Oracle Exadata Database Service and Oracle Autonomous Database, fully integrated into Google Cloud networking.</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Converged Database Capabilities:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Unified Data Management: Handles multiple data types within a single database system, reducing the need for multiple specialized databases.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Compliance with Data Residency Regulations: Ensures minimal data replication and consistent data management across geographies to meet stringent regulatory requirements.</span></li>
</ul>
</li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">41:58 Justin – “And it’s kind of interesting, but I can think of really interesting data warehouse use cases. could see some interesting, you know, different global replication needs that you might have that this could be really handy. And so if you’re already sending all the money to Oracle, why not take advantage of something like this? If it makes sense for your solution.”</span></i></p>
<p><b>42:33 </b><a href="https://blogs.oracle.com/cloud-infrastructure/post/migrations-now-migrate-aws-ec2-vm-instances-oci" target="_blank" rel="noreferrer noopener"><b>Oracle Cloud Migrations can now migrate AWS EC2 VM instances to OCI</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Oracle now natively will migrate your EC2 VM to ZOCI.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This fully managed toolset provides you with complete control over the migration workflow while simplifying and automating the process, including:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Automatically discovering VMs in your source environment</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Creating and managing an inventory with OCI of the resource identified in the source environment. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Providing compatibility assessments, metrics, recommendations and cost comparisons</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Creating plans and simplify the deployment of migration targets in OCI</span></li>
</ul>
</li>
</ul>
<h2><span style="font-weight:400;">Closing</span></h2>
<p><span style="font-weight:400;">And that is the week in the cloud! Visit our website, the home of the Cloud Pod where you can join our newsletter, slack team, send feedback or ask questions at theCloud Pod.net or tweet at us with hashtag #theCloudPod</span></p>
]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/1876998/c1e-1xdxbj6z45f4mx8q-1pdpo2q2t0z3-ztjre5.mp3" length="53545189"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 281 of The Cloud Pod, where the forecast is always cloudy! Justin and Ryan are your hosts as we search the clouds for all the latest news and info. This week we’re talking about ECS turning 10 (yes, we were there when it was announced, and yes, we’re old,) some more drama from the CrowdStrike fiasco, lots of updates to GitHub, plus more. Join us!  
Titles we almost went with this week:

Github Universe full of ECS containers
️Github Universe lives up to the Universal expectations 

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
Follow Up
01:09 Dr. Matt Woods ended up at PWC as chief innovation officer

YAWN
What exactly does a chief innovation officer at PWC do? Is this like a semi-retirement? 

General News 
01:44 TSA silent on CrowdStrike’s claim Delta skipped required security update

Delta isn’t backing down with CrowdStrike, and in a court filing said CrowdStrike should be on the hook for the entire $500M in losses, partly because CrowdStrike has admitted that it should have done more testing and staggered deployments to catch bugs. 
Delta further alleges that CrowdStrike postured as a certified best-in-class security provider who “never cuts corners,” while secretly designing its software to bypass Microsoft security certifications to make changes at the core of Delta’s computer systems without Delta’s knowledge. 
Delta says they would never have agreed to such a dangerous process if it had been disclosed. 
In its testimony to Congress, CrowdStrike said that they follow standard protocols, and that they are protecting against threats as they evolve.
CrowdStrike is also accusing Delta of failing to follow laws, including best practices established by the TSA.
According to CrowdStrike, most customers were up within a day of the issue – while Delta took 5 days. 
Crowdstrike alleges that Delta’s negligence caused this in following the TSA requirements designed to ensure that no major airline ever experiences prolonged system outages. 
CrowdStrike realized Delta failed to follow the requirements when its efforts to help...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/1876998/c1a-k5d5-v6z6jv7nc7k9-m1njzt.jpg"></itunes:image>
                                                                            <itunes:duration>00:44:38</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                            </item>
                    <item>
                <title>
                    <![CDATA[280: Evidently, The Cloud Pod Was Always Right]]>
                </title>
                <pubDate>Thu, 31 Oct 2024 11:12:35 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/1871025</guid>
                                    <link>https://tcpfm.castos.com/episodes/280-evidently-the-cloud-pod-was-always-right</link>
                                <description>
                                            <![CDATA[<p><span style="font-weight:400;">Welcome to episode 280 of The Cloud Pod, where the forecast is always cloudy! This week Justin, Jonathan, Ryan, and Matthew are your hosts as we travel through the latest in cloud news. This week we’re talking more about nuclear power, some additional major employee shakeups, Claude releases, plus saying RIP to CloudWatch Evidently and hello to Azure Cobalt VMs.  </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">☢️The cloud providers are colluding on Nuclear Power</span></li>
<li><span style="font-weight:400;">I fear our AWS AI nightmare might get worse without Dr. Matt Wood.</span></li>
<li><span style="font-weight:400;">I’m a glow with excitement about nuclear cloud power</span></li>
<li><span style="font-weight:400;">⚛️Plainly no one else knew what “CloudWatch Evidently” did either</span></li>
<li><span style="font-weight:400;">We sing a Claude Sonnet about Nuclear Power</span></li>
<li><span style="font-weight:400;">✅</span><a href="https://aws.amazon.com/blogs/aws/cloudwatch-evidently/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Evidently</span></a><span style="font-weight:400;">, The Cloud Pod was always right</span></li>
<li><span style="font-weight:400;">Amazon goes nuclear while their AI VP goes AWOL</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info.</span></h3>
<h2><span style="font-weight:400;">AI Is Going Great – Or How ML Makes All It’s Money  </span></h2>
<p><b>00:53 </b><a href="https://www.anthropic.com/news/3-5-models-and-computer-use" target="_blank" rel="noreferrer noopener"><b>Introducing computer use, a new Claude 3.5 Sonnet, and Claude 3.5 Haiku</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Anthropic is announcing the upgraded </span><a href="https://www.anthropic.com/claude/sonnet" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Claude 3.5 Sonnet</span></a><span style="font-weight:400;"> and a new Model Claude 3.5 Haiku. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Claude 3.5 Sonnet delivers across the board improvements over its predecessor, with particularly significant gains in coding — an area where it already leads the field (per anthropic).  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Claude 3.5 Haiku interestingly matches the performance of Claude 3 Opus, the prior largest model, on many evaluations at the same cost and similar speed to the previous generation of Haiku. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Claude 3.5 Sonnet also includes a groundbreaking new capability in beta: Computer Use.  </span></li>
<li style="font-weight:400;"><a href="https://docs.anthropic.com/en/docs/build-with-claude/computer-use" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Available today as an API</span></a><span style="font-weight:400;">, developers can direct Claude to use computers the way people do – by looking at a screen, moving a cursor, clicking buttons and typing text.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Claude 3.5 is the first frontier AI model to offer this capability. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Anthropic warns the feature is still </span><a href="https://www.anthropic.com/news/developing-computer-use" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">experimental</span></a><span style="font-weight:400;"> – at times cumbersome and error-prone. As well as things that are effortless for a human are still difficult including scrolling, dra...</span></li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 280 of The Cloud Pod, where the forecast is always cloudy! This week Justin, Jonathan, Ryan, and Matthew are your hosts as we travel through the latest in cloud news. This week we’re talking more about nuclear power, some additional major employee shakeups, Claude releases, plus saying RIP to CloudWatch Evidently and hello to Azure Cobalt VMs.  
Titles we almost went with this week:

☢️The cloud providers are colluding on Nuclear Power
I fear our AWS AI nightmare might get worse without Dr. Matt Wood.
I’m a glow with excitement about nuclear cloud power
⚛️Plainly no one else knew what “CloudWatch Evidently” did either
We sing a Claude Sonnet about Nuclear Power
✅Evidently, The Cloud Pod was always right
Amazon goes nuclear while their AI VP goes AWOL

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info.
AI Is Going Great – Or How ML Makes All It’s Money  
00:53 Introducing computer use, a new Claude 3.5 Sonnet, and Claude 3.5 Haiku

Anthropic is announcing the upgraded Claude 3.5 Sonnet and a new Model Claude 3.5 Haiku. 
Claude 3.5 Sonnet delivers across the board improvements over its predecessor, with particularly significant gains in coding — an area where it already leads the field (per anthropic).  
Claude 3.5 Haiku interestingly matches the performance of Claude 3 Opus, the prior largest model, on many evaluations at the same cost and similar speed to the previous generation of Haiku. 
Claude 3.5 Sonnet also includes a groundbreaking new capability in beta: Computer Use.  
Available today as an API, developers can direct Claude to use computers the way people do – by looking at a screen, moving a cursor, clicking buttons and typing text.  
Claude 3.5 is the first frontier AI model to offer this capability. 
Anthropic warns the feature is still experimental – at times cumbersome and error-prone. As well as things that are effortless for a human are still difficult including scrolling, dra...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[280: Evidently, The Cloud Pod Was Always Right]]>
                </itunes:title>
                                    <itunes:episode>280</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p><span style="font-weight:400;">Welcome to episode 280 of The Cloud Pod, where the forecast is always cloudy! This week Justin, Jonathan, Ryan, and Matthew are your hosts as we travel through the latest in cloud news. This week we’re talking more about nuclear power, some additional major employee shakeups, Claude releases, plus saying RIP to CloudWatch Evidently and hello to Azure Cobalt VMs.  </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">☢️The cloud providers are colluding on Nuclear Power</span></li>
<li><span style="font-weight:400;">I fear our AWS AI nightmare might get worse without Dr. Matt Wood.</span></li>
<li><span style="font-weight:400;">I’m a glow with excitement about nuclear cloud power</span></li>
<li><span style="font-weight:400;">⚛️Plainly no one else knew what “CloudWatch Evidently” did either</span></li>
<li><span style="font-weight:400;">We sing a Claude Sonnet about Nuclear Power</span></li>
<li><span style="font-weight:400;">✅</span><a href="https://aws.amazon.com/blogs/aws/cloudwatch-evidently/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Evidently</span></a><span style="font-weight:400;">, The Cloud Pod was always right</span></li>
<li><span style="font-weight:400;">Amazon goes nuclear while their AI VP goes AWOL</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info.</span></h3>
<h2><span style="font-weight:400;">AI Is Going Great – Or How ML Makes All It’s Money  </span></h2>
<p><b>00:53 </b><a href="https://www.anthropic.com/news/3-5-models-and-computer-use" target="_blank" rel="noreferrer noopener"><b>Introducing computer use, a new Claude 3.5 Sonnet, and Claude 3.5 Haiku</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Anthropic is announcing the upgraded </span><a href="https://www.anthropic.com/claude/sonnet" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Claude 3.5 Sonnet</span></a><span style="font-weight:400;"> and a new Model Claude 3.5 Haiku. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Claude 3.5 Sonnet delivers across the board improvements over its predecessor, with particularly significant gains in coding — an area where it already leads the field (per anthropic).  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Claude 3.5 Haiku interestingly matches the performance of Claude 3 Opus, the prior largest model, on many evaluations at the same cost and similar speed to the previous generation of Haiku. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Claude 3.5 Sonnet also includes a groundbreaking new capability in beta: Computer Use.  </span></li>
<li style="font-weight:400;"><a href="https://docs.anthropic.com/en/docs/build-with-claude/computer-use" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Available today as an API</span></a><span style="font-weight:400;">, developers can direct Claude to use computers the way people do – by looking at a screen, moving a cursor, clicking buttons and typing text.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Claude 3.5 is the first frontier AI model to offer this capability. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Anthropic warns the feature is still </span><a href="https://www.anthropic.com/news/developing-computer-use" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">experimental</span></a><span style="font-weight:400;"> – at times cumbersome and error-prone. As well as things that are effortless for a human are still difficult including scrolling, dragging or zooming.   </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The idea is to make Claude complete individual tasks, without always needing to leverage an API, like clicking in a GUI, or uploading a file from a computer.  These types of solutions are typically found in Build and Test like scenarios with tools such as Saucelabs or Browserstack. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">To do this, Claude was built to perceive and interact with computer interfaces. You can use data from my computer to fill out this online form or check a spreadsheet, move the cursor to a web browser, navigate to the relevant web pages, select the data for the spreadsheet and so on. </span></li>
</ul>
<p><i><span style="font-weight:400;">3:06  Jonathan – “</span></i><i><span style="font-weight:400;">If you can take pictures of the screen, then it can identify where buttons and things are without having to know the name of the objects in the DOM and stuff like that. So you could say, give me instructions, click on this, click on this, click on this, do this stuff. It would be really easy to automate tests that way instead of having to know the names of the divs and things on a page, especially for web testing. Because if a developer changes those, then you’ve got to update the tests where if you say click on the button that says do this, then it can. Something I really appreciate about Clawboard, although it won’t generate images, it’s really good at analyzing images and describing exactly what’s on the screen or exactly what things are doing in the image that you give it. I think it’s kind of cool. Looking forward to playing with that. API only though.”</span></i></p>
<h2><span style="font-weight:400;">AWS</span></h2>
<p><b>6:50 </b><a href="https://techcrunch.com/2024/10/16/amazon-jumps-on-nuclear-power-bandwagon-by-investing-in-x-energy-and-promising-small-reactors/" target="_blank" rel="noreferrer noopener"><b>Amazon jumps on nuclear power bandwagon by investing in X-Energy and </b></a></p>
<p><span style="font-weight:400;">         </span><a href="https://techcrunch.com/2024/10/16/amazon-jumps-on-nuclear-power-bandwagon-by-investing-in-x-energy-and-promising-small-reactors/" target="_blank" rel="noreferrer noopener"><b>promising small reactors</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft, then Google and Now AWS…and we’re positively glowing with all this nuclear energy!</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon revealed three deals, including an investment in startup X-Energy and two development agreements (Energy Northwest &amp; Dominion Energy)to add around 300 Megawatts of capacity in the PNW and Virginia. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The agreements include the constructions of several new Small Modular reactors (SMRs).  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">SMRs are an advanced kind of nuclear reactor with a small physical footprint, allowing them to be built closer to the grid. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This comes on top of their agreement to co-locate a data-center facility next to Talon Energy’s nuclear facility in Pennsylvania. </span></li>
</ul>
<p><i><span style="font-weight:400;">7:37  Ryan – “</span></i><i><span style="font-weight:400;">It’s so energy intensive to run AI workloads and you can’t really depend on you know like a cloudy day of ruining solar or non windy day like it’s can augment with that but it’s kind of interesting I’m really curious to see what they’ve done in terms of like nuclear waste and hopefully these smaller footprint reactors make that at least easier to manipulate versus like, you know, the giant amounts of nuclear waste that you have to track or train through towns.”</span></i></p>
<p><b>09:21</b> <a href="https://techcrunch.com/2024/10/16/this-week-in-ai-aws-loses-a-top-ai-exec/?guccounter=1" target="_blank" rel="noreferrer noopener"><b>This Week in AI: AWS loses a top AI exec</b></a><b>  </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Dr. Matt Wood, VP Of AI, </span><a href="https://www.linkedin.com/feed/update/urn:li:activity:7249765555107201025/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">announced</span></a><span style="font-weight:400;"> that he would be leaving AWS after 15 years.  Matt had been long involved in the AI initiatives and was appointed VP in September 2022.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Over the last two years there have been several missteps in AI, with Amazon missing out on </span><a href="https://techcrunch.com/2024/03/27/amazon-doubles-down-on-anthropic-completing-its-planned-4b-investment/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">investments</span></a><span style="font-weight:400;"> in Cohere and Anthropic, and having to do a joint investment with Google in Anthropic. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">AWS CEO Matt Garman is aggressively moving to right the ship, acqui-hiring AI startups such as </span><a href="https://techcrunch.com/2023/03/15/adept-a-startup-training-ai-to-use-existing-software-and-apis-raises-350m/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Adept</span></a><span style="font-weight:400;"> and investing in training systems like </span><a href="https://www.reuters.com/technology/amazon-sets-new-team-trains-ambitious-ai-model-codenamed-olympus-sources-2023-11-08/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Olympus</span></a><span style="font-weight:400;">.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">We’re not really sure if he resigned or was asked to leave. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The silver lining? No more boring keynotes! </span></li>
</ul>
<p><b>10:54</b> <a href="https://aws.amazon.com/blogs/mt/support-for-amazon-cloudwatch-evidently-ending-soon/?ck_subscriber_id=512838477" target="_blank" rel="noreferrer noopener"><b>Support for Amazon CloudWatch Evidently ending soon</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Way Back in December 2021 after Re:invent where it was announced we covered the launch of Evidently.  Our show notes at the time were “</span><span style="font-weight:400;"> AWS releases </span><a href="https://aws.amazon.com/blogs/aws/cloudwatch-evidently/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">CloudWatch Evidently</span></a><span style="font-weight:400;">, a capability that helps developers introduce experiments and feature management in their application code. The team remains confused as to why this is a CloudWatch feature.”</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Evidently no one else knew what Cloudwatch Evidently did either, and it’s being deprecated.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">AWS will discontinue the service on 10/17/2025 (so you have a year), and that’s when support for the service will end. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They’ll still provide critical security patches, but they will no longer support any limit increase requests. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">AWS recommends that you leverage </span><a href="https://aws.amazon.com/systems-manager/features/appconfig/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AppConfig</span></a><span style="font-weight:400;">, a feature of AWS Systems Manager.  Which I think we said you should keep using back then. </span></li>
</ul>
<p><i><span style="font-weight:400;">11:51  Ryan – “</span></i><i><span style="font-weight:400;">I do love that there’s no way you can find evidently, you know, because it’s part of CloudWatch, but you also won’t be able to find AppConfig because it’s buried in nine layers of Smangr.”</span></i></p>
<p><b>12:41</b> <a href="https://www.deeplearning.ai/short-courses/serverless-agentic-workflows-with-amazon-bedrock/" target="_blank" rel="noreferrer noopener"><b>Serverless Agentic Workflows with Amazon Bedrock</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS is launching a new short course developed in collaboration with Dr. Andrew Ng and Deep Learning AI.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This hands-on course taught by Mike Chambers, teaches how to build serverless agents that can handle complex tasks without the hassle of managing infrastructure.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">You will learn everything you need to know about integrating tools, automating workflows, and deploying responsible agents with built-in guardrails with AWS and </span><a href="https://docs.aws.amazon.com/bedrock/latest/userguide/what-is-bedrock.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Bedrock</span></a><span style="font-weight:400;">.  </span></li>
</ul>
<p><i><span style="font-weight:400;">13:08  Justin – “</span></i><i><span style="font-weight:400;">I’m very excited about the concept of serverless agentic or even agentic AI in general, but I’m not sure that I would do it on Bedrock.”</span></i></p>
<p><b>13:57</b> <a href="https://aws.amazon.com/about-aws/whats-new/2024/10/aws-lambda-console-key-function-insights-amazon-cloudwatch-dashboard/" target="_blank" rel="noreferrer noopener"><b>AWS Lambda console now surfaces key function insights via built-in </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/aws-lambda-console-key-function-insights-amazon-cloudwatch-dashboard/" target="_blank" rel="noreferrer noopener"><b>Amazon CloudWatch Metrics Insights dashboard</b></a></p>
<p><b>14:13 </b><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/aws-lambda-console-real-time-analytics-amazon-cloudwatch/" target="_blank" rel="noreferrer noopener"><b>AWS Lambda console now supports real-time log analytics via Amazon </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/aws-lambda-console-real-time-analytics-amazon-cloudwatch/" target="_blank" rel="noreferrer noopener"><b>CloudWatch Logs Live Tail</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">The AWS Lambda console now surfaces key metrics about Lambda Functions in your AWS account via a built-in Amazon CloudWatch Metric Insights Dashboard, enabling you to easily identify and troubleshoot the source of errors of performance issues. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Previously you would have to navigate to the </span><a href="https://docs.aws.amazon.com/AmazonCloudWatch/latest/monitoring/WhatIsCloudWatch.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Cloudwatch</span></a><span style="font-weight:400;"> console and query custom metrics or build custom dashboards. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Thank you. We’re honestly shocked this feature took so long to come out. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Not only do they now put some metrics into the </span><a href="https://aws.amazon.com/lambda/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Lambda</span></a><span style="font-weight:400;"> console, but you can also view real-time logs via </span><a href="https://docs.aws.amazon.com/AmazonCloudWatch/latest/logs/CloudWatchLogs_LiveTail.html#CloudWatchLogs_LiveTail_session" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Cloudwatch Logs Live Tail</span></a><span style="font-weight:400;">, an interactive log streaming and analytics capability that provides real-time visibility into logs, making it easier to develop and troubleshoot lambda functions. </span></li>
</ul>
<p><i><span style="font-weight:400;">14:41  Matthew – “</span></i><i><span style="font-weight:400;">I feel like the live tail is fairly recent and I used it a couple of weeks ago in Elastic Beanstalk. Don’t ask questions, but helping out somebody with Elastic Beanstalk, we’ll just move on. And it was a really nice feature of being able to go in there and see real time, hit the API, see the logs on the server, and kind of do it all in there. So I’m looking forward to actually having to be able to grab my lambdas and immediately be able to see the output versus.”</span></i></p>
<p><b>17:34</b> <a href="https://aws.amazon.com/blogs/security/options-for-aws-customers-who-use-entrust-issued-certificates/" target="_blank" rel="noreferrer noopener"><b>Options for AWS customers who use Entrust-issued certificates</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google and Mozilla, as well as the JRE will no longer support Entrust Public TLS certificates after November 2024</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Any certificates issued after November 11 2024 will not be trusted by the browsers. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">If you have imported Entrust certificates via </span><a href="https://aws.amazon.com/certificate-manager/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">ACM</span></a><span style="font-weight:400;"> for </span><a href="https://aws.amazon.com/elasticloadbalancing/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">ELB</span></a><span style="font-weight:400;"> or </span><a href="https://aws.amazon.com/cloudfront/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Cloudfront</span></a><span style="font-weight:400;">, you will need to reissue these certs before November 12th 2024.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The Chrome Security Team wrote in a </span><a href="https://security.googleblog.com/2024/06/sustaining-digital-certificate-security.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">blog post</span></a><span style="font-weight:400;">: “</span><i><span style="font-weight:400;">Over the past several years, publicly disclosed incident reports highlighted a pattern of concerning behaviors by Entrust that fall short of the above expectations, and has eroded confidence in their competence, reliability, and integrity as a publicly-trusted CA Owner.</span></i><span style="font-weight:400;">”</span></li>
</ul>
<p><b>20:46</b> <a href="https://aws.amazon.com/about-aws/whats-new/2024/10/aws-seamless-link-experience-console-mobile-app/" target="_blank" rel="noreferrer noopener"><b>AWS announces a seamless link experience for the AWS Console Mobile </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/aws-seamless-link-experience-console-mobile-app/" target="_blank" rel="noreferrer noopener"><b>App</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">I mean… we’ve wanted this – but we’re also a bit afraid of this feature, as the mobile apps from the cloud providers are pretty limited.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">AWS is announcing a seamless link experience for the AWS console mobile app.  Link to AWS services and resources can now be opened in the AWS Console mobile app when customers have the app installed on their mobile device. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Now AWS customers who are on the go can open links to AWS services and resources from sources like email and chat. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Customers benefit from the mobile apps biometric authentication, and mobile optimized customer experience. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Links to AWS services or resources not available natively, are accessible via an in app browser where customers can deep link to the relevant pages without additional authentication. </span></li>
</ul>
<p><i><span style="font-weight:400;">21:41  Justin – “</span></i><i><span style="font-weight:400;">So this is a nice quality of life improvement. If you’re a heavy user of the mobile app, which as much as I would like to be, I am not because they’re Customers benefit from using the mobile app because it supports bioelectric authentication as well as mobile optimized customer experience. And in the few cases where they don’t have a service that supported, they will apparently now open that experience in a native browser inside of the Amazon console mobile app, which if that works, okay, I’ll accept it, but I’m worried it’s not going to work well, but we’ll see.”</span></i></p>
<p><b>23:47</b> <a href="https://aws.amazon.com/about-aws/whats-new/2024/10/amazon-s3-new-region-bucket-name-filtering-listbuckets-api/" target="_blank" rel="noreferrer noopener"><b>Amazon S3 adds new Region and bucket name filtering for the ListBuckets </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/amazon-s3-new-region-bucket-name-filtering-listbuckets-api/" target="_blank" rel="noreferrer noopener"><b>API</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Stop me if you haven’t had this scenario before, someone needs access to an S3 bucket, you provision them an account, create the IAM policy, and then provide them access.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Next thing you know,  they call you and say they see a ton of buckets in addition to the one you gave them, and they would like to access more buckets… rinse and repeat.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This announcement fixes this problem, and allows you to keep access restricted.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon S3 now supports AWS region and bucket name filters for the ListBuckets API.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">In addition, paginated listbuckets requests now return your S3 general purpose buckets and their corresponding AWS regions in the response, helping you simplify apps that need to determine bucket locations across multiple regions. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">To get started, you can specify the AWS region in the query parameter and the bucket name prefixes.  </span></li>
</ul>
<p><i><span style="font-weight:400;">24:56  Matthew – “</span></i><i><span style="font-weight:400;">It’s amazing how many times they’ve had to, somebody’s been like, okay, they just need access to this bucket. And like, someone gave them just access to the bucket and then they’re like, if they can’t, it doesn’t work. And I’d be like, did you do list? And then literally your scenario would come up and it’s amazing. It’s taken 15 years for this to get fixed. Like I understand S3 is in its own world in IAM, cause it pre-exists IAM, but like this feels like it should have been something.”</span></i></p>
<p><b>27:02 </b><a href="https://aws.amazon.com/blogs/aws/upgraded-claude-3-5-sonnet-from-anthropic-available-now-computer-use-public-beta-and-claude-3-5-haiku-coming-soon-in-amazon-bedrock/" target="_blank" rel="noreferrer noopener"><b>Upgraded Claude 3.5 Sonnet from Anthropic (available now), computer use (public beta), and Claude 3.5 Haiku (coming soon) in Amazon Bedrock</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS supports the new </span><a href="https://aws.amazon.com/blogs/aws/anthropics-claude-3-5-sonnet-model-now-available-in-amazon-bedrock-the-most-intelligent-claude-model-yet/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Claude</span></a><span style="font-weight:400;"> libraries. </span><span style="font-weight:400;"> </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This is what happens when you don’t have a copywriter monitoring your releases and writing your posts. You come in second place. </span></li>
</ul>
<h2><span style="font-weight:400;">GCP</span></h2>
<p><b>27:29 </b><a href="https://blog.google/technology/ai/notebooklm-update-october-2024/" target="_blank" rel="noreferrer noopener"><b>New in NotebookLM: Customizing your Audio Overviews and introducing </b></a><a href="https://blog.google/technology/ai/notebooklm-update-october-2024/" target="_blank" rel="noreferrer noopener"><b>NotebookLM Business</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Justin did a thing!</span></li>
<li style="font-weight:400;"><a href="http://notebooklm.google/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Notebook LM</span></a><span style="font-weight:400;"> is a newish tool built with </span><a href="https://deepmind.google/technologies/gemini/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Gemini 1.5</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">You can upload a set of sources on a topic, and the notebook becomes an expert by grounding its responses in your material and giving you powerful ways to transform information. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">You can use this to create study guides, quizzes or even an </span><a href="https://blog.google/technology/ai/notebooklm-audio-overviews/%5C" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">audio overview</span></a><span style="font-weight:400;"> of the material. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Now, with this announcement you can guide the conversation by providing instructions like focusing on a specific topic or adjusting the expertise level to suit your audience. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">And it makes impressive podcasts (Demo)</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They are also announcing </span><a href="https://notebooklm.google/business" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">NotebookLM Business</span></a><span style="font-weight:400;">, an upcoming version that will be offered via Google Workspace with enhanced features for businesses, universities and organizations. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Note: The Cloud Pod’s female eye candy is the copywriter, not a host. Just FYI. </span></li>
</ul>
<p><i><span style="font-weight:400;">32:05  Justin – “</span></i><i><span style="font-weight:400;">You can definitely tell at different levels of how technical you want it to be. I chose a medium technical ability for it. That’s what I gave in the guidance for this new feature. But it gave me an idea. It’s funny because it has some of the inflections that you would have in a podcast when you’re thinking. We’re not out of a job yet, but maybe someday.”</span></i></p>
<p><b>34:51</b> <a href="https://developers.googleblog.com/en/compare-mode-in-google-ai-studio/" target="_blank" rel="noreferrer noopener"><b>Compare Mode in Google AI Studio: Your Companion for Choosing the </b></a><a href="https://developers.googleblog.com/en/compare-mode-in-google-ai-studio/" target="_blank" rel="noreferrer noopener"><b>Right Gemini Model</b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://aistudio.google.com/app/prompts/new_comparison" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Compare Mode</span></a><span style="font-weight:400;"> is a new feature designed to help you make informed decisions about which Gemini model best suits your needs. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Compare Mode simplifies the process of assessing cost, latency, token limits and response quality, allowing you to evaluate responses across the various Gemini and Gemma models available in AI studio, side by side. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">With this capability you can provide a prompt, and optional system instructions and compare mode will display the outputs from various models, allowing you to quickly assess the strengths of each of your specific use cases.</span></li>
</ul>
<p><i><span style="font-weight:400;">35:32  Ryan – “</span></i><i><span style="font-weight:400;">I also wonder how much this is going to like, you know, the, the, the, more expensive models are going to perform better in most cases. And so like it’s going to be, it’s going to lean you in that direction, or at least it seems like that’s going to be the case, but it’d be interesting.”</span></i></p>
<p><b>40:06</b> <a href="https://cloud.google.com/blog/products/ai-machine-learning/upgraded-claude-3-5-sonnet-with-computer-use-on-vertex-ai/" target="_blank" rel="noreferrer noopener"><b>Announcing Anthropic’s upgraded Claude 3.5 Sonnet on Vertex AI</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">With the launch of Claude 3.5 partner Google is here to tell you that they have added it to the </span><a href="https://cloud.google.com/model-garden" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Vertex AI Model Garden</span></a><span style="font-weight:400;">.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Including the computer use capability in the public beta.   </span></li>
</ul>
<p><b>40:20</b> <a href="https://cloud.google.com/blog/products/devops-sre/announcing-the-2024-dora-report/" target="_blank" rel="noreferrer noopener"><b>Highlights from the 10th DORA report</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">The </span><a href="https://cloud.google.com/resources/devops/state-of-devops?hl=en" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">2024 Accelerated State of DevOps reporting</span></a><span style="font-weight:400;"> has been published. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">One of the highlights of widespread AI adoption is reshaping software development practices with over 75% of respondents saying they rely on AI for at least one daily professional responsibility. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">More than 1/3rd of the respondents said AI experienced moderate to extreme productivity increases from AI. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">However, AI adoption may negatively impact software delivery performance and a reduction in delivery stability. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Despite the productivity gains, respondents reported little to no trust in AI-generated code. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Platform engineering is another area of increased adoption, per the report.  4 key findings were found</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Increased developer productivity</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Prevalence in larger firms</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Potential performance dip</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Need for user-centeredness and developer independence</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Developer experience is the cornerstone of success</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">I need to read the full report, but I’m not surprised by any of these findings. </span></li>
</ul>
<h2><span style="font-weight:400;">Azure</span></h2>
<p><b>42:48 </b><a href="https://techcommunity.microsoft.com/t5/apps-on-azure-blog/new-secure-sandboxes-at-scale-with-azure-container-apps-dynamic/ba-p/4142148" target="_blank" rel="noreferrer noopener"><b>New: Secure Sandboxes at Scale with Azure Container Apps Dynamic </b></a><a href="https://techcommunity.microsoft.com/t5/apps-on-azure-blog/new-secure-sandboxes-at-scale-with-azure-container-apps-dynamic/ba-p/4142148" target="_blank" rel="noreferrer noopener"><b>Sessions</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Azure is announcing in preview Azure Container Apps dynamic sessions.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Azure Container apps is a serverless platform that enables you to run containerized workloads without managing the underlying infrastructure. Dynamic sessions add the ability to execute untrusted code in secure, sandboxes environments at scale. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Dynamics sessions provide secure, ephemeral sandboxes called “sessions” for running potentially malicious code.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Dynamic sessions are ideal for running untrusted code in hostile multi-tenant scenarios:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Running code generated by a LLM</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Running code or commands submitted by cloud app users</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Running cloud based development environments, terminals and more. </span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">43:36  Jonathan – “</span></i><i><span style="font-weight:400;">Imagine you have a service where you want people to be able to define something as code, like a dashboard or some kind of agent for AI or something like that. And you want to test it in a sandbox where it’s not going to have any production impact if it fails or goes into some infinite loop or something. It’s great. It’s really nice to an isolated place to go and test things.”</span></i></p>
<p><b>44:42</b> <a href="https://techcrunch.com/2024/10/17/microsoft-said-it-lost-weeks-of-security-logs-for-its-customers-cloud-products/" target="_blank" rel="noreferrer noopener"><b>Microsoft said it lost weeks of security logs for its customers’ cloud </b></a><a href="https://techcrunch.com/2024/10/17/microsoft-said-it-lost-weeks-of-security-logs-for-its-customers-cloud-products/" target="_blank" rel="noreferrer noopener"><b>products</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">I mean why does anyone trust Microsoft for anything related to security?  This week’s nonsense…</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft has notified customers that it’s missing more than two weeks of security logs for some of its cloud products, leaving network defenders without critical data for detecting possible intrusions. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Per the note sent to customers “</span><i><span style="font-weight:400;">a bug in internal monitoring agents resulted in a malfunction in some of the agents when uploading log data to their internal logging platform</span></i><span style="font-weight:400;">.”</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The notification assures you that it was not caused by a security incident and only affected the collection of log events. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Products impacted included: Entra, Sentinel, Defender for Cloud and Purview. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This comes a year after Federal Investigators complained that Microsoft was withholding security logs from certain US federal government departments that house their emails on the company’s hardened, government-only cloud.  </span></li>
</ul>
<p><i><span style="font-weight:400;">45:54  Matthew – “…</span></i><i><span style="font-weight:400;">there’s only so many hits before people really start. You know yelling at Microsoft being like guys, you can’t lose our security logs that feels like 101 Mike. These systems need to be tested through and through before we promote it, especially for things like your DLP, your AD, your, your SIEM software. Like you can’t be missing these things.”</span></i></p>
<p><b>47:54</b> <a href="https://azure.microsoft.com/en-us/blog/leverage-microsoft-azure-tools-to-navigate-nis2-compliance/" target="_blank" rel="noreferrer noopener"><b>Leverage Microsoft Azure tools to navigate NIS2 compliance </b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Robust cybersecurity measures are vital for organizations to address evolving cyberthreats and navigate regulatory requirements and their impact on compliance strategies.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">NIS 2 is a European Union set of security measures to mitigate risk of cyberthreats and overall levels of cyber securities…</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">But we can’t… how do you explain to the EU that your missing security logs for 2 weeks? </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">WHAT THE HECK. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Leverage tools to maintain compliance – sure Microsoft. Sure. </span></li>
</ul>
<p><b>50:34</b> <a href="https://azure.microsoft.com/en-us/blog/azure-cobalt-100-based-virtual-machines-are-now-generally-available/" target="_blank" rel="noreferrer noopener"><b>Azure Cobalt 100-based Virtual Machines are now generally available</b></a></p>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">If you’ve been anxiously waiting for some </span><a href="https://azure.microsoft.com/get-started/azure-portal/resource-manager" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">ARM</span></a><span style="font-weight:400;"> based virtual machines on </span><a href="https://azure.microsoft.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Azure</span></a><span style="font-weight:400;">, they are pleased to announce the </span><a href="https://azure.microsoft.com/en-us/blog/azure-cobalt-100-based-virtual-machines-are-now-generally-available/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Azure Cobalt 100-based VM’s</span></a><span style="font-weight:400;"> are now GA.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">These VM’s leverage Microsoft’s first 64 bit Arm-Based Azure Cobalt 100 CPU, which has been fully designed in-house. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The new Cobalt 100 instances are in 2 varieties, a general purpose Dpsv6-series and a memory-optimized Epsv6-series VM Series. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Dpsv6 and Dpdsv6 vms offer up to 96 vCPUs and 384gb of memory. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The Dplsv6 series and dpldsv6 series up to 96 vcpus and 192gb of memory</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Epsv6 and epdsv6 series offer up to 96 vcpus and 672 gib of Ram. </span></li>
</ul>
</li>
<li style="font-weight:400;"><i><span style="font-weight:400;"> “We are really excited about the new Cobalt 100 VMs. We are making them the primary platform for our Databricks SQL Serverless offering on Azure, as they offer outstanding efficiency and allow us to deliver significant price-performance improvements to our customers. Customers using our Azure Databricks classic Jobs offering will also greatly benefit from Cobalt VMs by selecting them for their Jobs cluster nodes, achieving noticeable performance improvements while keeping operating costs down.” —</span></i><b><i>Michael Kiermaier, VP of Business Strategy and Operations, Databricks</i></b></li>
</ul>
<p><i><span style="font-weight:400;">52:05  Matthew – “</span></i><i><span style="font-weight:400;">I remember playing with the the Gravitons when they first came out and they were pretty nice. And so it is something that I kind of will throw into some dev and other environments to see how well they are. And what’s nice is they’re actually pretty well available. Like I’m looking at it and it’s a good chunk of reasons that are available day one.”</span></i></p>
<p><b>53:23</b> <a href="https://blogs.microsoft.com/blog/2024/10/21/new-autonomous-agents-scale-your-team-like-never-before/" target="_blank" rel="noreferrer noopener"><b>New autonomous agents scale your team like never before</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Azure is announcing two new agentic capabilities that will accelerate the gains and bring AI-first business process to every organization</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">First the ability to create autonomous agents with CoPilot Studio will be in public preview next month</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Second, they have introduced ten new autonomous agents in D365 to build capacity for sales, service, finance and supply chain teams.</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Earlier this year they announced the copilot studio in private beta, and it will be shifting to public preview, allowing more customers to reimagine critical business processes with AI.  Agents draw on the context of your work data in M365 Graph, system of record, dataverse, and fabric. They can support everything from your IT help desk to employee onboarding and act as personal concierges for sales and service.   </span></li>
</ul>
<p><i><span style="font-weight:400;">54:48  Jonathan – “…</span></i><i><span style="font-weight:400;">they’re not just agents, they’re AI workers for hire.”</span></i></p>
<h2><span style="font-weight:400;">Closing</span></h2>
<p><span style="font-weight:400;">And that is the week in the cloud! Visit our website, the home of the Cloud Pod where you can join our newsletter, slack team, send feedback or ask questions at theCloud Pod.net or tweet at us with hashtag #theCloudPod</span></p>
]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/1871025/c1e-jkjkuqr847up3nr6-jpjx7v7gtg6x-i2l8m0.mp3" length="67055197"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 280 of The Cloud Pod, where the forecast is always cloudy! This week Justin, Jonathan, Ryan, and Matthew are your hosts as we travel through the latest in cloud news. This week we’re talking more about nuclear power, some additional major employee shakeups, Claude releases, plus saying RIP to CloudWatch Evidently and hello to Azure Cobalt VMs.  
Titles we almost went with this week:

☢️The cloud providers are colluding on Nuclear Power
I fear our AWS AI nightmare might get worse without Dr. Matt Wood.
I’m a glow with excitement about nuclear cloud power
⚛️Plainly no one else knew what “CloudWatch Evidently” did either
We sing a Claude Sonnet about Nuclear Power
✅Evidently, The Cloud Pod was always right
Amazon goes nuclear while their AI VP goes AWOL

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info.
AI Is Going Great – Or How ML Makes All It’s Money  
00:53 Introducing computer use, a new Claude 3.5 Sonnet, and Claude 3.5 Haiku

Anthropic is announcing the upgraded Claude 3.5 Sonnet and a new Model Claude 3.5 Haiku. 
Claude 3.5 Sonnet delivers across the board improvements over its predecessor, with particularly significant gains in coding — an area where it already leads the field (per anthropic).  
Claude 3.5 Haiku interestingly matches the performance of Claude 3 Opus, the prior largest model, on many evaluations at the same cost and similar speed to the previous generation of Haiku. 
Claude 3.5 Sonnet also includes a groundbreaking new capability in beta: Computer Use.  
Available today as an API, developers can direct Claude to use computers the way people do – by looking at a screen, moving a cursor, clicking buttons and typing text.  
Claude 3.5 is the first frontier AI model to offer this capability. 
Anthropic warns the feature is still experimental – at times cumbersome and error-prone. As well as things that are effortless for a human are still difficult including scrolling, dra...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/1871025/c1a-k5d5-wwm4r9ora7gp-h8cznn.jpg"></itunes:image>
                                                                            <itunes:duration>00:55:53</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                            </item>
                    <item>
                <title>
                    <![CDATA[279: The Cloud Pod Glows With Excitement Over Google Nuclear Deal]]>
                </title>
                <pubDate>Wed, 23 Oct 2024 09:30:14 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/1865911</guid>
                                    <link>https://tcpfm.castos.com/episodes/279-google-nuclear</link>
                                <description>
                                            <![CDATA[<p><span style="font-weight:400;">Welcome to episode 279 of The Cloud Pod, where the forecast is always cloudy! This week Justin, Jonathan and Matthew are your guide through the Cloud. We’re talking about everything from BigQuery to Google Nuclear power plans, and everything in between! Welcome to episode 279! </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">AWS SKYNET (Q) now controls the supply chain</span></li>
<li><span style="font-weight:400;">⛓️AWS Supply Chain: Where skynet meets your shopping list</span></li>
<li><span style="font-weight:400;">Digital Ocean follows Azure with the Premium everything</span></li>
<li><span style="font-weight:400;">⛰️EKS mounts S3 </span></li>
<li><span style="font-weight:400;">GCP now a nuclear</span></li>
<li><span style="font-weight:400;">Big query don’t hit that iceberg </span></li>
<li><span style="font-weight:400;">Big Query Yells: “ICEBERG AHEAD” </span></li>
<li><span style="font-weight:400;">The Cloud Pod: Now with 50% more meltdown protection</span></li>
<li><span style="font-weight:400;">☢️The Cloud Pod radiates excitement over Google’s nuclear deal</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><span style="font-weight:400;">Follow Up</span></h2>
<p><b>00:46 </b><a href="https://www.theinformation.com/articles/openais-newest-possible-threat-ex-cto-murati-googles-mini-chatgpt-moment?rc=3t8xtd" target="_blank" rel="noreferrer noopener"><b>OpenAI’s Newest Possible Threat: Ex-CTO Murati</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Apologies listeners – paywall article. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Given the recent departure of Ex-CTO Mira Murati from OpenAI, we speculated that she might be starting something new…and the rumors are rumorin’. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Rumors have been running wild since her last day on October 4th, with several people reporting that there has been a lot of churn. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Speculation is that Murati may join former Open AI VP Bret Zoph at his new startup.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">It may be easy to steal some people, as the research organization at Open AI is reportedly in upheaval after Liam Fedus’s promotion to lead post-training – several researchers have asked to switch teams. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">In addition, Ilya Sutskever, an Open AI co-founder and former chief scientist, also has a new startup.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">We’ll definitely be keeping an eye on this particular soap opera. </span></li>
</ul>
<p><i><span style="font-weight:400;">2:00  Jonathan – “</span></i><i><span style="font-weight:400;">I kind wonder what will these other startups bring that’s different than what OpenAI are doing or Anthropic or anybody else. mean, they’re all going to be taking the same training data sets because that’s what’s available. It’s not like they’re going to invent some data from somewhere else and have an edge. I mean, I guess they could do different things like be mindful about licensing.”</span></i></p>
<h2><span style="font-weight:400;">General News</span></h2>
<p><b>4:41 </b><a href="https://www.digitalocean.com/blog/new-larger-droplet-sizes" target="_blank" rel="noreferrer noopener"><b>Introducing New 48vCPU and 60vCPU Optimized Premium Droplets on </b></a><a href="https://www.digitalocean.com/blog/new-larger-droplet-sizes" target="_blank" rel="noreferrer noopener"></a></p>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 279 of The Cloud Pod, where the forecast is always cloudy! This week Justin, Jonathan and Matthew are your guide through the Cloud. We’re talking about everything from BigQuery to Google Nuclear power plans, and everything in between! Welcome to episode 279! 
Titles we almost went with this week:

AWS SKYNET (Q) now controls the supply chain
⛓️AWS Supply Chain: Where skynet meets your shopping list
Digital Ocean follows Azure with the Premium everything
⛰️EKS mounts S3 
GCP now a nuclear
Big query don’t hit that iceberg 
Big Query Yells: “ICEBERG AHEAD” 
The Cloud Pod: Now with 50% more meltdown protection
☢️The Cloud Pod radiates excitement over Google’s nuclear deal

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
Follow Up
00:46 OpenAI’s Newest Possible Threat: Ex-CTO Murati

Apologies listeners – paywall article. 
Given the recent departure of Ex-CTO Mira Murati from OpenAI, we speculated that she might be starting something new…and the rumors are rumorin’. 
Rumors have been running wild since her last day on October 4th, with several people reporting that there has been a lot of churn. 
Speculation is that Murati may join former Open AI VP Bret Zoph at his new startup.  
It may be easy to steal some people, as the research organization at Open AI is reportedly in upheaval after Liam Fedus’s promotion to lead post-training – several researchers have asked to switch teams. 
In addition, Ilya Sutskever, an Open AI co-founder and former chief scientist, also has a new startup.  
We’ll definitely be keeping an eye on this particular soap opera. 

2:00  Jonathan – “I kind wonder what will these other startups bring that’s different than what OpenAI are doing or Anthropic or anybody else. mean, they’re all going to be taking the same training data sets because that’s what’s available. It’s not like they’re going to invent some data from somewhere else and have an edge. I mean, I guess they could do different things like be mindful about licensing.”
General News
4:41 Introducing New 48vCPU and 60vCPU Optimized Premium Droplets on ]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[279: The Cloud Pod Glows With Excitement Over Google Nuclear Deal]]>
                </itunes:title>
                                    <itunes:episode>279</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p><span style="font-weight:400;">Welcome to episode 279 of The Cloud Pod, where the forecast is always cloudy! This week Justin, Jonathan and Matthew are your guide through the Cloud. We’re talking about everything from BigQuery to Google Nuclear power plans, and everything in between! Welcome to episode 279! </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">AWS SKYNET (Q) now controls the supply chain</span></li>
<li><span style="font-weight:400;">⛓️AWS Supply Chain: Where skynet meets your shopping list</span></li>
<li><span style="font-weight:400;">Digital Ocean follows Azure with the Premium everything</span></li>
<li><span style="font-weight:400;">⛰️EKS mounts S3 </span></li>
<li><span style="font-weight:400;">GCP now a nuclear</span></li>
<li><span style="font-weight:400;">Big query don’t hit that iceberg </span></li>
<li><span style="font-weight:400;">Big Query Yells: “ICEBERG AHEAD” </span></li>
<li><span style="font-weight:400;">The Cloud Pod: Now with 50% more meltdown protection</span></li>
<li><span style="font-weight:400;">☢️The Cloud Pod radiates excitement over Google’s nuclear deal</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><span style="font-weight:400;">Follow Up</span></h2>
<p><b>00:46 </b><a href="https://www.theinformation.com/articles/openais-newest-possible-threat-ex-cto-murati-googles-mini-chatgpt-moment?rc=3t8xtd" target="_blank" rel="noreferrer noopener"><b>OpenAI’s Newest Possible Threat: Ex-CTO Murati</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Apologies listeners – paywall article. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Given the recent departure of Ex-CTO Mira Murati from OpenAI, we speculated that she might be starting something new…and the rumors are rumorin’. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Rumors have been running wild since her last day on October 4th, with several people reporting that there has been a lot of churn. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Speculation is that Murati may join former Open AI VP Bret Zoph at his new startup.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">It may be easy to steal some people, as the research organization at Open AI is reportedly in upheaval after Liam Fedus’s promotion to lead post-training – several researchers have asked to switch teams. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">In addition, Ilya Sutskever, an Open AI co-founder and former chief scientist, also has a new startup.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">We’ll definitely be keeping an eye on this particular soap opera. </span></li>
</ul>
<p><i><span style="font-weight:400;">2:00  Jonathan – “</span></i><i><span style="font-weight:400;">I kind wonder what will these other startups bring that’s different than what OpenAI are doing or Anthropic or anybody else. mean, they’re all going to be taking the same training data sets because that’s what’s available. It’s not like they’re going to invent some data from somewhere else and have an edge. I mean, I guess they could do different things like be mindful about licensing.”</span></i></p>
<h2><span style="font-weight:400;">General News</span></h2>
<p><b>4:41 </b><a href="https://www.digitalocean.com/blog/new-larger-droplet-sizes" target="_blank" rel="noreferrer noopener"><b>Introducing New 48vCPU and 60vCPU Optimized Premium Droplets on </b></a><a href="https://www.digitalocean.com/blog/new-larger-droplet-sizes" target="_blank" rel="noreferrer noopener"><b>DigitalOcean</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Those raindrops are getting pretty heavy as Digital Ocean announces their new 48vCPU Memory and storage optimized premium droplets, and 60vcpu general purpose and CPU optimized premium droplets. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Droplets are DO’s Linux-based virtual machines.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Premium Optimized Droplets are dedicated CPU instances with access to the full hyperthread, as well as 10GBps of outbound data transfer.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The 48vCPU boxes have 384GB of memory, and the 60vCPU boxes have 160gb.</span></li>
</ul>
<p><i><span style="font-weight:400;">6:02  Justin – “</span></i><i><span style="font-weight:400;">I’ve been watching the CloudPod hosting bill slowly creep up over the years as we get more and more data into S3 and we have logs that we store and things like that for the website. And I have other websites that I host there too. it originally started on DigitalOcean and it was a very flat rate for that VM that I need. You start sort of thinking like, maybe Amazon is great for this use case.”</span></i></p>
<h2><span style="font-weight:400;">AWS</span></h2>
<p><b>19:31 </b><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/cross-zone-network-load-balancer-zonal-shift-autoshift/" target="_blank" rel="noreferrer noopener"><b>Cross-zone enabled Network Load Balancer now supports zonal shift and </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/cross-zone-network-load-balancer-zonal-shift-autoshift/" target="_blank" rel="noreferrer noopener"><b>zonal autoshift</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS NLB now supports Amazon Application Recovery Controllers’ zonal shift and zonal auto-shift features on load balancers enabled across zones.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Zonal shift allows you to quickly shift traffic away from an impaired availability zone and recover from events such as bad application deployment and gray failures. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Zonal autoshift safely and automatically shifts your traffic away from an AZ when AWS identifies a potential impact to it. </span></li>
</ul>
<p><i><span style="font-weight:400;">19:57  Justin – “</span></i><i><span style="font-weight:400;">I like just to do that off my health checks, not off AWS telling them, but I appreciate the effort because when you do run into these type of AZ specific issues, they can be a bit of a pain to identify quickly. If Amazon can identify they have a problem and route your traffic for you, that is a great upgrade.”</span></i></p>
<p><b>21:23 </b><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/amazon-memorydb-valkey/" target="_blank" rel="noreferrer noopener"><b>Announcing Amazon MemoryDB for Valkey </b></a></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/amazon-elasticache-valkey/" target="_blank" rel="noreferrer noopener"><b>Announcing Amazon ElastiCache for Valkey</b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/memorydb/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon MemoryDB</span></a><span style="font-weight:400;"> and Elasticache have both announced support for </span><a href="https://valkey.io/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Valkey</span></a><span style="font-weight:400;"> with 30% and 33% lower costs than Memory DB and</span><a href="https://aws.amazon.com/elasticache/redis/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;"> Elasticache for Redis OSS</span></a><span style="font-weight:400;">.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Ironically, we saved you 50% by reducing these two stories into one. You’re welcome. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">In addition, they give you a nice free tier, where with MemoryDB, you are not charged for up to 10TB of data written per month. Any data over 10TB a month is billed at 0.04 GB, which is 80% lower than MemoryDB for Redis OSS.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">For Elasticache, serverless is 33% lower and Node based pricing is 20% lower than the supported engines. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Nice move passing on some savings to customers to drive Valkey adoption, and probably improve their margin as well by not having to pay </span><a href="https://redis.io/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Redis</span></a><span style="font-weight:400;">. </span></li>
</ul>
<p><i><span style="font-weight:400;">22:54  Matthew – “</span></i><i><span style="font-weight:400;">10 terabytes for a month on the free tier is a ton too. Like, I know a lot of apps that use Redis that honestly probably don’t even hit that in a production workload. So this is great. And I think I’m just more mad that when Redis forked or changed license, they were like, Azure stay with us. And now I’m just mad at everyone with all these improvements.”</span></i></p>
<p><b>24:16</b> <a href="https://aws.amazon.com/about-aws/whats-new/2024/10/organization-wide-views-agreements-spend-aws-marketplace/" target="_blank" rel="noreferrer noopener"><b>Access organization-wide views of agreements and spend in AWS </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/organization-wide-views-agreements-spend-aws-marketplace/" target="_blank" rel="noreferrer noopener"><b>Marketplace</b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/organization-wide-views-agreements-spend-aws-marketplace/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Marketplace</span></a><span style="font-weight:400;"> announces the GA of a new procurement insights dashboard, helping you manage your organization’s renewals and optimize your AWS marketplace spend. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The new dashboard gives you detailed visibility into your organization’s AWS marketplace agreements and associated spend across the AWS accounts in your organization.  </span></li>
</ul>
<p><i><span style="font-weight:400;">24:40  Justin – “</span></i><i><span style="font-weight:400;">…this is actually an interesting challenge, because if you’re buying your cloud solutions, you typically have a reseller or you’re going direct with AWS. And in the event that you’re doing marketplace, just it’s part of your cloud spend. And so you can commit a lot of money through marketplace without going through proper procurement cycles and without proper governance. And so by giving this now a consistent single dashboard, you can now hopefully start keeping track of where things are being spent.”</span></i></p>
<p><b>26:10 </b><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/mountpoint-amazon-s3-csi-driver-access-controls-kubernetes-pods/" target="_blank" rel="noreferrer noopener"><b>Mountpoint for Amazon S3 CSI driver introduces new access controls for </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/mountpoint-amazon-s3-csi-driver-access-controls-kubernetes-pods/" target="_blank" rel="noreferrer noopener"><b>individual Kubernetes pods</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Mountpoint for S3 Container Storage Interface now supports configuring distinct </span><a href="https://aws.amazon.com/iam/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS identity and access management (IAM)</span></a><span style="font-weight:400;"> roles for individual </span><a href="https://kubernetes.io/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">K8</span></a><span style="font-weight:400;"> pods. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Built on top of </span><a href="https://aws.amazon.com/s3/features/mountpoint/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Mountpoint for S3</span></a><span style="font-weight:400;">, the CSI driver presents an </span><a href="https://aws.amazon.com/s3/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">S3</span></a><span style="font-weight:400;"> bucket as volume accessible by containers in </span><a href="https://aws.amazon.com/pm/eks/?trk=43cd0cbf-ec64-42e8-a02d-24318978ccbe&amp;sc_channel=ps&amp;s_kwcid=AL!4422!10!72086967750141!72087494887246&amp;ef_id=e83081162a50159a58e8e60e5db3e2aa:G:s" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon EKS</span></a><span style="font-weight:400;"> and self-managed K8 clusters. </span></li>
</ul>
<p><i><span style="font-weight:400;">26:51  Jonathan – “</span></i><i><span style="font-weight:400;">I thought pods had the ability to have their own roles that they can assume for a long time, so I was surprised that this wasn’t already inherited from that existing functionality.”</span></i></p>
<p><b>27:19 </b><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/amazon-opensearch-serverless-suite-features-enhancements/" target="_blank" rel="noreferrer noopener"><b>Amazon OpenSearch Serverless introduces a suite of new features and </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/amazon-opensearch-serverless-suite-features-enhancements/" target="_blank" rel="noreferrer noopener"><b>enhancements </b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/opensearch-service/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Open Search</span></a><span style="font-weight:400;"> serverless has several new features this week.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">A new flat object type has been introduced, which allows for more efficient storage and searching of nested data. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Support for enhanced geospatial features, providing users with the ability to uncover valuable insights from location data.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Expanded field types, including support for unsigned long, and doc count mapper. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The multi-term aggregation feature enables you to perform complex aggregations and gain deeper insights into your data. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Furthermore, serverless Opensearch has seen a significant reduction in indexing latencies and faster ascending/descending search sorts, improving efficiency and performance overall. </span></li>
</ul>
<p><i><span style="font-weight:400;">29:09  Justin – “</span></i><i><span style="font-weight:400;">new features are always a bit delayed. Like they would announce it with a blog post and the blog post all you get for like two or three weeks. I mean, if you look back next week, I bet there’s updated documentation. So there’s a disconnect between the announcement and the documentation team and when they publish things.”</span></i></p>
<p><b>29:34 </b><a href="https://aws.amazon.com/blogs/aws/convert-aws-console-actions-to-reusable-code-with-aws-console-to-code-now-generally-available/" target="_blank" rel="noreferrer noopener"><b>Convert AWS console actions to reusable code with AWS Console-to-Code, </b></a><a href="https://aws.amazon.com/blogs/aws/convert-aws-console-actions-to-reusable-code-with-aws-console-to-code-now-generally-available/" target="_blank" rel="noreferrer noopener"><b>now generally available</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS is announcing the General Availability of AWS Console-to-Code which makes it easy to convert AWS console actions to reusable code. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">You can use AWS Console-to-code to record your actions and workflows in the console, such as launching an </span><a href="https://aws.amazon.com/ec2/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">EC2</span></a><span style="font-weight:400;"> instance, reviewing the</span><a href="https://aws.amazon.com/cli/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;"> AWS CLI</span></a><span style="font-weight:400;"> for your console actions.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">With just a few clicks more, </span><a href="https://aws.amazon.com/q/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Q</span></a><span style="font-weight:400;"> can generate code for you using IaC format of your change including </span><a href="https://aws.amazon.com/cloudformation/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Cloudformation</span></a><span style="font-weight:400;"> YAML or JSON (does anyone still do Cloudformation in JSON?) and </span><a href="https://aws.amazon.com/cdk/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS CDK </span></a><span style="font-weight:400;">Typescript, Python or Java.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This can be used as a starting point for infrastructure automation and further customized for your production workloads, included in pipelines and more. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">For GA it has several new features:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Support for more services including </span><a href="https://console.aws.amazon.com/ec2" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">EC2</span></a><span style="font-weight:400;">, </span><a href="https://aws.amazon.com/rds/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">RDS</span></a><span style="font-weight:400;"> and</span><a href="https://aws.amazon.com/vpc/%5C" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;"> VPC</span></a></li>
<li style="font-weight:400;"><span style="font-weight:400;">Simplified experience in managing the prototyping, recording and code generation workflows. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Preview code</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Advanced code generation </span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">31:07  Matthew – “</span></i><i><span style="font-weight:400;">Well, the problem with CDK was, especially – granted this was years ago – you tried to do anything too fancy with it and it just kind of tried to do too many things and then CloudFormation would barf…I’m sure it’s exponentially better now, like five years later, or might be more than that at this point. I don’t really want to do that math.’</span></i></p>
<h2><span style="font-weight:400;">GCP</span></h2>
<p><b>31:58 </b><a href="https://blog.google/outreach-initiatives/sustainability/google-kairos-power-nuclear-energy-agreement/" target="_blank" rel="noreferrer noopener"><b>New nuclear clean energy agreement with Kairos Power</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google sees MS restarting 3 Mile Island, and raises you by building new small modular reactors developed by Kairos Power. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This is the first corporate agreement to purchase nuclear energy from multiple small modular reactors (SMR) to be developed by Kairos Power. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The initial phase of work is intended to bring Kairos powers first SMR online quickly and safely by 2030, followed by additional reactor deployments through 2035.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The deal should enable up to 500 MW of new 24/7 carbon-free power to US electricity grids and help more communities benefit from clean and affordable nuclear power. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Kairos power technology uses a molten-salt cooling system combined with ceramic, pebble-type fuel, to efficiently transport heat to a steam turbine to generate power. This passively safe system allows the reactors to operate at low pressure, enabling a simple, more affordable nuclear design. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Using an iterative development approach, Kairos power will complete multiple successive hardware demonstrations ahead of its first commercial plant.  This will enable critical learnings and efficiency improvements that accelerate reactor deployments, as well as greater cost certainty for google and other customers. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Kairos has been at this for a while, having received over the summer a construction permit from the Nuclear Regulatory Commission to build their first power-producing reactor with the Hermes non-powered demonstration reactor in Tennessee. </span></li>
</ul>
<p><i><span style="font-weight:400;">35:04  Matthew – “</span></i><i><span style="font-weight:400;">I’m waiting for these cloud providers to vertically aggregate now and become power companies for their own things and their own like little generators now they have five little nuclear sites on each data center and that’s their power. And they’re essentially off grid except for the internet.”</span></i></p>
<p><b>37:46</b> <a href="https://blog.google/technology/google-deepmind/google-deepmind-demis-hassabis-john-jumper-nobel-prize-chemistry-alphafold/" target="_blank" rel="noreferrer noopener"><b>Google DeepMind’s Demis Hassabis &amp; John Jumper awarded Nobel Prize in Chemistry</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Co-Founder and CEO of Google Deepmind and Isomorphic labs Sir Demis Hassabis and Google DeepMind Director Dr. John Jumper were co-awarded the 2024 Nobel prize in chemistry for their work developing AlphaFold, a groundbreaking AI system that predicts the 3D structure of proteins from their amino acid sequences. David Baker was also co-awarded for his work on computational protein design. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Before AlphaFold, predicting the structure of a protein was a complex and time-consuming process. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">AlphaFolds predictions are freely available through the AlphaFold protein structure database and have given more than 2 million scientists and researchers from 190 countries a powerful tool for making new discoveries. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">We’re just really excited to see AI being used for something other than cat memes. </span></li>
</ul>
<p><b>40:02</b> <a href="https://blog.google/technology/safety-security/the-new-global-signal-exchange-will-help-fight-scams-and-fraud/" target="_blank" rel="noreferrer noopener"><b>The new Global Signal Exchange will help fight scams and fraud</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Scams have had a huge impact on people’s lives, with people losing their life savings in some instances.  Keeping people safe from scammers is core to the work of many teams at Google. And they are excited to share information about a new partnership and how Cross-Account protection is actively protecting 3.2billion users. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The partnership is with the </span><a href="https://www.gasa.org/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Global Anti-Scam Alliance (GASA)</span></a><span style="font-weight:400;">, and </span><a href="https://dnsrf.org/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">DNS Research Federation (DNSRF)</span></a><span style="font-weight:400;"> to launch the </span><a href="https://globalsignalexchange.org/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Global Signal Exchange (GSE)</span></a><span style="font-weight:400;">. The GSE is a new project with the ambition to be a global clearinghouse for online scams and fraud bad actor signals with google becoming the first founding member. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">In May, they announced </span><a href="https://support.google.com/accounts/answer/112802?hl=en" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Cross-Account Protection</span></a><span style="font-weight:400;">, a tool which enables ongoing cooperation between platforms in the fight against abuse. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Now they’re sharing that Cross-Account Protection is actively protecting 3.2 billion users across sites and apps where they sign in with their Google Account. </span></li>
</ul>
<p><i><span style="font-weight:400;">41:05  Matthew – “</span></i><i><span style="font-weight:400;">This is great, you know, the amount of people I know that have been scammed from, you know, one thing or another, or, you know, one of my friends, friends, grandparent got scammed a few weeks ago. It was, you know, messaged me to help. when I’m like, there’s not much you can do, you know, we can solve this in the world, you know, hopefully the world becomes a better place.</span></i></p>
<p><a href="https://cloud.google.com/blog/products/databases/database-center-preview-now-open-to-all-customers/" target="_blank" rel="noreferrer noopener"><b>Database Center — your AI-powered, unified fleet management solution</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Organizations are grappling with an explosion of operational data spread across an increasingly diverse and complex database landscape.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This complexity often results in costly outages, performance bottlenecks, security vulnerabilities, and compliance gaps, hindering your ability to extract valuable insights and deliver exceptional customer experiences. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">To help address this google earlier announced the preview of </span><a href="https://cloud.google.com/database-center/docs/overview" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Database Cente</span></a><span style="font-weight:400;">r, an AI-powered, unified fleet management solution. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Database Center is now GA to all customers, empowering you to monitor and operate database fleets at scale with a single unified solution.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They have also now added support for spanner, in addition to the previously supported CloudSQL and AlloyDB deployments, with support for more databases on the way. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Database center has the key features available in a unified interface where you can:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Gain a comprehensive view of our entire database fleet</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Proactively de-risk your fleet with intelligent performance and security recommendations</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Optimize your database fleet with AI-powered assistance. </span></li>
</ul>
</li>
</ul>
<p><b>43:51 </b><a href="https://cloud.google.com/blog/products/data-analytics/announcing-bigquery-tables-for-apache-iceberg/" target="_blank" rel="noreferrer noopener"><b>BigQuery tables for Apache Iceberg: optimized storage for the open </b></a><a href="https://cloud.google.com/blog/products/data-analytics/announcing-bigquery-tables-for-apache-iceberg/" target="_blank" rel="noreferrer noopener"><b>lakehouse</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google is announcing in preview BigQuery Tables for Apache Iceberg, a fully managed, Apache Iceberg-compatible storage engine from BQ with features such as autonomous storage optimizations, clustering, and high-throughput streaming ingestion.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">BigQuery tables for Apache Iceberg uses the Iceberg format to store data in customer owned cloud storage buckets while providing a similar customer experience and feature set as BigQuery native tables.  </span></li>
</ul>
<p><i><span style="font-weight:400;">45:17  Justin – “</span></i><i><span style="font-weight:400;">So one of my secret tricks to figuring out AWS predictions is go look at all the Apache projects that have gotten popular in the last six months. So I’m giving away trade secrets here, that is, yeah, there’s a lot of Apache projects. There’s a lot of Open Cloud Foundation projects. There’s a bunch of things, and those are all definitely ripe for opportunities.”</span></i></p>
<p><b>46:58  </b><a href="https://cloud.google.com/blog/topics/cost-management/introducing-the-google-cloud-cost-attribution-solution/" target="_blank" rel="noreferrer noopener"><b>Gain control of your Google Cloud costs: Introducing the Cost Attribution </b></a><a href="https://cloud.google.com/blog/topics/cost-management/introducing-the-google-cloud-cost-attribution-solution/" target="_blank" rel="noreferrer noopener"><b>Solution</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">As you drive </span><a href="https://www.finops.org/introduction/what-is-finops/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">FinOps</span></a><span style="font-weight:400;"> adoption in your organization (which we’re hoping you all are) identifying which teams, projects and services are driving your expenses is essential.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">To help ease this Google is introducing the Google Cloud </span><a href="https://github.com/google/cost-attribution-solution/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Cost Attribution Solution</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This is a comprehensive set of tools and best practices designed to improve your cost metadata and labeling governance processes, enabling data-driven decisions so you can ultimately optimize your cloud spending. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Cost Attribution Solution leverages a fundamental google cloud feature that often goes underutilized: labels.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">These simply yet powerful key-value pairs act as metadata tags that you can attach to your google cloud resources. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">By applying the labels you can get:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Granular Cost Breakdowns</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Data-Driven Decisions</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Customizable Reporting</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Google understands that your environment is unique and that you may have different levels of maturity, which is why they are giving you proactive and reactive governance approaches for labels;</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Proactive Governance (enforcement); Start on the right foot by enforcing consistent and accurate labeling from when you provision resources. Terraform Policy Validation integrates into your IAC workflow, helping ensure that every new resource is tagged correctly per the organization’s labeling policies. This prevents cost tracking gaps and improves accuracy from data 1. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Reactive governance (reporting, alerting and reconciliation) for existing resources they offer a dual approach</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Reporting: the tool identifies unlabeled resources, providing a clear picture of where you may have gaps in cost visibility down to individual projects and resources</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Alerting: Receive near real-time alerts when resources are created or modified without the proper labels, enabling you to quickly rectify any issues and maintain control over your cloud costs</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Reconciliation: go beyond just reporting by actively enforcing your labeling policies on existing projects. This empowers you to automate the application of correct labels to unlable or mislabeled resources, for comprehensive cost visibility and data accuracy across your entire Google Cloud landscape.  </span></li>
</ul>
</li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">49:46  Justin – “Y</span></i><i><span style="font-weight:400;">our pipeline has to be using the G cloud beta Terraform provider to do this. And so basically you, you know, it’s a G cloud beta Terraform vet command you run basically to do your policy validation. And so there are some pretty easy ways to bypass that for the Terraform code. So I would like the other option as well to basically post creation, which they kind of say they have in the reactive side with the alerting. But yeah, it’s still better. And if you are doing a lot of Terraform work on Google, you’re probably looking at this Terraform feature anyways, because it’s pretty powerful. But they’re providing basically a Terraform cloud implementation for Google that you don’t have to pay for, which is a plus.”</span></i></p>
<h2><span style="font-weight:400;">Azure</span></h2>
<p><b>51:31 </b><a href="https://github.blog/news-insights/product-news/code-referencing-now-generally-available-in-github-copilot-and-with-microsoft-azure-ai/" target="_blank" rel="noreferrer noopener"><b>Code referencing now generally available in GitHub Copilot and with </b></a><a href="https://github.blog/news-insights/product-news/code-referencing-now-generally-available-in-github-copilot-and-with-microsoft-azure-ai/" target="_blank" rel="noreferrer noopener"><b>Microsoft Azure AI</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">What’s being announced: GitHub is announcing the general availability of code referencing in GitHub Copilot Chat and GitHub Copilot code completions. This feature allows developers to see information about code suggestions that match existing public code.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Key features:</span></li>
</ul>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Option to block or allow suggestions containing matching code</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">For allowed suggestions, information is provided about the matches</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Notifications in the editor showing:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">The matching code</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The file where the code appears</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Licensing information (if detected) for the relevant repository</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Available in VS Code, with wider availability coming soon</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Partnership with Microsoft Azure to make the code referencing API available on Azure AI Content Safety</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">How it’s different from previous methods: Previously, GitHub Copilot had a filter to prevent suggestions matching public code, but lacked transparency about the origins of suggested code. The new code referencing feature:</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Provides transparency about code origins within Copilot suggestions</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Allows developers to make more informed decisions about using suggested code</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Extends GitHub’s indemnification commitment to include the use of code referencing for Copilot Business and Enterprise customers who comply with cited licenses</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Balances the benefits of AI-assisted coding with the values of the open source community, such as transparency and knowledge sharing</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Makes code referencing capabilities available to other AI development tools through the Azure AI Content Safety API</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This new feature aims to address concerns about the use of public code in AI-generated suggestions while maintaining the efficiency benefits of using GitHub Copilot. It provides developers and businesses with more control and information about the code they’re using, aligning with open source values of transparency and community knowledge sharing.</span></li>
</ul>
<p><i><span style="font-weight:400;">49:46  Jonathan – “Well, </span></i><i><span style="font-weight:400;">AI generated content still isn’t copyrightable, so I’d be surprised if anyone actually admits that something was written by AI.”</span></i></p>
<h2><span style="font-weight:400;">Closing</span></h2>
<p><span style="font-weight:400;">And that is the week in the cloud! Visit our website, the home of the Cloud Pod where you can join our newsletter, slack team, send feedback or ask questions at theCloud Pod.net or tweet at us with hashtag #theCloudPod</span></p>
]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/1865911/c1e-gdkdf3jq0rsxk2o6-34gmvkd8c702-5rt32o.mp3" length="65745418"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 279 of The Cloud Pod, where the forecast is always cloudy! This week Justin, Jonathan and Matthew are your guide through the Cloud. We’re talking about everything from BigQuery to Google Nuclear power plans, and everything in between! Welcome to episode 279! 
Titles we almost went with this week:

AWS SKYNET (Q) now controls the supply chain
⛓️AWS Supply Chain: Where skynet meets your shopping list
Digital Ocean follows Azure with the Premium everything
⛰️EKS mounts S3 
GCP now a nuclear
Big query don’t hit that iceberg 
Big Query Yells: “ICEBERG AHEAD” 
The Cloud Pod: Now with 50% more meltdown protection
☢️The Cloud Pod radiates excitement over Google’s nuclear deal

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
Follow Up
00:46 OpenAI’s Newest Possible Threat: Ex-CTO Murati

Apologies listeners – paywall article. 
Given the recent departure of Ex-CTO Mira Murati from OpenAI, we speculated that she might be starting something new…and the rumors are rumorin’. 
Rumors have been running wild since her last day on October 4th, with several people reporting that there has been a lot of churn. 
Speculation is that Murati may join former Open AI VP Bret Zoph at his new startup.  
It may be easy to steal some people, as the research organization at Open AI is reportedly in upheaval after Liam Fedus’s promotion to lead post-training – several researchers have asked to switch teams. 
In addition, Ilya Sutskever, an Open AI co-founder and former chief scientist, also has a new startup.  
We’ll definitely be keeping an eye on this particular soap opera. 

2:00  Jonathan – “I kind wonder what will these other startups bring that’s different than what OpenAI are doing or Anthropic or anybody else. mean, they’re all going to be taking the same training data sets because that’s what’s available. It’s not like they’re going to invent some data from somewhere else and have an edge. I mean, I guess they could do different things like be mindful about licensing.”
General News
4:41 Introducing New 48vCPU and 60vCPU Optimized Premium Droplets on ]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/1865911/c1a-k5d5-6zwq7wo6annz-gbdfvl.jpg"></itunes:image>
                                                                            <itunes:duration>00:54:48</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                            </item>
                    <item>
                <title>
                    <![CDATA[278: Azure is on a Bender: Bite my Shiny Metal FXv2-series VMs]]>
                </title>
                <pubDate>Wed, 16 Oct 2024 16:47:59 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/1860372</guid>
                                    <link>https://tcpfm.castos.com/episodes/278-fxv2-bender</link>
                                <description>
                                            <![CDATA[<p><span style="font-weight:400;">Welcome to episode 278 of The Cloud Pod, where the forecast is always cloudy! When Justin’s away, the guys will… maybe get a show recorded? This week, we’re talking OpenAI, another service scheduled for the grave over at AWS, saying goodbye to pesky IPv4 fees, Azure FXv2 VMs, Valkey 8.0 and so much more! Thanks for joining us, here in the cloud! </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">Another One Bites the Dust</span></li>
<li><span style="font-weight:400;">Peak AI reached: OpenAI Now Puts Print Statements in Code to Help You Debug</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor: <a href="https://shortclick.link/uthdi1" target="_blank" rel="noreferrer noopener">Archera</a></b></h3>
<p>There are a lot of cloud cost management tools out there. But only Archera provides cloud commitment insurance. It sounds fancy but it’s really simple. Archera gives you the cost savings of a 1 or 3 year AWS Savings Plan with a commitment as short as 30 days. If you don’t use all the cloud resources you’ve committed to, they will literally put money back in your bank account to cover the difference. Other cost management tools may say they offer “commitment insurance”, but remember to ask: will you actually give me my money back? Archera will. <a href="https://shortclick.link/uthdi1" target="_blank" rel="noreferrer noopener">Click this link to check them out</a></p>
<h2><span style="font-weight:400;">AI Is Going Great – Or How ML Makes All It’s Money</span></h2>
<p><b>00:59 </b><a href="https://openai.com/index/introducing-vision-to-the-fine-tuning-api/" target="_blank" rel="noreferrer noopener"><b>Introducing vision to the fine-tuning API</b></a><span style="font-weight:400;">.</span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">OpenAI has announced the integration of vision capabilities into its fine-tuning API, allowing developers to enhance the GPT-4o model to analyze and interpret images alongside text and audio inputs. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This update broadens the scope of applications for AI, enabling more multimodal interactions.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The fine-tuning API now supports image inputs, which means developers can train models to understand and generate content based on visual data in conjunction with text and audio.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">After October 31, 2024, training for fine-tuning will cost $25 per 1 million tokens, with inference priced at $3.75 per 1 million input tokens and $15 per 1 million output tokens. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Images are tokenized based on size before pricing. The introduction of prompt caching and other efficiency measures could lower the operational costs for businesses deploying AI solutions.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The API is also being enhanced to include features like epoch-based checkpoint creation, a comparative playground for model evaluation, and integration with third-party platforms like Weights and Biases for detailed fine-tuning data management.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">What does it mean? Admit it – you’re dying to know. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Developers can now create applications that not only process text or voice but also interpret and generate responses based on visual cues, and importantly fine tuned for domain specific applications, and this update could lead to more intuitive user interfaces in applications, where users can interact with services using images as naturally as they do with text or speech, potentially expanding the user base to those less tech-savvy or in fields where visual data is crucial...</span></li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 278 of The Cloud Pod, where the forecast is always cloudy! When Justin’s away, the guys will… maybe get a show recorded? This week, we’re talking OpenAI, another service scheduled for the grave over at AWS, saying goodbye to pesky IPv4 fees, Azure FXv2 VMs, Valkey 8.0 and so much more! Thanks for joining us, here in the cloud! 
Titles we almost went with this week:

Another One Bites the Dust
Peak AI reached: OpenAI Now Puts Print Statements in Code to Help You Debug

A big thanks to this week’s sponsor: Archera
There are a lot of cloud cost management tools out there. But only Archera provides cloud commitment insurance. It sounds fancy but it’s really simple. Archera gives you the cost savings of a 1 or 3 year AWS Savings Plan with a commitment as short as 30 days. If you don’t use all the cloud resources you’ve committed to, they will literally put money back in your bank account to cover the difference. Other cost management tools may say they offer “commitment insurance”, but remember to ask: will you actually give me my money back? Archera will. Click this link to check them out
AI Is Going Great – Or How ML Makes All It’s Money
00:59 Introducing vision to the fine-tuning API.

OpenAI has announced the integration of vision capabilities into its fine-tuning API, allowing developers to enhance the GPT-4o model to analyze and interpret images alongside text and audio inputs. 
This update broadens the scope of applications for AI, enabling more multimodal interactions.
The fine-tuning API now supports image inputs, which means developers can train models to understand and generate content based on visual data in conjunction with text and audio.
After October 31, 2024, training for fine-tuning will cost $25 per 1 million tokens, with inference priced at $3.75 per 1 million input tokens and $15 per 1 million output tokens. 
Images are tokenized based on size before pricing. The introduction of prompt caching and other efficiency measures could lower the operational costs for businesses deploying AI solutions.
The API is also being enhanced to include features like epoch-based checkpoint creation, a comparative playground for model evaluation, and integration with third-party platforms like Weights and Biases for detailed fine-tuning data management.
What does it mean? Admit it – you’re dying to know. 
Developers can now create applications that not only process text or voice but also interpret and generate responses based on visual cues, and importantly fine tuned for domain specific applications, and this update could lead to more intuitive user interfaces in applications, where users can interact with services using images as naturally as they do with text or speech, potentially expanding the user base to those less tech-savvy or in fields where visual data is crucial...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[278: Azure is on a Bender: Bite my Shiny Metal FXv2-series VMs]]>
                </itunes:title>
                                    <itunes:episode>278</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p><span style="font-weight:400;">Welcome to episode 278 of The Cloud Pod, where the forecast is always cloudy! When Justin’s away, the guys will… maybe get a show recorded? This week, we’re talking OpenAI, another service scheduled for the grave over at AWS, saying goodbye to pesky IPv4 fees, Azure FXv2 VMs, Valkey 8.0 and so much more! Thanks for joining us, here in the cloud! </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">Another One Bites the Dust</span></li>
<li><span style="font-weight:400;">Peak AI reached: OpenAI Now Puts Print Statements in Code to Help You Debug</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor: <a href="https://shortclick.link/uthdi1" target="_blank" rel="noreferrer noopener">Archera</a></b></h3>
<p>There are a lot of cloud cost management tools out there. But only Archera provides cloud commitment insurance. It sounds fancy but it’s really simple. Archera gives you the cost savings of a 1 or 3 year AWS Savings Plan with a commitment as short as 30 days. If you don’t use all the cloud resources you’ve committed to, they will literally put money back in your bank account to cover the difference. Other cost management tools may say they offer “commitment insurance”, but remember to ask: will you actually give me my money back? Archera will. <a href="https://shortclick.link/uthdi1" target="_blank" rel="noreferrer noopener">Click this link to check them out</a></p>
<h2><span style="font-weight:400;">AI Is Going Great – Or How ML Makes All It’s Money</span></h2>
<p><b>00:59 </b><a href="https://openai.com/index/introducing-vision-to-the-fine-tuning-api/" target="_blank" rel="noreferrer noopener"><b>Introducing vision to the fine-tuning API</b></a><span style="font-weight:400;">.</span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">OpenAI has announced the integration of vision capabilities into its fine-tuning API, allowing developers to enhance the GPT-4o model to analyze and interpret images alongside text and audio inputs. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This update broadens the scope of applications for AI, enabling more multimodal interactions.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The fine-tuning API now supports image inputs, which means developers can train models to understand and generate content based on visual data in conjunction with text and audio.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">After October 31, 2024, training for fine-tuning will cost $25 per 1 million tokens, with inference priced at $3.75 per 1 million input tokens and $15 per 1 million output tokens. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Images are tokenized based on size before pricing. The introduction of prompt caching and other efficiency measures could lower the operational costs for businesses deploying AI solutions.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The API is also being enhanced to include features like epoch-based checkpoint creation, a comparative playground for model evaluation, and integration with third-party platforms like Weights and Biases for detailed fine-tuning data management.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">What does it mean? Admit it – you’re dying to know. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Developers can now create applications that not only process text or voice but also interpret and generate responses based on visual cues, and importantly fine tuned for domain specific applications, and this update could lead to more intuitive user interfaces in applications, where users can interact with services using images as naturally as they do with text or speech, potentially expanding the user base to those less tech-savvy or in fields where visual data is crucial.</span></li>
</ul>
<p><i><span style="font-weight:400;">03:53  Jonathan – “</span></i><i><span style="font-weight:400;">I mean, I think it’s useful for things like quality assurance in manufacturing, for example. You know, could, you could tune it on what your nuts and bolts are supposed to look like and what a good bolt looks like and what a bad bolt looks like coming out of the factory. You just stream the video directly to, to an AI, AI like this and have it kick out all the bad ones. It’s kind of, kind of neat.”</span></i></p>
<p><b>04:41  </b><a href="https://openai.com/index/introducing-the-realtime-api/" target="_blank" rel="noreferrer noopener"><b>Introducing the Realtime API</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">OpenAI has launched its Realtime API in public beta, designed to enable developers to create applications with real-time, low-latency, multimodal interactions. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This API facilitates speech-to-speech conversations, making user interactions more natural and engaging.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The Realtime API uses WebSockets for maintaining a persistent connection, allowing for real-time input and output of both text and audio. This includes function calling capabilities, making it versatile for various applications.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">It leverages the new GPT-4o model, which supports multimodal inputs (text, audio, and now with vision capabilities in fine-tuning).</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Use Cases include:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;"> Interactive applications: Developers can now build apps where users can have back-and-forth voice conversations or even integrate visual data for a more comprehensive interaction.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Customer Service: The API can revolutionize customer service with real-time voice interactions that feel more human-like.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Voice Assistants: Healthify already uses the API for natural, conversational interactions with its AI coach, Ria.</span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">5:54  Matthew – “</span></i><i><span style="font-weight:400;">Just think about how much time you’ll have left in your life when you don’t actually have to attend the meetings. You train a model, you fine-tune it based on Ryan’s level of sassiness and how crabby he is that day. And you just put in the meeting so you can actually do work.”</span></i></p>
<p><b>09:58  </b><a href="https://openai.com/index/introducing-canvas/" target="_blank" rel="noreferrer noopener"><b>Introducing Canvas</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">OpenAI’s Canvas is an innovative interface designed to enhance collaboration with ChatGPT for </span><a href="https://openai.com/chatgpt/use-cases/writing-with-ai/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">writing</span></a><span style="font-weight:400;"> and coding projects, moving beyond the traditional chat format to offer a more interactive and dynamic workspace – a similar idea to </span><a href="https://www.anthropic.com/claude" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Anthropic Claude’s</span></a><span style="font-weight:400;"> Projects and artifacts.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">From drafting emails to writing articles, Canvas can assist in creating content, adjusting tone, length, or style, and providing real-time edits.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Developers can write, debug, and document code. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Canvas supports creating an API web server, adding comments, explaining code sections, and reviewing code for improvements.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Best of all it can recommend where to place print statements for debugging!</span></li>
</ul>
<p><i><span style="font-weight:400;">11:18  Jonathan – “</span></i><i><span style="font-weight:400;">I got my Pixel 9 phone, which comes with Gemini Pro for the year. And I noticed a shift kind of in the way AI is kind of being integrated with things. used to be, do you me to write the message for you? They’ve moved away from that now, I think, there’s a little pushback against that. People want to feel like they’re still authentic. So now instead, once you’ve finished writing the message, it’s like, would you like us to refine this for you? Like, yes, please, make it sound more professional.”</span></i></p>
<h2><span style="font-weight:400;">AWS</span></h2>
<p><b>13:01 </b><a href="https://aws.amazon.com/about-aws/whats-new/2024/09/aws-re-post-agent-generative-ai-powered-virtual-assistant/" target="_blank" rel="noreferrer noopener"><b>AWS Announces AWS re:Post Agent, a Generative AI-powered virtual </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2024/09/aws-re-post-agent-generative-ai-powered-virtual-assistant/" target="_blank" rel="noreferrer noopener"><b>assistant</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS is starting to leverage Gen AI to auto respond to post on </span><a href="https://repost.aws/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">re:Post</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Jonathan is especially looking forward to seeing the hallucinations that it posts. </span></li>
</ul>
<p><b>14:06</b> <a href="https://aws.amazon.com/blogs/machine-learning/maintain-access-and-consider-alternatives-for-amazon-monitron/" target="_blank" rel="noreferrer noopener"><b>Maintain access and consider alternatives for Amazon Monitron</b></a></p>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon Monitron is being shut down.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">It will no longer be available for new customers after October 31st, 2024. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Existing customers will be able to purchase devices and continue utilizing the service as normal until July 2025. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Customers will be considered an existing customer if they have commissioned an Amazon Monitron sensor through a project any time in the 30 days prior to October 31, 2024</span></li>
</ul>
</li>
</ul>
<ul>
<li style="font-weight:400;"><i><span style="font-weight:400;">“For existing Amazon business customers, we will allowlist your account with the existing Amazon Monitron devices. For existing Amazon.com retail customers, the Amazon Monitron team will provide specific ordering instructions according to individual request.”</span></i></li>
<li style="font-weight:400;"><span style="font-weight:400;">Alternative for your condition monitoring needs, we recommend exploring alternative solutions provided by AWS Partners: </span><a href="https://aws.amazon.com/marketplace/pp/prodview-gbsaknotwzjdi?sr=0-12&amp;ref_=beagle&amp;applicationId=AWSMPContessa" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Tactical Edge</span></a><span style="font-weight:400;">, </span><a href="https://aws.amazon.com/marketplace/seller-profile?id=seller-mbnj2ll267gim" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">IndustrAI</span></a><span style="font-weight:400;">, and </span><a href="https://aws.amazon.com/marketplace/seller-profile?id=seller-2zki35h4u25wu" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Factory AI</span></a><span style="font-weight:400;">.</span></li>
</ul>
<p><i><span style="font-weight:400;">15:11  Jonathan – “</span></i><i><span style="font-weight:400;">That’s a weird one, because I think they talked about this on stage at re.Invent a few years ago. It was a whole big industrial IoT thing. We have these devices that monitor the unique vibrations from each machine, and we can tell weeks in advance if some part’s going to fail or not. So it’s kind of weird that they’re killing it, but I guess the functionality can be built with other primitives that they have, and it doesn’t need to be its own service.”</span></i></p>
<p><b>17:05 </b><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/amazon-virtual-private-cloud-byoip-byoasn-local-zones/" target="_blank" rel="noreferrer noopener"><b>Amazon Virtual Private Cloud (VPC) now supports BYOIP and BYOASN in </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/amazon-virtual-private-cloud-byoip-byoasn-local-zones/" target="_blank" rel="noreferrer noopener"><b>all AWS Local Zones</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Now you can BYOIP  and ASNS to </span><a href="https://aws.amazon.com/about-aws/global-infrastructure/localzones/locations/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">local zones</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Huzzah</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">It *should* save you all the pesky IPv4 fees that you were paying. </span></li>
</ul>
<p><b>18:19</b> <a href="https://aws.amazon.com/about-aws/whats-new/2024/10/amazon-ec2-optimize-cpus-post-instance-launch/" target="_blank" rel="noreferrer noopener"><b>Amazon EC2 now supports Optimize CPUs post instance launch</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon EC2 now allows customers to modify an instance’s CPU options after launch. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">You can modify the number of vCPUs and/or disable the hyperthreading of a stopped EC2 instance to save on vCPU-based licensing costs. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">In addition, an instance’s CPU options are now maintained when changing its instance type.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This is beneficial to customers who have a Bring-Your-Own-license (BYOL) for commercial database workloads, like Microsoft SQL Server.</span></li>
</ul>
<p><i><span style="font-weight:400;">18:53  Ryan – “</span></i><i><span style="font-weight:400;">Yeah, this is one of those things where it’s a giant pain if you have to completely relaunch your instance. Or when you’re trying to upscale your instance to a new instance type to get more memory or what have you, and having that completely reset. so then not only are you trying to scale this, probably to avoid an outage, now it’s taking twice as long because you’re going to do a thing. So this is one of those really beneficial features that no one will ever mention again.”</span></i></p>
<p><b>21:36</b> <a href="https://aws.amazon.com/about-aws/whats-new/2024/10/amazon-workspaces-file-transfer-sessions-local-devices/" target="_blank" rel="noreferrer noopener"><b>Amazon WorkSpaces now supports file transfer between WorkSpaces </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2024/10/amazon-workspaces-file-transfer-sessions-local-devices/" target="_blank" rel="noreferrer noopener"><b>sessions and local devices</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon WorkSpaces now supports file transfers between Personal sessions and local computers. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Administrators can control file upload/download permissions to safeguard data.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Infosec is just going to love all the data loss options. </span></li>
</ul>
<p><i><span style="font-weight:400;">22:07  Jonathan – “</span></i><i><span style="font-weight:400;">So they re-implement RDP, they take out the feature, then they add it again, and then they give you a switch, which everyone’s going to switch on to stop you from using it. That’s fantastic.”</span></i></p>
<p><i><span style="font-weight:400;">22:17  Matthew – “</span></i><i><span style="font-weight:400;">But they can check the box now saying it exists, which means they’ll pass some RFP. So now they’re more likely to be able to be considered.”</span></i></p>
<h2><span style="font-weight:400;">GCP</span></h2>
<p><b>25:30 </b><a href="https://cloud.google.com/blog/products/databases/memorystore-launches-valkey-8-0-on-google-cloud/" target="_blank" rel="noreferrer noopener"><b>Introducing Valkey 8.0 on Memorystore: unmatched performance and fully </b></a><a href="https://cloud.google.com/blog/products/databases/memorystore-launches-valkey-8-0-on-google-cloud/" target="_blank" rel="noreferrer noopener"><b>open-source</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google Cloud has introduced </span><a href="https://cloud.google.com/memorystore" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Memorystore</span></a><span style="font-weight:400;"> for </span><a href="https://cloud.google.com/blog/products/databases/announcing-memorystore-for-valkey" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Valkey 8.0</span></a><span style="font-weight:400;">, marking it as the first major cloud platform to offer Valkey 8.0 as a fully managed service. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This launch signifies Google Cloud’s commitment to supporting open-source technologies by providing a high-performance, in-memory key-value store alternative to Redis, with enhancements in performance, reliability, and compatibility.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Compared to Redis, Valkey aims to maintain full compatibility while offering improvements in performance and community governance but has changes and features like-</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Better data availability during failover events.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Support for vector search, which is beneficial for AI and machine learning applications requiring similarity searches.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Improved concurrency allows for parallel processing of commands, reducing bottlenecks.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">and some other great performance improvements</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Valkey 8.0 on Memorystore offers up to twice the Queries Per Second (QPS) compared to Memorystore for Redis Cluster at microsecond latency, enabling higher throughput with similarly sized clusters.</span></li>
</ul>
<p><i><span style="font-weight:400;">26:53  Ryan – “ …</span></i><i><span style="font-weight:400;">when you see this type of change, but you know, especially right after a license kerfuffle, right? That, you know, because Valkey to come into existence. Like it’s kind of like, wow, the power of open search is really there. And now, why wasn’t this, you know, part of  the Redis thing, it’s because people weren’t going through it, you know, when it was that license. So it’s kind of a good thing in a lot of sense.”</span></i></p>
<p><b>29:56 </b><a href="https://cloud.google.com/blog/products/storage-data-transfer/gemini-insights-about-cloud-storage/" target="_blank" rel="noreferrer noopener"><b>Understand your Cloud Storage footprint with AI-powered queries and insights</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Managing millions or billions of objects across numerous projects and with hundreds of Cloud engineers is fun right?</span></li>
</ul>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google Cloud is the first hyperscale cloud provider to generate storage insights specific to an environment by querying object metadata and using the power of large language models (LLMs). (Although AWS has had a similar feature for quite a bit.. But it wasn’t AI.)</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">After the initial setup, you’ll be able to access the enhanced user experience, which includes a short summary of your dataset. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Bonus! Pre-curated set of prompts with validated responses. </span><i><span style="font-weight:400;">“We selected these prompts based on customers’ most common questions.”</span></i></li>
<li style="font-weight:400;"><span style="font-weight:400;">To combat hallucinations there are multiple informational indicators: </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Every response includes the SQL query for easy validation, </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Curated prompts show a ‘high accuracy’ tag</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">And helpful information displays data freshness metadata.</span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">31:42  Ryan – “…</span></i><i><span style="font-weight:400;">it’s insights into your storage data. There’s performance tiers, the ability to migrate it to lower performance tier for cost savings. There’s the insights on the access model and insecure sort of attack vectors that you could have. Like if it’s a publicly exposed bucket and it has excessive permissions or it has sensitive content in it, it’ll sort of provide that level of insight.”</span></i></p>
<h2><span style="font-weight:400;">Azure</span></h2>
<p><b>32:51 </b><a href="https://techcommunity.microsoft.com/t5/azure-high-performance-computing/announcing-the-general-availability-of-azure-cyclecloud/ba-p/4252607" target="_blank" rel="noreferrer noopener"><b>Announcing the General Availability of Azure CycleCloud Workspace for </b></a><a href="https://techcommunity.microsoft.com/t5/azure-high-performance-computing/announcing-the-general-availability-of-azure-cyclecloud/ba-p/4252607" target="_blank" rel="noreferrer noopener"><b>Slurm</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Let’s deconstruct this title:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Azure CycleCloud is an enterprise-friendly tool for orchestrating and managing High Performance Computing (HPC) environments on Azure.</span></li>
<li style="font-weight:400;"><a href="https://aka.ms/ccw4slurm" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Slurm</span></a><span style="font-weight:400;"> is a scheduler.</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">So really, what is this? It’s the ability to buy and launch from the marketplace an orchestrating and managing High Performance Computing (HPC) environments that leverages Slurm as a scheduler.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">When Matthew doesn’t know what the Azure thing is, we’re all in trouble. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">And yes, this is where the Futurama references originated. Are we proud of it? At the risk of sounding negative, no.</span></li>
</ul>
<p><b>35:33 </b><a href="https://techcommunity.microsoft.com/t5/azure-compute-blog/announcing-the-public-preview-of-the-new-azure-fxv2-series/ba-p/4255196" target="_blank" rel="noreferrer noopener"><b>Announcing the public preview of the new Azure FXv2-series Virtual </b></a><a href="https://techcommunity.microsoft.com/t5/azure-compute-blog/announcing-the-public-preview-of-the-new-azure-fxv2-series/ba-p/4255196" target="_blank" rel="noreferrer noopener"><b>Machines</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Shut up and take our money – new shiny machines!</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Best-suited to provide a balanced solution for compute-intensive workloads such as databases, data analytics workloads and EDA workloads, that also require large amounts of memory and high-performance, storage, I/O bandwidth.</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">up to 1.5x CPU performance</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">2x vCPUs, with 96 vCPU as the largest VM size</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">1.5x+ Network bandwidth, and offers up to 70 Gbps</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">up to 2x local storage (Read) IOPS and offers up to 5280 GiB local SSD capacity</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">up to 2x IOPS and up to 5x throughput in remote storage performance  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">up to 400k IOPS and up to 11 GBps throughput with Premium v2/ Ultra Disk support</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">up to 1800 GiB memory</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">FXv2-series VMs feature an all-core-turbo frequency up to 4.0 GHz</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">21:1 memory-to-vCPU ratio with the base sizes</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">The blog states that the FXv2-series </span><a href="https://azure.microsoft.com/pricing/details/virtual-machines" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Azure Virtual Machine</span></a><span style="font-weight:400;"> is best-suited to provide a balanced solution for compute-intensive workloads but then goes on to the real answer: That it is purpose-built, to address several requirements of SQL Server workloads. </span></li>
</ul>
<p><i><span style="font-weight:400;">37:00  Ryan – “…</span></i><i><span style="font-weight:400;">you can deploy these where you these VMs where you get a 21 to one ratio of memory to PCP. Yeah, it’s cool. So while they do go out, they tell their best suited for balance and compute intensive workloads. But if you read further down the post, they get to the real answer, which is this is purpose built to address several requirements for Microsoft SQL Server, which totally makes sense.”</span></i></p>
<p><b>38:42 </b><a href="https://techcommunity.microsoft.com/t5/azure-confidential-computing/general-availability-azure-confidential-vms-with-nvidia-h100/ba-p/4242644" target="_blank" rel="noreferrer noopener"><b>General Availability: Azure confidential VMs with NVIDIA H100 Tensor Core </b></a><a href="https://techcommunity.microsoft.com/t5/azure-confidential-computing/general-availability-azure-confidential-vms-with-nvidia-h100/ba-p/4242644" target="_blank" rel="noreferrer noopener"><b>GPUs</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">These are on AMD EPYC with H100</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Setup securely</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">ideal for inferencing, fine-tuning or training small-to-medium sized models such as Whisper, Stable diffusion and its variants (SDXL, SSD), and language models.</span></li>
</ul>
<p><i><span style="font-weight:400;">39:19  Jonathan – “</span></i><i><span style="font-weight:400;">How weird though. The point of a confidential VM is that it has one hole that you put something in. It does some magic work on it and then spits an answer out, but you don’t get to see the sausage being made inside. the fact that they’re selling this for training or inference is really interesting.”</span></i></p>
<p><b>42:08</b> <a href="https://techcommunity.microsoft.com/t5/finops-blog/what-s-new-in-finops-toolkit-0-5-august-2024/ba-p/4254148" target="_blank" rel="noreferrer noopener"><b>What’s new in FinOps toolkit 0.5 – August 2024</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">The FinOps Toolkit 0.5, released in August 2024, introduces several enhancements aimed at improving cloud financial management through Microsoft’s FinOps framework. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This update focuses on simplifying the process of cost management and optimization for Azure users, with new features for reporting, data analysis, and integration with Power BI for better financial analytics.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Key Updates in FinOps Toolkit 0.5:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Users can now connect Power BI reports directly to raw cost data exports in storage without needing FinOps hubs, simplifying the setup for cost analysis.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The toolkit now supports the </span><a href="https://microsoft.github.io/finops-toolkit/focus/mapping" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">FOCUS 1.0 schema</span></a><span style="font-weight:400;"> for cost and usage data, which aims to standardize FinOps data across platforms for easier analysis and comparison.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The update includes improvements in the Azure Optimization Engine for better custom recommendations on cost savings and performance  enhancements.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">There are new tools and updates for reporting, including a guide on how to </span><a href="https://microsoft.github.io/finops-toolkit/focus/convert" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">compare FOCUS data with actual or amortized cost data</span></a><span style="font-weight:400;">, aiding in more accurate financial reporting.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Expanded scenario-based documentation helps users update existing reports to use FOCUS and understand how to leverage the new data schema effectively.</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Organizations have the choice to use the latest toolkit with existing FinOps hubs or upgrade to gain access to new features while maintaining compatibility with previous report versions.</span></li>
</ul>
<p><b>47:11 </b><a href="https://azure.microsoft.com/en-us/blog/announcing-new-products-and-features-for-azure-openai-service-including-gpt-4o-realtime-preview-with-audio-and-speech-capabilities/" target="_blank" rel="noreferrer noopener"><b>GPT-4o-Realtime-Preview with audio and speech capabilities</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Woohoo! it released on Azure too now </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The guys may have officially lost the plot at this point. </span></li>
</ul>
<h2><span style="font-weight:400;">Closing</span></h2>
<p><span style="font-weight:400;">And that is the week in the cloud! Visit our website, the home of the Cloud Pod where you can join our newsletter, slack team, send feedback or ask questions at theCloud Pod.net or tweet at us with hashtag #theCloudPod</span></p>
]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/1860372/c1e-xm8mbmqgw6ar8nm5-dm5z929ds3zd-hgzo79.mp3" length="56065483"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 278 of The Cloud Pod, where the forecast is always cloudy! When Justin’s away, the guys will… maybe get a show recorded? This week, we’re talking OpenAI, another service scheduled for the grave over at AWS, saying goodbye to pesky IPv4 fees, Azure FXv2 VMs, Valkey 8.0 and so much more! Thanks for joining us, here in the cloud! 
Titles we almost went with this week:

Another One Bites the Dust
Peak AI reached: OpenAI Now Puts Print Statements in Code to Help You Debug

A big thanks to this week’s sponsor: Archera
There are a lot of cloud cost management tools out there. But only Archera provides cloud commitment insurance. It sounds fancy but it’s really simple. Archera gives you the cost savings of a 1 or 3 year AWS Savings Plan with a commitment as short as 30 days. If you don’t use all the cloud resources you’ve committed to, they will literally put money back in your bank account to cover the difference. Other cost management tools may say they offer “commitment insurance”, but remember to ask: will you actually give me my money back? Archera will. Click this link to check them out
AI Is Going Great – Or How ML Makes All It’s Money
00:59 Introducing vision to the fine-tuning API.

OpenAI has announced the integration of vision capabilities into its fine-tuning API, allowing developers to enhance the GPT-4o model to analyze and interpret images alongside text and audio inputs. 
This update broadens the scope of applications for AI, enabling more multimodal interactions.
The fine-tuning API now supports image inputs, which means developers can train models to understand and generate content based on visual data in conjunction with text and audio.
After October 31, 2024, training for fine-tuning will cost $25 per 1 million tokens, with inference priced at $3.75 per 1 million input tokens and $15 per 1 million output tokens. 
Images are tokenized based on size before pricing. The introduction of prompt caching and other efficiency measures could lower the operational costs for businesses deploying AI solutions.
The API is also being enhanced to include features like epoch-based checkpoint creation, a comparative playground for model evaluation, and integration with third-party platforms like Weights and Biases for detailed fine-tuning data management.
What does it mean? Admit it – you’re dying to know. 
Developers can now create applications that not only process text or voice but also interpret and generate responses based on visual cues, and importantly fine tuned for domain specific applications, and this update could lead to more intuitive user interfaces in applications, where users can interact with services using images as naturally as they do with text or speech, potentially expanding the user base to those less tech-savvy or in fields where visual data is crucial...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/1860372/c1a-k5d5-34gd07pnf5zg-texujb.jpg"></itunes:image>
                                                                            <itunes:duration>00:46:44</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                            </item>
                    <item>
                <title>
                    <![CDATA[277: Class E IPs, so now you can procrastinate IPv6 even longer]]>
                </title>
                <pubDate>Thu, 10 Oct 2024 11:04:34 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/1856367</guid>
                                    <link>https://tcpfm.castos.com/episodes/277-class-e-ip</link>
                                <description>
                                            <![CDATA[<p><span style="font-weight:400;">​Welcome to episode 277 of The Cloud Pod, where the forecast is always cloudy! Justin, Ryan, and Matthew are your hosts this week for a news packed show. This week we dive into the latest in cloud computing with announcements from Google’s new AI search tools, Meta’s open-sourced AI models, and Microsoft Copilot’s expanded capabilities. We’ve also got Oracle releases, and some non-liquid Java on the agenda (but also the liquid kind, too) and Class E IP addresses. Plus, be sure to stay tuned for the aftershow! </span></p>
<p> </p>
<h3><b>Titles we almost went with this week:</b></h3>
<p><span style="font-weight:400;">Which cloud provider does not have llama 3.2</span></p>
<p><span style="font-weight:400;">Vmware says we will happily help you support your old Microsoft OS’s for $$$$</span></p>
<p><span style="font-weight:400;">Class E is the best kind of IP Space</span></p>
<p><span style="font-weight:400;">Microsoft says trust AI, and so does Skynet</span></p>
<p><span style="font-weight:400;">3.2 Llama’s walked into an AI bar… </span></p>
<p><span style="font-weight:400;">Google gets cranky about MS Licensing, join the club</span></p>
<p><span style="font-weight:400;">✍️Write Your Prompts, Optimize them with Vertex Prompts Analyzer, rinse repeat into a  </span></p>
<p><span style="font-weight:400;">     vortex of optimization</span></p>
<p><span style="font-weight:400;">️Oracle releases Java 23, Cloud Pod Uses Amazon Corretto 23 instead</span></p>
<p><span style="font-weight:400;">Oracle releases Java 23, Cloud Pod still says run! MK </span></p>
<p> </p>
<h3><b>A big thanks to this week’s sponsor: <a href="https://shortclick.link/uthdi1" target="_blank" rel="noreferrer noopener">Archera</a></b></h3>
<h3>There are a lot of cloud cost management tools out there. But only Archera provides cloud commitment insurance. It sounds fancy but it’s really simple. Archera gives you the cost savings of a 1 or 3 year AWS Savings Plan with a commitment as short as 30 days. If you don’t use all the cloud resources you’ve committed to, they will literally put money back in your bank account to cover the difference. Other cost management tools may say they offer “commitment insurance”, but remember to ask: will you actually give me my money back? Archera will. <a href="https://shortclick.link/uthdi1" target="_blank" rel="noreferrer noopener">Click this link to check them out</a></h3>
<p><a href="https://shortclick.link/uthdi1" target="_blank" rel="noreferrer noopener">https://shortclick.link/uthdi1</a></p>
<h2><span style="font-weight:400;">AI Is Going Great – Or How ML Makes All It’s Money</span></h2>
<p> </p>
<p><b>01:06 </b><a href="https://www.businessinsider.com/mira-murati-openai-cto-leaving-executive-memo-staff-2024-9?utm_source=Iterable&amp;utm_medium=email&amp;utm_campaign=campaign_Insider%20Today%20-%20Thu,%20Sep%2026,%202024&amp;Utm_content=LimitedPromo" target="_blank" rel="noreferrer noopener"><b>OpenAI CTO Mira Murati, 2 other execs announce they’re leaving</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Listener Note: paywall article </span></li>
<li style="font-weight:400;"><a href="https://openai.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">OpenAI</span></a><span style="font-weight:400;"> Chief Technology Officer Mira Murati is leaving, and within hours, two more OpenAI executives joined the list of high-profile departures.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Mira Murati spent 6.5 years at the company, and was named CEO temporarily when the board ousted co-founder Sam Altman.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">“</span><i><span style="font-weight:400;">It’s hard to overstate how much Mira has meant to OpenAI, our mission, and to us all personally</span></i><span style="font-weight:400;">,” Altman wrote. “</span><i><span style="font-weight:400;">I feel tremend...</span></i></li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[​Welcome to episode 277 of The Cloud Pod, where the forecast is always cloudy! Justin, Ryan, and Matthew are your hosts this week for a news packed show. This week we dive into the latest in cloud computing with announcements from Google’s new AI search tools, Meta’s open-sourced AI models, and Microsoft Copilot’s expanded capabilities. We’ve also got Oracle releases, and some non-liquid Java on the agenda (but also the liquid kind, too) and Class E IP addresses. Plus, be sure to stay tuned for the aftershow! 
 
Titles we almost went with this week:
Which cloud provider does not have llama 3.2
Vmware says we will happily help you support your old Microsoft OS’s for $$$$
Class E is the best kind of IP Space
Microsoft says trust AI, and so does Skynet
3.2 Llama’s walked into an AI bar… 
Google gets cranky about MS Licensing, join the club
✍️Write Your Prompts, Optimize them with Vertex Prompts Analyzer, rinse repeat into a  
     vortex of optimization
️Oracle releases Java 23, Cloud Pod Uses Amazon Corretto 23 instead
Oracle releases Java 23, Cloud Pod still says run! MK 
 
A big thanks to this week’s sponsor: Archera
There are a lot of cloud cost management tools out there. But only Archera provides cloud commitment insurance. It sounds fancy but it’s really simple. Archera gives you the cost savings of a 1 or 3 year AWS Savings Plan with a commitment as short as 30 days. If you don’t use all the cloud resources you’ve committed to, they will literally put money back in your bank account to cover the difference. Other cost management tools may say they offer “commitment insurance”, but remember to ask: will you actually give me my money back? Archera will. Click this link to check them out
https://shortclick.link/uthdi1
AI Is Going Great – Or How ML Makes All It’s Money
 
01:06 OpenAI CTO Mira Murati, 2 other execs announce they’re leaving

Listener Note: paywall article 
OpenAI Chief Technology Officer Mira Murati is leaving, and within hours, two more OpenAI executives joined the list of high-profile departures.
Mira Murati spent 6.5 years at the company, and was named CEO temporarily when the board ousted co-founder Sam Altman.  
“It’s hard to overstate how much Mira has meant to OpenAI, our mission, and to us all personally,” Altman wrote. “I feel tremend...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[277: Class E IPs, so now you can procrastinate IPv6 even longer]]>
                </itunes:title>
                                    <itunes:episode>277</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p><span style="font-weight:400;">​Welcome to episode 277 of The Cloud Pod, where the forecast is always cloudy! Justin, Ryan, and Matthew are your hosts this week for a news packed show. This week we dive into the latest in cloud computing with announcements from Google’s new AI search tools, Meta’s open-sourced AI models, and Microsoft Copilot’s expanded capabilities. We’ve also got Oracle releases, and some non-liquid Java on the agenda (but also the liquid kind, too) and Class E IP addresses. Plus, be sure to stay tuned for the aftershow! </span></p>
<p> </p>
<h3><b>Titles we almost went with this week:</b></h3>
<p><span style="font-weight:400;">Which cloud provider does not have llama 3.2</span></p>
<p><span style="font-weight:400;">Vmware says we will happily help you support your old Microsoft OS’s for $$$$</span></p>
<p><span style="font-weight:400;">Class E is the best kind of IP Space</span></p>
<p><span style="font-weight:400;">Microsoft says trust AI, and so does Skynet</span></p>
<p><span style="font-weight:400;">3.2 Llama’s walked into an AI bar… </span></p>
<p><span style="font-weight:400;">Google gets cranky about MS Licensing, join the club</span></p>
<p><span style="font-weight:400;">✍️Write Your Prompts, Optimize them with Vertex Prompts Analyzer, rinse repeat into a  </span></p>
<p><span style="font-weight:400;">     vortex of optimization</span></p>
<p><span style="font-weight:400;">️Oracle releases Java 23, Cloud Pod Uses Amazon Corretto 23 instead</span></p>
<p><span style="font-weight:400;">Oracle releases Java 23, Cloud Pod still says run! MK </span></p>
<p> </p>
<h3><b>A big thanks to this week’s sponsor: <a href="https://shortclick.link/uthdi1" target="_blank" rel="noreferrer noopener">Archera</a></b></h3>
<h3>There are a lot of cloud cost management tools out there. But only Archera provides cloud commitment insurance. It sounds fancy but it’s really simple. Archera gives you the cost savings of a 1 or 3 year AWS Savings Plan with a commitment as short as 30 days. If you don’t use all the cloud resources you’ve committed to, they will literally put money back in your bank account to cover the difference. Other cost management tools may say they offer “commitment insurance”, but remember to ask: will you actually give me my money back? Archera will. <a href="https://shortclick.link/uthdi1" target="_blank" rel="noreferrer noopener">Click this link to check them out</a></h3>
<p><a href="https://shortclick.link/uthdi1" target="_blank" rel="noreferrer noopener">https://shortclick.link/uthdi1</a></p>
<h2><span style="font-weight:400;">AI Is Going Great – Or How ML Makes All It’s Money</span></h2>
<p> </p>
<p><b>01:06 </b><a href="https://www.businessinsider.com/mira-murati-openai-cto-leaving-executive-memo-staff-2024-9?utm_source=Iterable&amp;utm_medium=email&amp;utm_campaign=campaign_Insider%20Today%20-%20Thu,%20Sep%2026,%202024&amp;Utm_content=LimitedPromo" target="_blank" rel="noreferrer noopener"><b>OpenAI CTO Mira Murati, 2 other execs announce they’re leaving</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Listener Note: paywall article </span></li>
<li style="font-weight:400;"><a href="https://openai.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">OpenAI</span></a><span style="font-weight:400;"> Chief Technology Officer Mira Murati is leaving, and within hours, two more OpenAI executives joined the list of high-profile departures.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Mira Murati spent 6.5 years at the company, and was named CEO temporarily when the board ousted co-founder Sam Altman.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">“</span><i><span style="font-weight:400;">It’s hard to overstate how much Mira has meant to OpenAI, our mission, and to us all personally</span></i><span style="font-weight:400;">,” Altman wrote. “</span><i><span style="font-weight:400;">I feel tremendous gratitude towards her for what she has helped us build and accomplish, but most of all, I feel personal gratitude towards her for her support and love during all the hard times. I am excited for what she’ll do next</span></i><span style="font-weight:400;">.”</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Mira oversaw the development of ChatGPT and image generator Dall-E. She was also a pretty public face for the company, appearing in its videos and interviewing journalists.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The other two departures were Barret Zoph, who was the company’s Vice President of Research and Chief Research officer Bob McGrew.</span></li>
</ul>
<p> </p>
<p><i><span style="font-weight:400;">02:26  Ryan – “</span></i><i><span style="font-weight:400;">Her reason for leaving is, you know, to take some time and space to explore and, you know, be more creative. I’m like, yeah, okay. they’re starting copy. Yeah. Yeah. Leaving for health reasons. You got fired.” </span></i></p>
<p> </p>
<p><span style="font-weight:400;">-Copywriter Note: this is 100% copywriter speak for you either got fired – or will be soon and decide to step down.</span></p>
<p> </p>
<p><b>03:38 </b><a href="https://ai.meta.com/blog/llama-3-2-connect-2024-vision-edge-mobile-devices/" target="_blank" rel="noreferrer noopener"><b>Llama 3.2: Revolutionizing edge AI and vision with open, customizable </b></a></p>
<p><a href="https://ai.meta.com/blog/llama-3-2-connect-2024-vision-edge-mobile-devices/" target="_blank" rel="noreferrer noopener"><b>models</b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://about.meta.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Meta</span></a><span style="font-weight:400;"> is releasing</span><a href="https://www.llama.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;"> Llama 3.2</span></a><span style="font-weight:400;">, which includes small and medium sized vision LLM’s (11B and 90B) and lightweight, text only models (1B and 3B) that fit on edge and mobile devices, including pre-trained and instruction tuned versions. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The 1B and 3B models support context length of 128k tokens and are state of the art in their class for on-device use cases like summarization, instruction following, and rewriting tasks running locally at the edge. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The models are enabled on </span><a href="https://www.qualcomm.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Qualcomm</span></a><span style="font-weight:400;"> and </span><a href="https://www.mediatek.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">MediaTek</span></a><span style="font-weight:400;"> hardware, and optimized for ARM Processors. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Llama 3.2 11B and 90B vision models are drop-in replacements for their text model equivalents, while exceeding on image understanding tasks compared to closed models, such as </span><a href="https://www.anthropic.com/news/claude-3-haiku" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Claude 3 Haiku</span></a><span style="font-weight:400;">.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Unlike other </span><a href="https://euneedsai.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">multi-modal models</span></a><span style="font-weight:400;">, both pre-trained and aligned models are available to be fine-tuned for custom applications using torchtune and deployed locally using torchchat. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">In addition, they are launching Llama Stack distributions, which greatly simplify the way developers work with Llama models in different environments from single node, on-prem, cloud and on device, enabling turnkey RAG and tooling-enabled applications with integrated safety.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Models are available on </span><a href="http://llama.com" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Llama.com</span></a><span style="font-weight:400;"> and </span><a href="https://huggingface.co/meta-llama" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Hugging Face</span></a><span style="font-weight:400;"> and various partner platforms.  </span></li>
</ul>
<p><i><span style="font-weight:400;">04:58  Ryan – “</span></i><i><span style="font-weight:400;">I’m excited about the stack distributions just because it’s, you know, makes using these things a lot easier. I love the idea of having a turnkey rag and, you know, being able to sort of create that more dynamically without going too deep into AI and knowing how, you know, the sausage is made. And then, you know, the fact that they’re making models small enough to fit on edge and mobile devices is just great.”</span></i></p>
<p> </p>
<p><b>07:06 </b><a href="https://www.databricks.com/blog/introducing-meta-llama-32-databricks-faster-language-models-and-powerful-multi-modal-models" target="_blank" rel="noreferrer noopener"><b>Introducing Meta Llama 3.2 on Databricks: faster language models and </b></a></p>
<p><a href="https://www.databricks.com/blog/introducing-meta-llama-32-databricks-faster-language-models-and-powerful-multi-modal-models" target="_blank" rel="noreferrer noopener"><b>powerful multi-modal models</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Databricks now supports Meta Llama 3.2 </span></li>
</ul>
<p> </p>
<h2><span style="font-weight:400;">AWS</span></h2>
<p> </p>
<p><b>07:35 </b><a href="https://aws.amazon.com/blogs/aws/run-your-compute-intensive-and-general-purpose-workloads-sustainably-with-the-new-amazon-ec2-c8g-m8g-instances/" target="_blank" rel="noreferrer noopener"><b>Run your compute-intensive and general purpose workloads sustainably </b></a></p>
<p><a href="https://aws.amazon.com/blogs/aws/run-your-compute-intensive-and-general-purpose-workloads-sustainably-with-the-new-amazon-ec2-c8g-m8g-instances/" target="_blank" rel="noreferrer noopener"><b>with the new Amazon EC2 C8g, M8g instances</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Last week, we talked about the new C8g instances, but alongside those, Amazon has launched the Graviton 4-powered M8g instances with even more CPU and memory. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">M8g instances can have up to 192 VCPu, 768 GB of memory, 50 Gbps of network bandwidth, and 40 GB of </span><a href="https://aws.amazon.com/ebs/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">EBS</span></a><span style="font-weight:400;"> bandwidth. </span></li>
<li style="font-weight:400;"><a href="https://www.aboutamazon.com/news/aws/graviton4-aws-cloud-computing-chip" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Graviton 4</span></a><span style="font-weight:400;"> processors offer enhanced security with always-on encryption, dedicated caches for every vCPU and support for pointer authentication.  </span></li>
</ul>
<p> </p>
<p><i><span style="font-weight:400;">08:58  Ryan – “</span></i><i><span style="font-weight:400;">I don’t know why you guys are more concerned about the headline because I was like, what is a sustainable workload when you’re talking about 192 vcpu and all the gobs of memory and you go through the entire blog post, they don’t mention it. They don’t mention anything about the power or the CO2 or anything. And so you’re just less to assume that because it’s Graviton, it’s more energy efficient. But I am claiming clickbait. I call bullshit.”</span></i></p>
<p> </p>
<p><b>10:19 </b><a href="https://aws.amazon.com/blogs/aws/introducing-llama-3-2-models-from-meta-in-amazon-bedrock-a-new-generation-of-multimodal-vision-and-lightweight-models/" target="_blank" rel="noreferrer noopener"><b>Introducing Llama 3.2 models from Meta in Amazon Bedrock: A new </b></a></p>
<p><a href="https://aws.amazon.com/blogs/aws/introducing-llama-3-2-models-from-meta-in-amazon-bedrock-a-new-generation-of-multimodal-vision-and-lightweight-models/" target="_blank" rel="noreferrer noopener"><b>generation of multimodal vision and lightweight models</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS gets Lama 3.2 90B &amp; 11 B vision, 3B and 1B text only models in </span><a href="https://aws.amazon.com/sagemaker/jumpstart/?p=pm&amp;c=sm&amp;z=2" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">SageMaker</span></a><span style="font-weight:400;">.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Woohoo.</span></li>
</ul>
<p> </p>
<p><b>28:31 </b><a href="https://aws.amazon.com/blogs/containers/migrating-from-aws-app-mesh-to-amazon-ecs-service-connect/" target="_blank" rel="noreferrer noopener"><b>Migrating from AWS App Mesh to Amazon ECS Service Connect</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS has decided to deprecate </span><a href="https://aws.amazon.com/app-mesh/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS App Mesh</span></a><span style="font-weight:400;"> effective September 30th, 2026.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Until this date, AWS App Mesh customers will be able to use the service as normal, including creating new resources and onboarding new accounts via the AWS CLI and AWS Cloudformation. </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">However, new customers will no longer be able to onboard to AWS App Mesh starting on September 24th, 2024. </span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">This blog post walks you through the differences of the two solutions and how to migrate to the new solution. This is the way all deprecations should be done on AWS. </span></li>
</ul>
<p> </p>
<p><i><span style="font-weight:400;">11:09  Justin – “</span></i><i><span style="font-weight:400;">Thank you, Amazon, for writing a thorough blog post detailing how to get this done versus just silently canceling a service in the community post. I appreciate it.”</span></i></p>
<p> </p>
<p><b>14:34 </b><a href="https://aws.amazon.com/blogs/storage/switch-your-file-share-access-from-amazon-fsx-file-gateway-to-amazon-fsx-for-windows-file-server/" target="_blank" rel="noreferrer noopener"><b>Switch your file share access from Amazon FSx File Gateway to Amazon </b></a></p>
<p><a href="https://aws.amazon.com/blogs/storage/switch-your-file-share-access-from-amazon-fsx-file-gateway-to-amazon-fsx-for-windows-file-server/" target="_blank" rel="noreferrer noopener"><b>FSx for Windows File Server</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">While the use of App Mesh is a bit of a big deal, this one feels a bit more like a yawn to us. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">As of October 28th, 2024 new customers will no longer be able to deploy </span><a href="https://aws.amazon.com/storagegateway/file/fsx/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon FSX File Gateways</span></a><span style="font-weight:400;">.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">FSX File Gateway is a type of </span><a href="https://aws.amazon.com/storagegateway/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS storage gateway</span></a><span style="font-weight:400;">, with local caching designed to be deployed on premises.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">FSX File gateway optimizes on-premise access to fully managed file shares in </span><a href="https://aws.amazon.com/fsx/windows/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon FSX for Windows FIle Server</span></a><span style="font-weight:400;">.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">With the drop in bandwidth costs and increasing availability, many clients can access FSX for Windows File Server in the cloud from their on-premise location without the need for a gateway or local cache.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Those who still need a local cache will find that Amazon </span><a href="https://aws.amazon.com/fsx/netapp-ontap/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">FSX for Netapp Ontap</span></a><span style="font-weight:400;"> using </span><a href="https://aws.amazon.com/blogs/storage/caching-data-using-amazon-fsx-for-netapp-ontap/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">FlexCache or Global File Cache</span></a><span style="font-weight:400;"> can serve their needs.  </span></li>
</ul>
<p><span style="font-weight:400;"> </span></p>
<p> </p>
<p><i><span style="font-weight:400;">15:49  Matthew – “</span></i><i><span style="font-weight:400;">It’s more interesting that this is the first one they decided to kill off, not the other services that have been around. Because years ago when they first had all the storage gateway, there were like the three types they had. And obviously they had the fourth, but like they didn’t kill off any of the S3 ones that were related. If you’re talking about things like network latency and everything else, where blob storage is meant to kind of handle that, where Samba shares, SIF shares.”</span></i></p>
<p> </p>
<h2><span style="font-weight:400;">GCP</span></h2>
<p> </p>
<p><b>17:54 </b><a href="https://siliconangle.com/2024/09/25/google-files-eu-antitrust-complaint-microsoft-software-licensing/" target="_blank" rel="noreferrer noopener"><b>Google files EU antitrust complaint against Microsoft over software </b></a></p>
<p><a href="https://siliconangle.com/2024/09/25/google-files-eu-antitrust-complaint-microsoft-software-licensing/" target="_blank" rel="noreferrer noopener"><b>licensing</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google has filed an antitrust complaint against Microsoft corp within the European commission. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The move has to do with Windows Server. Per Google, a set of licensing terms that MS applied to the OS in 2019 harmed competition and raised costs for its customers. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Under the revised usage terms, customers must pay additional fees if they wish to move their windows server licenses from Azure to rival platforms such as Google Cloud. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Google claims that this can result in a 400% increase to run Windows on rival clouds. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Google wasn’t done, complaining that companies that run windows servers on third party cloud platforms get limited access to security patches, compared to Azure users and the search giant argues there are other “interoperability barriers”</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This complaint comes two years after CISPE filed a similar complaint, but they </span><a href="https://cispe.cloud/cispe-and-microsoft-agree-settlement-in-fair-software-licensing-case/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">withdrew</span></a><span style="font-weight:400;"> it after reaching an </span><a href="https://www.reuters.com/technology/microsoft-clinches-deal-settle-cispe-antitrust-complaint-2024-07-10/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">agreement</span></a><span style="font-weight:400;"> with Microsoft. </span></li>
</ul>
<p> </p>
<p><i><span style="font-weight:400;">18:52  Ryan – “</span></i><i><span style="font-weight:400;">The Microsoft press releases for this have been worded very differently in the sense of like, it’s features built into the Azure workloads. And so it’s like, while you say that, they’re not granting the ability to Windows servers to get security patches on other clouds. The reality is, it’s only because they have the workloads running in Azure that they can offer the enhanced security patches, or at least I presume that. I guess I don’t know that. But yeah, and then the Windows licensing, it’s a service. Your licensing fees are built into using this service. yeah, competitive advantage.”</span></i></p>
<p> </p>
<p><b>20:11 </b><a href="https://cloud.google.com/blog/products/data-analytics/bigquery-vector-search-is-now-ga/" target="_blank" rel="noreferrer noopener"><b>BigQuery vector search now GA, setting the stage for a new class of </b></a></p>
<p><a href="https://cloud.google.com/blog/products/data-analytics/bigquery-vector-search-is-now-ga/" target="_blank" rel="noreferrer noopener"><b>AI-powered analytics</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">BigQuery </span><a href="https://cloud.google.com/bigquery/docs/vector-search-intro" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Vector Search</span></a><span style="font-weight:400;"> is now generally available, enabling vector similarity search on BigQuery data. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This functionality, also commonly referred to as approximate nearest-neighbor search, is the key to empowering numerous new data and AI use cases such as semantic search, similarity detection, and retrieval-augmented generation (RAG) with large language models. </span></li>
<li style="font-weight:400;"><a href="https://cloud.google.com/blog/products/data-analytics/introducing-new-vector-search-capabilities-in-bigquery" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Initially announced in February,</span></a><span style="font-weight:400;"> BigQuery vector search integrates generation, management and search of embeddings within the data platform to provide a serverless and integrated vector analytics solution for use cases such as </span><a href="https://cloud.google.com/blog/products/data-analytics/bigquery-vector-search-for-log-analysis?e=48754805" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">anomaly detection</span></a><span style="font-weight:400;">, multi-modal search, </span><a href="https://medium.com/google-cloud/getting-ice-cream-recommendations-at-scale-with-gemini-embeddings-and-vector-search-cf1f61a3d55b" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">product recommendations</span></a><span style="font-weight:400;">, drug discovery and more. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">In addition, </span><a href="https://cloud.google.com/bigquery/docs/vector-index#ivf-index" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">IVF or Inverted File Index</span></a><span style="font-weight:400;"> for BigQuery vector search is also GA, this index uses a k-means algorithm to cluster the vector data and combines it with an inverted row locator in a two-piece index in order to efficiently search similar embedding representations of your data. IVF includes several new enhancements:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Improved scalability</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Managed index with guaranteed correctness</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Stored Columns</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Pre-filters</span></li>
</ul>
</li>
</ul>
<p> </p>
<p><i><span style="font-weight:400;">22:15  Justin – “…</span></i><i><span style="font-weight:400;">so my experience so far with costing of AI things is that it’s not as expensive as people fear it is. If you’re building a foundational model, 100%, it’s expensive. need lots of Nvidia GPUs, you know, that kind of stuff. But, know, if you’re using like inference nodes and you’re doing, you know, you’re using an LLM to respond or using rag to augment, like it isn’t as expensive as you might think it is to do those things, at least at some scale. you know, not as much as you might fear.”</span></i></p>
<p> </p>
<p><b>24:47 </b><a href="https://cloud.google.com/blog/products/databases/google-cloud-database-news-for-september-2024/" target="_blank" rel="noreferrer noopener"><b>Google Cloud database news roundup, September 2024 edition</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google summarizes a busy month of announcements for September 2024.</span>
<ul>
<li style="font-weight:400;"><a href="https://www.googlecloudpresscorner.com/2024-09-09-Oracle-and-Google-Cloud-Announce-the-General-Availability-of-Oracle-Database-Google-Cloud" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Oracle Database GA in Google Cloud</span></a><span style="font-weight:400;"> (see last week’s show)</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">New </span><a href="https://cloud.google.com/blog/products/databases/announcing-spanner-editions?e=48754805" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Spanner Editions</span></a><span style="font-weight:400;"> are now generally available across Standard, Enterprise and Enterprise Plus. (also last week)</span></li>
<li style="font-weight:400;"><a href="https://cloud.google.com/sql/?hl=en" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Cloud SQL</span></a><span style="font-weight:400;"> has three new features that improve the cloud sql enterprise plus postgres and mysql capabilities </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Edition Upgrades for </span><a href="https://cloud.google.com/sql/docs/postgres/upgrade-cloud-sql-instance-to-enterprise-plus-in-place" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">in place upgrades</span></a></li>
<li style="font-weight:400;"><span style="font-weight:400;">MySQL </span><a href="https://cloud.google.com/sql/docs/mysql/upgrade-minor-db-version" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">minor version</span></a><span style="font-weight:400;"> upgrades</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Zonal (ie standalone) instances.</span></li>
</ul>
</li>
<li style="font-weight:400;"><a href="https://cloud.google.com/alloydb/?hl=en" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Alloy DB</span></a><span style="font-weight:400;"> now supports </span><a href="https://cloud.google.com/sql/docs/postgres/release-notes#September_12_2024" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">PostgreSQL</span></a><span style="font-weight:400;"> 16 in preview</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Node-level metrics on </span><a href="https://cloud.google.com/memorystore/docs/cluster/memorystore-for-redis-cluster-overview" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Memorystore for Redis Clusters</span></a></li>
<li style="font-weight:400;"><a href="https://cloud.google.com/memorystore/docs/valkey/product-overview" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Memorystore for Valkey</span></a><span style="font-weight:400;"> support </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">And </span><a href="https://cloud.google.com/firestore/docs/vector-search" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">KNN Vector searches</span></a><span style="font-weight:400;"> for </span><a href="https://cloud.google.com/firestore/?hl=en" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Firestore</span></a><span style="font-weight:400;"> as Generally available</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Busy month covered here at the cloud pod (didn’t talk about that because justin refuses to discuss Firestore.)</span></li>
</ul>
<p> </p>
<p><b>26:18 </b><a href="https://cloud.google.com/blog/products/ai-machine-learning/announcing-vertex-ai-prompt-optimizer/" target="_blank" rel="noreferrer noopener"><b>Announcing Public Preview of Vertex AI Prompt Optimizer</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Prompt design and engineering stands out as one of the most approachable methods to drive meaningful output from LLM. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">However, prompting large language models can feel like navigating a complex maze. You must experiment with various combinations of instructions and examples to achieve the desired output.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Taking a prompt and moving it from one LLM to another is challenging because different language models behave differently. Simply reusing a prompt is ineffective, so users need an intelligent prompt optimizer to generate useful usps.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">To help solve this problem google is announcing </span><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/learn/prompts/prompt-optimizer" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Vertex AI Prompt Optimizer</span></a><span style="font-weight:400;"> in public preview. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Prompt optimizer makes it easy to optimize, handles versatile tasks and expanded support for multi-modal tasks, comprehensive evaluations and flexible and customizable. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Built for data driven optimization and built for Gemini.  </span></li>
</ul>
<p> </p>
<p><i><span style="font-weight:400;">27:48 Ryan – “I </span></i><i><span style="font-weight:400;">feel like I’m ahead of my time because I have not retrained my brain. But what I have learned to do is just ask AI how I should ask it. then, so I feel like this is basically just service-flying my normal use case, which is like, hey, I want to do a thing. How do I ask you to do a thing? And then it asks itself much better than I would have.”</span></i></p>
<p> </p>
<p><b>29:22 </b><a href="https://cloud.google.com/blog/products/databases/vector-search-for-memorystore-for-valkey-and-redis-cluster/" target="_blank" rel="noreferrer noopener"><b>From millions to billions: Announcing vector search in Memorystore for </b></a></p>
<p><a href="https://cloud.google.com/blog/products/databases/vector-search-for-memorystore-for-valkey-and-redis-cluster/" target="_blank" rel="noreferrer noopener"><b>Valkey and Redis Cluster</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google is announcing </span><a href="https://cloud.google.com/blog/products/databases/memorystore-for-redis-vector-search-and-langchain-integration?e=48754805" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">vector search</span></a><span style="font-weight:400;"> on both the </span><a href="https://cloud.google.com/blog/products/databases/announcing-memorystore-for-valkey?e=48754805" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Memorystore for Valkey</span></a><span style="font-weight:400;"> and Memorystore for Redis Clusters.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Combining ultra-low latency in-memory vector search with zero-downtime scalability and high performance vector search across millions or billions of vectors. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Currently in preview, vector support for these Memorystore offerings mean you can now scale out your cluster by scaling out to 250 shards, storing billions of vectors in a single instance. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Vector search with Redis can produce single-millisecond latency on over a billion vectors with greater than 99% recall. </span></li>
</ul>
<p> </p>
<p><i><span style="font-weight:400;">29:57  Justin – “</span></i><i><span style="font-weight:400;">I don’t know if I would say that Redis or Valkey is, you know, zero downtime, but sure, okay.”</span></i></p>
<p> </p>
<p><b>31:53 </b><a href="https://cloud.google.com/blog/products/containers-kubernetes/how-class-e-addresses-solve-for-ip-address-exhaustion-in-gke/" target="_blank" rel="noreferrer noopener"><b>Leveraging Class E IPv4 Address space to mitigate IPv4 exhaustion issues </b></a></p>
<p><a href="https://cloud.google.com/blog/products/containers-kubernetes/how-class-e-addresses-solve-for-ip-address-exhaustion-in-gke/" target="_blank" rel="noreferrer noopener"><b>in GKE</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">As most technologists know, we are rapidly running out of IPV4 space, and the number of applications and services hosted on GKE continues to grow consuming even more private Ipv4 address space.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">For many large organizations, the </span><a href="https://simple.wikipedia.org/wiki/Private_network" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">RFC 1918</span></a><span style="font-weight:400;"> address space is becoming increasingly scarce, leading to IP Address Exhaustion challenges that impact their applications at scale. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Ipv6 solves this exact issue by providing more addresses but not all enterprises or applications are ready for IPv6 yet.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Bringing Class E IPV4 address space (240.0.0.0/4) can address the challenges as you continue to grow. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Class E addresses are reserved for future use, as noted in RFC5735 and RFC 1112, however, that doesn’t mean you can’t use them today in certain circumstances. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This blog post goes into the details of how to do this, which I found pretty interesting.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The following are some common objections or misconceptions about using Class E addresses:</span></li>
</ul>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Class E addresses do not work with other Google services. This is not true. Google Cloud VPC includes class E addresses as part of its </span><a href="https://cloud.google.com/vpc/docs/subnets#valid-ranges" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">valid address ranges for IPV4</span></a><span style="font-weight:400;">. Further, many Google managed services can be accessed using private connectivity methods with Class E addresses.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Using Class E addresses limits communicating with services outside Google (internet / Interconnect to on-prem/other cloud). Misleading. Given that Class E addresses are non-routable and not advertised over the internet or outside of Google Cloud, you can use  </span><a href="https://cloud.google.com/nat/docs/overview" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">NAT</span></a><span style="font-weight:400;"> or </span><a href="https://cloud.google.com/kubernetes-engine/docs/concepts/ip-masquerade-agent" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">IP masquerading</span></a><span style="font-weight:400;"> to translate Class E addresses to public or private IPv4 addresses to reach destinations outside of Google Cloud. In addition, </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">With the notable exception of Microsoft Windows, many operating systems now support Class E addresses.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Many on-prem vendors (Cisco, Juniper, Arista) support routing Class E addresses for private DC use. </span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Class E addresses have performance/scale limitations. This is not true. There is no performance difference for Class E addresses from other address ranges used in Google Cloud. Even with NAT/IP Masquerade, agents can scale to support a large number of connections without impacting performance. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">So while Class E addresses are reserved for future use, not routable over the Internet, and should not be advertised over the public Internet, you can use them for private use within Google Cloud VPCs, for both Compute Engine instances and Kubernetes pods/services in GKE. </span></li>
</ul>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">There are several benefits of leveraging the Class E address space:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">It’s very large, while RFC 1918 has 17.9 million addresses, Class E has 268.4 million addresses. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Scalability and growth</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Efficient resource utilization</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Future-proofing</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">There are sharp edges, though. Not all OSs will support Class E addressing, and networking equipment and software such as routers and firewalls need to be able to support Class E addresses. Transitioning from RFC 1918 to Class E requires careful planning and execution. </span></li>
</ul>
<p> </p>
<p><i><span style="font-weight:400;">35:55  Justin – “</span></i><i><span style="font-weight:400;">I did do a quick Google search, does Windows support Class E addresses? And no, it does not. Windows blocks Class E addresses and doesn’t allow them to be assigned to a NIC through DHCP. Apparently though, you can set one up in Azure as your VPC virtual network, but they say it will not work for your Windows boxes and it may have compatibility issues with your Linux boxes. Which, yeah, cool, cool, cool. But you know.”</span></i></p>
<p> </p>
<p><b>37:47 </b><a href="https://cloud.google.com/blog/products/ai-machine-learning/llama-3-2-metas-new-generation-models-vertex-ai/" target="_blank" rel="noreferrer noopener"><b>Meta’s Llama 3.2 is now available on Google Cloud</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Meta Llama 3.2 is on Google Cloud in the </span><a href="https://cloud.google.com/model-garden" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Vertex AI Model Garden</span></a></li>
<li style="font-weight:400;"><span style="font-weight:400;">By using Llama 3.2 on Vertex AI, you can:</span></li>
</ul>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Experiment with confidence: Explore Llama 3.2 capabilities through simple API calls and our comprehensive generative AI evaluation service within Vertex AI’s intuitive environment, without worrying about complex deployment processes.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Tailor Llama 3.2 to your exact needs: Fine-tune the model using your own data to build bespoke solutions tailored to your unique needs. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Ground your AI in truth: Make sure your AI outputs are reliable, relevant, and trustworthy with Vertex AI’s multiple options for grounding and RAG. For example, you can connect your models to enterprise systems, use Vertex AI Search for enterprise information retrieval, leverage Llama for generation, and more.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Craft intelligent agents: Create and orchestrate agents powered by Llama 3.2, using Vertex AI’s comprehensive set of tools, including LangChain on Vertex AI. Integrate Llama 3.2 into your AI experiences with </span><a href="https://firebase.google.com/docs/genkit" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Genkit</span></a><span style="font-weight:400;">’s Vertex AI plugin.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Deploy without overheads: Simplify deployment and scaling Llama 3.2 applications with flexible auto-scaling, pay-as-you-go pricing, and world-class infrastructure designed for AI.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Operate within your enterprise guardrails: Deploy with confidence with not only support for Meta’s Llama Guard for the models, but also Google Cloud’s built-in security, privacy, and compliance measures. Moreover, enterprise controls, such as Vertex AI Model Garden’s </span><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/control-model-access" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">new organization policy</span></a><span style="font-weight:400;">, provide the right access controls to make sure only approved models are accessed by users.</span></li>
</ul>
<p> </p>
<p><b>38:36 </b><a href="https://cloud.google.com/blog/products/databases/database-migration-service-for-sql-server/" target="_blank" rel="noreferrer noopener"><b>Migrate your SQL Server databases using Database Migration Service, now </b></a></p>
<p><a href="https://cloud.google.com/blog/products/databases/database-migration-service-for-sql-server/" target="_blank" rel="noreferrer noopener"><b>GA</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">DMS For SQL Server Databases is now Generally Available. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Database migrations are often challenging and require scarce expertise.</span></li>
<li style="font-weight:400;"><a href="https://cloud.google.com/database-migration" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Database Migration Service</span></a><span style="font-weight:400;"> has a unique approach to SQL Server database migrations:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Minimal Downtime and System Overhead</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Serverless Simplicity</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Security at the forefront</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">No additional charge</span></li>
</ul>
</li>
</ul>
<p> </p>
<p><i><span style="font-weight:400;">39:13  Ryan – “</span></i><i><span style="font-weight:400;">I like the service. I really just wish it would work server to server in the Cloud, but, cause then I could use it…It just, it doesn’t because they restricted it so that you have to define your endpoint as a Cloud SQL box.”</span></i></p>
<p> </p>
<h2><span style="font-weight:400;">Azure</span></h2>
<p> </p>
<p><b>40:20 </b><a href="https://azure.microsoft.com/en-us/blog/developer-insights-building-resilient-end-to-end-security/" target="_blank" rel="noreferrer noopener"><b>Developer insights: Building resilient end-to-end security</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">This is the first in a new series that will be on the Azure Blog on their </span><a href="https://azure.microsoft.com/en-us/explore/security/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">end-to-end approach</span></a><span style="font-weight:400;"> to cybersecurity. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The purpose of this series is to highlight how </span><a href="https://www.microsoft.com/en-us/security" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Microsoft Security</span></a><span style="font-weight:400;"> is transforming security platforms with practical, end-to-end security solutions for developers.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">It’s a lot of fluffy overview in this first in the series, but we’ll keep an eye on it as it evolves to see what else Microsoft reveals. You’re welcome. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Unless you’re not familiar with a platform approach to security, then you should check it out in our show notes. </span></li>
</ul>
<p> </p>
<p><i><span style="font-weight:400;">41:22  Matthew – “</span></i><i><span style="font-weight:400;">I think it’s a good start to try to get people to think about security day one. There’s so many people think about security. They were ready to go to production. wait, this thing has to be. So doc comply or GDPR, whatever it is, you know, so I feel like it’s a good way to try to get developers to think security at the beginning versus security at the end. And if I have to say shift left, I might vomit a little.”</span></i></p>
<p> </p>
<p><b>42:39 </b><a href="https://azure.microsoft.com/en-us/blog/run-vcf-private-clouds-in-azure-vmware-solution-with-support-for-portable-vcf-subscriptions/" target="_blank" rel="noreferrer noopener"><b>Run VCF private clouds in Azure VMware Solution with support for portable </b></a></p>
<p><a href="https://azure.microsoft.com/en-us/blog/run-vcf-private-clouds-in-azure-vmware-solution-with-support-for-portable-vcf-subscriptions/" target="_blank" rel="noreferrer noopener"><b>VCF subscriptions</b></a><b>.</b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">For those of you who are paying for VMware cloud foundation bundles from Broadcom Vmware, you can now port those subscriptions to </span><a href="https://azure.microsoft.com/en-us/products/azure-vmware" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Microsoft’s Azure VMware Solution (AVS)</span></a><span style="font-weight:400;"> in a fast and easy way using familiar VMWare tools and skills. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">If you don’t have a VCF subscription, but want to take advantage of VCF and AVS you can buy your solution from Microsoft directly. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This may be a benefit for you as it includes the fully managed and maintained cloud and vmware infrastructure.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The VMWare Cloud Foundation stack which includes vSphere, vSAN, NSX and HCX as well as VCF Operations and VCF Automation (formerly the Aria Suite)</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">You also get extended security updates for Windows Server 2012 and SQL Server 2012 and 2014.  </span></li>
</ul>
<p> </p>
<p><b>43:53 </b><a href="https://blogs.microsoft.com/blog/2024/09/24/microsoft-trustworthy-ai-unlocking-human-potential-starts-with-trust/" target="_blank" rel="noreferrer noopener"><b>Microsoft Trustworthy AI: Unlocking human potential starts with trust</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft is focused on helping customers use and build AI that is trustworthy, meaning that it is secure, safe and private.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Security is the top priority, and their expanded </span><a href="https://aka.ms/SFIwebsite" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Secure Future Initiatives</span></a><span style="font-weight:400;"> underscore the company’s commitment and responsibility to make customers more secure. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">To enhance security with AI, they are launching </span><a href="https://aka.ms/XPIA-Evaluations" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Evaluations</span></a><span style="font-weight:400;"> in Azure AI Studio to support proactive risk assessments. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft 365 Copilot will provide </span><a href="https://aka.ms/IntroducingWebQueryTransparency" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">transparency into web queries</span></a><span style="font-weight:400;"> to help admins and users better understand how web search enhances the Copilot response. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">In terms of Safety, they have several new features to ensure that the AI is safe and several new capabilities to mitigate risks. </span>
<ul>
<li style="font-weight:400;"><a href="https://aka.ms/GroundednessCorrection" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Correction capability</span></a><span style="font-weight:400;"> in Azure AI Content Safety Groundedness detection feature that helps fix hallucination issues in real-time before users see them. </span></li>
<li style="font-weight:400;"><a href="https://aka.ms/EmbeddedContentSafety" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Embedded content safety</span></a><span style="font-weight:400;"> allows customers to embed Azure AI content safety on devices. This is important for on-device scenarios where connectivity could be unavailable or intermittent. </span></li>
<li style="font-weight:400;"><a href="https://aka.ms/XPIA-Evaluations" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">New evaluations</span></a><span style="font-weight:400;"> in Azure AI studio to help customers assess the quality and relevancy of outputs and how often their AI application outputs protected material</span></li>
<li style="font-weight:400;"><a href="https://aka.ms/ProtectedMaterial-CodeReferencing" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Protected material detection</span></a><span style="font-weight:400;"> for code is now in preview in Azure AI content safety to help detect pre-existing content and code. This feature helps developers explore public source code in GitHub repos, fostering collaboration and transparency while enabling more informed coding decisions. </span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">And finally, in privacy, they are announcing:</span>
<ul>
<li style="font-weight:400;"><a href="https://aka.ms/ConfidentialInferencingBlog" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Confidential inference</span></a><span style="font-weight:400;"> in preview in the Azure OpenAI service whisper model, so customers can develop generative AI applications that support verifiable end-to-end privacy. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">General Availability of </span><a href="https://aka.ms/CVM-H100-GA" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Confidential VMs with NVIDIA h100 tensor core GPU</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><a href="https://azure.microsoft.com/en-us/blog/enterprise-trust-in-azure-openai-service-strengthened-with-data-zones/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Azure Open Data Zones</span></a><span style="font-weight:400;"> for the EU and US are coming soon and build on existing data residency provided by the Azure OpenAI service by making it easier to manage the data processing and storage of generative AI applications. This new functionality offers customers the flexibility of scaling generative AI applications across all Azure regions with a geography while giving them control of data processing and storage with the EU or US. </span></li>
</ul>
</li>
</ul>
<p> </p>
<p><i><span style="font-weight:400;">45:55  Ryan – </span></i><i><span style="font-weight:400;">“That’s an interesting wrinkle that I hadn’t thought of before. You know, the computation of these AI models and having that all be within specific regions for, I guess, GDPR reasons.”</span></i></p>
<h2><span style="font-weight:400;">Oracle</span></h2>
<p><b>47:51 </b><a href="https://www.oracle.com/news/announcement/oracle-releases-java-23-2024-09-17/" target="_blank" rel="noreferrer noopener"><b>Oracle Releases Java 23</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Oracle is launching </span><a href="https://www.oracle.com/java/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Java 23</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">We still don’t know how we got from 8 to 23, but here we are. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Java 23 is supported by the recent GA of </span><a href="https://docs.oracle.com/en-us/iaas/jms/index.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Java Management Service</span></a><span style="font-weight:400;"> 9.0, an </span><a href="https://www.oracle.com/cloud/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">OCI</span></a><span style="font-weight:400;"> Native Service that provides a unified console to help organizations manage Java runtimes and applications on-premise or in the cloud.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">JSM 9 includes usability improvements and JDK 23 provides more options for fine-tune and improve peak performance with the addition of the Graal compiler, a dynamic just-in-time compilation written in Java that transforms bytecode into optimized machine code. </span></li>
</ul>
<p> </p>
<p><i><span style="font-weight:400;">48:40  Justin – </span></i><i><span style="font-weight:400;">“…if you’re paying Oracle’s ridiculous Java fees and not using Coretto or any of the other numerous Java ports that have happened, you can get this from Oracle for Java 23.”</span></i></p>
<p> </p>
<p><b>51:06 </b><a href="https://siliconangle.com/2024/09/09/oracles-stock-pops-strong-earnings-beat-driven-cloud-growth-new-partnerships/" target="_blank" rel="noreferrer noopener"><b>Oracle’s stock pops on strong earnings beat, driven by cloud growth and </b></a></p>
<p><a href="https://siliconangle.com/2024/09/09/oracles-stock-pops-strong-earnings-beat-driven-cloud-growth-new-partnerships/" target="_blank" rel="noreferrer noopener"><b>new partnerships</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Oracle’s recent quarter was good with earnings per share of $1.39 vs the target of 1.32. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Revenue for the quarter rose 8% from a year before, to 13.31 billion, better than wall street estimates.  Net income rose to 2.93 B up from 2.42 billion in the same period. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Cloud service and license support revenue rose 10% from a year earlier to 10.52 billion. Whereas cloud infrastructure grew 45% to 2.2 billion up from 2.42 billion in the same period a year earlier. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Catz said that demand is outstripping supply and she is ok with that. </span></li>
</ul>
<p> </p>
<p><i><span style="font-weight:400;">51:40  Justin – </span></i><i><span style="font-weight:400;">“I don’t really understand if cloud service and licensing is like Oracle licensing and cloud OCI revenue shoved together. And then they also break out cloud infrastructure into its own number, but like 2.2 billion is not a lot of money for a cloud.”</span></i></p>
<p> </p>
<p><b>52:48 </b><a href="https://blogs.oracle.com/cloud-infrastructure/post/announcing-ga-oci-compute-amd-mi300x-gpus" target="_blank" rel="noreferrer noopener"><b>Announcing General Availability of OCI Compute with AMD MI300X GPUs</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">OCI is announcing the GA of bare metal instances with the AMD Instinct MI300X GPU. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">OCI Supercluster with AMD instinct MI300x accelerators provide high-throughput, ultra-low latency RDMA cluster network architecture for up to 16,384 MI300X GPUs. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">A single instance will be 6.00 per hour, and include 8 AMD Instinct Mi300X accelerator. 1.5TB of memory, Intel Sapphire Rapids CPU, and 2TB of DDR 5 memory, and 8×3.84 TB NVME drives with frontend network support 100G.   </span></li>
</ul>
<p><i><span style="font-weight:400;">53:35  Matthew- </span></i><i><span style="font-weight:400;">“</span></i><i><span style="font-weight:400;">I still say you’re dong the cloud wrong.”</span></i></p>
<p> </p>
<h2><span style="font-weight:400;">Aftershow</span></h2>
<p> </p>
<p><b>54:48 </b><a href="https://www.systeminit.com/blog-system-initiative-is-the-future" target="_blank" rel="noreferrer noopener"><b>System Initiative is the Future</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Adam Jacobs has announced his new startup, System Initiative. Jacobs is a well-known DevOps founder who was one of the engineers behind Chef.</span></li>
</ul>
<ul>
<li style="font-weight:400;"><b>Revolutionary DevOps Technology</b><span style="font-weight:400;">: System Initiative is introduced as a game-changing DevOps automation tool. It offers a fresh approach that addresses long-standing industry issues, such as slow feedback loops and complex infrastructure challenges.</span></li>
<li style="font-weight:400;"><b>Building What You Believe In</b><span style="font-weight:400;">: The founder emphasizes the importance of building products you are passionate about. This project is the culmination of five years of work, but feels like the culmination of a career in DevOps tools.</span></li>
<li style="font-weight:400;"><b>The Problem with Infrastructure as Code</b><span style="font-weight:400;">: While functional, infrastructure as code is limited. It locks systems in static representations of dynamic environments, causing inefficiencies. The founder believes the industry is stuck and needs new solutions.</span></li>
<li style="font-weight:400;"><b>Digital Twins &amp; Simulation</b><span style="font-weight:400;">: A key innovation in System Initiative is using 1:1 digital twins of cloud infrastructure, decoupling real and hypothetical states. This solves the feedback loop problem by simulating infrastructure changes without deploying them.</span></li>
<li style="font-weight:400;"><b>200% Problem Solved</b><span style="font-weight:400;">: System Initiative simplifies automation by eliminating the need to understand the underlying domain and the tool itself. Its digital twins offer a 1:1 translation with no loss of fidelity.</span></li>
<li style="font-weight:400;"><b>Complexity in DevOps</b><span style="font-weight:400;">: The founder reflects on working with major enterprises and the complexity inherent in all infrastructure. System Initiative embraces this complexity with a platform designed to be powerful, flexible, and expressive.</span></li>
<li style="font-weight:400;"><b>Reactive Programming for Flexibility</b><span style="font-weight:400;">: System Initiative’s infrastructure is based on a reactive graph of functions, making it easier to create, modify, and automate complex environments dynamically.</span></li>
<li style="font-weight:400;"><b>Multiplayer Collaboration</b><span style="font-weight:400;">: System Initiative enables real-time collaboration, allowing multiple users to work on the same infrastructure and see changes instantly. This drastically improves communication and productivity in DevOps teams.</span></li>
<li style="font-weight:400;"><b>Open Source &amp; Community Focus</b><span style="font-weight:400;">: The project is 100% open source, inviting contributions and fostering a collaborative community to build and extend the platform.</span></li>
<li style="font-weight:400;"><b>Future of DevOps Automation</b><span style="font-weight:400;">: The System Initiative aims to replace Infrastructure as Code today and transform how teams work together in complex environments in the future. It’s presented as the next step in the evolution of DevOps.</span></li>
</ul>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">These points should frame your conversation around the key innovations, the philosophical drive behind the project, and the technology’s transformative potential.</span></li>
</ul>
<p> </p>
<h2><span style="font-weight:400;">Closing</span></h2>
<p><span style="font-weight:400;">And that is the week in the cloud! Visit our website, the home of the Cloud Pod where you can join our newsletter, slack team, send feedback or ask questions at theCloud Pod.net or tweet at us with hashtag #theCloudPod</span></p>
]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/1856367/c1e-2okob8zmg6um9dd7-6zw11orvu555-p2siwp.mp3" length="68421802"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[​Welcome to episode 277 of The Cloud Pod, where the forecast is always cloudy! Justin, Ryan, and Matthew are your hosts this week for a news packed show. This week we dive into the latest in cloud computing with announcements from Google’s new AI search tools, Meta’s open-sourced AI models, and Microsoft Copilot’s expanded capabilities. We’ve also got Oracle releases, and some non-liquid Java on the agenda (but also the liquid kind, too) and Class E IP addresses. Plus, be sure to stay tuned for the aftershow! 
 
Titles we almost went with this week:
Which cloud provider does not have llama 3.2
Vmware says we will happily help you support your old Microsoft OS’s for $$$$
Class E is the best kind of IP Space
Microsoft says trust AI, and so does Skynet
3.2 Llama’s walked into an AI bar… 
Google gets cranky about MS Licensing, join the club
✍️Write Your Prompts, Optimize them with Vertex Prompts Analyzer, rinse repeat into a  
     vortex of optimization
️Oracle releases Java 23, Cloud Pod Uses Amazon Corretto 23 instead
Oracle releases Java 23, Cloud Pod still says run! MK 
 
A big thanks to this week’s sponsor: Archera
There are a lot of cloud cost management tools out there. But only Archera provides cloud commitment insurance. It sounds fancy but it’s really simple. Archera gives you the cost savings of a 1 or 3 year AWS Savings Plan with a commitment as short as 30 days. If you don’t use all the cloud resources you’ve committed to, they will literally put money back in your bank account to cover the difference. Other cost management tools may say they offer “commitment insurance”, but remember to ask: will you actually give me my money back? Archera will. Click this link to check them out
https://shortclick.link/uthdi1
AI Is Going Great – Or How ML Makes All It’s Money
 
01:06 OpenAI CTO Mira Murati, 2 other execs announce they’re leaving

Listener Note: paywall article 
OpenAI Chief Technology Officer Mira Murati is leaving, and within hours, two more OpenAI executives joined the list of high-profile departures.
Mira Murati spent 6.5 years at the company, and was named CEO temporarily when the board ousted co-founder Sam Altman.  
“It’s hard to overstate how much Mira has meant to OpenAI, our mission, and to us all personally,” Altman wrote. “I feel tremend...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/1856367/c1a-k5d5-pk4njd1obnk9-qc0zep.jpg"></itunes:image>
                                                                            <itunes:duration>01:09:20</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                            </item>
                    <item>
                <title>
                    <![CDATA[276: New from AWS – Elastic Commute – Flex Your Way to an Empty Office]]>
                </title>
                <pubDate>Mon, 30 Sep 2024 20:27:14 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/1852219</guid>
                                    <link>https://tcpfm.castos.com/episodes/276-elastic-commute</link>
                                <description>
                                            <![CDATA[<p><span style="font-weight:400;">Welcome to episode 276 of The Cloud Pod, where the forecast is always cloudy! This week, our hosts Justin, Matthew, and Jonathan do a speedrun of OpenWorld news, talk about energy needs and the totally not controversial decision to reopen 3 Mile Island, a “managed” exodus from cloud, and Kubernetes news. As well as Amazon’s RTO we are calling “Elastic Commute”. All this and more, right now on The Cloud Pod. </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">The Cloud Pod Hosts don’t own enough pants for five days a week</span></li>
<li><span style="font-weight:400;">IBM thinks it can contain the cost of K8s</span></li>
<li><span style="font-weight:400;">Microsoft loves nuclear energy</span></li>
<li><span style="font-weight:400;">The Cloudpod tries to give Oracle some love and still does not care</span></li>
<li><span style="font-weight:400;">The cloud pod goes nuclear on k8s costs</span></li>
<li><span style="font-weight:400;">⛽Can IBM contain the costs of Kubernetes and Nuclear Power? </span></li>
<li><span style="font-weight:400;">Google takes on take over while microsoft takes on nuclear</span></li>
<li><span style="font-weight:400;">AWS Launches ‘Managed Exodus’: Streamline Your Talent Drain</span></li>
<li><span style="font-weight:400;">Introducing Amazon WorkForce Alienation™: Scale Your Employee Discontent to the </span><span style="font-weight:400;">Cloud</span></li>
<li><span style="font-weight:400;">Amazon SageMaker Studio Lab: Now with Real-Time Resignation Prediction</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><span style="font-weight:400;">General News</span></h2>
<p><b>01:08 </b><a href="https://techcrunch.com/2024/09/17/ibm-acquires-kubernetes-cost-optimization-startup-kubecost/" target="_blank" rel="noreferrer noopener"><b>IBM acquires Kubernetes cost optimization startup Kubecost</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">IBM is quickly becoming the place where cloud cost companies go to assimilate? Or Die? Rebirthed mabe? Either way, it’s not a great place to end up. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">On Tuesday they announced the acquisition of </span><a href="https://www.kubecost.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Kubecost</span></a><span style="font-weight:400;">, a </span><a href="https://www.finops.org/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">FinOps</span></a><span style="font-weight:400;"> startup that helps teams monitor and optimize their K8 clusters, with a focus on efficiency – and ultimately cost. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This acquisition follows the acquisitions of Apptio, </span><a href="https://techcrunch.com/2021/04/29/ibm-is-acquiring-turbonomic-valued-at-963m-in-2019/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Turbonomic</span></a><span style="font-weight:400;">, and </span><a href="https://techcrunch.com/2020/11/18/ibm-is-acquiring-apm-startup-instana-as-it-continues-to-expand-hybrid-cloud-vision/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Instana</span></a><span style="font-weight:400;"> over the years. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Kubecost is the company behind </span><a href="https://www.opencost.io/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">OpenCost</span></a><span style="font-weight:400;">; a vendor-neutral open source project that forms part of the core Kubecost...</span></li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 276 of The Cloud Pod, where the forecast is always cloudy! This week, our hosts Justin, Matthew, and Jonathan do a speedrun of OpenWorld news, talk about energy needs and the totally not controversial decision to reopen 3 Mile Island, a “managed” exodus from cloud, and Kubernetes news. As well as Amazon’s RTO we are calling “Elastic Commute”. All this and more, right now on The Cloud Pod. 
Titles we almost went with this week:

The Cloud Pod Hosts don’t own enough pants for five days a week
IBM thinks it can contain the cost of K8s
Microsoft loves nuclear energy
The Cloudpod tries to give Oracle some love and still does not care
The cloud pod goes nuclear on k8s costs
⛽Can IBM contain the costs of Kubernetes and Nuclear Power? 
Google takes on take over while microsoft takes on nuclear
AWS Launches ‘Managed Exodus’: Streamline Your Talent Drain
Introducing Amazon WorkForce Alienation™: Scale Your Employee Discontent to the Cloud
Amazon SageMaker Studio Lab: Now with Real-Time Resignation Prediction

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
General News
01:08 IBM acquires Kubernetes cost optimization startup Kubecost 

IBM is quickly becoming the place where cloud cost companies go to assimilate? Or Die? Rebirthed mabe? Either way, it’s not a great place to end up. 
On Tuesday they announced the acquisition of Kubecost, a FinOps startup that helps teams monitor and optimize their K8 clusters, with a focus on efficiency – and ultimately cost. 
This acquisition follows the acquisitions of Apptio, Turbonomic, and Instana over the years. 
Kubecost is the company behind OpenCost; a vendor-neutral open source project that forms part of the core Kubecost...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[276: New from AWS – Elastic Commute – Flex Your Way to an Empty Office]]>
                </itunes:title>
                                    <itunes:episode>276</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p><span style="font-weight:400;">Welcome to episode 276 of The Cloud Pod, where the forecast is always cloudy! This week, our hosts Justin, Matthew, and Jonathan do a speedrun of OpenWorld news, talk about energy needs and the totally not controversial decision to reopen 3 Mile Island, a “managed” exodus from cloud, and Kubernetes news. As well as Amazon’s RTO we are calling “Elastic Commute”. All this and more, right now on The Cloud Pod. </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">The Cloud Pod Hosts don’t own enough pants for five days a week</span></li>
<li><span style="font-weight:400;">IBM thinks it can contain the cost of K8s</span></li>
<li><span style="font-weight:400;">Microsoft loves nuclear energy</span></li>
<li><span style="font-weight:400;">The Cloudpod tries to give Oracle some love and still does not care</span></li>
<li><span style="font-weight:400;">The cloud pod goes nuclear on k8s costs</span></li>
<li><span style="font-weight:400;">⛽Can IBM contain the costs of Kubernetes and Nuclear Power? </span></li>
<li><span style="font-weight:400;">Google takes on take over while microsoft takes on nuclear</span></li>
<li><span style="font-weight:400;">AWS Launches ‘Managed Exodus’: Streamline Your Talent Drain</span></li>
<li><span style="font-weight:400;">Introducing Amazon WorkForce Alienation™: Scale Your Employee Discontent to the </span><span style="font-weight:400;">Cloud</span></li>
<li><span style="font-weight:400;">Amazon SageMaker Studio Lab: Now with Real-Time Resignation Prediction</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><span style="font-weight:400;">General News</span></h2>
<p><b>01:08 </b><a href="https://techcrunch.com/2024/09/17/ibm-acquires-kubernetes-cost-optimization-startup-kubecost/" target="_blank" rel="noreferrer noopener"><b>IBM acquires Kubernetes cost optimization startup Kubecost</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">IBM is quickly becoming the place where cloud cost companies go to assimilate? Or Die? Rebirthed mabe? Either way, it’s not a great place to end up. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">On Tuesday they announced the acquisition of </span><a href="https://www.kubecost.com/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Kubecost</span></a><span style="font-weight:400;">, a </span><a href="https://www.finops.org/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">FinOps</span></a><span style="font-weight:400;"> startup that helps teams monitor and optimize their K8 clusters, with a focus on efficiency – and ultimately cost. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This acquisition follows the acquisitions of Apptio, </span><a href="https://techcrunch.com/2021/04/29/ibm-is-acquiring-turbonomic-valued-at-963m-in-2019/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Turbonomic</span></a><span style="font-weight:400;">, and </span><a href="https://techcrunch.com/2020/11/18/ibm-is-acquiring-apm-startup-instana-as-it-continues-to-expand-hybrid-cloud-vision/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Instana</span></a><span style="font-weight:400;"> over the years. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Kubecost is the company behind </span><a href="https://www.opencost.io/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">OpenCost</span></a><span style="font-weight:400;">; a vendor-neutral open source project that forms part of the core Kubecost commercial offering.  </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">OpenCost is part of the Cloud Native Computing Foundations cohort of </span><a href="https://www.opencost.io/blog/introducing-opencost" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">sandbox projects.</span></a></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Kubecost is expected to be integrated into IBM’s </span><a href="https://www.ibm.com/blog/announcement/finops-for-all-cloud-costs-and-stakeholders/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">FinOps Suite</span></a><span style="font-weight:400;">, which combines Cloudability and Turbonomic. </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">There is also speculation that it might make its way to OpenShift, too.</span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">02:26  Jsutin- “…so KubeCost lives inside of Kubernetes, and basically has the ability to see how much CPU, how much memory they’re using, then calculate basically the price of the EC2 broken down into the different pods and services.”</span></i></p>
<h2><span style="font-weight:400;">AI Is Going Great – Or How ML Makes All It’s Money</span></h2>
<p><b>05:03 </b><a href="https://openai.com/index/introducing-openai-o1-preview/" target="_blank" rel="noreferrer noopener"><b>Introducing OpenAI o1-preview</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Reasoning LLM’s have arrived this week. Dun Dun Dun…</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The idea behind reasoning models is to take more time to “think” before they respond to you. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This allows them to reason through complex tasks. and solve harder problems than previous models in science, coding, and math. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">ChatGPT is releasing the first with OpenAI o1-preview, which they expect to ship regular updates and improvements.  </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Alongside the release, they are considering evaluations for the next updates, which are in development. </span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">In ChatGPT’s tests they said the model performs similarly to PhD students on benchmark tasks in physics, chemistry and biology. It also excels in math and coding.  </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">In a qualifying exam for the International Mathematics Olympiad (IMO), GPT-4o correctly solved only 13% of the problems, while the reasoning model scored 83%. </span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">As part of the development of these models, Open AI has come up with a new safety training approach that harnesses the reasoning capabilities to make them adhere to safety and alignment guidelines. </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">One way they measure safety is by testing how well the model continues its safety rules after a user bypasses them (jailbreaking). On one of their hardest tests, GPT-4o scored 22 out of 100, whereas o1-preview scored 84.</span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">07:12  Jonathan – “</span></i><i><span style="font-weight:400;">I have not played with the O1 preview. I’ve been all in on Claude lately. I’ve been playing around with different system prompts to promote the whole chain of thought thing. I know opening ISA, the reasoning engine is not just a chain of thought under the hood. But I’m curious to know what it was you asked it. And I’ll run your prompts through what I’ve got. Because I do a similar thing where I evaluate</span></i></p>
<p><i><span style="font-weight:400;">evaluate what was asked and then sort of like almost fan out ideas from the central topic. In a way, just having like other ideas be present as tokens in the context window gives the LLM the opportunity to kind of explore more options in the answers that it gives. And so, yeah, it’ll be interesting.”</span></i></p>
<h2><span style="font-weight:400;">AWS</span></h2>
<p><b>28:31 </b><a href="https://www.theregister.com/2024/09/17/aws_cma_investigation/?ck_subscriber_id=512838477" target="_blank" rel="noreferrer noopener"><b>AWS Claims Customers are Packing Bags and Heading Back On-Prem</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS says its facing stiff competition from on-premises infrastructure, which is an about face after saying that all workloads would eventually move to the cloud.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This is from a summary of evidence given to UK Watchdog, The Competition and Markets Authority. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">AWS listed several examples of customers returning to their data centers, and AWS said “Building a datacenter requires significant effort, so the fact that customers are doing it highlights the level of flexibility that they have and the attractiveness of moving back to on-premises.”</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">AWS points that 29% of cloud customers (across all providers) have switched to on-premises services. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">A convenient lawyer-y case against being a monopoly? </span></li>
</ul>
<p><i><span style="font-weight:400;">10:41  Matthew – “</span></i><i><span style="font-weight:400;">I wouldn’t say it’s played as aggressive, but I’ve definitely started to see more articles and I’ve talked with a few companies in the last couple of years that are really kind of evaluating whether their cloud moves were the right moves and whether to move back or not. And the other piece of it is these companies either are using highly specialized workloads that don’t really fit the cloud or they’re large enough. That makes sense to keep them running, but the majority of customers are doing a simple app, and the cloud makes more sense.”</span></i></p>
<p><b>16:21 </b><a href="https://www.aboutamazon.com/news/company-news/ceo-andy-jassy-latest-update-on-amazon-return-to-office-manager-team-ratio" target="_blank" rel="noreferrer noopener"><b>Message from CEO Andy Jassy: Strengthening our culture and teams</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">It’s time to return to the office full time, says Andy Jassy in his September 16th memo to Amazon Employees. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Bye bye, meetings in sweatpants – and bye bye, A LOT of Amazon employees. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Jassy says he feels good about the progress they are making together across stores, AWS and advertising, as well as prime video expansion, and investment areas like GenAI, Kuiper and Healthcare, and several others evolving nicely. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">He talks about his start at the company 27 years ago and their plan to stay for a few years before moving back to NYC. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">He then goes on to discuss Amazon’s “unique” culture and how it is a key part of its success.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The S-team (Amazon’s executive team) wants Amazon to operate like the “world’s largest startup,” which means it has a passion for constantly inventing for customers, a strong urgency for big opportunities, high ownership, fast decision-making and scrappiness. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">As part of these questions, the S-Team has been thinking about 1) whether they have the right organizational structure to drive the ownership and speed they desire and 2) whether they are set up to invent, collaborate and be connected to each other (and the culture) to deliver the absolute best for the customers and the business. </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">They concluded they could do better on both. </span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">To do this, they decided they have too much management, and this is slowing down and causing bureaucracy.  </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">To solve this they plan to increase the ration of individual contributors to managers by 15% by the end of Q1 2025. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Fewer managers will remove layers and flatten the organization.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">If it’s done well, it will improve the ability to move fast, clarify and invigorate a sense of ownership and drive decision-making closer to the front lines where it most impacts customers. </span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">He points out that he created a bureaucratic mailbox so that people could send emails about needless processes or red tape, and he would read them. </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">We call BS. </span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">The controversial part is that it’s time to return to the office five days a week. They want to return to the pre-pandemic days when being out of the office was an exception. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They will bring back assigned desks in the US. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Because they know many of their employees will need to make accommodations, this new way of working will start on January 2nd, 2025. </span></li>
</ul>
<p><i><span style="font-weight:400;">18:43  Justin – “</span></i><i><span style="font-weight:400;">I don’t know how well you can innovate and do the right things for your customers. If you lose all of your senior talent, to attrition. So, I’m definitely a little concerned about maybe what I would call 25, I’m maybe the lost year for Amazon.”</span></i></p>
<p><i><span style="font-weight:400;">19:02  Jonathan – “…</span></i><i><span style="font-weight:400;">they may have had that culture before, but then the pandemic happened and people realize that things didn’t have to be that way and things could be different and they see the benefits. And I don’t think he’s going to make the change that he thinks he is by doing this. I think it’ll demotivate people. You can’t force culture change through policy. That’s not what the culture is. Culture is the result of all the things that you do, including those policies.”</span></i></p>
<p><b>25:29 </b><a href="https://aws.amazon.com/blogs/aws/amazon-rds-for-mysql-zero-etl-integration-with-amazon-redshift-now-generally-available-enables-near-real-time-analytics/" target="_blank" rel="noreferrer noopener"><b>Amazon RDS for MySQL zero-ETL integration with Amazon Redshift, now </b></a><a href="https://aws.amazon.com/blogs/aws/amazon-rds-for-mysql-zero-etl-integration-with-amazon-redshift-now-generally-available-enables-near-real-time-analytics/" target="_blank" rel="noreferrer noopener"><b>generally available, enables near real-time analytics</b></a><b>.</b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Zero ETL Integrations make it easy to unify your data across applications and data sources for holistic insights and breaking data silos. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">AWS is announcing </span><a href="https://aws.amazon.com/rds/mysql/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon RDS for MySQL</span></a><span style="font-weight:400;"> zero-ETL with </span><a href="https://aws.amazon.com/redshift/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Redshift</span></a><span style="font-weight:400;"> is now GA.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This release also includes new features such as data filtering, support for multiple integrations, and the ability to configure zero-ETL integrations in your </span><a href="https://aws.amazon.com/cloudformation/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS CloudFormation template</span></a><span style="font-weight:400;">.</span></li>
</ul>
<p><i><span style="font-weight:400;">26:12  Jonathan – “</span></i><i><span style="font-weight:400;">What’s more painful is having somebody click it in a console and then lose it and then have no commit history to refer back to if they need to rebuild it again. So at least it’s a manageable artifact.”</span></i></p>
<p><b>26:54 </b><a href="https://aws.amazon.com/blogs/opensource/aws-welcomes-the-opensearch-foundation/" target="_blank" rel="noreferrer noopener"><b>AWS Welcomes the OpenSearch Software Foundation</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS is transferring </span><a href="https://opensearch.org/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">OpenSearch</span></a><span style="font-weight:400;"> to the OpenSearch Software Foundation, a community-driven initiative under the </span><a href="https://www.linuxfoundation.org/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Linux Foundation</span></a><span style="font-weight:400;">.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This announcement follows the leadership expansion of the project shared earlier this year. </span></li>
</ul>
<p><b>29:54 </b><a href="https://techcrunch.com/2024/09/17/aws-shuts-down-deepcomposer-its-midi-keyboard-for-ai-music/" target="_blank" rel="noreferrer noopener"><b>AWS shuts down DeepComposer, its MIDI keyboard for AI music</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">The AWS cloud service killing AI has found another victim in </span><a href="https://techcrunch.com/2019/12/02/aws-announces-deepcomposer-a-machine-learning-keyboard-for-developers/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">DeepComposer</span></a><span style="font-weight:400;">, their AI powered keyboard experiment. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The DeepComposer project just reached its 5 year milestone, and the physical MIDI piano and AWS service let users compose songs with the help of generative AI. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">You have until September 17th 2025 to download your data stored there before the service will end. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">AWS has also announced that the DeepRacer league is ending after this year, and we assume that means the </span><a href="https://techcrunch.com/2019/07/31/why-aws-is-building-tiny-ai-race-cars-to-teach-machine-learning/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">DeepRacer</span></a><span style="font-weight:400;"> will be defunct soon as well. </span></li>
</ul>
<p><i><span style="font-weight:400;">30:49  Matthew – “</span></i><i><span style="font-weight:400;">It’s so funny to look back and think that Deep Compose was five years ago. They had AI in the palm of their hands and let it go.”</span></i></p>
<p><b>37:25 </b><a href="https://aws.amazon.com/blogs/aws/amazon-s3-express-one-zone-now-supports-aws-kms-with-customer-managed-keys/" target="_blank" rel="noreferrer noopener"><b>Amazon S3 Express One Zone now supports AWS KMS with customer managed keys</b></a><b>  </b></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/AmazonS3/latest/userguide/s3-express-one-zone.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon S3 Express One Zone</span></a><span style="font-weight:400;">, a high-performance, single availability zone </span><a href="https://aws.amazon.com/s3/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">S3 storage</span></a><span style="font-weight:400;">, now supports server-side encryption.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Thanks – but why didn’t this exist before? </span></li>
</ul>
<p><b>37:58 </b><a href="https://aws.amazon.com/blogs/aws/now-available-graviton4-powered-memory-optimized-amazon-ec2-x8g-instances/" target="_blank" rel="noreferrer noopener"><b>Now available: Graviton4-powered memory-optimized Amazon EC2 X8g instances</b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/blogs/aws/category/compute/graviton/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Graviton</span></a><span style="font-weight:400;">-4-powered, memory-optimized x8g instances are now available in ten virtual sizes and two bare metal sizes, with up to 3TiB of DDR5 memory and up to 192 vCPU’s.</span></li>
</ul>
<p><i><span style="font-weight:400;">38:33  Justin – “</span></i><i><span style="font-weight:400;">I think the limitation on CPU is 64 or 96 before. Like, this is doubling or tripling the number of CPUs too, which wasn’t typically the Graviton runs so well, but I don’t see the CPU being my problem. It’s really when I want to run a database in the memory.”</span></i></p>
<h2><span style="font-weight:400;">GCP</span></h2>
<p><b>39:37 </b><a href="https://cloud.google.com/blog/products/identity-security/automate-access-control-with-sensitive-data-protection-and-conditional-iam/" target="_blank" rel="noreferrer noopener"><b>Safer by default: Automate access control with Sensitive Data Protection </b></a><a href="https://cloud.google.com/blog/products/identity-security/automate-access-control-with-sensitive-data-protection-and-conditional-iam/" target="_blank" rel="noreferrer noopener"><b>and conditional IAM</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Safer by default, now automated!</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Google Cloud’s </span><a href="https://cloud.google.com/security/products/sensitive-data-protection?hl=en" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Sensitive Data Protection</span></a><span style="font-weight:400;"> can automatically discover sensitive data assets and attach tags to your data assets based on sensitivity.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Using </span><a href="https://cloud.google.com/iam/docs/conditions-overview" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">IAM conditions</span></a><span style="font-weight:400;">, you can grant or deny access to the data based on the presence or absence of a sensitivity level tag key or tag value. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This has several use cases including:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Automate access control across various supported resources based on attributes and classifications. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Restrict access to the supported resources like Cloud Storage, BigQuery and CloudSQL until those resources are profiled and classified by sensitive data protection</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Change access to a resource automatically as the data sensitivity level for that resource changes. </span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">40:57  Justin – “</span></i><i><span style="font-weight:400;">I would hope this is something you wouldn’t necessarily use for machine accounts or service to service accounts. This to me is a person who’s getting this type of access. This is where you care about the primitives and the context and those things. And this is a context that you are caring about based on the data sensitivity and the context is important to the end user, not necessarily to the machine.”</span></i></p>
<p><b>41:26 </b><a href="https://cloud.google.com/blog/products/identity-security/how-to-prevent-account-takeovers-with-new-certificate-based-access/" target="_blank" rel="noreferrer noopener"><b>How to prevent account takeovers with new certificate-based access</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Stolen credentials are one of the most </span><a href="https://cloud.google.com/blog/topics/threat-intelligence/m-trends-2024?e=48754805" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">common attack vectors</span></a><span style="font-weight:400;"> used by attackers to gain unauthorized access to user accounts and steal information. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Google is providing certificate-based access in the </span><a href="https://cloud.google.com/iam/docs/overview" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">IAM</span></a><span style="font-weight:400;"> portfolio to help combat stolen credentials, cookie theft, and accidental credential loss. </span></li>
<li style="font-weight:400;"><a href="https://cloud.google.com/beyondcorp-enterprise/docs/securing-resources-with-certificate-based-access" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Certificate-based access (CBA)</span></a><span style="font-weight:400;"> uses mutual TLS to ensure that users’ credentials are bound to a device certificate before authorizing access to cloud resources.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">CBA provides strong protection requiring the X.509 certificate as a device identifier, and verifies devices with user context for every access request to cloud resources. Even if an attacker compromises a user’s credentials, account access will remain blocked as they do not have the corresponding certificate. Rendering the stolen credentials useless</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This is a lot of words to say they support X.509 certificates – but we still appreciate it. </span></li>
</ul>
<p><i><span style="font-weight:400;">42:21  Matthew- “</span></i><i><span style="font-weight:400;">It’s a great level up though to protect because you see all these articles online of like, somebody got in and 12 things went wrong or in someone’s personal account, somebody launched $30,000 worth of Bitcoin miners. So a really good level up to see.”</span></i></p>
<p><b>42:43 </b><a href="https://cloud.google.com/blog/products/identity-security/new-ciem-support-in-security-command-center-can-help-reduce-risk/" target="_blank" rel="noreferrer noopener"><b>Announcing expanded CIEM support to reduce multi cloud risk in Security </b></a><a href="https://cloud.google.com/blog/products/identity-security/new-ciem-support-in-security-command-center-can-help-reduce-risk/" target="_blank" rel="noreferrer noopener"><b>Command Center</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Identities can be a major source of cloud risk when they are not properly managed. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Compromised credentials are frequently used to gain unauthorized access to cloud environments, which often magnifies that risk since many user and service accounts are granted access to cloud services and assets beyond their required scope. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This means that if just one credential is stolen, or abused, companies may be at risk of data exfiltration and resource compromise.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">To make this easier, Google is integrating </span><a href="https://cloud.google.com/security-command-center/docs/ciem-overview" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Cloud Infrastructure Entitlement Management (CIEM)</span></a><span style="font-weight:400;"> into </span><a href="https://cloud.google.com/security/products/security-command-center" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Security Command Center</span></a><span style="font-weight:400;">, their multi-cloud security and risk tool, and are announcing GA of expanded CIEM support for additional clouds and identify providers. (AWS and Entra ID and Okta)   </span></li>
</ul>
<h2><span style="font-weight:400;">Azure</span></h2>
<p><b>42:43 </b><a href="https://azure.microsoft.com/en-us/blog/introducing-o1-openais-new-reasoning-model-series-for-developers-and-enterprises-on-azure/" target="_blank" rel="noreferrer noopener"><b>Introducing o1: OpenAI’s new reasoning model series for developers and </b></a><a href="https://azure.microsoft.com/en-us/blog/introducing-o1-openais-new-reasoning-model-series-for-developers-and-enterprises-on-azure/" target="_blank" rel="noreferrer noopener"><b>enterprises on Azure</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">When OpenAI announces new models, Azure, their closest frenemies, follows closely with the new capability on Microsoft Azure Open AI services. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Both the </span><a href="https://aka.ms/gh-models-blog" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">o1-preview</span></a><span style="font-weight:400;"> and the o1-mini are now available in Azure Open AI service, Azure AI Studio and Github Models.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The o1 series enables complex coding, math reasoning, brainstorming and comparative analysis capabilities, setting a new benchmark for AI powered solutions. </span></li>
</ul>
<p><i><span style="font-weight:400;">44:36  Jonathan – “A model garden. </span></i><i><span style="font-weight:400;">It sounds so beautiful until you realize it’s just a concrete building that uses millions of gallons of water a day.”</span></i></p>
<p><b>44:49 </b><a href="https://azure.microsoft.com/en-us/blog/azure-public-ips-are-now-zone-redundant-by-default/" target="_blank" rel="noreferrer noopener"><b>Azure Public IPs are now zone-redundant by default</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Azure is making their </span><a href="https://learn.microsoft.com/en-us/azure/virtual-network/ip-services/public-ip-addresses#availability-zone" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Public IP’s</span></a><span style="font-weight:400;"> redundant by default. This means that unless you specifically select a single zone when deploying your Microsoft Azure Standard Public IPs, they will automatically be zone-redundant without any extra steps on your part. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This zone redundancy will be at no-extra cost. A zone-redundant IP is created in three zones for a region and can survive any single zone failure, improving the resiliency of your application using the public IP.</span></li>
</ul>
<p><i><span style="font-weight:400;">45:20  Matthew – </span></i><i><span style="font-weight:400;">“So when I started in Azure, I realized that these weren’t set up. If you try to attach a non multizonal IP address to a multi zonal service, it just yells at you. So to me, this is like one of those EIPs that are all multizonal by default. You don’t even think about what zone…so you don’t have to think about it. Where here you used to think about it and then there was no migration path to say, hey, take this single zone IP address and move it to be multi-zone. Even if you charge me more for it, there was nothing. So you would have to completely change your IP address, which we all know customers never whitelist specific IP addresses. They never caused the problems. You do that change never.”</span></i></p>
<p><b>46:53 </b><a href="https://azure.microsoft.com/en-us/blog/microsoft-and-oracle-enhance-oracle-databaseazure-with-data-and-ai-integration/" target="_blank" rel="noreferrer noopener"><b>Microsoft and Oracle enhance Oracle Database@Azure with data and AI </b></a><a href="https://azure.microsoft.com/en-us/blog/microsoft-and-oracle-enhance-oracle-databaseazure-with-data-and-ai-integration/" target="_blank" rel="noreferrer noopener"><b>integration </b></a></p>
<ul>
<li style="font-weight:400;"><a href="http://www.azure.com/oracle" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Oracle Database@Azure</span></a><span style="font-weight:400;"> got some updates from Open World including:</span>
<ul>
<li style="font-weight:400;"><a href="https://www.microsoft.com/en-us/microsoft-fabric" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Fabric</span></a><span style="font-weight:400;"> integration</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Integration with Sentinel and </span><a href="https://techcommunity.microsoft.com/t5/oracle-on-azure-blog/oracle-database-azure-achieves-extensive-certifications/ba-p/4196383" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">compliance certifications</span></a><span style="font-weight:400;"> to provide “industry leading” security and compliance. </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Sure, Jan. </span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Plans to expand to a total of 21 primary regions, each with at least two availability zones and support for Oracle’s Maximum Availability Architecture. </span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">47:36  Jonathan – </span></i><i><span style="font-weight:400;">“Out of all the companies who build Oracle database and re -select the cloud providers, Oracle is most definitely the industry leader.”</span></i></p>
<p><b>47:57 </b><a href="https://azure.microsoft.com/en-us/blog/advanced-container-networking-services-enhancing-security-and-observability-in-aks/" target="_blank" rel="noreferrer noopener"><b>Advanced Container Networking Services: Enhancing security and </b></a><a href="https://azure.microsoft.com/en-us/blog/advanced-container-networking-services-enhancing-security-and-observability-in-aks/" target="_blank" rel="noreferrer noopener"><b>observability in AKS</b></a><b>   </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft Azure Container network team is giving out gifts this week. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Following the success of </span><a href="https://learn.microsoft.com/en-us/azure/aks/advanced-network-observability-concepts?tabs=non-cilium" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">advanced network observability</span></a><span style="font-weight:400;">, which provides deep insights in network traffic within AKS clusters, they now introduce fully qualified domain name filtering as a new security feature.</span></li>
</ul>
<p><b>48:26 </b><a href="https://www.theinformation.com/briefings/microsoft-deal-will-reopen-three-mile-island-nuclear-plant-to-power-ai?rc=3t8xtd" target="_blank" rel="noreferrer noopener"><b>Microsoft Deal Will Reopen Three Mile Island Nuclear Plant to Power AI</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Listener note: paywall article</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft signed a deal to restart a power plan on Three Mile Island in Pennsylvania to power its rapidly expanding data center footprint for AI. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The plan TMI Unit 1, which shut down in 2019 for economic reasons will be producing the energy for Microsoft.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">A separate reactor at the site partially melted down in 1979. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The reactor generated more than 800 megawatts of power, and constellation energy, </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The plant owner said it would be ready for MS by 2028. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">We expect protests. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Green energy? AI? GPU’s? This all needs (carbon free) power. </span></li>
</ul>
<p><i><span style="font-weight:400;">49:39  Justin – </span></i><i><span style="font-weight:400;">“And they’re willing to wait for it till 2028. So they have expectations that not only is this plausible and something they can get the Nuclear Energy Commission to approve, but that they will still have AI dominating this much of their power consumption that they need 800 megawatts.”</span></i></p>
<p><b>50:29 </b><a href="https://techcommunity.microsoft.com/t5/azure-sql-blog/elastic-pools-for-azure-sql-database-hyperscale-now-generally/ba-p/4242658" target="_blank" rel="noreferrer noopener"><b>Elastic pools for Azure SQL Database Hyperscale now Generally Available!</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Azure is announcing GA for Azure SQL Database Hyperscale </span><a href="https://aka.ms/hsep" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">elastic pools</span></a><span style="font-weight:400;">.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">While you may start with a standalone hyper-scale database, chances are that as your fleet of databases grows, you want to optimize price and performance across a set of hyper-scale databases. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Elastic pools offer the convenience of pooling resources like CPU, memory, and IO while ensuring strong security isolation between those databases. </span></li>
</ul>
<p><i><span style="font-weight:400;">51:02  Justin – </span></i><i><span style="font-weight:400;">“Yeah, I mean it’s no different than back in the day when you would take all your VM’s on Prem and say OK cool, we had 100 gigabytes memory. I’m going to allocate 200 gigabytes of memory to all your servers and hope that none of them, not all of them, blow up at once. Because you know your workloads. So now you’re able to do this. With hyper scale, which is equivalent to Aurora, but is actually with Microsoft SQL engine and it also gets rid of the increased storage price, but they’ve gotten rid of the SQL licensing.”</span></i></p>
<h2><span style="font-weight:400;">Oracle</span></h2>
<p><b>Hold onto your butts – it’s time for OpenWorld news. </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">There are a lot of things to cover this week from OpenWorld, to save our hosts’ sanity we won’t get too deep into all of these, but will try to highlight key things. If you REALLY care about Oracle, well – that’s why you’re here, deep into the show notes, where the show note editor has done **all** the work to arrange and manage the chaos for you. You’re welcome. </span></li>
</ul>
<p><b>55:12 </b><a href="https://blogs.oracle.com/cloud-infrastructure/post/the-best-platform-for-ai-workloads-oke" target="_blank" rel="noreferrer noopener"><b>Introducing the best platform for AI workloads – OCI Kubernetes Engine </b></a></p>
<p><a href="https://blogs.oracle.com/cloud-infrastructure/post/the-best-platform-for-ai-workloads-oke" target="_blank" rel="noreferrer noopener"><b>(OKE)</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">OKE or </span><a href="https://www.oracle.com/cloud/cloud-native/container-engine-kubernetes/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Oracle Cloud Infrastructure Kubernetes Engine</span></a><span style="font-weight:400;"> gets new capabilities to let customers meet their AI and ML workload needs. Cool! </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Optimized for AI workloads: OKE offers built-in observability and health checks for your container environment and now includes the capability to track current and historical RDMA and GPU errors to improve operability for customers using GPUs.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Now supports Ubuntu for GPU workloads and worker nodes.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Lots of noise about containers for training and security by design that isn’t new. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">There’s a Steinbeck joke here (Get it? Grapes of Wrath? Okie? Ok, so maybe that one doesn’t work.) </span></li>
</ul>
</li>
</ul>
<p><b>55:28</b> <a href="https://blogs.oracle.com/cloud-infrastructure/post/announcing-oracle-cloud-guard-container-security" target="_blank" rel="noreferrer noopener"><b>Announcing Oracle Cloud Guard Container Security</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">OCI is announcing a limited availability (</span><a href="https://apexadb.oracle.com/ords/f?p=108:30:204907759122819::::P30_SELF_NOMINATION:Self-Nomination" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">beta) release of Container Governance</span></a><span style="font-weight:400;"> through the Oracle Cloud Guard’s container security. This single pane of glass experience for managing your large scale containerized workload compliance. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Key features include:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Ready to go recipes to give you secure and compliant baseline configurations for container security</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Single Pane of Glass</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Robust exception management</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Remote monitoring </span></li>
</ul>
</li>
</ul>
<p><b>55:38</b> <a href="https://blogs.oracle.com/cloud-infrastructure/post/enhancing-oke-with-integrated-logging-analytics" target="_blank" rel="noreferrer noopener"><b>Enhanced monitoring in OKE with Logging Analytics</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><a href="https://www.oracle.com/cloud/cloud-native/container-engine-kubernetes/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">OCI OKE integration</span></a><span style="font-weight:400;"> with </span><a href="https://www.oracle.com/manageability/logging-analytics/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">OCI Logging Analytic</span></a><span style="font-weight:400;">s to give you high availability into your K8 environment.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">OCI logging analytics gives you a comprehensive ML/AI-powered monitoring solution across all environments, including OKE monitoring. </span></li>
</ul>
<p><b>55:50 </b><a href="https://blogs.oracle.com/cloud-infrastructure/post/oci-kubernetes-engine-openid-connect-oidc" target="_blank" rel="noreferrer noopener"><b>OCI Kubernetes Engine supports OpenId Connect (OIDC)</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">OKE supports </span><a href="https://go.oracle.com/LP=144411?elqCampaignId=581643" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">OIDC</span></a><span style="font-weight:400;"> or OpenID Connect allowing you a secure and flexible way to authenticate and authorize users within applications and systems. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">With this OKE capability you can authorize kubernetes pods to access non-OCI resources using third party security token services. </span></li>
</ul>
<p><b>55:55 </b><a href="https://blogs.oracle.com/cloud-infrastructure/post/simplify-operations-oci-kubernetes-add-ons" target="_blank" rel="noreferrer noopener"><b>Simplify operations with OCI Kubernetes Engine (OKE) add-ons</b></a><span style="font-weight:400;">  </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Take advantage of your K* operations with </span><a href="https://blogs.oracle.com/cloud-infrastructure/post/kubernetes-add-on-lifecycle-management" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">OCI OKE</span></a><span style="font-weight:400;"> add-ons.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">These add ons cover 4 key container areas today with more coming in the future</span>
<ul>
<li style="font-weight:400;"><a href="https://docs.oracle.com/en-us/iaas/Content/ContEng/Tasks/contengusingclusterautoscaler_topic-Working_with_Cluster_Autoscaler_as_Cluster_Add-on.htm#contengusingclusterautoscaler_topic-Working_with_Cluster_Autoscaler_as_Cluster_Add-on" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Kubernetes Cluster Autoscaler</span></a></li>
<li style="font-weight:400;"><a href="https://docs.oracle.com/en-us/iaas/Content/ContEng/Tasks/contengistio-cluster-add-on.htm" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Istio service Mesh</span></a></li>
<li style="font-weight:400;"><a href="https://docs.oracle.com/en-us/iaas/Content/ContEng/Tasks/contengsettingupnativeingresscontroller-cluster-addon-top-level.htm#contengsettingupnativeingresscontroller-cluster-addon-top-level" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">OCI Native Ingress controller</span></a></li>
<li style="font-weight:400;"><a href="https://docs.oracle.com/en-us/iaas/Content/ContEng/Tasks/contengworkingwithmetricsserver_cluster-add-on.htm#contengdeployingmetricsserver_cluster-add-on" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Kubernetes Metrics Server</span></a></li>
</ul>
</li>
</ul>
<p><span style="font-weight:400;">Please note – someone write down the date and time – Jonathan is impressed with something from Oracle. </span></p>
<p><b>56:49 </b><a href="https://blogs.oracle.com/cloud-infrastructure/post/announcing-oci-streaming-with-apache-kafka-in-la" target="_blank" rel="noreferrer noopener"><b>Announcing OCI Streaming with Apache Kafka, in limited availability</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">OCI is launching a new managed kafka service, currently in beta with GA in the fall.  </span></li>
</ul>
<p><b>57:00 </b><a href="https://blogs.oracle.com/cloud-infrastructure/post/oci-database-with-postgresql-release-new-features" target="_blank" rel="noreferrer noopener"><b>OCI Database with PostgreSQL release new features</b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://www.oracle.com/cloud/postgresql/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">OCI Database for PostgreSQL</span></a><span style="font-weight:400;"> released several new features including support for versions 13, 14, and 15. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Extension support. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">New Vertical scaling and flexible shapes, Network security group integration.  </span></li>
</ul>
<p><b>57:09 </b><a href="https://blogs.oracle.com/cloud-infrastructure/post/streamline-your-it-management-with-oci-resource-analytics" target="_blank" rel="noreferrer noopener"><b>Streamline your IT management with OCI Resource Analytics</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Asset and Resource management for your OCI environment. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Simplify centralized inventory, advanced troubleshooting and glean insights from the built in dashboards and reports. </span></li>
</ul>
<p><b>57:23  </b><a href="https://blogs.oracle.com/machinelearning/post/announcing-gpu-support-for-oml-notebooks-on-adb" target="_blank" rel="noreferrer noopener"><b>Announcing GPU support for Oracle Machine Learning Notebooks on Autonomous Database</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Autonomous Database Serverless now provides integrated access to OCI GPU instances through Oracle Machine Learning (OML) notebooks.</span></li>
</ul>
<p><b>57:30 </b><a href="https://blogs.oracle.com/cloud-infrastructure/post/oracle-code-assist-beta-and-netsuite-suitescript" target="_blank" rel="noreferrer noopener"><b>Announcing Oracle Code Assist beta and NetSuite SuiteScript support</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Last year they </span><a href="https://blogs.oracle.com/cloud-infrastructure/post/oracle-code-assist-ai-companion-boost-velocity" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">announced Code Assist</span></a><span style="font-weight:400;"> and AI code companion. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Now they have released and optimized the Java version of Code Assist available in beta for developers to help build new applications faster and quickly update code written in older Java versions. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">While it was optimized for Java, it does work with most modern languages including Python, Javascript, suitescript, rust, ruby, go, pl/sql, C# and C. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Suitescript is a custom javascript language for Netsuite to enable customization of their SaaS ERP. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Future updates will be coming out for code-assist to further enhance this experience for suitescript. </span></li>
</ul>
<p><b>58:19  </b><a href="https://blogs.oracle.com/cloud-infrastructure/post/private-ip-address-support-oci-object-storage" target="_blank" rel="noreferrer noopener"><b>Announcing private IP address support for OCI Object Storage using </b></a><a href="https://blogs.oracle.com/cloud-infrastructure/post/private-ip-address-support-oci-object-storage" target="_blank" rel="noreferrer noopener"><b>private endpoints</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">OCI announced GA of private endpoints for OCI object storage in all commercial regions. Private endpoints help enable secure, private connectivity using a private IP address to access OCI object storage from your VCN or on-premise network. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This will be fun to manage. We’ll pass. </span></li>
</ul>
<p><b>58:48 </b><a href="https://blogs.oracle.com/cloud-infrastructure/post/oci-fleet-application-management-ga" target="_blank" rel="noreferrer noopener"><b>OCI Fleet Application Management is now generally available – simplifying </b></a><a href="https://blogs.oracle.com/cloud-infrastructure/post/oci-fleet-application-management-ga" target="_blank" rel="noreferrer noopener"><b>full-stack patching and compliance management at scale</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Given the name of this, you’d think this would have to do with managing your applications – but you’d be wrong. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">GA of OCI Fleet Application Management was announced </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The new service simplifies centralized management and patch operations across your entire cloud stack for any software or technology deployed on OCI. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">We see what you’ve done here Oracle. We don’t like it. </span></li>
</ul>
<p><b>59:15 </b><a href="https://blogs.oracle.com/cloud-infrastructure/post/building-storage-systems-for-future-oci-roadmap" target="_blank" rel="noreferrer noopener"><b>Building storage systems for the future: The OCI roadmap</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Other storage things announced other than HPMT</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Object STorage HDFS connector for your </span><a href="https://docs.oracle.com/en-us/iaas/Content/Resources/Assets/whitepapers/hdfs-connector-oci-object-storage-performance-big-data.pdf" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Hadoop</span></a><span style="font-weight:400;"> based needs</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Coming in the next few months</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Scale 10x</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">File storage with Lustre</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">File Storage usage quotas</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Object storage support for multiple checksums</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">File storage 30 minute RPO</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Block Volumes support for customer managed keys for cross-region replication. </span></li>
</ul>
</li>
</ul>
<p><b>1:00:23 </b><a href="https://blogs.oracle.com/cloud-infrastructure/post/new-standardized-oci-landing-zones-framework" target="_blank" rel="noreferrer noopener"><b>Introducing the new standardized OCI Landing Zones framework for an </b></a><a href="https://blogs.oracle.com/cloud-infrastructure/post/new-standardized-oci-landing-zones-framework" target="_blank" rel="noreferrer noopener"><b>even easier onboarding to OCI</b></a><b> </b></p>
<p><a href="https://blogs.oracle.com/cloud-infrastructure/post/accelerating-zero-trust-journey-on-oci-with-landing-zones" target="_blank" rel="noreferrer noopener"><b>Accelerating your zero trust journey on OCI with zero trust landing zone</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Oracle announced the early preview of OCIE zero trust landing zones, a new solution that enables a one-click provisioning for both secure, high performing architecture for your cloud tenancy, with deployment and hardened configuration of key services in need to meet certain requirements. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This is based on recommendations from CISA and UK governments National Cyber Secure Centre. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The following is provisioned with OCI Zero Trust Landing zones</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Base tenancy = IAM, KMS, Cloud Guard, Vulnerability Scanning, bastion, logging, events (auditing), notification and security zones.  As well as you can enable ZTNA around applications and workloads, devices and visibility into marketplace partners. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">OCI Access Governance for Attribute and policy based access controls. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">OCI zero trust packet routing for network microsegmentation. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Networking enhancements with next-gen firewalls like fortinet’s fortigate. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Observability enhancements.</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">A reference architecture. </span></li>
</ul>
<p><b>1:01:20 </b><a href="https://www.oracle.com/news/announcement/ocw24-oracle-expands-multicloud-capabilities-with-aws-google-cloud-and-microsoft-azure-2024-09-11/" target="_blank" rel="noreferrer noopener"><b>Oracle Expands Multi Cloud Capabilities with AWS, Google Cloud, and </b></a><a href="https://www.oracle.com/news/announcement/ocw24-oracle-expands-multicloud-capabilities-with-aws-google-cloud-and-microsoft-azure-2024-09-11/" target="_blank" rel="noreferrer noopener"><b>Microsoft Azure</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Oracle has partnerships now with AWS, Azure and Google to run Oracle Database@ services. </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS launched, and we talked about hs last week</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Oracle Database@Azure is now in Six Azure Datacenters with that increasing to 15 soon</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Oracle Database@Google is now GA in 4 google cloud regions, with expansion to additional regions in process.  </span></li>
</ul>
</li>
</ul>
<p><b>1:01:45 </b><a href="https://www.oracle.com/news/announcement/ocw24-oracle-offers-first-zettascale-cloud-computing-cluster-2024-09-11/" target="_blank" rel="noreferrer noopener"><b>Oracle Offers First Zettascale Cloud Computing Cluster</b></a></p>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Oracle announced the first zettascale cloud computing cluster accelerated by NVIDIA blackwell platform. OCI is now taking orders for the largest AI supercomputer in the cloud available with up to 131,072 NVIDIA blackwell GPUs. (do you think they built this for Elon?)</span></li>
</ul>
</li>
</ul>
<ul>
<li style="font-weight:400;"><i><span style="font-weight:400;">“We have one of the broadest AI infrastructure offerings and are supporting customers that are running some of the most demanding AI workloads in the cloud,” </span></i><b><i>said Mahesh Thiagarajan, executive vice president, Oracle Cloud Infrastructure. </i></b><i><span style="font-weight:400;">“With Oracle’s distributed cloud, customers have the flexibility to deploy cloud and AI services wherever they choose while preserving the highest levels of data and AI sovereignty.”</span></i></li>
<li style="font-weight:400;"><span style="font-weight:400;">The 131,072 NVIDIA blackwell GPus deliver 2.4 zettaFLOPs of peak performance. The maximum scale of the OCI supercluster offers more than three times as many GPUs as the frontier supercomputer and more than six times that of other hyperscalers. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">OCI superclusters can be powered by either the H100 or H200 tensor core GPUs or NVIDIA blackwell GPUs. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Was this built for Elon first? Just curious…</span></li>
</ul>
<p><b>1:04:49 </b><a href="https://www.oracle.com/news/announcement/ocw24-oracle-introduces-an-ai-centric-generative-development-infrastructure-for-enterprises-2024-09-10/" target="_blank" rel="noreferrer noopener"><b>Oracle Introduces an AI-centric Generative Development Infrastructure for </b></a><a href="https://www.oracle.com/news/announcement/ocw24-oracle-introduces-an-ai-centric-generative-development-infrastructure-for-enterprises-2024-09-10/" target="_blank" rel="noreferrer noopener"><b>Enterprises</b></a></p>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Oracle announced Generative development (gendev) for enterprises, a groundbreaking AI-centric application development infrastructure. It provides innovative development technologies that enables developers to rapidly generate sophisticated applications and make it easy for applications to use AI-powered natural language interfaces and human-centric data. Gendev combines technologies in Oracle Database 23ai, including JSON relational duality views, AI vector search and APEX to facilitate development using Gen AI. </span></li>
</ul>
</li>
</ul>
<ul>
<li style="font-weight:400;"><i><span style="font-weight:400;">“Just as paved roads had to be built for us to get the full benefit of cars, we have to change the application development infrastructure to get the full benefit of AI app generation. GenDev enables developers to harness AI to swiftly generate modular, evolvable enterprise applications that are understandable and safe. Users can interact with data and applications using natural language and find data based on its semantic content,”</span></i><b><i> said Juan Loaiza, executive vice president, Mission-Critical Database Technologies, Oracle</i></b><i><span style="font-weight:400;">. “Oracle Database 23ai provides the AI-centric infrastructure needed to dramatically accelerate generative development for enterprise apps.”  </span></i></li>
</ul>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Autonomous Database further simplifies and accelerates GenDev with these new key features:</span></li>
</ul>
<ul>
<li style="font-weight:400;"><b>Oracle Autonomous Database Select AI with RAG and other enhancements:</b><span style="font-weight:400;"> Enables customers to reduce the risk of hallucinations by leveraging retrieval-augmented generation (RAG) and AI Vector Search to provide more precise responses to natural language questions when using large language models (LLMs) with enterprise data. Autonomous Database also eliminates the need for expertise in creating AI pipelines to generate and populate vector embeddings.</span></li>
<li style="font-weight:400;"><b>Broader support for LLMs:</b><span style="font-weight:400;"> Helps organizations gain more value from generative AI with built-in integration from Autonomous Database to additional LLMs: Google Gemini, Anthropic Claude, and Hugging Face. Autonomous Database integrates with 35 different LLMs across seven providers to give customers a wide choice in building GenDev applications.</span></li>
<li style="font-weight:400;"><b>Autonomous Database NVIDIA GPU support:</b><span style="font-weight:400;"> Enables customers to access NVIDIA GPUs to accelerate performance of certain AI data operations without having to worry about provisioning or managing GPU servers. Initially, customers can take advantage of Oracle Machine Learning Notebooks that use GPU-enabled Python packages for resource-intensive workloads, such as generating vector embeddings using transformer models and building deep learning models.</span></li>
<li style="font-weight:400;"><b>Data Studio AI enhancements:</b><span style="font-weight:400;"> Enable customers to prepare and load data using natural language, as well as use a visual “drag and drop” tool to create AI pipelines with text and image vector embeddings.</span></li>
<li style="font-weight:400;"><b>Graph Studio enhancements:</b><span style="font-weight:400;"> Enable users to build Operational Property Graph models without code, new in Oracle Database 23ai, using the built-in self-service tool.</span></li>
<li style="font-weight:400;"><b>Autonomous Database for Developers:</b><span style="font-weight:400;"> Enables users to access the rich set of features and tools provided by Autonomous Database at a flat hourly rate. This provides a lower and more predictable entry point ($0.039/hour = $28.54/month) for development use cases with a simple upgrade path to production deployment.</span></li>
<li style="font-weight:400;"><b>Autonomous Database for Developers Container Image:</b><span style="font-weight:400;"> Gives customers the same fixed shape, flat hourly rate, and capabilities of Autonomous Database for Developers in the cloud, but in a convenient downloadable image. Developers continue to have a fully-managed database with a full suite of built-in tools but can run it directly on their laptops and conveniently use it in their CI/CD pipelines.</span></li>
<li style="font-weight:400;"><b>Autonomous Database Select AI—Synthetic Data Creation:</b><span style="font-weight:400;"> Enables customers to simplify and accelerate building development and test instances of Autonomous Database by enabling them to clone a production database and replace the data with realistic test data generated through AI.</span></li>
</ul>
<p><i><span style="font-weight:400;">1:06:49  Jonathan – </span></i><i><span style="font-weight:400;">“</span></i><i><span style="font-weight:400;">I can see how there’s value in things like, you know, document stores, reference information, technical decisions, like a way of organizing the structure of projects so that the developer can better use the tool to reach the end goal. So I actually think this is probably a really good product aimed at helping kind of organize and like shepherd the whole process through because I mean, sure you can sit down in front of chat GPT and ask you to write some code, but with limited context window, you have to kind of keep copying stuff out or restarting the chat. You have to keep referring back to original design documents, which is kind of cumbersome. And so solving the usability of these systems to actually deliver applications is great. And I wish them well with it. I’d really like to play with it.”</span></i></p>
<h2><span style="font-weight:400;">Closing</span></h2>
<p><span style="font-weight:400;">And that is the week in the cloud! Visit our website, the home of the Cloud Pod where you can join our newsletter, slack team, send feedback or ask questions at theCloud Pod.net or tweet at us with hashtag #theCloudPod</span></p>
]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/1852219/c1e-rodobj78v6fg5264-6zwdgkzgb8xv-w0fohl.mp3" length="84270414"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 276 of The Cloud Pod, where the forecast is always cloudy! This week, our hosts Justin, Matthew, and Jonathan do a speedrun of OpenWorld news, talk about energy needs and the totally not controversial decision to reopen 3 Mile Island, a “managed” exodus from cloud, and Kubernetes news. As well as Amazon’s RTO we are calling “Elastic Commute”. All this and more, right now on The Cloud Pod. 
Titles we almost went with this week:

The Cloud Pod Hosts don’t own enough pants for five days a week
IBM thinks it can contain the cost of K8s
Microsoft loves nuclear energy
The Cloudpod tries to give Oracle some love and still does not care
The cloud pod goes nuclear on k8s costs
⛽Can IBM contain the costs of Kubernetes and Nuclear Power? 
Google takes on take over while microsoft takes on nuclear
AWS Launches ‘Managed Exodus’: Streamline Your Talent Drain
Introducing Amazon WorkForce Alienation™: Scale Your Employee Discontent to the Cloud
Amazon SageMaker Studio Lab: Now with Real-Time Resignation Prediction

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
General News
01:08 IBM acquires Kubernetes cost optimization startup Kubecost 

IBM is quickly becoming the place where cloud cost companies go to assimilate? Or Die? Rebirthed mabe? Either way, it’s not a great place to end up. 
On Tuesday they announced the acquisition of Kubecost, a FinOps startup that helps teams monitor and optimize their K8 clusters, with a focus on efficiency – and ultimately cost. 
This acquisition follows the acquisitions of Apptio, Turbonomic, and Instana over the years. 
Kubecost is the company behind OpenCost; a vendor-neutral open source project that forms part of the core Kubecost...]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/1852219/c1a-k5d5-471gnvrdsd3o-mmuhkf.jpg"></itunes:image>
                                                                            <itunes:duration>01:10:14</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                            </item>
                    <item>
                <title>
                    <![CDATA[275: I SQream, You SQream, We All SQream for  AI Ice Cream]]>
                </title>
                <pubDate>Wed, 18 Sep 2024 10:17:15 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/1837493</guid>
                                    <link>https://tcpfm.castos.com/episodes/275-i-sqream-you-sqream-we-all-sqream-for-ai-ice-cream</link>
                                <description>
                                            <![CDATA[<p><span style="font-weight:400;">Welcome to episode 275 of The Cloud Pod, where the forecast is always cloudy! Justin, Matthew and Ryan are awake and ready to bring you all the latest and greatest in cloud news, including SQream, a new partnership between OCI and AWS (yes, really) Azure Linux, and a lot of updates over at AWS. Get comfy and we’ll see you all in the cloud! </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">I SQream, You SQream, The CloudPod SQreams for AI Ice Cream</span></li>
<li><span style="font-weight:400;">️AWS East gets Stability, but only for AI.</span></li>
<li><span style="font-weight:400;">AWS has some Lofty Goals</span></li>
<li><span style="font-weight:400;">️Claude Learns BigQuery</span></li>
<li><span style="font-weight:400;">✅Azure now Securely Checks the Prompts from the cloud pod</span></li>
<li><span style="font-weight:400;">Azure find out about Linux</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><span style="font-weight:400;">AWS</span></h2>
<p><b>00:28 </b><a href="https://aws.amazon.com/blogs/aws/stability-ais-best-image-generating-models-now-in-amazon-bedrock/" target="_blank" rel="noreferrer noopener"><b>Stability AI’s best image generating models now in Amazon Bedrock</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">If you are like The CloudPod hosts, the part you care most about AI is the rapid ability to create graphics for any meme-worthy moment or funny pictures for that group chat. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Luckily AWS has access to the latest image generation capability with 3 models from </span><a href="https://stability.ai/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Stability AI</span></a><span style="font-weight:400;">.</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Stable Image Ultra – Produces the highest quality, photorealistic outputs perfect for professional print media and large format applications. Stable image Ultra excels at rendering exceptional detail and realism. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Stable diffusion 3 large – strikes a balance between generation speed and output quality. Ideal for creating high-volume, high-quality digital assets for websites, newsletters and marketing materials. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Stable Image Core – Optimized for fast and affordable image generation, great for rapidly iterating on concepts during ideation. </span></li>
</ul>
</li>
</ul>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">One of the key improvements of Stable Image Ultra and Stable Diffusion 3 large compared to Stable Diffusion XL (SDXL) is text quality in generated images, with fewer errors in spelling and typography thanks to innovation diffusion transformer architecture, which implements two separate sets of weights for image and text but enables information flow between the two modalities. </span></li>
</ul>
<p><i><span style="font-weight:400;">02:46  Justin – “I do notice more and more that, you get it, you get the typical product shot on Amazon, but then like they’ll insert the product into different backgrounds and scenes. Like, it’s a, it’s a lamp and all of a sudden it’s on a thing and they’re like, Hmm, that doesn’t look like a real photo though. It looks like AI. So you do notice it more and more.”</span></i></p>
<p><b>04:13</b> <a href="https://aws.amazon.com/about-aws/whats-new/2024/09/aws-network-load-balancer-tcp-idle-timeout/" target="_blank" rel="noreferrer noopener"></a></p>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 275 of The Cloud Pod, where the forecast is always cloudy! Justin, Matthew and Ryan are awake and ready to bring you all the latest and greatest in cloud news, including SQream, a new partnership between OCI and AWS (yes, really) Azure Linux, and a lot of updates over at AWS. Get comfy and we’ll see you all in the cloud! 
Titles we almost went with this week:

I SQream, You SQream, The CloudPod SQreams for AI Ice Cream
️AWS East gets Stability, but only for AI.
AWS has some Lofty Goals
️Claude Learns BigQuery
✅Azure now Securely Checks the Prompts from the cloud pod
Azure find out about Linux

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
AWS
00:28 Stability AI’s best image generating models now in Amazon Bedrock 

If you are like The CloudPod hosts, the part you care most about AI is the rapid ability to create graphics for any meme-worthy moment or funny pictures for that group chat. 
Luckily AWS has access to the latest image generation capability with 3 models from Stability AI.

Stable Image Ultra – Produces the highest quality, photorealistic outputs perfect for professional print media and large format applications. Stable image Ultra excels at rendering exceptional detail and realism. 
Stable diffusion 3 large – strikes a balance between generation speed and output quality. Ideal for creating high-volume, high-quality digital assets for websites, newsletters and marketing materials. 
Stable Image Core – Optimized for fast and affordable image generation, great for rapidly iterating on concepts during ideation. 




One of the key improvements of Stable Image Ultra and Stable Diffusion 3 large compared to Stable Diffusion XL (SDXL) is text quality in generated images, with fewer errors in spelling and typography thanks to innovation diffusion transformer architecture, which implements two separate sets of weights for image and text but enables information flow between the two modalities. 

02:46  Justin – “I do notice more and more that, you get it, you get the typical product shot on Amazon, but then like they’ll insert the product into different backgrounds and scenes. Like, it’s a, it’s a lamp and all of a sudden it’s on a thing and they’re like, Hmm, that doesn’t look like a real photo though. It looks like AI. So you do notice it more and more.”
04:13 ]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[275: I SQream, You SQream, We All SQream for  AI Ice Cream]]>
                </itunes:title>
                                    <itunes:episode>275</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p><span style="font-weight:400;">Welcome to episode 275 of The Cloud Pod, where the forecast is always cloudy! Justin, Matthew and Ryan are awake and ready to bring you all the latest and greatest in cloud news, including SQream, a new partnership between OCI and AWS (yes, really) Azure Linux, and a lot of updates over at AWS. Get comfy and we’ll see you all in the cloud! </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">I SQream, You SQream, The CloudPod SQreams for AI Ice Cream</span></li>
<li><span style="font-weight:400;">️AWS East gets Stability, but only for AI.</span></li>
<li><span style="font-weight:400;">AWS has some Lofty Goals</span></li>
<li><span style="font-weight:400;">️Claude Learns BigQuery</span></li>
<li><span style="font-weight:400;">✅Azure now Securely Checks the Prompts from the cloud pod</span></li>
<li><span style="font-weight:400;">Azure find out about Linux</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><span style="font-weight:400;">AWS</span></h2>
<p><b>00:28 </b><a href="https://aws.amazon.com/blogs/aws/stability-ais-best-image-generating-models-now-in-amazon-bedrock/" target="_blank" rel="noreferrer noopener"><b>Stability AI’s best image generating models now in Amazon Bedrock</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">If you are like The CloudPod hosts, the part you care most about AI is the rapid ability to create graphics for any meme-worthy moment or funny pictures for that group chat. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Luckily AWS has access to the latest image generation capability with 3 models from </span><a href="https://stability.ai/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Stability AI</span></a><span style="font-weight:400;">.</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Stable Image Ultra – Produces the highest quality, photorealistic outputs perfect for professional print media and large format applications. Stable image Ultra excels at rendering exceptional detail and realism. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Stable diffusion 3 large – strikes a balance between generation speed and output quality. Ideal for creating high-volume, high-quality digital assets for websites, newsletters and marketing materials. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Stable Image Core – Optimized for fast and affordable image generation, great for rapidly iterating on concepts during ideation. </span></li>
</ul>
</li>
</ul>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">One of the key improvements of Stable Image Ultra and Stable Diffusion 3 large compared to Stable Diffusion XL (SDXL) is text quality in generated images, with fewer errors in spelling and typography thanks to innovation diffusion transformer architecture, which implements two separate sets of weights for image and text but enables information flow between the two modalities. </span></li>
</ul>
<p><i><span style="font-weight:400;">02:46  Justin – “I do notice more and more that, you get it, you get the typical product shot on Amazon, but then like they’ll insert the product into different backgrounds and scenes. Like, it’s a, it’s a lamp and all of a sudden it’s on a thing and they’re like, Hmm, that doesn’t look like a real photo though. It looks like AI. So you do notice it more and more.”</span></i></p>
<p><b>04:13</b> <a href="https://aws.amazon.com/about-aws/whats-new/2024/09/aws-network-load-balancer-tcp-idle-timeout/" target="_blank" rel="noreferrer noopener"><b>AWS Network Load Balancer now supports configurable TCP idle timeout </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2024/09/aws-gateway-load-balancer-tcp-idle-timeout/" target="_blank" rel="noreferrer noopener"><b>AWS Gateway Load Balancer now supports configurable TCP idle timeout</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">We see you Amazon – trying to get two press releases for basically the same thing, not today sir! </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Both the AWS Network Load Balancer and Gateway Load Balancer have received a configurable TCP Idle timeout. </span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">AWS Network load balancer had a fixed value of 350 seconds, which could cause TCP handshake retries for long-lived traffic flows of some apps and add latency.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Now you can configure it between 60 seconds and 6000 seconds, with the default remaining at 350. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The Gateway also has a 350 second fixed value, and also gets the 60-6000 second range.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Want more info on these totally different and not at all the same announcements? Check it out </span><a href="https://aws.amazon.com/about-aws/global-infrastructure/regional-product-services/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">here</span></a><span style="font-weight:400;">. </span></li>
</ul>
<p><i><span style="font-weight:400;">04:53  Ryan – “Yeah, we’ve all worked at that company with that one ancient app that, you know, couldn’t handle retries.</span></i></p>
<p><b>05:44</b> <a href="https://aws.amazon.com/about-aws/whats-new/2024/09/aws-fault-injection-service-additional-safety-control/" target="_blank" rel="noreferrer noopener"><b>AWS Fault Injection Service introduces additional safety control</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Fault Injection Service now provides additional safety control with a safety lever that, when engaged, stops all running experiments and prevents new experiments from starting. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">You can also prevent fault injection during certain time periods, such as sales events or product launches, or in response to application health alarms. </span></li>
</ul>
<p><i><span style="font-weight:400;">06:22  Ryan – “ …in my head I immediately went to like, something bad happened that caused this feature to exist. Like, I feel bad for whoever that was. Because you know it wasn’t good.”</span></i></p>
<p><b>07:14</b> <a href="https://aws.amazon.com/about-aws/whats-new/2024/09/apache-spark-amazon-emr-serverless-sagemaker-studio/" target="_blank" rel="noreferrer noopener"><b>Use Apache Spark on Amazon EMR Serverless directly from Amazon </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2024/09/apache-spark-amazon-emr-serverless-sagemaker-studio/" target="_blank" rel="noreferrer noopener"><b>Sagemaker Studio</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">You can now run petabyte-scale data analytics and machine learning in </span><a href="https://aws.amazon.com/blogs/big-data/run-apache-spark-3-5-1-workloads-4-5-times-faster-with-amazon-emr-runtime-for-apache-spark/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">EMR</span></a><span style="font-weight:400;"> Serverless direction from </span><a href="https://aws.amazon.com/blogs/machine-learning/use-langchain-with-pyspark-to-process-documents-at-massive-scale-with-amazon-sagemaker-studio-and-amazon-emr-serverless/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">SageMaker Studio</span></a><span style="font-weight:400;"> notebooks. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Serverless automatically provisions and scales the needed resources, allowing you to focus on data and models without having to configure, optimize, tune or manage your clusters. </span></li>
</ul>
<p><i><span style="font-weight:400;">07:40  Ryan – “Yeah, is it the query that’s terrible or the underlying data? The world may never know. Or both. It’s both.”</span></i></p>
<p><b>07:57 </b><a href="https://aws.amazon.com/about-aws/whats-new/2024/09/bedrock-agents-sonnet-3-5/" target="_blank" rel="noreferrer noopener"><b>Bedrock Agents on Sonnet 3.5</b></a><b>  </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Agents for </span><a href="https://aws.amazon.com/bedrock/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Bedrock</span></a><span style="font-weight:400;"> enable developers to create generative AI-based applications that can complete complex tasks for a wide range of use cases, and deliver answers based on company knowledge sources. </span></li>
</ul>
<p><i><span style="font-weight:400;">08:32  Justin – “It’s just an AI bot you put onto your Slack team that, you know, answers questions based on data you’ve fed it basically. Yeah. Agents is really just a chat interface to an AI of some kind that you’ve fed data to.”</span></i></p>
<p><b>08:58 </b><a href="https://aws.amazon.com/about-aws/whats-new/2024/09/amazon-workspaces-pools-windows-10-11-licenses/" target="_blank" rel="noreferrer noopener"><b>Amazon WorkSpaces Pools now allows you to bring your Windows 10 or 11 </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2024/09/amazon-workspaces-pools-windows-10-11-licenses/" target="_blank" rel="noreferrer noopener"><b>licenses</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">If you are leveraging Amazon Workspace Pools powered by Windows 10 or 11, you can now Bring your own License (assuming you meet microsoft requirements) to support your eligible M365 apps for enterprise, providing a consistent desktop experience to their users when they switch between on-premise and virtual desktops. </span></li>
</ul>
<p><i><span style="font-weight:400;">09:28  Ryan – “I doubt they’re talking about a single user. I think it’s like if you’re an IT department, you have to manage both..”</span></i></p>
<p><b>10:45</b> <a href="https://aws.amazon.com/about-aws/whats-new/2024/09/amazon-ecs-graviton-based-spot-compute-fargate/" target="_blank" rel="noreferrer noopener"><b>Amazon ECS now supports AWS Graviton-based Spot compute with AWS </b></a></p>
<p><a href="https://aws.amazon.com/about-aws/whats-new/2024/09/amazon-ecs-graviton-based-spot-compute-fargate/" target="_blank" rel="noreferrer noopener"><b>Fargate</b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://aws.amazon.com/ecs/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon ECS</span></a><span style="font-weight:400;"> now supports AWS Graviton-based compute with AWS Fargate Spot. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This capability helps you run fault-tolerant arm-based applications with up to a 70% discount compared to fargate prices. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">And yes, this is as complicated as it seems. </span></li>
</ul>
<p><i><span style="font-weight:400;">11:13  Ryan – “All this means is that they finally got their inventory up on Graviton hardware in the data centers where they can start allowing it to work.”</span></i></p>
<p><b>12:33</b> <a href="https://aws.amazon.com/startups/lp/aws-gen-ai-lofts" target="_blank" rel="noreferrer noopener"><b>AWS GenAI Lofts</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS pre-pandemic (in the “before times”) used to have AWS Lofts, where you could go and hang out with experts, community events would be held and overall you could pop in to get 1:1 assistance on your cloud project.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">After the pandemic, however,  they sort of disappeared – but AWS has brought them back as the Gen AI Lofts. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Unfortunately they’re not permanent lofts; they’re just pop-up events.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Currently the lofts are located in </span><a href="https://aws.amazon.com/startups/lp/aws-gen-ai-loft-san-francisco" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">San Francisco</span></a><span style="font-weight:400;"> and </span><a href="https://aws.amazon.com/startups/lp/aws-gen-ai-loft-sao-paulo" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">São Paulo</span></a><span style="font-weight:400;">, with </span><a href="https://aws.amazon.com/startups/lp/aws-gen-ai-loft-london" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">London</span></a><span style="font-weight:400;">, </span><a href="https://aws.amazon.com/startups/lp/aws-gen-ai-loft-paris" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Paris</span></a><span style="font-weight:400;">, and </span><a href="https://aws.amazon.com/startups/lp/aws-gen-ai-loft-seoul" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Seoul</span></a><span style="font-weight:400;"> opening in October. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The SF one is being held in the AWS office in downtown San Francisco. </span></li>
</ul>
<p><i><span style="font-weight:400;">14:36  Justin – “I think it’s nice to be able to go someplace and get, you know, A) talk to people who are trying to do the same thing you’re trying to do. And number two, if they don’t know, then you can ask the expert who’s there and you can, then he can get the answer for you. Because they’re the experts and they have access to the product managers and different things.</span></i></p>
<p><b>15:31</b> <a href="https://aws.amazon.com/about-aws/whats-new/2024/09/amazon-msk-cross-cluster-replication-identical-topic-names/" target="_blank" rel="noreferrer noopener"><b>Amazon MSK enhances cross-cluster replication with support for identical topic names</b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://docs.aws.amazon.com/msk/latest/developerguide/msk-replicator.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon MSK replicator</span></a><span style="font-weight:400;"> now supports a new configuration that enables you to preserve original Kafka topic names while replicating streaming data across Amazon Managed Streaming for Kafka Clusters. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Amazon MSK replicator is a feature of </span><a href="https://aws.amazon.com/msk/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon MSK</span></a><span style="font-weight:400;"> that lets you reliably replicate data across MSK clusters in the same or different AWS regions with just a few clicks. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Let’s be real. The fact that you couldn’t use the same topic name between clusters in different regions was a *problem*. We’re really glad they fixed this one. </span></li>
</ul>
<p><i><span style="font-weight:400;">15:56  Ryan – “I’m sure people have just been working around this with application config, based on where the workload is hosted.”</span></i></p>
<p><b>17:22</b> <a href="https://aws.amazon.com/blogs/aws/amazon-sagemaker-hyperpod-introduces-amazon-eks-support/" target="_blank" rel="noreferrer noopener"><b>Amazon SageMaker HyperPod introduces Amazon EKS support</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS is announcing that EKS is now supported in </span><a href="https://aws.amazon.com/sagemaker/hyperpod/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Sagemaker Hyperpods</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This purpose built infrastructure is engineering with resilience at its core for foundation model development. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This allows customers to orchestrate hyperpod clusters using </span><a href="https://docs.aws.amazon.com/sagemaker/latest/dg/sagemaker-hyperpod-eks.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">EKS</span></a><span style="font-weight:400;">, combining the power of </span><a href="https://kubernetes.io/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">K8</span></a><span style="font-weight:400;"> with Hyperpods resilient environment designed for training large models.</span></li>
</ul>
<p><i><span style="font-weight:400;">18:00  Ryan – “Historically these, types of jobs haven’t been really designed with resilience, right? It’s like, it could have a failure and then you have to restart a job or a series of jobs. going to take hours to complete. So it is kind of nice to see this…but it is kind of funny.”</span></i></p>
<h2><span style="font-weight:400;">GCP</span></h2>
<p><b>18:41 </b><a href="https://cloud.google.com/blog/products/ai-machine-learning/google-cloud-named-a-leader-in-forrester-wave-for-ai-platforms/" target="_blank" rel="noreferrer noopener"><b>Google named a leader in the Forrester Wave: AI/ML Platforms, Q3 2024</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google is named a leader in the Forrester Wave… which is cool and we wouldn’t have even mentioned, but the Top Current offering was Palantir? </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Should we be concerned? </span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Palantir apparently has one of the strongest offerings in the AI/ML space, with a vision and roadmap to create a platform that brings together humans and machines in a joint-decision making model. Uh huh…</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">But back to Google… </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Google is the best positioned hyperscaler for AI. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Google </span><a href="https://cloud.google.com/vertex-ai" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Vertex AI</span></a><span style="font-weight:400;"> is thoughtfully designed to simplify access to Google’s portfolio of AI infrastructure at planet scale, AI models, and complementary data services. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The company continues to outpace competitors in AI innovation, especially in </span><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/models/evaluation-overview" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">genAI</span></a><span style="font-weight:400;">, and has a strong roadmap to expand tooling for multirole AI teams. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Google has also worked hard to nurture a large set of well-incented partners that is likely to help it increase adoption of Google Vertex AI. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Google has enough differentiation in AI from other hyperscalers that enterprises may decide to migrate from their existing hyperscaler to Google – or at least start a new relationship with Google Cloud.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Want your own copy of the Forrester Wave? Find it </span><a href="https://reprint.forrester.com/reports/the-forrester-wavetm-ai-ml-platforms-q3-2024-e8e56c78/index.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">here</span></a><span style="font-weight:400;">. </span></li>
</ul>
<p><i><span style="font-weight:400;">20:20  Justin – “Apparently Google is the best positioned hyperscaler for AI. Take that Azure.”</span></i></p>
<p><i><span style="font-weight:400;">20:55  Matthew – “Okay, so C3AI, I haven’t actually done any research, but their stock symbol is just AI. I think they win… just hands down they win. Like game over, everyone else should just not be on the leaderboard.”</span></i></p>
<p><b>22:00</b> <a href="https://cloud.google.com/blog/products/data-analytics/integrating-bigquery-with-anthropics-claude/" target="_blank" rel="noreferrer noopener"><b>BigQuery and Anthropic’s Claude: A powerful combination for data-driven </b></a><a href="https://cloud.google.com/blog/products/data-analytics/integrating-bigquery-with-anthropics-claude/" target="_blank" rel="noreferrer noopener"><b>insights</b></a> <span style="font-weight:400;"> </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google Cloud is extending their Open Platform with the preview of </span><a href="https://cloud.google.com/bigquery?utm_source=google&amp;utm_medium=cpc&amp;utm_campaign=na-US-all-en-dr-bkws-all-all-trial-e-dr-1707554&amp;utm_content=text-ad-none-any-DEV_c-CRE_665665924750-ADGP_Hybrid+%7C+BKWS+-+MIX+%7C+Txt-Data+Analytics-BigQuery-KWID_43700077225652815-kwd-47616965283&amp;utm_term=KW_bigquery-ST_bigquery&amp;gad_source=1&amp;gclid=CjwKCAjw8rW2BhAgEiwAoRO5rDdEcPlaUa4K_VA_jnW9XH5dvtdGzCdH8w3L-cL8SBFomlDu-SQOyBoCi_IQAvD_BwE&amp;gclsrc=aw.ds&amp;e=48754805&amp;hl=en" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">BigQuery</span></a><span style="font-weight:400;">’s new integration with </span><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/partner-models/use-claude" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Anthropic Claude models on Vertex AI</span></a><span style="font-weight:400;"> that connects your data in BigQuery with powerful intelligence capabilities of Claude models. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">BigQueries integration with Anthropic Claude models allows organizations to reimagine data driven decision making and boost productivity across a variety of tasks including:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Analyzing log data for enhanced security</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Marketing optimization</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Document summarization</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Content localization</span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">20:27  Justin – “If Jonathan were here – and not sleeping / napping – he would tell you that cloud’s pretty darn good. And so, this is actually pretty nice to get an alternative that’s pretty decent to Gemini, to give you some additional BigQuery options for your summarization and advanced logging analytics. Apparently.”</span></i></p>
<p><b>23:50</b> <a href="https://cloud.google.com/blog/products/management-tools/introducing-log-scopes-in-cloud-observability/" target="_blank" rel="noreferrer noopener"><b>Cut through the noise with new log scopes for Cloud Observability</b></a> <span style="font-weight:400;"> </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">GCP is introducing </span><a href="https://cloud.google.com/logging/docs/log-scope/create-and-manage" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">log scopes</span></a><span style="font-weight:400;"> for cloud logging – a significant advancement in managing and analyzing your orgs logs. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Log scopes are a named collection of logs of interest within the same or different projects. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They are groups of </span><a href="https://cloud.google.com/logging/docs/routing/overview#log-views" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">log views</span></a><span style="font-weight:400;"> that control and grant permissions to a subset of logs in a </span><a href="https://cloud.google.com/logging/docs/routing/overview#buckets" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">log bucket</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Combined with </span><a href="https://cloud.google.com/monitoring/settings" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">metric scopes</span></a><span style="font-weight:400;">, log scopes let you define a set of correlated telemetry for your application, which can then be used for faster troubleshooting or referencing for insights. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Some example use cases from the press release:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Use Case 1: Correlating metrics with logs from the same application when an organization uses a centralized log storage architecture. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Use Case 2: Correlating metrics with logs for isolated environments such as development, staging and production across projects. </span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">24:35  Ryan – “ …that second one is the one I’m most interested in just because it’s, you know, for all kinds of reasons, we’ve separated workloads out and put them into different projects and for blast radius and security concerns and all those things, but it becomes much more challenging to sort of correlate a transaction through many, many different services spread out through multiple projects. And so there’s sort of two ways you tackle that. One is just re-consolidate all the logs together, and that can get expensive and generate this condition where you’re sorting through a whole bunch of noise. Or it’s like you just look it up everywhere and you manually construct it back together, which just doesn’t work and no one does. That’s what we used to do when all the logs were on server hard disks. So this is really neat to be able to tag them all together, really, and then search on them from that tag, which I think is pretty neat.”</span></i></p>
<p><b>25:59 </b><a href="https://cloud.google.com/blog/products/storage-data-transfer/backup-and-dr-service-adds-immutable-indelible-backups/" target="_blank" rel="noreferrer noopener"><b>Introducing backup vaults for cyber resilience and simplified Compute </b></a></p>
<p><a href="https://cloud.google.com/blog/products/storage-data-transfer/backup-and-dr-service-adds-immutable-indelible-backups/" target="_blank" rel="noreferrer noopener"><b>Engine backups</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google is enhancing Google Cloud Backup and DR service with some new capabilities:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">New Backup Vault storage feature, which delivers immutable (preventing modification) and indelible (preventing deletion) backups, securing your backups against tampering and unauthorized deletion</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">A centralized backup management experience, which delivers a fully managed end-to-end solution, making data protection effortless, and supporting direct integration into resource management flows</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Integration within the compute engine vm creation experience, empowering application owners to apply backup policies when VMs are initially created. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">These are all good quality of life improvements. </span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">26:26  Ryan – “Yeah, I mean, the backup policy is specifically when VMs are created is definitely something that, you know, I would like to see more features in that direction.”</span></i></p>
<h2><span style="font-weight:400;">Azure</span></h2>
<p><b>28:31 </b><a href="https://techcommunity.microsoft.com/t5/azure-tools-blog/azure-cli-docker-container-base-linux-image-is-now-azure-linux/ba-p/4236248" target="_blank" rel="noreferrer noopener"><b>Azure CLI docker container base Linux image is now Azure Linux</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Starting with version 2.64.0 of Azure CLI, the base linux distribution of Azure CLI is now Azure Linux. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">There is no impact to your az commands; shell commands specific to alpine will not work (apk) and Github actions that use specific alpine components or commands. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">You also have to trust that Microsoft Azure Linux is secure and as great as Alpine. Insert your favorite side eye meme here. </span></li>
</ul>
<p><i><span style="font-weight:400;">30:05  Justin – “…it’s a supply chain problem. It’s – how do you tell the government that you’re sure that nothing in your, you know, in your Linux operating system is compromised by a third party nation state? The answer is, well, we own all of the source and we build our own version of Linux from that source and we review it all. And that’s how you solve this problem.”</span></i></p>
<p><b>33:45</b> <a href="https://techcommunity.microsoft.com/t5/ai-azure-ai-services-blog/general-availability-of-prompt-shields-in-azure-ai-content/ba-p/4235560" target="_blank" rel="noreferrer noopener"><b>General availability of Prompt Shields in Azure AI Content Safety and Azure </b></a></p>
<p><a href="https://techcommunity.microsoft.com/t5/ai-azure-ai-services-blog/general-availability-of-prompt-shields-in-azure-ai-content/ba-p/4235560" target="_blank" rel="noreferrer noopener"><b>OpenAI Service</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Azure is announcing the GA of </span><a href="https://www.youtube.com/watch?v=-vgLI5OSm0w" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Prompt Shields</span></a><span style="font-weight:400;"> in Azure AI Content safety and Azure OpenAI service, a robust AI security feature </span><a href="https://techcommunity.microsoft.com/t5/ai-azure-ai-services-blog/azure-ai-announces-prompt-shields-for-jailbreak-and-indirect/ba-p/4099140" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">announced in March 2024</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Prompt Shields seamlessly integrate with Azure OpenAI service content filters and are available in Azure AI content safety, providing a robust defense against different types of prompt injection attacks. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">By leveraging advanced machine learning algorithms and natural language processing, prompt shields effectively identify and mitigate potential threats in user prompts and third party data. </span></li>
</ul>
<p><b>34:15</b> <a href="https://techcommunity.microsoft.com/t5/ai-azure-ai-services-blog/ga-release-of-protected-material-detection-in-azure-ai-content/ba-p/4235577" target="_blank" rel="noreferrer noopener"><b>GA release of Protected Material Detection in Azure AI Content Safety and </b></a></p>
<p><a href="https://techcommunity.microsoft.com/t5/ai-azure-ai-services-blog/ga-release-of-protected-material-detection-in-azure-ai-content/ba-p/4235577" target="_blank" rel="noreferrer noopener"><b>Azure OpenAI Service</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Protected material detection is an additional GA feature of AI content safety and Azure Open AI service.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This feature addresses outputs that could potentially violate copyright.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Many customers and end users are apprehensive about the risk of IP Infringement claims when integrating and using generative AI.  </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">To address this, the feature specifically targets model completions and scans for matches against an index of third party text content to detect the usage of third-party text content, including songs, news articles, recipes and selected web content. </span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">34:33  Ryan – “I mean, it’s not really for its accuracy. It’s about the mitigation of risk when you get sued. Like, you can say, well, I tried, I turned all the checkboxes… I do think these kinds of features… will be in every product eventually.”</span></i></p>
<p><b>37:02 </b><a href="https://techcommunity.microsoft.com/t5/running-sap-applications-on-the/m-series-announcements-ga-of-mv3-high-memory-and-details-on-mv3/ba-p/4235719" target="_blank" rel="noreferrer noopener"><b>M-Series announcements – GA of Mv3 High Memory and details on Mv3 </b></a></p>
<p><a href="https://techcommunity.microsoft.com/t5/running-sap-applications-on-the/m-series-announcements-ga-of-mv3-high-memory-and-details-on-mv3/ba-p/4235719" target="_blank" rel="noreferrer noopener"><b>Very High Memory virtual machines</b></a><b>   </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft has released the third version of the M-Series (Mv#) powered by 4th generation Intel Xeon processors (Sapphire Rapids) across the board. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">These high memory VMs give customers faster insights, more uptime, lower total cost of ownership and improved price-performance for their most demanding workloads. </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">What workloads do you ask? SAP Hana. Duh. </span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">The systems can scale for workloads from 6TB to 16TB, with up to 40% throughput over the Mv2 high memory. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">416 VCPU, 6tb of memory and a max of 64 data disks.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The largest configuration is 832 VCPU and 16TB of memory. </span></li>
</ul>
<h2><span style="font-weight:400;">Oracle</span></h2>
<p><b>39:00 </b><a href="https://blogs.oracle.com/cloud-infrastructure/post/breaking-boundaries-ml-development-sqream-on-oci" target="_blank" rel="noreferrer noopener"><b>Breaking boundaries in ML development: SQream on OCI</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Oracle says that now is an exciting time to be developing AI and ML solutions.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">With investors and customers expecting AI and ML innovation at a dizzying pace, companies struggle moving from AI Proof of Concept to Production, with the issue quite often being the efficient handling and preparing of massive amounts of data – a critical step that bottlenecks everything else in the dev process. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Oracle is pleased to share breakthrough technologies like SQream on OCI to improve the outcomes by transforming legacy processes by accelerating data preparation and reducing development cycles by over 90%.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">With these advancements, organizations can streamline their workflows and expedite AI deployments, ultimately enabling them to achieve their strategic objectives more effectively. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Data Preparation:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">It involves labor-intensive, manual processes that are time-consuming, prone to errors, and often require multiple iterations, from manual scripting for data collection to painstaking efforts in data cleaning and complex custom scripting for integrating and transforming disparate datasets, manual processes can lead to significant delays. </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">SQream on OCI dramatically impacts these tasks, streamlining and automating the processes by leveraging GPU-accelerated technology.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Also with SQream, data scientists can quickly experiment with different feature sets and validate their effectiveness faster. </span></li>
</ul>
</li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">SQream on OCI revolutionizes your team dynamics by enhancing collaboration, boosting morale and productivity, and optimizing human resource allocation. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">SQream also optimizes your hardware utilization leading to reduced operational costs. </span></li>
</ul>
<p><i><span style="font-weight:400;">40:46  Ryan – “I also think that every one of their claims is complete nonsense. I, cause it’s Oracle and it’s like, there’s no way.”</span></i></p>
<p><b>42:11</b> <a href="https://www.oracle.com/news/announcement/ocw24-oracle-and-amazon-web-services-announce-strategic-partnership-2024-09-09/" target="_blank" rel="noreferrer noopener"><b>Oracle and Amazon Web Services Announce Strategic Partnership</b></a></p>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Open World is happening this week, and they dropped a ton of announcements today, which we’ll cover next week.</span></li>
<li style="font-weight:400;"><b>*But*</b><span style="font-weight:400;"> Sometimes a story is so important we must talk about it now.  </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Folks, hell has not frozen over, nor are pigs flying. </span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Oracle and AWS today announced the launch of Oracle Database@AWS, a new offering that allows customers to access Oracle Autonomous Database service within AWS. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Oracle Database@AWS will provide customers with a unified experience between OCI and AWS offering a simplified database administration, billing, and unified customer support system. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">In addition, customers will be able to seamlessly connect enterprise data in their Oracle Database to apps running on Ec2, AWS analytics services, or AI and ML services including Bedrock. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">With direct access to Oracle Exadata database services on AWS, including Oracle Autonomous database on dedicated infrastructure and workloads running on RAC clusters, Oracle Database@AWS allows customers to bring together all of their enterprise data to drive breakthrough innovations. </span></li>
</ul>
</li>
</ul>
<ul>
<li style="font-weight:400;"><i><span style="font-weight:400;">“We are seeing huge demand from customers that want to use multiple clouds,” </span></i><b><i>said Larry Ellison, Oracle Chairman and CTO</i></b><i><span style="font-weight:400;">. “To meet this demand and give customers the choice and flexibility they want, Amazon and Oracle are seamlessly connecting AWS services with the very latest Oracle Database technology, including the Oracle Autonomous Database. With Oracle Cloud Infrastructure deployed inside of AWS data centers, we can provide customers with the best possible database and network performance.”</span></i></li>
</ul>
<ul>
<li style="font-weight:400;"><i><span style="font-weight:400;">“As far back as 2008, customers could run their Oracle workloads in the cloud, and since then, many of the world’s largest and most security-sensitive organizations have chosen to deploy their Oracle software on AWS,” </span></i><b><i>said Matt Garman, CEO at AWS</i></b><i><span style="font-weight:400;">. “This new, deeper partnership will provide Oracle Database services within AWS to allow customers to take advantage of the flexibility, reliability, and scalability of the world’s most widely adopted cloud alongside enterprise software they rely on.”</span></i></li>
</ul>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Customers can also benefit from the following with Oracle Database@AWS</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Zero-ETL integration between Oracle Database services and AWS Analytics services. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Customers will be able to seamlessly and securely connect and analyze data across Oracle Database services and applications they already have running on AWS to get faster, deeper insights without having to build pipelines.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Flexible options to simplify and accelerate migrating their Oracle databases to the cloud, including compatibility with proven migration tools such as Oracle Zero Downtime Migration.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">A simplified procurement experience via AWS Marketplace that enables customers to purchase Oracle Database services using their existing AWS commitments and use their existing Oracle license benefits, including Bring Your Own License (BYOL) and discount programs such as Oracle Support Rewards (OSR).</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">A fully unified support experience from both AWS and Oracle as well as guidance through reference architectures, landing zones, and other collateral for customers to successfully build and run their most trusted enterprise applications in the cloud.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Seamless integration with Amazon Simple Storage Service (Amazon S3) for an easy and secure way to perform database backups and restoration, and to aid with disaster recovery.</span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">44:47  Matthew- “Half of these features already existed between just RDS Oracle and AWS I feel like, and the other half just use are a good way to kill all your EDP pricing – EDP that you have to finish by the end of the year.”</span></i></p>
<h2><span style="font-weight:400;">Closing</span></h2>
<p><span style="font-weight:400;">And that is the week in the cloud! Visit our website, the home of the Cloud Pod where you can join our newsletter, slack team, send feedback or ask questions at theCloud Pod.net or tweet at us with hashtag #theCloudPod</span></p>
]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/1837493/c1e-jkjkuqk90qcp3nr6-6zdv07jmup-znkwhd.mp3" length="56455230"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 275 of The Cloud Pod, where the forecast is always cloudy! Justin, Matthew and Ryan are awake and ready to bring you all the latest and greatest in cloud news, including SQream, a new partnership between OCI and AWS (yes, really) Azure Linux, and a lot of updates over at AWS. Get comfy and we’ll see you all in the cloud! 
Titles we almost went with this week:

I SQream, You SQream, The CloudPod SQreams for AI Ice Cream
️AWS East gets Stability, but only for AI.
AWS has some Lofty Goals
️Claude Learns BigQuery
✅Azure now Securely Checks the Prompts from the cloud pod
Azure find out about Linux

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
AWS
00:28 Stability AI’s best image generating models now in Amazon Bedrock 

If you are like The CloudPod hosts, the part you care most about AI is the rapid ability to create graphics for any meme-worthy moment or funny pictures for that group chat. 
Luckily AWS has access to the latest image generation capability with 3 models from Stability AI.

Stable Image Ultra – Produces the highest quality, photorealistic outputs perfect for professional print media and large format applications. Stable image Ultra excels at rendering exceptional detail and realism. 
Stable diffusion 3 large – strikes a balance between generation speed and output quality. Ideal for creating high-volume, high-quality digital assets for websites, newsletters and marketing materials. 
Stable Image Core – Optimized for fast and affordable image generation, great for rapidly iterating on concepts during ideation. 




One of the key improvements of Stable Image Ultra and Stable Diffusion 3 large compared to Stable Diffusion XL (SDXL) is text quality in generated images, with fewer errors in spelling and typography thanks to innovation diffusion transformer architecture, which implements two separate sets of weights for image and text but enables information flow between the two modalities. 

02:46  Justin – “I do notice more and more that, you get it, you get the typical product shot on Amazon, but then like they’ll insert the product into different backgrounds and scenes. Like, it’s a, it’s a lamp and all of a sudden it’s on a thing and they’re like, Hmm, that doesn’t look like a real photo though. It looks like AI. So you do notice it more and more.”
04:13 ]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/1837493/c1a-k5d5-kp2051djuqrk-q0t6o9.jpg"></itunes:image>
                                                                            <itunes:duration>00:47:03</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                            </item>
                    <item>
                <title>
                    <![CDATA[274: The Cloud Pod is Still Not Open Source]]>
                </title>
                <pubDate>Wed, 11 Sep 2024 12:05:22 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/1833347</guid>
                                    <link>https://tcpfm.castos.com/episodes/274-not-open-source</link>
                                <description>
                                            <![CDATA[<p><span style="font-weight:400;">Welcome to episode 274 of The Cloud Pod, where the forecast is always cloudy! Justin, Ryan and Matthew are your hosts this week as we explore the world of SnapShots, Maia, Open Source, and VMware – just to name a few of the topics. And stay tuned for an installment of our continuing Cloud Journey Series to explore ways to decrease tech debt, all this week on The Cloud Pod.  </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">The Cloud Pod in Parallel Cluster</span></li>
<li><span style="font-weight:400;">The Cloud Pod cringes at managing 1000 aws accounts</span></li>
<li><span style="font-weight:400;">The Cloud Pod welcomes Imagen 3 with less Wokeness</span></li>
<li><span style="font-weight:400;">️The Cloud Pod wants to be instantly snapshotted</span></li>
<li><span style="font-weight:400;">The Cloud pod hates tech debt</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><span style="font-weight:400;">General News</span></h2>
<p><b>00:32 </b><a href="https://www.elastic.co/blog/elasticsearch-is-open-source-again" target="_blank" rel="noreferrer noopener"><b>Elasticsearch is Open Source, Again</b></a></p>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Shay Banon is pleased to call </span><a href="https://www.elastic.co/elasticsearch" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">ElasticSearc</span></a><span style="font-weight:400;">h and </span><a href="https://www.elastic.co/kibana" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Kibana</span></a><span style="font-weight:400;"> “open source” again.  He says everyone at Elastic is ecstatic to be open source again, it’s part of his and “Elastics DNA.” </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They’re doing this by adding AGPL as another license option next to ELv2 and SSPL in the coming weeks. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They never stopped believing or behaving like an OSS company after they changed the license, but by being able to use the term open source and by using AGPL – an OSI approved license – removes any questions or fud people might have. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Shay says the change 3 years ago was because they had </span><a href="https://www.elastic.co/blog/why-license-change-aws" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">issues with AWS</span></a><span style="font-weight:400;"> and the market confusion their offering was causing. </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">So, after trying all the other options, changing the license – all while knowing it would result in a fork with a different name – was the path they took. </span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">While it was painful, they said it worked. </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">3 years later, Amazon is fully invested in their OpenSearch fork, the market confusion has mostly gone, and their partnership with AWS is stronger than ever.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They are even being named partner of the year with AWS. </span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">They want to “make life of our users as simple as possible,” so if you’re ok with the ELv2 or the SSPL, then you can keep using that license. They aren’t removing anything, just giving you another option with AGPL.</span></li></ul></li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 274 of The Cloud Pod, where the forecast is always cloudy! Justin, Ryan and Matthew are your hosts this week as we explore the world of SnapShots, Maia, Open Source, and VMware – just to name a few of the topics. And stay tuned for an installment of our continuing Cloud Journey Series to explore ways to decrease tech debt, all this week on The Cloud Pod.  
Titles we almost went with this week:

The Cloud Pod in Parallel Cluster
The Cloud Pod cringes at managing 1000 aws accounts
The Cloud Pod welcomes Imagen 3 with less Wokeness
️The Cloud Pod wants to be instantly snapshotted
The Cloud pod hates tech debt

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
General News
00:32 Elasticsearch is Open Source, Again



Shay Banon is pleased to call ElasticSearch and Kibana “open source” again.  He says everyone at Elastic is ecstatic to be open source again, it’s part of his and “Elastics DNA.” 
They’re doing this by adding AGPL as another license option next to ELv2 and SSPL in the coming weeks. 
They never stopped believing or behaving like an OSS company after they changed the license, but by being able to use the term open source and by using AGPL – an OSI approved license – removes any questions or fud people might have. 
Shay says the change 3 years ago was because they had issues with AWS and the market confusion their offering was causing. 

So, after trying all the other options, changing the license – all while knowing it would result in a fork with a different name – was the path they took. 


While it was painful, they said it worked. 

3 years later, Amazon is fully invested in their OpenSearch fork, the market confusion has mostly gone, and their partnership with AWS is stronger than ever.
They are even being named partner of the year with AWS. 


They want to “make life of our users as simple as possible,” so if you’re ok with the ELv2 or the SSPL, then you can keep using that license. They aren’t removing anything, just giving you another option with AGPL.]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[274: The Cloud Pod is Still Not Open Source]]>
                </itunes:title>
                                    <itunes:episode>274</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p><span style="font-weight:400;">Welcome to episode 274 of The Cloud Pod, where the forecast is always cloudy! Justin, Ryan and Matthew are your hosts this week as we explore the world of SnapShots, Maia, Open Source, and VMware – just to name a few of the topics. And stay tuned for an installment of our continuing Cloud Journey Series to explore ways to decrease tech debt, all this week on The Cloud Pod.  </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">The Cloud Pod in Parallel Cluster</span></li>
<li><span style="font-weight:400;">The Cloud Pod cringes at managing 1000 aws accounts</span></li>
<li><span style="font-weight:400;">The Cloud Pod welcomes Imagen 3 with less Wokeness</span></li>
<li><span style="font-weight:400;">️The Cloud Pod wants to be instantly snapshotted</span></li>
<li><span style="font-weight:400;">The Cloud pod hates tech debt</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><span style="font-weight:400;">General News</span></h2>
<p><b>00:32 </b><a href="https://www.elastic.co/blog/elasticsearch-is-open-source-again" target="_blank" rel="noreferrer noopener"><b>Elasticsearch is Open Source, Again</b></a></p>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Shay Banon is pleased to call </span><a href="https://www.elastic.co/elasticsearch" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">ElasticSearc</span></a><span style="font-weight:400;">h and </span><a href="https://www.elastic.co/kibana" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Kibana</span></a><span style="font-weight:400;"> “open source” again.  He says everyone at Elastic is ecstatic to be open source again, it’s part of his and “Elastics DNA.” </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They’re doing this by adding AGPL as another license option next to ELv2 and SSPL in the coming weeks. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They never stopped believing or behaving like an OSS company after they changed the license, but by being able to use the term open source and by using AGPL – an OSI approved license – removes any questions or fud people might have. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Shay says the change 3 years ago was because they had </span><a href="https://www.elastic.co/blog/why-license-change-aws" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">issues with AWS</span></a><span style="font-weight:400;"> and the market confusion their offering was causing. </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">So, after trying all the other options, changing the license – all while knowing it would result in a fork with a different name – was the path they took. </span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">While it was painful, they said it worked. </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">3 years later, Amazon is fully invested in their OpenSearch fork, the market confusion has mostly gone, and their partnership with AWS is stronger than ever.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They are even being named partner of the year with AWS. </span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">They want to “make life of our users as simple as possible,” so if you’re ok with the ELv2 or the SSPL, then you can keep using that license. They aren’t removing anything, just giving you another option with AGPL.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">He calls out trolls and people who will pick at this announcement, so they are attempting to address the trolls in advance. </span></li>
</ul>
</li>
</ul>
<ul>
<li style="font-weight:400;"><i><span style="font-weight:400;">“Changing the license was a mistake, and Elastic now backtracks from it”. We removed a lot of market confusion when we changed our license 3 years ago. And because of our actions, a lot has changed. It’s an entirely different landscape now. We aren’t living in the past. We want to build a better future for our users. It’s because we took action then, that we are in a position to take action now.</span></i></li>
</ul>
<ul>
<li style="font-weight:400;"><i><span style="font-weight:400;">“AGPL is not true open source, license X is”: AGPL is an OSI approved license, and it’s a widely adopted one. For example, MongoDB used to be AGPL and Grafana is AGPL. It shows that AGPL doesn’t affect usage or popularity. We chose AGPL because we believe it’s the best way to start to pave a path, with OSI, towards more Open Source in the world, not less.”</span></i></li>
<li style="font-weight:400;"><i><span style="font-weight:400;">“Elastic changes the license because they are not doing well” – I will start by saying that I am as excited today as ever about the future of Elastic. I am tremendously proud of our products and our team’s execution. We shipped Stateless Elasticsearch, ES|QL, and tons of vector database/hybrid search improvements for GenAI use cases. We are leaning heavily into OTel in logging and Observability. And our SIEM product in Security keeps adding amazing features and it’s one of the fastest growing in the market. Users’ response has been humbling. The stock market will have its ups and downs. What I can assure you, is that we are always thinking long term, and this change is part of it.”</span></i></li>
</ul>
<p><i><span style="font-weight:400;">03:03  Ryan – “ I have a hard time thinking that this has nothing to do with performance and you know, there was quite the reputation hit when they changed the license before and Since you can do open search now, which is truly open search open source. I imagine there’s a lot of people that are sort of adopting that instead.”</span></i></p>
<h2><span style="font-weight:400;">AI Is Going Great – Or How ML Makes All It’s Money </span></h2>
<p><b>06:28 </b><a href="https://www.digitalocean.com/blog/nvidia-h100-now-available-on-digitalocean-kubernetes" target="_blank" rel="noreferrer noopener"><b>Nvidia H100 now available on DigitalOcean Kubernetes (EA)</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Digital Ocean is making Nvidia’s latest H100 GPU’s available on DigitalOcean Kubernetes (DOKS).  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Early access customers have the choice of 1 x H100 or 8 x H100 nodes. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">H100 nodes are of course in high demand for building and training your AI workloads, and so this is a great alternative option to other cloud providers.  </span></li>
</ul>
<p><i><span style="font-weight:400;">06:51  Ryan – “I wonder how many people are actually because of the capacity constraints are having to utilize multiple clouds for this. Like it’s kind of crazy if you think about, you know, people using capacity across DigitalOcean, GCP, Azure, and AWS trying to get model training done, but it’s possible.”</span></i></p>
<h2><span style="font-weight:400;">AWS</span></h2>
<p><b>08:06 </b><a href="https://aws.amazon.com/blogs/aws/how-aws-powered-prime-day-2024-for-record-breaking-sales/" target="_blank" rel="noreferrer noopener"><b>How AWS powered Prime Day 2024 for record-breaking sales</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS is here to tell us how they powered the mighty </span><a href="https://www.aboutamazon.com/news/retail/when-is-prime-day-2024" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Prime Day</span></a><span style="font-weight:400;"> from July 17-18th in their annual recap blog post.</span></li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/ec2/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Amazon Ec2</span></a><span style="font-weight:400;"> Services such as </span><a href="https://www.aboutamazon.com/news/retail/amazon-rufus" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Rufus</span></a><span style="font-weight:400;"> and Search use AWS Artificial Intelligence chips under the hood, and Amazon deployed a cluster of over 80,000 I</span><a href="https://aws.amazon.com/machine-learning/inferentia/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">nferentia</span></a><span style="font-weight:400;"> and </span><a href="https://aws.amazon.com/machine-learning/trainium/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Trainum</span></a><span style="font-weight:400;"> chips for Prime Day.  </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">They used over 250k </span><a href="https://aws.amazon.com/ec2/graviton/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Graviton</span></a><span style="font-weight:400;"> chips to power more than 5800 distinct Amazon.com services (double that of 2023.)</span></li>
</ul>
</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/ebs" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">EBS</span></a><span style="font-weight:400;"> used 264 PiB of storage, 62% more than the year before. </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">With 5.6 trillion read/write operations, they transferred 444 Petabytes of data during the event, an 81% increase. </span></li>
</ul>
</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/aurora" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Aurora</span></a><span style="font-weight:400;"> had 6,311 database instances running Postgres and Mysql compatible editions, processed 376 billion transactions, stored 2,978 terabytes of data and transferred 913 terabytes of data.</span></li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/dynamodb" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">DynamoDB</span></a><span style="font-weight:400;"> powers many things, including Alexa, Amazon.com sites and Amazon fulfillment centers. </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Over the course of the Prime Days, they made tens of trillions of calls to the DynamoDB API.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">DynamoDB maintained high availability while delivering single-digit millisecond responses, and peaking at 146m requests per second.</span></li>
</ul>
</li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/elasticache" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Elasticache</span></a><span style="font-weight:400;"> served more than a quadrillion requests on a single day, with a peak of over 1 trillion requests per minute.</span></li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/quicksight" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">QuickSight</span></a><span style="font-weight:400;"> dashboards saw 107k unique hits, 1300+ unique visitors and delivered over 1.6M queries.</span></li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/sagemaker" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Sagemaker</span></a><span style="font-weight:400;"> processed 145B inference requests.</span></li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/ses" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">SES</span></a><span style="font-weight:400;"> sent 30 percent more emails than the prior year.</span></li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/guardduty" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Guard Duty</span></a><span style="font-weight:400;"> monitored nearly 6 trillion log events per hour, a 31.9% increase.</span></li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/cloudtrail" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Cloudtrail</span></a><span style="font-weight:400;"> processed over 976 Billion events in support of PD. </span></li>
<li style="font-weight:400;"><a href="https://aws.amazon.com/cloudfront" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">CloudFront</span></a><span style="font-weight:400;"> had a peak load of over 500M http requests per minute, for a total of over 1.3 Trillion HTTP requests during prime day, 30% more than the year prior.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Rigorous preparation is the key, for example 733 </span><a href="https://aws.amazon.com/fis" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Fault Injection Service</span></a><span style="font-weight:400;"> experiments were run to test resilience and ensure Amazon.com remains highly available. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">With the rebranded </span><a href="https://aws.amazon.com/premiumsupport/aws-countdown/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Countdown</span></a><span style="font-weight:400;"> support program your organization can handle these big events using tried and true methods.  </span></li>
</ul>
<p><i><span style="font-weight:400;">13:47  Matthew – “ I would love to be at a company where I’m running something at this scale. I feel like, you know, they’re like, cool, come have us do it. But the amount of companies that run stuff at this insane scale is going to be in the single digits.”</span></i></p>
<p><b>16:48 </b><a href="https://aws.amazon.com/blogs/aws/announcing-aws-parallel-computing-service-to-run-hpc-workloads-at-virtually-any-scale/" target="_blank" rel="noreferrer noopener"><b>Announcing AWS Parallel Computing Service to run HPC workloads at </b></a><a href="https://aws.amazon.com/blogs/aws/announcing-aws-parallel-computing-service-to-run-hpc-workloads-at-virtually-any-scale/" target="_blank" rel="noreferrer noopener"><b>virtually any scale </b></a><a href="https://kill-the-newsletter.com/feeds/wmrx6rsab4dbqgph/entries/i90dbpj5bfm1ce1z91gq.html" target="_blank" rel="noreferrer noopener"><b>AWS Parallel Computing Service is Now Generally Available, Designed to </b></a><a href="https://kill-the-newsletter.com/feeds/wmrx6rsab4dbqgph/entries/i90dbpj5bfm1ce1z91gq.html" target="_blank" rel="noreferrer noopener"><b>Accelerate Scientific Discovery</b></a></p>
<ul>
<li style="list-style-type:none;">
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS is announcing </span><a href="https://aws.amazon.com/pcs" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Parallel Computing Services</span></a><span style="font-weight:400;"> (AWS PCS), a new managed service that helps customers set up and manage </span><a href="https://aws.amazon.com/hpc/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">HPC clusters</span></a><span style="font-weight:400;"> so they seamlessly run their simulations at virtually any scale on AWS.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Using the </span><a href="https://slurm.schedmd.com/man_index.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Slurm</span></a><span style="font-weight:400;"> Scheduler, you can work in a familiar HCP environment, accelerating your time to results instead of worrying about infrastructure. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This is a managed service of an open source tool they provided in November 2018.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This open source tool allowed you to build and deploy POC and production HPC environments, and you could take advantage of a CLI, API and Python libraries.  </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">But you were responsible for the updates, as well as tearing down and redeploying clusters. </span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">The Managed services makes everything available via the AWS Management Console, AWS SDK and AWS CLI.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Your system administrators can create managed Slurm clusters that use their compute and storage configs, identity and job allocation preferences. </span></li>
</ul>
</li>
</ul>
<ul>
<li style="font-weight:400;"><i><span style="font-weight:400;">“Developing a cure for a catastrophic disease, designing novel materials, advancing renewable energy, and revolutionizing transportation are problems that we just can’t afford to have waiting in a queue,” </span></i><b><i>said Ian Colle, director, advanced compute and simulation at AWS</i></b><i><span style="font-weight:400;">. “Managing HPC workloads, particularly the most complex and challenging extreme-scale workloads, is extraordinarily difficult. Our aim is that every scientist and engineer using AWS Parallel Computing Service, regardless of organization size, is the most productive person in their field because they have the same top-tier HPC capabilities as large enterprises to solve the world’s toughest challenges, any time they need to, and at any scale.”</span></i></li>
<li style="font-weight:400;"><span style="font-weight:400;">Maxar Intelligence provides secure, precise geospatial intelligence, enabling government and commercial customers to monitor, understand, and navigate our changing planet. “</span><i><span style="font-weight:400;">As a long-time user of AWS HPC solutions, we were excited to test the service-driven approach from AWS Parallel Computing Service,” </span></i><b><i>said Travis Hartman, director of Weather and Climate at Maxar Intelligence.</i></b><i><span style="font-weight:400;"> “We found great potential for AWS Parallel Computing Service to bring better cluster visibility, compute provisioning, and service integration to Maxar Intelligence’s WeatherDesk platform, which would enable the team to make their time-sensitive HPC clusters more resilient and easier to manage.”</span></i></li>
</ul>
<p><b>18:31 </b><a href="https://siliconangle.com/2024/08/26/exclusive-inside-mind-aws-ceo-matt-garman-aims-shape-future-cloud-ai/" target="_blank" rel="noreferrer noopener"><b>Exclusive: Inside the mind of AWS CEO Matt Garman and how he aims to </b></a><a href="https://siliconangle.com/2024/08/26/exclusive-inside-mind-aws-ceo-matt-garman-aims-shape-future-cloud-ai/" target="_blank" rel="noreferrer noopener"><b>shape the future of cloud and AI</b></a><b>   </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Silicon Angle’s John Furrier got an exclusive with new AWS CEO Matt Garman, and they chatted about how he plans to shape the future of cloud and AI. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Garman was a key architect of the AWS EC2 computing service.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Now, as the new CEO, he faces leading AWS into the future – and this is a future dominated by generative AI. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">On Generative AI Garman says that their job at AWS is to help customers and companies take advantage of A in a secure, reliable and performant platform that allows them to innovate in ways never imagined before. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Garman sees AI as a transformative force that could redefine the AWS trajectory.   </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Garman asserts that they never obsess about their competitors, instead they obsess about their customers. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">He says AWS is focused on customers by focusing on the future and not dwelling on the past.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">In the interview Garman stressed the importance of inference, which is leveraging the knowledge of the AI to generate insights or perform tasks, as the true killer app of generative AI. </span>
<ul>
<li style="font-weight:400;"><i><span style="font-weight:400;">“All the money and effort that people are spending on building these large training models don’t make sense if there isn’t a huge amount of inference on the backend to build interesting things</span></i><span style="font-weight:400;">,” Garman notes. He sees inference not just as a function but as an integral building block that will be embedded in every application.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">“</span><i><span style="font-weight:400;">Inference is where the real value of AI is realized</span></i><span style="font-weight:400;">,” Garman adds, signaling that AWS is not just participating in the AI revolution but is engineering the very infrastructure that will define its future.</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Garman believes Generative AI could unlock new dimensions for AS, enabling it to maintain its dominance while expanding into new areas of growth. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Garman views developers and startups as the lifeblood of AWS.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">AWS is not just a cloud provider; it’s an enabler of innovation at all levels, from the smallest startups to the largest enterprises. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Garmin isn’t just investing in silicone with Trainium and Inferentia chips, but in the whole ecosystem by betting on open, scalable technologies. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Their investments in ethernet networking, for example, has allowed them to outperform traditional Infiniband networks in terms of scalability and reliability. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Garman is confident that AWS is up to the task in AI and cloud and continues to innovate.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">AWS offers not just the best technology, but a partnership that is focused on helping customers succeed. </span></li>
</ul>
<p><i><span style="font-weight:400;">21:15  Justin – “Well, I feel like we’re reaching the point when AI has already been shoved in at the low hanging fruit for things. We were like, cool. You know, EBS is AI. Cool. That doesn’t really help me. And I don’t really care about it. I feel like now you’re starting to hit those higher level services. You’ve done the building blocks and now hopefully they can start to piece things together to be useful AI versus just everyone raising their hands and say, I have AI and things, you know, and I think that’s what’s going to be interesting a to those higher level services the same way they’ve done with S3 &amp; EC2?”</span></i></p>
<p><b>23:55 </b><a href="https://aws.amazon.com/about-aws/whats-new/2024/08/amazon-ec2-status-checks-reachability-health-ebs-volume/" target="_blank" rel="noreferrer noopener"><b>Amazon EC2 status checks now support the reachability health of attached </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2024/08/amazon-ec2-status-checks-reachability-health-ebs-volume/" target="_blank" rel="noreferrer noopener"><b>EBS volumes</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">You can now leverage EC2 status checks to directly monitor if the EBS volumes attached to your instance are reachable and able to complete I/O operations. You can use the new status check to quickly detect attachment issues or volume impairments that may impact the scaling of your apps running on Ec2. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">You can further integrate these status checks with auto-scaling groups to monitor the health of Ec2 instances and replace impacted instances to ensure high availability and reliability of your applications.  Attached EBS status checks can be used along with the instance status and system status checks to monitor the health of your instances. </span></li>
</ul>
<p><i><span style="font-weight:400;">24:37  Justin – “And this one’s like, I get it. It’s nice that this is there. It seems straightforward that you’d want to know that your EBS volume is attached. But really the reason why people typically don’t like an EBS volume is because of its performance, not because of its attachment status. So they do their own set of custom checks typically on the EBS volume to make sure it’s getting the expected IO throughput, which I do not believe is part of this particular status check.”</span></i></p>
<p><b>29:16 </b><a href="https://aws.amazon.com/about-aws/whats-new/2024/08/organizational-units-aws-control-tower-1000-accounts/" target="_blank" rel="noreferrer noopener"><b>Organizational Units in AWS Control Tower can now contain up to 1,000 </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2024/08/organizational-units-aws-control-tower-1000-accounts/" target="_blank" rel="noreferrer noopener"><b>accounts</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS Control Tower can now support OU’s with 1,000 accounts. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">You can now implement governance best practices and standardize configurations across the accounts in your OU at greater scale. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">When registering an OU or enabling the AWS control tower baseline on an OU, member accounts receive best practice configurations, controls, and baseline resources such as AWS IAM roles, AWS CloudTrail, AWS Config, AWS Identity Center. </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Previously you could only register OU’s with 300 or less accounts, so this is a 3x increase. </span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">30:07  Justin – “Every time I see things that support this number of accounts, I’m like, okay, it’s great. When everybody wants to say the base costs for there is a base cost for an AWS account by the time you implement. That trail and guard duty and config and all those, and you have to enable some of those services here. And I’m like, okay, the base costs are just writing. Those are going to be a lot, but then again, if you have a thousand accounts, you probably don’t care about a single, a couple hundred dollars.”</span></i></p>
<h2><span style="font-weight:400;">GCP</span></h2>
<p><b>31:33 </b><a href="https://cloud.google.com/blog/products/data-analytics/gemini-in-bigquery-features-are-now-ga/" target="_blank" rel="noreferrer noopener"><b>Get started with the new generally available features of Gemini in BigQuery</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Several </span><a href="https://cloud.google.com/gemini/docs/bigquery/overview" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">BigQuery Gemini</span></a><span style="font-weight:400;"> features are now generally available:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">SQL Code Generation and explanation</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Python code generation</span></li>
<li style="font-weight:400;"><a href="https://cloud.google.com/bigquery/docs/data-canvas" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Data Canvas</span></a></li>
<li style="font-weight:400;"><a href="https://cloud.google.com/dataplex/docs/data-insights" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Data Insights</span></a><span style="font-weight:400;"> and Partitioning</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Cluster Recommendations</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Data insights starts with data discovery and assessing which insights you can get from our data assets.  </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Imagine having a library of insightful questions tailored specifically to your data questions you didn’t even know how you should ask. </span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Data Insights eliminates the guesswork with pre-validated, ready-to-run queries offering immediate insights.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">For instance, if you are working with a table containing customer churn data, Data Insights might prompt you to explore the factors contributing to churn within specific customer segments. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Gemini for BigQuery now helps you </span><a href="https://cloud.google.com/bigquery/docs/write-sql-gemini" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">write and modify SQL or Python code</span></a><span style="font-weight:400;"> using straightforward natural language prompts, referencing relevant schemas and metadata.  </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">This helps reduce errors and inconsistencies in your code while empowering users to craft complex, accurate queries, even if they have limited coding experience. </span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">32:44  Ryan – “Yeah, I mean, that’s the cool thing about BigQuery and Gemini is that they’ve just built it right into the console.”</span></i></p>
<p><b>34:07 </b><a href="https://blog.google/products/gemini/google-gemini-update-august-2024/" target="_blank" rel="noreferrer noopener"><b>New in Gemini: Custom Gems &amp; improved image generation with Imagen 3</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google is rolling Gems, first previewed at </span><a href="https://blog.google/products/gemini/google-gemini-update-may-2024/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Google I/O</span></a><span style="font-weight:400;">.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Gems is a new feature that lets you customize Gemini to create your own personal AI experts on any topic you want. </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">They are now available for </span><a href="http://gemini.google.com/advanced" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Gemini Advanced</span></a><span style="font-weight:400;">, Business and Enterprise users. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Their new image generation model, </span><a href="https://deepmind.google/technologies/imagen-3/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Imagen 3</span></a><span style="font-weight:400;">, will be rolling out across Gemini, Gemini Advanced, Business and Enterprise in the coming days. </span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Gems allow you to create a team of experts to help you think through a challenging project, brainstorm ideas for an upcoming event or write the perfect caption for a social media post. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Some of the premade gems available for you:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Learning Coach</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Brainstormer</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Career Guide</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Writing Editor</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Coding Partner</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Imagen 3 sets a new high watermark for image quality. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Gems have built-in safeguards and adherence to product design </span><a href="https://gemini.google/our-approach/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">principles</span></a><span style="font-weight:400;">, across a wide range of benchmarks, Imagen 3 performs favorably compared to other image generation models available. </span></li>
</ul>
<p><i><span style="font-weight:400;">35:00  Matthew – “Yeah, it’s kind of cool. I was wondering if I could get all of those pre -made gems at the same time. Like I’m going to do a brainstorming session with a career coach and the coding partner and the brainstormer. then like the career guides, like you should really think about getting a new job. I like to use SQL server on Kubernetes and it’s like, yeah, I think you should update your resume. That’s what that should see.”</span></i></p>
<p><b>39:11 </b><a href="https://cloud.google.com/blog/products/compute/introducing-compute-engine-instant-snapshots/" target="_blank" rel="noreferrer noopener"><b>Instant snapshots: protect Compute Engine workloads from errors and </b></a><a href="https://cloud.google.com/blog/products/compute/introducing-compute-engine-instant-snapshots/" target="_blank" rel="noreferrer noopener"><b>corruption</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google is introducing </span><a href="https://cloud.google.com/compute/docs/disks/instant-snapshots" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">instant snapshots</span></a><span style="font-weight:400;"> for Compute Engine, which provides near instantaneous, high-frequency, point in time checkpoints of a disk that can be rapidly restored as needed. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Instant snapshots have a RPO of seconds and a RTO in the tens of seconds.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Google cloud is the only hyperscale to provide high-performance checkpointing that allows you to recover in seconds. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Common use cases for this feature include:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Enabling rapid recovery from user error, application software failures, and file system corruptions</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Backup verification workflows, such as for database workloads, that create periodic snapshots and immediately restore them to run data consistency checks.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Taking restore points before an application upgrade to enable rollback in the event that maintenance fails. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Improving developer productivity.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Verify state before backups</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Increase backup frequencies</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Some additional benefits over traditional snapshots:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">In Place backups at the zonal or regional disk level</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Fast and incremental</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Fast restore</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Convertible to backup or archive (second point of presence for long term, geo redundant storage)</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">I supposed this could save you in a crowdstrike even too…..</span></li>
</ul>
<p><i><span style="font-weight:400;">40:22  Justin – “Ryan, I’d like you to get this set up on all of our operating system drives for CrowdStrike as soon as possible.”</span></i></p>
<p><b>44:29 </b><a href="https://cloud.google.com/blog/products/databases/announcing-memorystore-for-valkey/" target="_blank" rel="noreferrer noopener"><b>Google Cloud launches Memorystore for Valkey, a 100% open-source </b></a><a href="https://cloud.google.com/blog/products/databases/announcing-memorystore-for-valkey/" target="_blank" rel="noreferrer noopener"><b>key-value service</b></a> <span style="font-weight:400;"> </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">The Memorystore team is announcing the preview of </span><a href="https://cloud.google.com/memorystore/docs/valkey/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Valkey 7.2 support.</span></a><span style="font-weight:400;"> </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Memorystore for ValKey joins </span><a href="https://cloud.google.com/memorystore/docs/cluster/memorystore-for-redis-cluster-overview" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Memorystore for Redis Cluster</span></a><span style="font-weight:400;"> and </span><a href="https://cloud.google.com/memorystore/docs/redis/memorystore-for-redis-overview" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Memory store for Redis</span></a><span style="font-weight:400;"> as a direct response to customer demand, and is a game-changer for organizations seeking high-performance data management solutions relying on 100% open source software.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Maybe soon Redis can be open source again too (but we won’t hold our breath.)</span></li>
</ul>
<p><i><span style="font-weight:400;">45:12  Justin – “I haven’t heard much about Valkey since they forked. I assume people are adopting it, but I didn’t hear much about Open Tofu for quite a while. Then everyone started talking about Open Tofu, so I assume it’s one of those things. As the cloud providers get support for it, I do think Valkey was already supported on AWS ElastiCache, and I think Microsoft was supporting it earlier as well. So I think Google is kind of late to the party on supporting Valkey, but we’ll see.”</span></i></p>
<p><b>45:46 </b><a href="https://cloud.google.com/blog/products/compute/a-simpler-way-to-plan-and-manage-block-storage-performance/" target="_blank" rel="noreferrer noopener"><b>A radically simpler way to plan and manage block storage performance</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Earlier this year, Google announced the GA of Hyperdisk storage pools with advanced capacity, that helps you simplify management and lower the TCO of your block storage capacity. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Today, we are bringing that same innovation to block storage performance through hyperdisk storage pools with advanced performance. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">You can now provision IOPS and throughput in aggregate which hyperdisk storage pools will dynamically allocate as your app read and write data, allowing you to increase resource utilization and radically simplify performance planning and management. </span></li>
</ul>
<p><i><span style="font-weight:400;">46:18  Justin – “I mean, it’s just basically taking a pool of IOPS and you’re allocating it to different disks dynamically through ML or AI, similar to what you’re doing for the capacity of your disk. It makes it nice, I appreciate it. I don’t know that I use it, but I like that it’s there.”</span></i></p>
<h2><span style="font-weight:400;">Azure</span></h2>
<p><b>47:07 </b><a href="https://techcommunity.microsoft.com/t5/azure-infrastructure-blog/inside-maia-100-revolutionizing-ai-workloads-with-microsoft-s/ba-p/4229118" target="_blank" rel="noreferrer noopener"><b>Inside Maia 100: Revolutionizing AI Workloads with Microsoft’s Custom AI </b></a><a href="https://techcommunity.microsoft.com/t5/azure-infrastructure-blog/inside-maia-100-revolutionizing-ai-workloads-with-microsoft-s/ba-p/4229118" target="_blank" rel="noreferrer noopener"><b>Accelerator</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">At </span><a href="https://www.hotchips.org/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Hotchips 2024</span></a><span style="font-weight:400;">, Microsoft initially shared some specs on Maia 100, Microsoft’s first-gen custom AI accelerator designed specifically for large scale AI workloads deployed in Azure. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Maia 100 accelerator is purpose built for a wide range of cloud based AI workloads, and utilizes TSMC’s N5 process with COWOS-S interpose technology. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Equipped with large on-die SRAM, Maia 100’s reticle-size SOC die, combined with four HBM2e die, provide a total of 1.8 TB per second of bandwidth and 64gb of capacity to accommodate AI scale data handling requirements. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The chip architecture includes a high-speed tensor unit for training and inference, while supporting a wide range of data types, including low precision data types such as the MX data format. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Vector processor is a loosely coupled superscalar engine built with custom instruction set architecture (ISA) to support a wide range of data types, including F32 and BF16.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">A direct memory access engine supports different tensor sharding schemes. And a Hardware semaphore enables asynchronous programming on the MAIA systems. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Maia 100 supports up to 4800 gbps all gather and scatter reduced bandwidth, and 1200 gbps all to all bandwidth. </span></li>
</ul>
<p><i><span style="font-weight:400;">49:05  Ryan – “I’m just, not sure whether or not like I’m just too far gone into the managed services part where I don’t really want this level of detail anymore. Like just, do the thing I’m paying to do the thing and all the type of processor with this type of chip and you know, these types of things are irrelevant, but also like maybe, maybe in that space, if you’re deep in it, you need that performance. It’s really hard to say.”</span></i></p>
<p><b>50:29 </b><a href="https://techcommunity.microsoft.com/t5/azure-sql-blog/introducing-simplified-subscription-limits-for-sql-database-and/ba-p/4219102" target="_blank" rel="noreferrer noopener"><b>Introducing Simplified Subscription Limits for SQL Database and Synapse </b></a><a href="https://techcommunity.microsoft.com/t5/azure-sql-blog/introducing-simplified-subscription-limits-for-sql-database-and/ba-p/4219102" target="_blank" rel="noreferrer noopener"><b>Analytics Dedicated SQL Pool</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Azure is introducing new and simplified subscription limits for Azure SQL Database and Azure Synapse analytics dedicated SQL Pool (Formerly SQL DW). </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">What’s changing:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">New vCore based limits, which will be directly equivalent to DTU and DWU</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Default logical server limits</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Configurable vCore limits</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">New Portal Experience</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">All subscriptions will have a default limit of 250 logical servers. </span></li>
</ul>
<p><i><span style="font-weight:400;">51:23  Matthew – “They went from one metric, which was their original metric of a weird combination of memory and CPU and maximum storage allocation to the newer one. Which is supposed to simplify it.”</span></i></p>
<p><b>54:21 </b><a href="https://azure.microsoft.com/en-us/blog/check-out-whats-new-in-azure-vmware-solution/" target="_blank" rel="noreferrer noopener"><b>Check out what’s new in Azure VMware Solution</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Azure is pleased to announce several enhancements to their </span><a href="https://azure.microsoft.com/en-us/products/azure-vmware" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">VMWare solution for Azure</span></a><span style="font-weight:400;">:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Azure VMWare solution is now in 33 </span><a href="https://azure.microsoft.com/en-us/explore/global-infrastructure/products-by-region/?products=azure-vmware&amp;rar=true&amp;regions=all" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">regions</span></a></li>
<li style="font-weight:400;"><span style="font-weight:400;">Azure VMware solution has been added to the DoD SRG impact level 4 provisional authorization in </span><a href="https://learn.microsoft.com/en-us/azure/azure-government/compliance/azure-services-in-fedramp-auditscope#azure-government-services-by-audit-scope" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Azure Governmen</span></a><span style="font-weight:400;">t.</span></li>
<li style="font-weight:400;"><a href="https://www.netapp.com/newsroom/press-releases/news-rel-20240828-201926/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Expanded support for FCF with Netapp and VMware</span></a><span style="font-weight:400;"> being able to simplify their FCF hybrid environment by leveraging Netapp Ontap software. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">You can now leverage Spot Eco by Netapp with your Vsphere VM’s in the cloud.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Collaborations with Jetrsteam enhance DR and Ransomware protection. Jetrsteam delivers advanced DR that offers near zero RPO and Instant RTO. </span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">55:04  Matthew – “Can I translate this? How to burn a;; your capital and piss off your CFO in 15 minutes or less.”</span></i></p>
<h2><span style="font-weight:400;">Cloud Journey Series</span></h2>
<p><b>55:52 </b><a href="https://seroter.com/2024/08/27/4-ways-to-pay-down-tech-debt-by-ruthlessly-removing-stuff-from-your-architecture/" target="_blank" rel="noreferrer noopener"><b>4 ways to pay down tech debt by ruthlessly removing stuff from your </b></a><a href="https://seroter.com/2024/08/27/4-ways-to-pay-down-tech-debt-by-ruthlessly-removing-stuff-from-your-architecture/" target="_blank" rel="noreferrer noopener"><b>architecture</b></a><span style="font-weight:400;"> </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Richard Seroter from google had a great blog post about paying down tech debt by ruthlessly removing stuff from your architecture. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">We thought we’d pass some of these along to my co hosts to get their take on Richard’s advice. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">He starts out covering debt and really architectural debt from carrying 8 products that do the same thing in every category, Brittle automation that only partially works or still requires manual workarounds and black magic. Unique customizations to package software that prevents upgrades to modern versions. Or half-finished “ivory tower” designs where the complex distributed system isn’t fully in place and may never be.  Too much coupling, too little coupling, unsupported frameworks and on and on. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">To help eliminate some of this debt, he breaks it down into 4 ways. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">#1 Stop moving so much data around</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">How many components do you have that get data from point A to point B.  How many ETL pipelines to consolidate or hydrate data, messaging and event processing solutions to send this data around or even API calls that suck data from system A to system b. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Can you dump some of this?  Here are some of examples to help you</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Perform analytics queries against data sitting in different places by leveraging BigQuery omni, query your data that runs in AWS, Azure or GCP and stop consolidating it to a single data lake. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Enrich your data from outside the database. You might have ETL jobs in place to bring reference data into your data warehouse to supplement whats is already there, but with things like BigQuery federated queries, you can reach live into PostgreSQL, Mysql, Spanner and Even SAP Datasphere</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Perform complex SQL analytics against log data instead of copying and sending logs to online systems. </span></li>
</ul>
</li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">58:39  Justin- “I was thinking about this is a great pitch for Google because I don’t think I could do this on AWS because all the data storage is separate for every product because of their isolation model. Where on GCP I can do these things because they have one data layer.”</span></i></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">#2 Compress the stack by removing duplicative components</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Break out the chainsaw, time to kill duplicated products.  Or too many best-of breeds.  A rule of thumb from Richard‘s colleague Josh McKenty “if it’s emerging, buy a few; if it’s mature, no more than two.”</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">You don’t need multiple database platforms or project management solutions. Or leverage multi-purpose services and embrace “good enough.”</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Do you have multiple databases? Maybe you should wait 15 days before you buy a specialized vector database. You can use Postgres or any number of existing databases that now support vectors. You can also have multiple messaging buses and stream processors, consolidate to Pub-Sub, etc. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">(underneath this one is really just use managed services)</span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">1:00:08  Ryan – “I’m sort of like… the trick of this is the replacing it, right? This is still identification of tech debt. I actually don’t know if that’s really the problem to be solved. I think the problem is like, how do you prioritize and change these? And I thought that, you know, the article, it sort of references offhand, but you know, the reality is you have to be constantly making changes.”</span></i></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">#3 Replace hyper-customized software and automation with managed services and vanilla infrastructure. </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">You are not Google or that unique. Your company likely does a few things that are “secret sauce,” but the rest is identical. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Fit the team to the software, not the other way around. This customization leads to lock-in, and you get stuck in upgrade purgatory. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">No one gets rewarded for their super highly customized K8 cluster. Use GKE autopilot, pay per pod, or find some other way to not have to manage something highly customized to your org. </span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">1:03:23  Matthew – “Yeah; most of the time you don’t need that extra performance that you’re squeezing out of it, but adding complexity – and honestly, most likely the cause of many underlying outages whether you want to believe it or not.”</span></i></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">#4 Tone it down on microservices and distributed systems</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">People have gone overkill on microservices. You don’t need dozens of serverless functions to serve a static web app or a big, complex Javascript framework for two pages. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Tech debt often comes from overengineering the system when you’d be better off smashing it back into an “app” hosted in a cloud run. There would be fewer moving parts and all the agility you want. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">He doesn’t advocate going full DHH, but most folks would be better off defaulting to more monolith systems running on a server or two. </span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">1:04:54  Ryan – “ It’s a common fallacy that you want to develop everything as a microservice so that you can manage them and update them separately. But really, if you only have a single customer of that API or your microservice, it shouldn’t be separate. And so it’s really about understanding the contracts and ins and outs and who needs to use the service.”</span></i></p>
<h2><span style="font-weight:400;">Closing</span></h2>
<p><span style="font-weight:400;">And that is the week in the cloud! Visit our website, the home of the Cloud Pod where you can join our newsletter, slack team, send feedback or ask questions at theCloud Pod.net or tweet at us with hashtag #theCloudPod</span></p>
]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/1833347/c1e-jkjkuqkvdjcp3nr6-ok4o2zx2inz6-58x1bp.mp3" length="81626822"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 274 of The Cloud Pod, where the forecast is always cloudy! Justin, Ryan and Matthew are your hosts this week as we explore the world of SnapShots, Maia, Open Source, and VMware – just to name a few of the topics. And stay tuned for an installment of our continuing Cloud Journey Series to explore ways to decrease tech debt, all this week on The Cloud Pod.  
Titles we almost went with this week:

The Cloud Pod in Parallel Cluster
The Cloud Pod cringes at managing 1000 aws accounts
The Cloud Pod welcomes Imagen 3 with less Wokeness
️The Cloud Pod wants to be instantly snapshotted
The Cloud pod hates tech debt

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
General News
00:32 Elasticsearch is Open Source, Again



Shay Banon is pleased to call ElasticSearch and Kibana “open source” again.  He says everyone at Elastic is ecstatic to be open source again, it’s part of his and “Elastics DNA.” 
They’re doing this by adding AGPL as another license option next to ELv2 and SSPL in the coming weeks. 
They never stopped believing or behaving like an OSS company after they changed the license, but by being able to use the term open source and by using AGPL – an OSI approved license – removes any questions or fud people might have. 
Shay says the change 3 years ago was because they had issues with AWS and the market confusion their offering was causing. 

So, after trying all the other options, changing the license – all while knowing it would result in a fork with a different name – was the path they took. 


While it was painful, they said it worked. 

3 years later, Amazon is fully invested in their OpenSearch fork, the market confusion has mostly gone, and their partnership with AWS is stronger than ever.
They are even being named partner of the year with AWS. 


They want to “make life of our users as simple as possible,” so if you’re ok with the ELv2 or the SSPL, then you can keep using that license. They aren’t removing anything, just giving you another option with AGPL.]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/1833347/c1a-k5d5-8dr896k1bpro-hgtzb8.jpg"></itunes:image>
                                                                            <itunes:duration>01:08:02</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                            </item>
                    <item>
                <title>
                    <![CDATA[273: Phi-fi-fo-fum, I Smell the Bones of The Cloud Pod Hosts]]>
                </title>
                <pubDate>Wed, 04 Sep 2024 10:39:08 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/1828808</guid>
                                    <link>https://tcpfm.castos.com/episodes/273-phi-fi-fo-fum</link>
                                <description>
                                            <![CDATA[<p><span style="font-weight:400;">Welcome to episode 273 of The Cloud Pod, where the forecast is always cloudy! Hold onto your butts – this week your hosts Justin, Ryan, Matthew and (eventually) Jonathan are bringing you two weeks worth of cloud and AI news. We’ve got Karpenter, Kubernetes, and Secrets, plus news from OpenAI, MFA changes that are going to be super fun for Matthew, and Azure Phi. Get comfy – it’s going to be a doozy!</span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">The Cloud Pod Teaches Azure-normalized Camel Casing</span></li>
<li><span style="font-weight:400;">The Cloud Pod Travels to Malaysia</span></li>
<li><span style="font-weight:400;">⚖️Azure Detaches Itself From its Own Scale Sets</span></li>
<li><span style="font-weight:400;">✍️The Cloud Pod Conditionally Writes Show Notes </span></li>
<li><span style="font-weight:400;">You got MFA!</span></li>
<li><span style="font-weight:400;">⛔The Cloud Pod Delays Deleting Itself</span></li>
<li><span style="font-weight:400;">The Cloud Pod is Now the Cloud Pod Podcast!</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><span style="font-weight:400;">General News</span></h2>
<p><b>01:37 </b><a href="https://www.hashicorp.com/blog/terraform-azurerm-provider-4-0-adds-provider-defined-functions" target="_blank" rel="noreferrer noopener"><b>Terraform AzureRM provider 4.0 adds provider-defined functions</b></a><span style="font-weight:400;"> </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Terraform is announcing the GA of </span><a href="https://registry.terraform.io/providers/hashicorp/azurerm/latest" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Terraform AzureRM provider 4.0</span></a><span style="font-weight:400;">.  The new version improves the extensibility and flexibility in the provider. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Since the Providers’ Last major release in March 2022, Hashi has added support for some 340 resources and 120 data sources, bringing the total Azure resources to 1,101 resources and almost 360 data sources. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The provider has topped 660M downloads, MS and Hashi continue to develop new, innovative integrations that further ease the cloud adoption journey to enterprise organizations. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">With </span><a href="https://www.hashicorp.com/blog/terraform-1-8-improves-extensibility-with-provider-defined-functions" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Terraform 1.8</span></a><span style="font-weight:400;">, providers can implement custom functions that you can call from the Terraform configuration. The new provider adds two Azure-specific provider functions to let users correct the casing of their resource IDs or access the individual components of it. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Previously, the Azure RM provider took an all-or-nothing approach to Azure resource provider registration, where the Terraform provider would either attempt to register a fixed set of 68 providers upon initialization or registration or be skipped. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This didn’t match Microsoft’s recommendations, which are to register resource providers only as needed, and to enable the services you’re actively using. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">With adding two new feature flags, </span><a href="https://registry.terraform.io/provi..."></a></li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 273 of The Cloud Pod, where the forecast is always cloudy! Hold onto your butts – this week your hosts Justin, Ryan, Matthew and (eventually) Jonathan are bringing you two weeks worth of cloud and AI news. We’ve got Karpenter, Kubernetes, and Secrets, plus news from OpenAI, MFA changes that are going to be super fun for Matthew, and Azure Phi. Get comfy – it’s going to be a doozy!
Titles we almost went with this week:

The Cloud Pod Teaches Azure-normalized Camel Casing
The Cloud Pod Travels to Malaysia
⚖️Azure Detaches Itself From its Own Scale Sets
✍️The Cloud Pod Conditionally Writes Show Notes 
You got MFA!
⛔The Cloud Pod Delays Deleting Itself
The Cloud Pod is Now the Cloud Pod Podcast!

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
General News
01:37 Terraform AzureRM provider 4.0 adds provider-defined functions 

Terraform is announcing the GA of Terraform AzureRM provider 4.0.  The new version improves the extensibility and flexibility in the provider. 
Since the Providers’ Last major release in March 2022, Hashi has added support for some 340 resources and 120 data sources, bringing the total Azure resources to 1,101 resources and almost 360 data sources. 
The provider has topped 660M downloads, MS and Hashi continue to develop new, innovative integrations that further ease the cloud adoption journey to enterprise organizations. 
With Terraform 1.8, providers can implement custom functions that you can call from the Terraform configuration. The new provider adds two Azure-specific provider functions to let users correct the casing of their resource IDs or access the individual components of it. 
Previously, the Azure RM provider took an all-or-nothing approach to Azure resource provider registration, where the Terraform provider would either attempt to register a fixed set of 68 providers upon initialization or registration or be skipped. 
This didn’t match Microsoft’s recommendations, which are to register resource providers only as needed, and to enable the services you’re actively using. 
With adding two new feature flags, ]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[273: Phi-fi-fo-fum, I Smell the Bones of The Cloud Pod Hosts]]>
                </itunes:title>
                                    <itunes:episode>273</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p><span style="font-weight:400;">Welcome to episode 273 of The Cloud Pod, where the forecast is always cloudy! Hold onto your butts – this week your hosts Justin, Ryan, Matthew and (eventually) Jonathan are bringing you two weeks worth of cloud and AI news. We’ve got Karpenter, Kubernetes, and Secrets, plus news from OpenAI, MFA changes that are going to be super fun for Matthew, and Azure Phi. Get comfy – it’s going to be a doozy!</span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">The Cloud Pod Teaches Azure-normalized Camel Casing</span></li>
<li><span style="font-weight:400;">The Cloud Pod Travels to Malaysia</span></li>
<li><span style="font-weight:400;">⚖️Azure Detaches Itself From its Own Scale Sets</span></li>
<li><span style="font-weight:400;">✍️The Cloud Pod Conditionally Writes Show Notes </span></li>
<li><span style="font-weight:400;">You got MFA!</span></li>
<li><span style="font-weight:400;">⛔The Cloud Pod Delays Deleting Itself</span></li>
<li><span style="font-weight:400;">The Cloud Pod is Now the Cloud Pod Podcast!</span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><span style="font-weight:400;">General News</span></h2>
<p><b>01:37 </b><a href="https://www.hashicorp.com/blog/terraform-azurerm-provider-4-0-adds-provider-defined-functions" target="_blank" rel="noreferrer noopener"><b>Terraform AzureRM provider 4.0 adds provider-defined functions</b></a><span style="font-weight:400;"> </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Terraform is announcing the GA of </span><a href="https://registry.terraform.io/providers/hashicorp/azurerm/latest" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Terraform AzureRM provider 4.0</span></a><span style="font-weight:400;">.  The new version improves the extensibility and flexibility in the provider. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Since the Providers’ Last major release in March 2022, Hashi has added support for some 340 resources and 120 data sources, bringing the total Azure resources to 1,101 resources and almost 360 data sources. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The provider has topped 660M downloads, MS and Hashi continue to develop new, innovative integrations that further ease the cloud adoption journey to enterprise organizations. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">With </span><a href="https://www.hashicorp.com/blog/terraform-1-8-improves-extensibility-with-provider-defined-functions" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Terraform 1.8</span></a><span style="font-weight:400;">, providers can implement custom functions that you can call from the Terraform configuration. The new provider adds two Azure-specific provider functions to let users correct the casing of their resource IDs or access the individual components of it. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Previously, the Azure RM provider took an all-or-nothing approach to Azure resource provider registration, where the Terraform provider would either attempt to register a fixed set of 68 providers upon initialization or registration or be skipped. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This didn’t match Microsoft’s recommendations, which are to register resource providers only as needed, and to enable the services you’re actively using. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">With adding two new feature flags, </span><a href="https://registry.terraform.io/providers/hashicorp/azurerm/latest/docs/guides/4.0-upgrade-guide#resource_provider_registrations" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">resource_provider_registrations</span></a><span style="font-weight:400;"> and </span><a href="https://registry.terraform.io/providers/hashicorp/azurerm/latest/docs/guides/4.0-upgrade-guide#resource_providers_to_register" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">resource_providers_to_register</span></a><span style="font-weight:400;">, users now have more control over which providers to register automatically or whether to continue managing a subscription resources provider. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">AzureRM has removed a number of deprecated items, and it is recommended that you look at the removed resources/data sources and the </span><a href="https://registry.terraform.io/providers/hashicorp/azurerm/latest/docs/guides/4.0-upgrade-guide#behaviour-changes-and-removed-properties-in-resources" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">4.0 upgrade guide</span></a><span style="font-weight:400;">.</span></li>
</ul>
<p><i><span style="font-weight:400;">03:50  Justin – “Okay, so it doesn’t have anything really to do with Terraform. It has to do with Azure and enabling and disabling resource types that they can monkey with, basically, with configuration code.”</span></i></p>
<p><b>06:12</b> <a href="https://www.nextplatform.com/2024/08/22/rackspace-goes-all-in-again-on-openstack/" target="_blank" rel="noreferrer noopener"><b>Rackspace Goes All In – Again – On OpenStack</b></a><span style="font-weight:400;"> </span></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Rackspace hasn’t been very vocal about </span><a href="https://www.nextplatform.com/2019/04/18/openstack-follows-the-datacenter-out-to-the-edge/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">OpenStack</span></a><span style="font-weight:400;"> – which they launched in 2010 – out of a collaboration between NASA and Rackspace. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Rackspace didn’t turn their back per say, contributing over 5.6M lines of code to it, and it is one of the largest OpenStack cloud providers. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">In recent years, however, they have withdrawn to some extent from commitments to OpenStack</span><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Recently they reaffirmed their commitment, with the launch of OpenStack Enterprise, a fully managed cloud offering aimed at critical workloads that run at scale and that brings enhanced security and efficiency. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The only thing we can think is… you wanted to make an alternative to VMWare. Got it. Good luck. </span></li>
</ul>
<p><i><span style="font-weight:400;">07:35  Ryan – “I think there should be something like OpenStack for, you know, being able to run your own hardware and, know, still get a lot of the benefits of compute in a cloud ecosystem, hardware that you control and ecosystems that maybe you don’t want being managed by a third party vendor. So happy to see OpenStack continue to gain support even though I haven’t touched it in years.”</span></i></p>
<h2><span style="font-weight:400;">AWS</span></h2>
<p><b>08:39 </b><a href="https://aws.amazon.com/blogs/containers/announcing-karpenter-1-0/" target="_blank" rel="noreferrer noopener"><b>Announcing Karpenter 1.0</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Karpenter is an open source K8 cluster autoscaling project, created by AWS. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The project has been adopted for mission-critical use cases by industry leaders.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">It’s been adding key features over the years, like </span><a href="https://aws.amazon.com/about-aws/whats-new/2022/08/workload-consolidation-karpenter/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">workload consolidation</span></a><span style="font-weight:400;">, </span><a href="https://aws.amazon.com/about-aws/whats-new/2024/02/disruption-controls-karpenter/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">disruption controls</span></a><span style="font-weight:400;"> and more. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Now it has reached 1.0, and is no longer considered beta by AWS.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This new release includes the Stable Karpenter API’s </span><span style="font-weight:400;">NodePool </span><span style="font-weight:400;">and</span><span style="font-weight:400;"> EC2NodeClass.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">As part of this release, the custom resource definition (CRD) API groups and kind name remain unchanged.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">AWS has also created conversion webhooks to make migrating from beta to stable more seamless.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Karpenter </span><a href="https://github.com/kubernetes-sigs/karpenter/releases/tag/v1.0.0" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">V1</span></a><span style="font-weight:400;"> adds support for disruption budgets by reason. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The supported reasons are Underutilized, Empty and Drifted. </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">This will enable the user to have finer-grained control of the disruption budgets that apply to specific disruption reasons. </span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">09:28  Ryan – “See, this is how I know Kubernetes is too complex. I feel like every other week there’s some sort of announcement of some other project that controls like the allocation of resources or the scaling of resources or the something something of pods. And I’m just like, okay, cool.”</span></i></p>
<p><b>11:26  </b><a href="https://aws.amazon.com/blogs/aws/add-macos-to-your-continuous-integration-pipelines-with-aws-codebuild/" target="_blank" rel="noreferrer noopener"><b>Add macOS to your continuous integration pipelines with AWS CodeBuild</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">What took you so long? </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Now, you can build applications on </span><a href="https://en.wikipedia.org/wiki/MacOS" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">MacOS</span></a><span style="font-weight:400;"> with </span><a href="https://aws.amazon.com/codebuild/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS CodeBuild</span></a><span style="font-weight:400;">.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">You can build artifacts on managed Apple M2 machines that run </span><a href="https://www.apple.com/macos/sonoma/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">MacOS 14 Sonoma</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">AWS Codebuild is a fully managed continuous integration service that compiles source code, runs tests, and produces ready-to-deploy software packages. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">CodeBuild for MacOS is based on a recently introduced reserved capacity fleet containing instances powered by </span><a href="https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/ec2-mac-instances.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Ec2</span></a><span style="font-weight:400;"> but maintained by CodeBuild.  </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">With reserved capacity fleets, you configure a set of dedicated instances for your build environment. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">These machines remain idle, ready to process builds or tests immediately, which reduces build durations. </span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Codebuild provides a standard disk image to your build. It contains pre-installed versions of Xcode, </span><a href="https://fastlane.tools/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Fastlane</span></a><span style="font-weight:400;">, </span><a href="https://www.ruby-lang.org/en/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Ruby</span></a><span style="font-weight:400;">, Python and Nodej, as well as codebuild manages autoscaling of the fleet. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">CodeBuild for macOS works with reserved fleets. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Contrary to on-demand fleets, where you pay per minute of build, </span><a href="https://docs.aws.amazon.com/codebuild/latest/userguide/fleets.html" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">reserved fleets</span></a><span style="font-weight:400;"> are charged for the time the build machines are reserved for your exclusive usage, even when no builds are running. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The capacity reservation follows the Amazon EC2 Mac 24-hour minimum allocation period, as required by the </span><a href="https://www.apple.com/legal/sla/docs/macOSSonoma.pdf" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Software License Agreement for macOS</span></a><span style="font-weight:400;"> (article 3.A.ii).</span></li>
</ul>
<p><i><span style="font-weight:400;">09:28  Justin- “You’re not spin up, so the key thing is that you don’t wanna spin up additional Mac OS’s every time you wanna do this because then you’re paying for every one of those for 24 hours. So because you have a reserved fleet, you’re using the same Mac OS that’s in the fleet and you don’t have to worry about auto scaling it up and down.”</span></i></p>
<p><b>15:00 </b><a href="https://aws.amazon.com/about-aws/whats-new/2024/08/amazon-ec2-g6e-instances/" target="_blank" rel="noreferrer noopener"><b>Announcing general availability of Amazon EC2 G6e instances</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS announced the general availability of EC2 G6e instances powered by NVIDIA L40S Tensor Core GPUs. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">G6e instances can be used for a wide range of ML and Spatial computing use cases. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">G6e instances deliver up to 2.5x better performance compared to G5 instances and up to 20% lower inference costs than p4d instances.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Customers can use G6e instances to deploy LLMs with up to 13B parameters and diffusion models for generating images, video and audio. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">G6e instances feature up to 8 NVIDIA L40s Tensor Core GPUs with 384 GB of GPU memory (48GB per GPU) and third generation AMD EPYC processors.  192vCPUs, 400Gbps of network bandwidth, up to 1.536 TB of system memory and up to 7.6 TB of NVMe SSD storage. </span></li>
</ul>
<p><i><span style="font-weight:400;">15:56  Ryan – “My initial reaction was like, got to figure out like a modern workload where I care about these types of specs on these specific servers. And then I remember I provide cloud platforms to the rest of the business and I go, no, this is going to be expensive. How am I going to justify all this… pass.”</span></i></p>
<p><b>16:56 </b><a href="https://aws.amazon.com/blogs/aws/now-open-aws-asia-pacific-malaysia-region/" target="_blank" rel="noreferrer noopener"><b>Now open — AWS Asia Pacific (Malaysia) Region</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">The </span><a href="https://aws.amazon.com/blogs/aws/in-the-works-aws-region-in-malaysia/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">AWS Malaysia region</span></a><span style="font-weight:400;"> with three Availability Zones is now Open, with the API name of ap-southeast-5</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This is the first infra region in Malaysia, and the 13th in Asia Pacific joining Hong Kong, Hyderabad, Jakarta, Melbourne, Mumbai, Osaka, Seoul, Singapore, Sydney, Tokyo and China. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The new AWS region will support the Malaysian Government’s strategic Madani economic framework.</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">The initiative aims to improve the living standards for all Malaysians by 2023 while supporting innovation in Malaysia and across ASEAN. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The new region will add about 12.1 B to Malaysia’s GDP and will support more than 3,500 full-time jobs at external businesses throughout 2038. </span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">15:56  Justin – “The forecast models all die at 2038. We didn’t really understand why. We just assumed that’s when the jobs run out. No, no, that’s a different problem.”</span></i></p>
<p><b>19:52 </b><a href="https://aws.amazon.com/about-aws/whats-new/2024/08/cloudformation-resource-discovery-template-review-iac-generator/" target="_blank" rel="noreferrer noopener"><b>CloudFormation simplifies resource discovery and template review in the </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2024/08/cloudformation-resource-discovery-template-review-iac-generator/" target="_blank" rel="noreferrer noopener"><b>IaC Generator</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS Cloudformation now includes enhancements to the IaC generator, which customers use to create IaC from existing resources. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Now, after the IaC generator finishes scanning the resources in an account, it presents a graphical summary of the different resource types to help customers find the resources they want to include in their template more quickly. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">After selecting resources, customers can preview their template in AWS application composer, visualizing the entire application architecture with the resources and their relationships. </span></li>
</ul>
<p><i><span style="font-weight:400;">20:20  Ryan- “This is how I do all of my deployment architectures. Now I just deploy everything and then I generate the picture, screenshot that and then document. Ta -da!”</span></i></p>
<p><b>21:19 </b><a href="https://aws.amazon.com/about-aws/whats-new/2024/08/amazon-documentdb-mongodb-compatibility-global-clusters-failover/" target="_blank" rel="noreferrer noopener"><b>Amazon DocumentDB (with MongoDB Compatibility) Global Clusters </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2024/08/amazon-documentdb-mongodb-compatibility-global-clusters-failover/" target="_blank" rel="noreferrer noopener"><b>introduces Failover</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">DocumentDB now supports</span><a href="https://docs.aws.amazon.com/documentdb/latest/developerguide/global-clusters.html%5C" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;"> global cluster</span></a><span style="font-weight:400;"> failover, a fully managed experience for performing a cross-region failover to respond to unplanned events such as regional outages.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">With Global Cluster Failover, you can convert a secondary region into the new primary region in typically a minute and also maintain the multi-region global cluster configuration. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">An Amazon DocumentDB Global Cluster is a single cluster that can span up to 6 AWS regions, enabling DR from region wide outages and low latency global reads.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Combined with Global Cluster Switchover, you can easily promote a secondary region to primary for both planned and unplanned events.  </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Switchover is a managed failover experience meant for planned events such as regional rotations. </span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">22:25  Ryan – “I mean, anytime you can do this type of like a DR and failover at the data layer, I’m, I’m in love with, because it’s so difficult to orchestrate on your own. And so that’s a huge value from using a cloud provider. Like I would like to just click some boxes and make, and it will just work. Awesome.“</span></i></p>
<p><b>22:46 </b><a href="https://aws.amazon.com/about-aws/whats-new/2024/08/amazon-s3-conditional-writes/" target="_blank" rel="noreferrer noopener"><b>Amazon S3 now supports conditional writes</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">S3 adds support for conditional writes that check for the existence of an object before creating it. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This allows you to prevent applications from overwriting existing objects when uploading data. </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">You can perform conditional writes using putobject or completemultipartupload API requests in both general-purpose and directory buckets. </span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">This makes it easier for distributed applications with multiple clients concurrently updating data in parallel across shared datasets.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This allows you to no longer write client side consensus mechanisms to coordinate updates or use additional API requests to check for the presence of an object before uploading data. </span></li>
</ul>
<p><i><span style="font-weight:400;">23:28  Justin – “…either you would have to do an API call to verify if the file was there before, which you’re not paying for, and then you can do your write, or you get to do this. And if you have all your apps trying to do this all at the same time, the milliseconds of latency can kill you on this type of thing. So having the ability is very nice.”</span></i></p>
<p><b>25:10 </b><a href="https://aws.amazon.com/about-aws/whats-new/2024/08/aws-lambda-supports-function-level-configuration-recursive-loop-detection/" target="_blank" rel="noreferrer noopener"><b>AWS Lambda now supports function-level configuration for recursive loop </b></a><a href="https://aws.amazon.com/about-aws/whats-new/2024/08/aws-lambda-supports-function-level-configuration-recursive-loop-detection/" target="_blank" rel="noreferrer noopener"><b>detection</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">AWS Lambda now supports function-level configuration which allows you to disable or enable recursive loop detection. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Lambda recursive loop detection, enabled by default, is a preventative guard rail that automatically detects and stops recursive invocations between Lambda and other supported services, preventing runaway workloads. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Customers running intentionally recursive patterns could turn off recursive loop detection on a per account basis through support. Now customers can disable or enable recursive loop detection on a per function basis, allowing them to run their intentionally recursive workflows while protecting the remaining functions in their account from runaway workloads caused by unintended loops. </span></li>
</ul>
<p><i><span style="font-weight:400;">25:44  Justin – “I remember when they first added this several years ago, we were like, this is amazing. Thank God they finally did this. But then I forgot about the support part that you had to reach out to support if you didn’t want your attention to your cursive pattern. And I, if I was going to go down that path, I’d just say, don’t – I’ve done something wrong. But, apparently if I think I’m actually right – which is a problem, I think I’m right all the time – it can now cost myself some money. So do be careful with this feature. It’s a gun that can shoot you in the foot very quickly.”</span></i></p>
<h2><span style="font-weight:400;">GCP</span></h2>
<p><b>27:58 </b><a href="https://cloud.google.com/blog/products/business-intelligence/opening-up-the-looker-semantic-layer/" target="_blank" rel="noreferrer noopener"><b>Looker opens semantic layer via new SQL Interface and connectors for </b></a><a href="https://cloud.google.com/blog/products/business-intelligence/opening-up-the-looker-semantic-layer/" target="_blank" rel="noreferrer noopener"><b>Tableau &amp; others</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google says that Data is the driving force of innovation in business, especially in the world of accelerating AI adoption. But data driven organizations struggle with inconsistent or unreliable metrics. Without a single source of truth for data definitions, metrics can have a different logic depending on what tool or team they come from. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Teams that can’t trust data go back to their gut, a risky strategy. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Google designed </span><a href="https://cloud.google.com/looker" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Looker</span></a><span style="font-weight:400;"> with a </span><a href="https://cloud.google.com/blog/products/data-analytics/lookers-universal-semantic-model" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">semantic model</span></a><span style="font-weight:400;"> to let you define metrics once and use them everywhere, for better governance, security and overall trust in your data. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">So to live up to that vision, they are releasing BI connectors, including GA of their </span><a href="https://cloud.google.com/looker/docs/tableau-connector" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">custom-built connector for Tableau</span></a><span style="font-weight:400;">, which will make it easier to use Looker’s metrics layer within the broader ecosystem of SQL based tools, with an integration layer for lookerML models based on BigQuery, plus connectors for popular products. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This integration layer is the </span><a href="https://cloud.google.com/looker/docs/sql-interface" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">OpenSQL Interface</span></a><span style="font-weight:400;"> and gives Looker customers more options for how they deploy governed analytics.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">They are also releasing a general purpose JDBC driver for connecting the interface, and partners including thoughtspot, mode and APOS systems have already integrated their products with Looker’s semantic layer. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The connectors for Looker now include:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google Sheets</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Looker Studio</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Power BI</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Tableau</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Thoughtspot</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Mode</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">APOS Systems</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Custom JDBC</span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">29:48  Ryan- “…these types of connectors and stuff offer great amount of flexibility because these BI tools are so complex that people sort of develop their favorite and don’t want to use another one.”</span></i></p>
<p><b>31:10 </b><a href="https://cloud.google.com/blog/products/compute/c4-machine-series-is-now-ga/" target="_blank" rel="noreferrer noopener"><b>C4 VMs now GA: Unmatched performance and control for your enterprise </b></a><a href="https://cloud.google.com/blog/products/compute/c4-machine-series-is-now-ga/" target="_blank" rel="noreferrer noopener"><b>workloads</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google is pleased to release the GA of the </span><a href="https://cloud.google.com/compute/docs/general-purpose-machines#c4_series" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">C4 Machine series</span></a><span style="font-weight:400;">, the most performant general-purpose VM for Compute Engine and GKE customers. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">C4 VM’s are engineered from the ground up and fine-tuned to deliver industry-leading performance, with up to 20% better price-performance for general-purpose workloads, and 45% better price performance for CPU based inference versus comparable GA VMs from other hyperscalers. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Together with the </span><a href="https://cloud.google.com/compute/docs/general-purpose-machines#n4_series" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">N4 machines</span></a><span style="font-weight:400;">, C4 VMs provide the performance and flexibility you need to handle the majority of workloads, all powered by Google’s </span><a href="https://cloud.google.com/titanium?hl=en" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Titanium</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">With Titanium offload technology, C4 provides high performance connectivity with up 20 Gbps of networking bandwidth and scalable storage with up to 5</span><a href="https://cloud.google.com/compute/docs/disks/hyperdisks#hyperdisk-extreme" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">00k iops and 10GB throughput on Hyperdisk Extreme</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">C4 instances scale up to 192vCPU and 1.5TB of DDR5 memory and feature the latest generation performance with Intel’s 5th Gen XEON processors. </span></li>
</ul>
<p><i><span style="font-weight:400;">32:42  Matthew – “…the specs on this is outstanding. Like the 20 gigabytes of networking, like they really put a lot into this and it really feels like it’s going to be a good workhorse for people in the future.”</span></i></p>
<p><b>33:19 </b><a href="https://cloud.google.com/blog/products/containers-kubernetes/introducing-new-gke-custom-compute-class-api/" target="_blank" rel="noreferrer noopener"><b>Containers &amp; Kubernetes Your infrastructure resources, your way, with new </b></a><a href="https://cloud.google.com/blog/products/containers-kubernetes/introducing-new-gke-custom-compute-class-api/" target="_blank" rel="noreferrer noopener"><b>GKE custom compute class API</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google is launching a new custom compute class API in GKE.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Imagine that your sales platform is working great, and despite surging demand, your K8 infrastructure is seamlessly adapting to handle the traffic.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">GKE cluster autoscaler is intelligently selecting the best resources from a range of options you’ve defined. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">No pages for being out of resources, or capacity issues. All powered by the custom compute class API.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Google is providing you fine-grained control over our infrastructure choices, GKE can now prioritize and utilize a variety of compute and accelerator options based on specific needs ensuring that your apps, including AI workloads, always have the resources they need to thrive. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">GKE custom compute classes maximize obtainability and reliability by providing fall-back compute priorities as a list of candidate node characteristics or statically defined node pools. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This increases the chances of successful autoscaling while giving you control over the resources that get spun up. If your first priority resource is unable to scale up, GKE will automatically try the second priority node selection, and then continue to other lower priorities on the list. </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">For example, n2d is preferred, falls back to c2d, then n2d, and then a nodepool. </span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Scaling events for top-priority nodes may not be available without custom compute classes, so pods land on lower-priority instances and require manual intervention, but with active migration for workloads to preferential node shape is available. </span></li>
</ul>
<p><i><span style="font-weight:400;">34:51  Ryan – “Kubernetes is really complicated, huh?”</span></i></p>
<p><i><span style="font-weight:400;">38:50  Matthew – “I do want to point out that they had to say in this article – because this article has absolutely nothing to do with AI in any way shape or form, but it includes AI workloads because for some reason it wouldn’t have been known. and I actually checked the article because I saw it in the note or show notes, but I literally had to go into the article to be like why is that commentary necessary? Did somebody miss their AI quota for the day so they just threw it in?”</span></i></p>
<p><b>40:21 </b><a href="https://cloud.google.com/blog/products/identity-security/introducing-delayed-destruction-a-new-way-to-protect-your-secrets/" target="_blank" rel="noreferrer noopener"><b>Introducing delayed destruction for Secret Manager, a new way to protect </b></a></p>
<p><a href="https://cloud.google.com/blog/products/identity-security/introducing-delayed-destruction-a-new-way-to-protect-your-secrets/" target="_blank" rel="noreferrer noopener"><b>your secrets</b></a><b>   </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Destroying your secrets just got a lot safer with the new </span><a href="https://cloud.google.com/secret-manager/docs/delay-destruction-of-secret-versions" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">delayed destruction of secret versions</span></a><span style="font-weight:400;"> for </span><a href="https://cloud.google.com/secret-manager/docs/overview%5C" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Secrets Manager</span></a><span style="font-weight:400;">. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This new capability helps to ensure that secret material cannot be erroneously deleted—either by accident or as part of an intended malicious attack. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">While managing secrets and secret versions was possible before, it had some challenges/risks.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Destruction of a secret version is an irreversible step, meaning there is no way to recover your secret once destroyed – nor was there actionable alerting if there was an attempt to destroy any of your critical secrets, which reduces the chance of timely intervention from an administrator. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">With the customizable delay duration, you can prevent immediate destruction of secret versions as well as fire a </span><a href="https://cloud.google.com/secret-manager/docs/event-notifications#events" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">new pub/sub event notification</span></a><span style="font-weight:400;"> that alerts you when a destroy action is attempted. </span></li>
</ul>
<p><i><span style="font-weight:400;">41:13  Ryan – “I mean, this is a good feature. AWS has it by default from the, from the rollout where there’s, takes seven days for a secret to actually go away and you can restore it up until then. The monitoring is the bigger one for me, like being able to configure a notification without trying to like, you know, scout through all the API logs for the delete secret API method. So this is nice. I like that.”</span></i></p>
<p><b>44:09 </b><a href="https://cloud.google.com/blog/products/application-development/run-your-ai-inference-applications-on-cloud-run-with-nvidia-gpus/" target="_blank" rel="noreferrer noopener"><b>Run your AI inference applications on Cloud Run with NVIDIA GPUs</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">You can now run your AI Inference jobs on Cloud Run with NVIDIA GPUs.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">This allows you to perform real-time inference with lightweight open models such as </span><a href="https://ai.google.dev/gemma/?utm_source=keyword&amp;utm_medium=referral&amp;utm_campaign=gemma_cta&amp;utm_content=" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Gemma</span></a><span style="font-weight:400;"> 2B/7B or Meta Llama 3 (8B) or your own custom models. </span></li>
</ul>
<p><i><span style="font-weight:400;">44:33  Ryan – “No, I mean, this is a great example of how to use serverless in the right way, right? These scales down, you’re doing lightweight transactions on those inference jobs. And then you’re not running dedicated hardware or maintaining an environment, which, you know, basically means that you keep warm.”</span></i></p>
<p><b>45:08  </b><a href="https://cloud.google.com/blog/products/serverless/google-cloud-functions-is-now-cloud-run-functions/" target="_blank" rel="noreferrer noopener"><b>Cloud Functions is now Cloud Run functions — event-driven programming </b></a><a href="https://cloud.google.com/blog/products/serverless/google-cloud-functions-is-now-cloud-run-functions/" target="_blank" rel="noreferrer noopener"><b>in one unified serverless platform</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Cloud Functions is now Cloud Run Functions, which is stupid.  This goes beyond a simple name change, though, as they have unified cloud function infrastructure with cloud run, and the developers of cloud function 2nd gen get immediate access to all new cloud run features, including NVIDIA GPUs. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">In addition, Google Cloud Function Gen customers have access to all cloud run capabilities, including:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Multi-event triggers</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">High-performance direct VPC egress</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Ability to mount cloud storage volumes (So Justin can run SQL ♥️) </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Google Managed language run times</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Traffic splitting</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Managed Prometheus and OpenTelemetry</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Inference Functions with NVIDIA GPUS</span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">46:56  Justin – “Yeah, I started to wonder why you would just use Cloud Run. Unless you’re getting some automation with Cloud Run functions that I’m not familiar enough with. But the fact that you get all the Cloud Run benefits with Cloud Functions, and if I get some advantage using functions, I guess it’s a win.”</span></i></p>
<p><b>47:57 </b><a href="https://cloud.google.com/blog/products/identity-security/whats-new-in-assured-workloads-enable-updates-and-new-control-packages/" target="_blank" rel="noreferrer noopener"><b>What’s New in Assured Workloads: Enable updates and new control </b></a><a href="https://cloud.google.com/blog/products/identity-security/whats-new-in-assured-workloads-enable-updates-and-new-control-packages/" target="_blank" rel="noreferrer noopener"><b>packages</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Compliance isn’t a one time job, and so Google is releasing several updates to Assured Workloads which helps your organization meet compliance requirements. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Compliance Updates feature, allows you to evaluate if your current assured workloads folder configuration differs from the latest available configuration, and can enable you to upgrade previously created AW folders to the latest. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Expanded regional controls with Assured workloads now in over 30 regions and 20 countries. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Regional controls now support over 50 of the most popular Google Cloud Services (45% more than the year prior)</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">And they now have over 100 new fedramp high authorized services including Vertex AI, Cloud Build and Cloud Run, Cloud Filestore, as well as powerful security controls on their secure by design, secure by default cloud platform such as VPC Service Controls, Cloud Armor, Cloud Load Balancing and reCaptcha. </span></li>
</ul>
<p><i><span style="font-weight:400;">48:36  Justin – “Which means AI is coming to the government.” </span></i></p>
<p><b>50:22 </b><a href="https://cloud.google.com/blog/products/data-analytics/new-managed-service-for-apache-kafka/" target="_blank" rel="noreferrer noopener"><b>Try the new Managed Service for Apache Kafka and take cluster </b></a><a href="https://cloud.google.com/blog/products/data-analytics/new-managed-service-for-apache-kafka/" target="_blank" rel="noreferrer noopener"><b>management off your todo list</b></a><b>  </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Running distributed event processing and storage systems like Apache Kafka can push your ops team to the bring.  There are tons of ways to secure, network and autoscale your clusters. But Google is pleased to now offer you a shortcut with the new Google Cloud Managed Service for Apache Kafka.  This service takes care of the high-stakes, sometimes tedious work of running infra.  This is an alternative to cloud pub/sub. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">You can have Kafka clusters in 10 different VPC networks. </span></li>
</ul>
<p><i><span style="font-weight:400;">51:13  Justin – “There was no mention about region support, which is really what I need out of this service, versus in region support. But if they can make this multi -region over time, I’m sort of in on this one.”</span></i></p>
<p><b>52:57 </b><a href="https://cloud.google.com/blog/products/management-tools/announcing-terraform-google-provider-6-0-0/" target="_blank" rel="noreferrer noopener"><b>Announcing Terraform Google Provider 6.0.0: More Flexibility, Better </b></a><a href="https://cloud.google.com/blog/products/management-tools/announcing-terraform-google-provider-6-0-0/" target="_blank" rel="noreferrer noopener"><b>Control</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Like Azure, Google is also getting a new provider – the 6.00 is now GA, the combined Hashicorp/Google provider team has listened closely to the feedback from customers. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Some of the key notable (but somehow also not very notable) changes</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Opt-out default label “goog-terraform-provisioned” (which isn’t helpful)</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">As a follow up to the addition of provider level default labels in 5.16, this now gives an opt out of the default label.  The tag was added automatically to anything created by the terraform provider.  Previously you had to opt in to the label, now you have to opt out. </span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Deletion protection fields added to multiple resources</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Google_domain, google_cloud_run_v2_job, google_cloud_run_v2_service, google_folder and google_project. (Which should have delete protection before this, but what do we know.)   </span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">Allows reducing the suffix length in “name_prefix”.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The max length for the user defined name prefix has increased from 37 characters to 54. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">There is an upgrade guide available and I’m sure more will be coming out. </span></li>
</ul>
</li>
</ul>
<h2><span style="font-weight:400;">Azure</span></h2>
<p><b>55:19 </b><a href="https://azure.microsoft.com/en-us/blog/elevate-your-ai-deployments-more-efficiently-with-new-deployment-and-cost-management-solutions-for-azure-openai-service-including-self-service-provisioned/" target="_blank" rel="noreferrer noopener"><b>Elevate your AI deployments more efficiently with new deployment and cost </b></a><a href="https://azure.microsoft.com/en-us/blog/elevate-your-ai-deployments-more-efficiently-with-new-deployment-and-cost-management-solutions-for-azure-openai-service-including-self-service-provisioned/" target="_blank" rel="noreferrer noopener"><b>management solutions for Azure OpenAI Service including self-service </b></a><a href="https://azure.microsoft.com/en-us/blog/elevate-your-ai-deployments-more-efficiently-with-new-deployment-and-cost-management-solutions-for-azure-openai-service-including-self-service-provisioned/" target="_blank" rel="noreferrer noopener"><b>Provisioned</b></a></p>
<ul>
<li style="font-weight:400;"><a href="https://azure.microsoft.com/en-us/products/ai-services/openai-service" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Azure OpenAI Service</span></a><span style="font-weight:400;">, designed to help their 60,000 plus customers manage their AI deployments is announcing significant updates to make AI more cost efficient and effective. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">So What’s New?</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Self Service Provisioning and Model independent quota requests allowing you to request Provisioned Throughput Units (PTUs) more flexibly and efficiently. This new feature empowers you to manage your Azure OpenAI Service quota deployments independently without relying on support from your account team.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">By decoupling quota requests from specific models, you can now allocate resources based on your immediate needs and adjust as your requirements evolve.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Visibility to service capacity and availability.  Now know in real-time about service capacity in different regions, ensuring that you plan and manage your deployment effectively. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Provisioned hourly pricing and reservations</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Hourly no-commit purchasing</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Monthly and yearly azure reservations for provisioned deployments</span></li>
</ul>
</li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">56:22  Matthew – “These are, while they sound crazy, extremely useful because as soon as like, was it 4.0 came out, we had to go like. Boy them because otherwise we were worried we were locked out of the region. So even though we weren’t using them yet, our accounting was like, make sure you deploy them as soon as you see the announcement that may or may not be coming out in a very, in the next couple of days and, and do the units that you’re going to need for production, even though you, didn’t know what we needed yet.”</span></i></p>
<p><b>58:32 </b><a href="https://techcommunity.microsoft.com/t5/azure-compute-blog/announcing-general-availability-of-attach-amp-detach-of-virtual/ba-p/4215560" target="_blank" rel="noreferrer noopener"><b>Announcing General Availability of Attach &amp; Detach of Virtual Machines on </b></a><a href="https://techcommunity.microsoft.com/t5/azure-compute-blog/announcing-general-availability-of-attach-amp-detach-of-virtual/ba-p/4215560" target="_blank" rel="noreferrer noopener"><b>Virtual Machine Scale Sets</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Azure is thrilled to announce you can attach or detach VMs to and from a Virtual Machine Scale Set (VMSS) with no downtime is GA. This functionality is available for scale sets with Flexible Orchestration Mode with a Fault Domain Count of 1</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Benefits:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Let Azure do the work</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Easy to Scale</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">No Downtime</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Isolated Troubleshooting</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Easily Move VMs</span></li>
</ul>
</li>
<li style="font-weight:400;"><span style="font-weight:400;">And yes – Azure is thrilled. That’s in the announcement. Really. </span></li>
</ul>
<p><i><span style="font-weight:400;">59:10  Matthew – “And this is only for flexible, so if you’re not using flexible, which has other issues already with it, like and you are you have to be in a fault counts, you actually have more than capacity than you need. So there’s very specific ways that you can leverage this.”</span></i></p>
<p><b>1:04:29 </b><a href="https://azure.microsoft.com/en-us/blog/announcing-mandatory-multi-factor-authentication-for-azure-sign-in/" target="_blank" rel="noreferrer noopener"><b>Announcing mandatory multi-factor authentication for Azure sign-in</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Cyberattacks are becoming more frequent and so Microsoft is now forcing you to MFA for all Azure Sign Ons as part of their $20 Billion dollar investment in security. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Starting in October, MFA will be required to sign in to </span><a href="https://portal.azure.com/#home" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Azure Portal</span></a><span style="font-weight:400;">, </span><a href="https://entra.microsoft.com/#home" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Microsoft Entra admin center</span></a><span style="font-weight:400;"> and </span><a href="https://intune.microsoft.com/#home" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Intune Admin center</span></a><span style="font-weight:400;">.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">In early 2025 gradual enforcement for MFA for sign in will occur for the </span><a href="https://azure.microsoft.com/en-us/get-started/azure-portal/mobile-app/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Azure CLI</span></a><span style="font-weight:400;">, </span><a href="https://learn.microsoft.com/en-us/powershell/azure/?view=azps-12.2.0" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Azure Powershell</span></a><span style="font-weight:400;">, </span><a href="https://azure.microsoft.com/en-us/get-started/azure-portal/mobile-app/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Azure Mobile App</span></a><span style="font-weight:400;"> and </span><a href="https://learn.microsoft.com/en-us/devops/deliver/what-is-infrastructure-as-code" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">IaC tools</span></a><span style="font-weight:400;"> will commence. </span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">We have 6 months to stockpile alcohol to help Matthew manage the IaC tools situation. </span></li>
</ul>
</li>
</ul>
<p><i><span style="font-weight:400;">1:06:18  Matthew – “Or you just run your worker nodes inside and use the, whatever they call it, service principal to, which is like an IAM role to handle the authentication for you, which definitely works great with Atlantis.”</span></i></p>
<p><b>1:06:47</b> <a href="https://azure.microsoft.com/en-us/blog/boost-your-ai-with-azures-new-phi-model-streamlined-rag-and-custom-generative-ai-models/" target="_blank" rel="noreferrer noopener"><b>Boost your AI with Azure’s new Phi model, streamlined RAG, and custom </b></a><a href="https://azure.microsoft.com/en-us/blog/boost-your-ai-with-azures-new-phi-model-streamlined-rag-and-custom-generative-ai-models/" target="_blank" rel="noreferrer noopener"><b>generative AI models</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Azure is announcing several updates to help developers quickly create AI solutions with greater choice and flexibility leveraging the </span><a href="https://azure.microsoft.com/en-us/solutions/ai" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Azure AI</span></a><span style="font-weight:400;"> toolchain:</span>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Improvements to the Phi family of models, including a new Mixture of Experts (MoE) model and 20+ languages</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">AI21 Jamba 1.5 Large and Jamba 1.5 on Azure AI models as a service</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Integrated vectorization in Azure AI search to create a streamlined retrieval augmented generation (RAG) pipeline with the integrated data prep and embedding</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Custom generative extraction model in Azure AI Document Intelligence, so you can now extract custom fields for unstructured documents with high accuracy. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The GA of </span><a href="https://learn.microsoft.com/azure/ai-services/speech-service/text-to-speech-avatar/what-is-text-to-speech-avatar" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Text to speech Avatar</span></a><span style="font-weight:400;">, a capability of </span><a href="https://aka.ms/azure-speech" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">Azure AI speech</span></a><span style="font-weight:400;"> service, which brings natural-sounding voices and photorealistic avatars to life, across diverse languages and voices, enhancing customer engagement and overall experience</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">GA of </span><a href="https://marketplace.visualstudio.com/items?itemName=ms-toolsai.vscode-ai" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">VS Code extension for Azure Virtual Machine Learning</span></a><span style="font-weight:400;">.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">The GA of Conversational PII detection Service in Azure AI Language</span></li>
</ul>
</li>
</ul>
<h2><span style="font-weight:400;">Closing</span></h2>
<p><span style="font-weight:400;">And that is the week in the cloud! Visit our website, the home of the Cloud Pod where you can join our newsletter, slack team, send feedback or ask questions at theCloud Pod.net or tweet at us with hashtag #theCloudPod</span></p>
]]>
                </content:encoded>
                                    <enclosure url="https://episodes.castos.com/5e2d2c4b117f29-10227663/1828808/c1e-60w0c2p7gqc5vn19-9j5z806piomo-whulvo.mp3" length="80817026"
                        type="audio/mpeg">
                    </enclosure>
                                <itunes:summary>
                    <![CDATA[Welcome to episode 273 of The Cloud Pod, where the forecast is always cloudy! Hold onto your butts – this week your hosts Justin, Ryan, Matthew and (eventually) Jonathan are bringing you two weeks worth of cloud and AI news. We’ve got Karpenter, Kubernetes, and Secrets, plus news from OpenAI, MFA changes that are going to be super fun for Matthew, and Azure Phi. Get comfy – it’s going to be a doozy!
Titles we almost went with this week:

The Cloud Pod Teaches Azure-normalized Camel Casing
The Cloud Pod Travels to Malaysia
⚖️Azure Detaches Itself From its Own Scale Sets
✍️The Cloud Pod Conditionally Writes Show Notes 
You got MFA!
⛔The Cloud Pod Delays Deleting Itself
The Cloud Pod is Now the Cloud Pod Podcast!

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
General News
01:37 Terraform AzureRM provider 4.0 adds provider-defined functions 

Terraform is announcing the GA of Terraform AzureRM provider 4.0.  The new version improves the extensibility and flexibility in the provider. 
Since the Providers’ Last major release in March 2022, Hashi has added support for some 340 resources and 120 data sources, bringing the total Azure resources to 1,101 resources and almost 360 data sources. 
The provider has topped 660M downloads, MS and Hashi continue to develop new, innovative integrations that further ease the cloud adoption journey to enterprise organizations. 
With Terraform 1.8, providers can implement custom functions that you can call from the Terraform configuration. The new provider adds two Azure-specific provider functions to let users correct the casing of their resource IDs or access the individual components of it. 
Previously, the Azure RM provider took an all-or-nothing approach to Azure resource provider registration, where the Terraform provider would either attempt to register a fixed set of 68 providers upon initialization or registration or be skipped. 
This didn’t match Microsoft’s recommendations, which are to register resource providers only as needed, and to enable the services you’re actively using. 
With adding two new feature flags, ]]>
                </itunes:summary>
                                    <itunes:image href="https://episodes.castos.com/5e2d2c4b117f29-10227663/images/1828808/c1a-k5d5-v61mr201uqn4-h8jdko.jpg"></itunes:image>
                                                                            <itunes:duration>01:07:21</itunes:duration>
                                                    <itunes:author>
                    <![CDATA[Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News]]>
                </itunes:author>
                            </item>
                    <item>
                <title>
                    <![CDATA[272: AI: Now with JSON Schemas!]]>
                </title>
                <pubDate>Sat, 24 Aug 2024 14:16:08 +0000</pubDate>
                <dc:creator>Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn | Cloud Computing &amp; AI News</dc:creator>
                <guid isPermaLink="true">
                    https://permalink.castos.com/podcast/7740/episode/1820812</guid>
                                    <link>https://tcpfm.castos.com/episodes/272-ai-now-with-json-schemas</link>
                                <description>
                                            <![CDATA[<p><span style="font-weight:400;">Welcome to episode 272 of The Cloud Pod! This week, Matthew and Justin are bringing you all the latest in cloud and AI news, including new updates to the ongoing Crowdstrike drama, JSON schemas, AWS vaults, and IPv6 addresses – even some hacking opportunities! All this and more, this week in the cloud. </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">️The cloud pod is now logically air-gapped</span></li>
<li><span style="font-weight:400;">The Cloud Pod has continuous snark</span></li>
<li><span style="font-weight:400;">The Cloud Pod points the finger at delta</span></li>
<li><span style="font-weight:400;">AI now with JSON SCHEMAS!!! </span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><span style="font-weight:400;">Follow Up</span></h2>
<p><b>00:35 </b><a href="https://www.crowdstrike.com/wp-content/uploads/2024/08/Channel-File-291-Incident-Root-Cause-Analysis-08.06.2024.pdf" target="_blank" rel="noreferrer noopener"><b>Crowdstrike RCA</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">The final RCA is out from Crowdstrike, and as we talked during the </span><a href="https://www.crowdstrike.com/blog/falcon-content-update-preliminary-post-incident-report/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">preliminary report</span></a><span style="font-weight:400;">, this was an issue with a channel file that had 21 input parameters. No update previously had more than 20, and it was not caught in earlier testing. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Crowdstrike has several findings, and mitigating actions that they are taking. They go into detail on each of them, and you can read through all of them at the linked </span><a href="https://www.crowdstrike.com/wp-content/uploads/2024/08/Channel-File-291-Incident-Root-Cause-Analysis-08.06.2024.pdf" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">document</span></a><span style="font-weight:400;">. </span></li>
</ul>
<p><i><span style="font-weight:400;">02:31  Justin – “…the one thing I would say is this would be a perfect RCA if it included a timeline, but it lacks, it lacks a timeline view.”</span></i></p>
<p><i><span style="font-weight:400;">12:06  Justin – “…their mitigations don’t have any dates on them of when they’re going to be done or implemented, which, in addition to a timeline, it would be nice to see in this process.”</span></i></p>
<p><b>15:46</b> <a href="https://www.ciodive.com/news/microsoft-delta-crowdstrike-letter-IT/723507/" target="_blank" rel="noreferrer noopener"><b>Microsoft joins CrowdStrike in pushing IT outage recovery responsibility </b></a></p>
<p><a href="https://www.ciodive.com/news/microsoft-delta-crowdstrike-letter-IT/723507/" target="_blank" rel="noreferrer noopener"><b>back to Delta</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft has joined Crowdstrike in throwing Delta under the bus. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Delta Airlines has been blaming Crowdstrike and MS for their recent IT woes, which the company claims cost them over $</span><a href="https://www.ciodive.com/news/delta-crowdstrike-outage-costs/722970/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">500 million</span></a><span style="font-weight:400;">.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft says “</span><i><span style="font-weight:400;">Our preliminary review suggests that Delta, unlike its competitors, has not modernized its IT infrastructure, either for t...</span></i></li></ul>]]>
                                    </description>
                <itunes:subtitle>
                    <![CDATA[Welcome to episode 272 of The Cloud Pod! This week, Matthew and Justin are bringing you all the latest in cloud and AI news, including new updates to the ongoing Crowdstrike drama, JSON schemas, AWS vaults, and IPv6 addresses – even some hacking opportunities! All this and more, this week in the cloud. 
Titles we almost went with this week:

️The cloud pod is now logically air-gapped
The Cloud Pod has continuous snark
The Cloud Pod points the finger at delta
AI now with JSON SCHEMAS!!! 

A big thanks to this week’s sponsor:
We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. 
Follow Up
00:35 Crowdstrike RCA

The final RCA is out from Crowdstrike, and as we talked during the preliminary report, this was an issue with a channel file that had 21 input parameters. No update previously had more than 20, and it was not caught in earlier testing. 
Crowdstrike has several findings, and mitigating actions that they are taking. They go into detail on each of them, and you can read through all of them at the linked document. 

02:31  Justin – “…the one thing I would say is this would be a perfect RCA if it included a timeline, but it lacks, it lacks a timeline view.”
12:06  Justin – “…their mitigations don’t have any dates on them of when they’re going to be done or implemented, which, in addition to a timeline, it would be nice to see in this process.”
15:46 Microsoft joins CrowdStrike in pushing IT outage recovery responsibility 
back to Delta

Microsoft has joined Crowdstrike in throwing Delta under the bus. 
Delta Airlines has been blaming Crowdstrike and MS for their recent IT woes, which the company claims cost them over $500 million.
Microsoft says “Our preliminary review suggests that Delta, unlike its competitors, has not modernized its IT infrastructure, either for t...]]>
                </itunes:subtitle>
                                    <itunes:episodeType>full</itunes:episodeType>
                                <itunes:title>
                    <![CDATA[272: AI: Now with JSON Schemas!]]>
                </itunes:title>
                                    <itunes:episode>272</itunes:episode>
                                                <itunes:explicit>false</itunes:explicit>
                <content:encoded>
                    <![CDATA[<p><span style="font-weight:400;">Welcome to episode 272 of The Cloud Pod! This week, Matthew and Justin are bringing you all the latest in cloud and AI news, including new updates to the ongoing Crowdstrike drama, JSON schemas, AWS vaults, and IPv6 addresses – even some hacking opportunities! All this and more, this week in the cloud. </span></p>
<h3><b>Titles we almost went with this week:</b></h3>
<ul>
<li><span style="font-weight:400;">️The cloud pod is now logically air-gapped</span></li>
<li><span style="font-weight:400;">The Cloud Pod has continuous snark</span></li>
<li><span style="font-weight:400;">The Cloud Pod points the finger at delta</span></li>
<li><span style="font-weight:400;">AI now with JSON SCHEMAS!!! </span></li>
</ul>
<h3><b>A big thanks to this week’s sponsor:</b></h3>
<h3><span style="font-weight:400;">We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our slack channel for more info. </span></h3>
<h2><span style="font-weight:400;">Follow Up</span></h2>
<p><b>00:35 </b><a href="https://www.crowdstrike.com/wp-content/uploads/2024/08/Channel-File-291-Incident-Root-Cause-Analysis-08.06.2024.pdf" target="_blank" rel="noreferrer noopener"><b>Crowdstrike RCA</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">The final RCA is out from Crowdstrike, and as we talked during the </span><a href="https://www.crowdstrike.com/blog/falcon-content-update-preliminary-post-incident-report/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">preliminary report</span></a><span style="font-weight:400;">, this was an issue with a channel file that had 21 input parameters. No update previously had more than 20, and it was not caught in earlier testing. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Crowdstrike has several findings, and mitigating actions that they are taking. They go into detail on each of them, and you can read through all of them at the linked </span><a href="https://www.crowdstrike.com/wp-content/uploads/2024/08/Channel-File-291-Incident-Root-Cause-Analysis-08.06.2024.pdf" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">document</span></a><span style="font-weight:400;">. </span></li>
</ul>
<p><i><span style="font-weight:400;">02:31  Justin – “…the one thing I would say is this would be a perfect RCA if it included a timeline, but it lacks, it lacks a timeline view.”</span></i></p>
<p><i><span style="font-weight:400;">12:06  Justin – “…their mitigations don’t have any dates on them of when they’re going to be done or implemented, which, in addition to a timeline, it would be nice to see in this process.”</span></i></p>
<p><b>15:46</b> <a href="https://www.ciodive.com/news/microsoft-delta-crowdstrike-letter-IT/723507/" target="_blank" rel="noreferrer noopener"><b>Microsoft joins CrowdStrike in pushing IT outage recovery responsibility </b></a></p>
<p><a href="https://www.ciodive.com/news/microsoft-delta-crowdstrike-letter-IT/723507/" target="_blank" rel="noreferrer noopener"><b>back to Delta</b></a></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft has joined Crowdstrike in throwing Delta under the bus. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Delta Airlines has been blaming Crowdstrike and MS for their recent IT woes, which the company claims cost them over $</span><a href="https://www.ciodive.com/news/delta-crowdstrike-outage-costs/722970/" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;">500 million</span></a><span style="font-weight:400;">.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Microsoft says “</span><i><span style="font-weight:400;">Our preliminary review suggests that Delta, unlike its competitors, has not modernized its IT infrastructure, either for the benefit of its customers or for its pilots and flight attendants</span></i><span style="font-weight:400;">” Mark Cheffo from law firm Dechert representing MS. </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Gonna get ugly before this all gets settled. *Insert Michael Jackson eating popcorn gif here*</span></li>
</ul>
<p><i><span style="font-weight:400;">16:43  Justin – “The struggle with, you know, offering to send someone on site to help you is, you know, you, you can’t vet them that quickly. And so you also have an obligation to your shareholders. You have obligations to your security controls and your SOC and ISO and all the things that you’re doing, you know, to, to allow some strangers into your network and then give them access required to fix this issue, which in some cases required you to provide local encryption keys, and local administrator passwords, like you’re, you’re basically saying, you know, here’s the keys. Cause we’re in a, you know, everything’s in crisis and we’re going to throw security out the window to allow these people to come in and touch my environment to get us back up and running. I could see, I can see the argument both ways.”</span></i></p>
<h2><span style="font-weight:400;">AI Is Going Great – Or How ML Makes All It’s Money </span></h2>
<p><b>20:16  </b><a href="https://www.theinformation.com/briefings/anthropic-offers-15-000-to-break-new-ai-safety-system?rc=3t8xtd" target="_blank" rel="noreferrer noopener"><b>Anthropic Offers $15,000 to Break New AI Safety System</b></a><b> </b></p>
<ul>
<li style="font-weight:400;"><span style="font-weight:400;">With Defcon occurring this week, Anthropic is poking the hackers, offering up to $15,000 for “jailbreaks” that bypass the Anthropic AI safeguard and elicit prohibited content from the Claude chatbots.</span></li>
<li style="font-weight:400;"><span style="font-weight:400;">By</span><a href="https://www.anthropic.com/news/model-safety-bug-bounty" target="_blank" rel="noreferrer noopener"><span style="font-weight:400;"> inviting outside researchers</span></a><span style="font-weight:400;"> to test the models, Anthropic is hoping to identify problems the company couldn’t find on its own.  </span></li>
<li style="font-weight:400;"><span style="font-weight:400;">Anthropic is hoping to attract hacker groups who post jailbreaks on Twitter to recruit for the program.</span></li>
</ul>
<p><b>21:14</b> <a href="https://www.databricks.com